Next Article in Journal
Entangling Lattice-Trapped Bosons with a Free Impurity: Impact on Stationary and Dynamical Properties
Next Article in Special Issue
Physical Limitations on Fundamental Efficiency of SET-Based Brownian Circuits
Previous Article in Journal
Coupled Feedback Loops Involving PAGE4, EMT and Notch Signaling Can Give Rise to Non-Genetic Heterogeneity in Prostate Cancer Cells
Previous Article in Special Issue
Computational Abstraction
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Conditional Action and Imperfect Erasure of Qubits

by
Heinz-Jürgen Schmidt
Fachbereich Physik, Universität Osnabrück, D-49069 Osnabrück, Germany
Entropy 2021, 23(3), 289; https://doi.org/10.3390/e23030289
Submission received: 26 January 2021 / Revised: 20 February 2021 / Accepted: 23 February 2021 / Published: 26 February 2021
(This article belongs to the Special Issue Physical Information and the Physical Foundations of Computation)

Abstract

:
We consider state changes in quantum theory due to “conditional action” and relate these to the discussion of entropy decrease due to interventions of “intelligent beings” and the principles of Szilard and Landauer/Bennett. The mathematical theory of conditional actions is a special case of the theory of “instruments”, which describes changes of state due to general measurements and will therefore be briefly outlined in the present paper. As a detailed example, we consider the imperfect erasure of a qubit that can also be viewed as a conditional action and will be realized by the coupling of a spin to another small spin system in its ground state.

1. Introduction

According to a widespread opinion, there are two types of state change in quantum mechanics: time evolution in closed systems and state changes due to measurements. The mathematical description of these two processes is known in principle:
(i)
Time evolution in closed systems can be described by means of unitary operators U ( t ) according to:
ρ U ( t ) ρ U ( t ) ,
where U ( t ) is obtained by the solutions of the time-dependent Schrödinger equation and ρ denotes any statistical operator.
(ii)
Conditional state changes according to the outcome of a measurement of an observable A will be described, in the simplest case, by maps of the form:
ρ P n ρ P n ,
where P n n N is the family of eigenprojections of a self-adjoint operator A = n a n P n . Without selection according to the outcomes of the measurement, the total state change will be:
ρ n N P n ρ P n ,
In a recent article [1], we suggested a third type of state change, called “conditional action”, that combines the two aforementioned ones insofar as it describes a state change depending on the result of a preceding measurement.
(iii)
In the simplest case, a conditional action is mathematically described by maps of the form:
ρ U n P n ρ P n U n ,
with the same notation as in (2) and a family of unitary operators U n n N . Without selection according to the outcomes of the measurement, the total state change will be:
ρ n N U n P n ρ P n U n ,
We emphasize that the above form of state changes in (2)–(5) represents only the simplest cases and will be generalized in the following sections. Before explaining the details of this suggestion and suitable generalizations, we will fix some general notation used in the present paper. A measurement leading to the state change (2) is called a “Lüders measurement” in accordance with [2], sometimes also called “projective measurements” in the literature. In order not to have to go into the technical intricacies, the quantum system Σ will be described by a finite-dimensional Hilbert space H . In this case, the index set N will also be finite. Let B ( H ) denote the real linear space of Hermitian operators A : H H and B 1 + ( H ) the convex subset of statistical operators, i.e., Hermitian operators ρ with non-negative eigenvalues and Tr ρ = 1 . The state changes considered above in (2) and (3) can be viewed as a map L : N × B ( H ) B ( H ) defined by:
L ( n ) ( ρ ) : = P n ρ P n ,
which will be called a “Lüders instrument”, and the corresponding map:
L ( N ) ( ρ ) : = n N P n ρ P n ,
the “total Lüders operation”. A difference of the two state changes according to (i) and (ii) arises when we consider the change of the von Neumann entropy:
S ( ρ ) : = Tr ρ log ρ , for ρ B 1 + ( H ) .
Under unitary time evolutions (i), the entropy remains constant,
S ( ρ ) = S U t ρ U t ,
whereas for Lüders measurements (ii), the entropy may increase, and we can only state that:
S ( ρ ) S ( L ( N ) ( ρ ) ) ;
see [3,4,5], in accordance with the second law of thermodynamics.
In contrast to closed systems, time evolution in open systems can take a more general form. An obvious model to account for the time evolution in open systems is to consider the extension of the system Σ with Hilbert space H by another, auxiliary system E (environment, heat bath, measurement apparatus, etc.) with Hilbert space K and the unitary time evolution V of the total system Σ + E . If the total system is initially in the state ρ σ , it will generally evolve into an entangled state V ρ σ V . In the end, we again consider the system Σ and find its reduced state ρ 1 given by the partial trace:
ρ 1 = Tr K V ρ σ V .
The corresponding state change ρ ρ 1 will, in general, not be of the unitary type (1), but represents a natural extension (ie) of the state changes according to (i). In general, the entropy balance for these state changes is ambivalent: S ( ρ 1 ) can be smaller or larger than S ( ρ ) . In fact, the initial entropy of the total system is S ( ρ ) + S ( σ ) , and the unitary time evolution V leaves this invariant. However, the separation of the total system into its parts ρ 1 according to (11) and:
ρ 2 = Tr H V ρ σ V ,
increases the entropy (or leaves it constant) according to the “subadditivity” of S (see [4], 11.3.4.), and hence:
S ( ρ ) + S ( σ ) S ( ρ 1 ) + S ( ρ 2 ) .
However, S ( ρ 1 ) S ( ρ ) may assume positive or negative values. This can be physically understood as the phenomenon that, apart from a possible increase of the total entropy according to (13), there may be an entropy flow from the system Σ into the environment E or vice versa.
An analogous extension of the system Σ to Σ + E can also be considered for Lüders measurements. We again start with an initial total state ρ σ , where σ B 1 + ( K ) , and a unitary time evolution V of the total system. Then, a Lüders measurement corresponding to a complete family Q n n N of mutually orthogonal projections of the auxiliary system is performed, and the post-measurement total state is reduced to the system Σ leading to the final state:
I ( n ) ( ρ ) : = Tr K 𝟙 Q n V ρ σ V 𝟙 Q n ,
or, without selection, to:
I ( N ) ( ρ ) : = Tr K n 𝟙 Q n V ρ σ V 𝟙 Q n .
Thus, we obtain extensions (iie) of the state changes (ii) due to Lüders measurements by maps I : N × B ( H ) B ( H ) of the form (14), which are called “instruments” in the literature; see [6] and Section 2 for more precise mathematical definitions. Lüders instruments are idealized special cases of general instruments that, in some sense, minimize the perturbation of the Σ system by the measurement, but “real measurements” are better described by general instruments. Analogous remarks as in the case of open systems apply for the entropy balance: It is well known (see [4] or [7], Exercise 11.15) that general measurements may decrease the system’s entropy.
The latter observation leads us to the suggestion [1] that the entropy decrease of systems due to the “intervention of intelligent beings” as, e.g., Maxwell’s demon, can be explained by the same mechanism. Originally, the notion of “conditional action” was developed to describe the intervention of Maxwell’s demon in the energy distribution of a gas with two chambers: depending on the result of an energy measurement on a gas molecule approaching the partition between the two chambers, a door is opened or shut. Thus, the further time evolution of the gas depends on the result of the measurement. Similarly, the result of measuring whether a single molecule is in the left or right chamber can be used to trigger an isothermal expansion to the right or left (Szilard’s engine). Szilard argues [8] that the entropy decrease of the system is compensated by the entropy costs of acquiring information about the position of the gas particle (“Szilard’s principle”). His arguments are formulated within classical physics and not easy to understand; see also the analysis and reconstruction of Szilard’s reasoning in [9,10,11]. Nevertheless, it seems possible that the entropy decrease due to such external interventions is a special case of the well-understood entropy decrease due to state changes described by general instruments.
In fact, it can be easily confirmed that the maps of the form (4) describing “conditional action” are special cases of instruments and hence were called “Maxwell instruments” in [1]. The mathematical notion of state changes described by instruments is sufficiently general to cover not only changes due to inevitable measurement disturbances, but also “deliberate” state changes depending on the result of a measurement.
This notion of “conditional action” will be slightly generalized in the present paper and then comprises not only interventions of Maxwell’s demon or cycles of Szilard’s engine [1,8,12], but also quantum teleportation ([4], Ch. 1.3.7), quantum error correction ([4], Ch. 10), or the erasure of qubits [1]. With respect to the latter example, there is some relationship with the recently proposed definition of “generalized erasure channels” in [13].
Relative to the choice of a suitable basis, a qubit has two values, “0” or “1”. Consider a “yes-no”-measurement corresponding to said basis. If the result is “1”, the two states are swapped; hence, 1 0 . If the result is “0”, then nothing is done; hence 0 0 . This constitutes the conditional action, which sets the qubit state to its default value “0” for any case, and hence, it can be legitimately considered as an “erasure of a qubit”.
The latter example contains an ironic punch line in that the erasure of memory contents with measurement results and the corresponding entropy costs are usually considered to resolve the apparent contradiction between the actions of Maxwell’s demon and the second law (Landauer’s principle). If memory erasure itself were taken as a conditional action, we would seem to enter an infinite circle of creating and erasing new memory contents. The obvious resolution to this problem is the observation that the entropy decrease in the system Σ is due to some flow of entropy from Σ to the auxiliary system E as described above. If E can be viewed as a “memory device”, then at the end of the conditional action, it already contains the missing entropy. It is not necessary to erase the content of the memory. The latter would not create the missing entropy, but only make it visible.
As in [1], it seems sensible to distinguish between the principle that erasure of memory produces entropy [14] (“Landauer’s principle” in the narrow sense) and the position that this effect constitutes the solution of the apparent paradox of Maxwell’s demon [15] (“Landauer/Bennett principle”). Moreover, it will be a matter of substantiating our critique of the Landauer/Bennett principle (not of the Landauer principle) outlined above with a more realistic model of qubit erasure than that given in [1].
To this end, we realize the qubit (the system Σ ) by a single spin with spin quantum number s = 1 / 2 described by a Hilbert space H C 2 s + 1 = C 2 and model the erasure of the qubit by the coupling of the single spin with a “heat bath” E consisting of N = 6 spins such that the time evolution of the total system can be analytically calculated. The quotation marks refer to the fact that the “heat bath” is pretty small and not macroscopic, as usually required, and that, moreover, it is rather a “cold bath”. This is due to the choice of the default value ”0” of the qubit as the ground state ↓ of the single spin. Thus, erasing the qubit is physically equivalent to cooling the system Σ to the temperature T = 0 . Although this is, strictly speaking, impossible due to the third law of thermodynamics, it can be approximately accomplished by coupling the single spin to a system of N = 6 spins in its ground state. Here, we ignore the physical impossibility to prepare a system in its ground state and consider the ground state of the “heat bath” as a suitable approximation to a state of very low temperature. This approximation has the advantage of providing fairly simple expressions for the relevant quantities considered in this paper. The corresponding calculations are presented in Section 5.
As a side effect of this account results the necessity to define the concept of “conditional action” somewhat more generally than in [1]. This is done in Section 2, where we also recapitulate the basic notions of quantum measurement theory required for the present work. A critical account of the Szilard principle in the realm of quantum theory is given in Section 3, where we also formulate an upper bound for the entropy decrease due to conditional action that is compatible with Szilard’s reasoning, but only valid under certain restrictions. A similar bound is derived in Section 4, where the connections of the present theory with the observer-local referential (OLR) approach [16] are considered. The proofs are moved to the Appendix, as is the explicit construction of a “standard” measurement dilation for a general instrument. This measurement dilation is well known, but nevertheless reproduced here since some arguments given in this paper depend on its details. We close with a summary and outlook in Section 6.

2. General Definitions and Results

In the following, we will heavily rely on the mathematical notions of operations and instruments. Although these notions are well known (see, e.g., [6,17,18,19]), it will be in order to recall the pertinent definitions adapted to the present purposes and their interpretations in the context of measurement theory. For readability, we sometimes will repeat definitions already presented in the Introduction.
Let H be a d-dimensional Hilbert space, B ( H ) denote the space of Hermitian operators A : H H , and B + ( H ) the cone of positively semi-definite operators, i.e., having only non-negatives eigenvalues. The convex subset B 1 + ( H ) B + ( H ) consists of statistical operators ρ with Tr ρ = 1 . Such operators physically describe (mixed) states. Pure states are represented by one-dimensional projectors P ψ , where ψ H with ψ = 1 .
According to [4], 8.2.1, there are three equivalent ways to define operations:
  • By considering the system Σ coupled to environment E,
  • by an operator-sum representation, or
  • via physically motivated axioms.
Here, we follow the second approach and define an “operation” to be a map A : B ( H ) B ( H ) of the form:
A ( ρ ) = i I A i ρ A i ,
with the Kraus operators A i : H H and a finite index set I ,
such that:
Tr A ( ρ ) 1 for   all ρ B 1 + ( H ) ;
see [17]. It follows that an operation is linear and maps B + ( H ) into itself. It is mathematically convenient not to require that an operation preserves the trace. The normalized state after the operation would be obtained as A ( ρ ) Tr A ( ρ ) .
Operations are intended to describe state changes due to measurements. For example, the total Lüders operation (7) is a trace-preserving operation in the above sense with I = N and A n = P n for all n N . An operation A : B ( H ) B ( H ) will be called pure iff the representation (16) of A can be reduced to a single Kraus operator, i.e.,
A ( ρ ) = A 1 ρ A 1 .
Physically, this means that a pure operation maps pure states onto pure states, up to a positive factor.
There exists a so-called statistical duality between states and observables; see [6], Chapter 23.1. In the finite-dimensional case, B ( H ) can be identified with its dual space B ( H ) by means of the Euclidean scalar product Tr ( A B ) . Physically, we may distinguish between the two spaces in the sense that B ( H ) is spanned by the subset of statistical operators representing states and B ( H ) is spanned by the subset of operators with eigenvalues in the interval [ 0 , 1 ] representing effects. Effects describe yes-no measurements including the subset of projectors, which are the extremal points of the convex set of effects; see [6].
Every operation A : B ( H ) B ( H ) , viewed as a transformation of states (Schrodinger picture), gives rise to the dual operation A : B ( H ) B ( H ) viewed as a transformation of effects (Heisenberg picture). Reconsider the representation (16) of the operation A by means of the Kraus operators A i . Then, the dual operation A has the corresponding representation:
A ( X ) = i I A i X A i ,
for all X B ( H ) .
Let N be a finite set of outcomes. Then, the map I : N × B ( H ) B ( H ) will be called an instrument iff:
  • I ( n ) is an operation for all n N and
  • Tr n N I ( n ) ( ρ ) = Tr ρ for all ρ B ( H ) .
The first condition can be re-written as:
I ( n ) ( ρ ) = i I n A n i ρ A n i for   all n N ,
with suitable Kraus operators A n i : H H . The second condition can be rephrased by saying that the total operation I ( N ) defined by:
I ( N ) ( ρ ) n N I ( n ) ( ρ )
will be trace-preserving. An instrument I will be called “pure” iff each operation I ( n ) , n N , is pure.
Examples of pure instruments are given by Lüders instruments (6) and “Maxwell instruments” (4).
Similar as for operations, every instrument I gives rise to a dual instrument I : N × B ( H ) B ( H ) defined by:
I ( n ) ( X ) : = I ( n ) ( X )
for all n N and X B ( H ) . The condition that the total operation (21) will be trace-preserving translates into:
I ( N ) ( 𝟙 ) = n N I ( n ) ( 𝟙 ) = n N i I n A n i A n i = 𝟙 .
Thus, every dual instrument yields a resolution of the identity by means of effects:
F n : = I ( n ) ( 𝟙 ) = i I n A n i A n i ,
and hence to a generalized observable in the sense of a positive operator-valued measure F = F n n N ; see [6]. Note, however, that compared to the general definition in [6], we will have to consider generalized observables only in the discrete, finite-dimensional case. The traditional notion of “sharp” observables represented by self-adjoint operators corresponds to the special case of a projection-valued measure P n n N satisfying n N P n = 𝟙 . From now on, we will correspondingly distinguish between “sharp observables” and “generalized observables”.
It can be shown [1] that “Maxwell instruments” are just pure instruments corresponding to sharp observables. The example of imperfect erasure of qubits considered in Section 5 suggest that the class of “Maxwell instruments” is too narrow to describe realistic conditional actions. First, imperfect erasure cannot be described by a pure instrument, since the initial state of the “heat bath” is not pure. Moreover, the measurement of a sharp “heat bath” observable does not give rise to a sharp qubit observable. Hence, it seems sensible to use general instruments to describe conditional action. Fortunately, the main results on the entropy balance of conditional action in [1] can be easily generalized to general instruments.
To this end, we reconsider the map I defined in (14) by means of coupling the system Σ to some environment E. Recall that the environment E is described by some Hilbert space K and an initial state σ B 1 + K . Moreover, V denotes the unitary time evolution of the total system and Q n n N a sharp environment observable. It can be shown that (i) (14) defines an instrument in the above sense and (ii) every instrument can be obtained in this way (see Theorem 7. 14 of [6], Exercise 8. 9 of [4], or Appendix A). The special instrument defined in (14) will be referred to as a “measurement dilation” D K , σ , V , Q of a given instrument I . If the initial state σ of the environment is pure, σ = P ϕ , the measurement dilation will also be denoted by D K , ϕ , V , Q . A measurement dilation of a given instrument I is hence a physical realization of I by a Lüders instrument of the extended system Σ + E and a subsequent reduction to Σ .
Let a conditional action be described by the instrument I with measurement dilation I = D K , σ , V , Q . With respect to this measurement dilation, we define the two reduced states:
ρ 1 : = Tr K n N 𝟙 Q n V ρ σ V 𝟙 Q n ,
and:
ρ 2 : = Tr H n N 𝟙 Q n V ρ σ V 𝟙 Q n .
Then, the analogous arguments leading to the entropy balance (13) also prove the following:
Proposition 1.
Under the preceding conditions, the following holds:
Δ S : = S ( ρ ) S ( ρ 1 ) S ( ρ 2 ) S ( σ ) .
In connection with the Szilard principle discussed in the next section, the following proposition will be of some interest:
Proposition 2.
The total operation I ( N ) of an instrument with measurement dilation I = D K , σ , V , Q is independent of the sharp environment observable Q.
This means in particular that I ( N ) could even be realized by a coupling of Σ to some environment E, unitary time evolution and final state reduction, without any measurement at all. The proof of Proposition 2 can be found in Appendix B.1.

3. The Szilard Principle Revisited

As mentioned in the Introduction, the ideas of L. Szilard [8] to resolve the apparent contradiction between the results of the “intervention of intelligent beings” and the second law were published more than nine decades ago and are confined to classical physics. Therefore, the reconstruction of “Szilard’s principle” for quantum mechanics could appear as somewhat daring. In this section, nevertheless, we will reconsider what we understand by “Szilard’s principle” from the point of view developed in the present article.
According to this principle, the entropy decrease of the system is compensated by the entropy costs of acquiring information about the system’s state. Recall that we have considered a so-called measurement dilation of the instrument I describing state changes due to conditional action, which extends the system Σ by an auxiliary system E (environment). For the “standard dilation” given in Appendix A, the dimension of the Hilbert space K corresponding to the auxiliary system E equals the number of outcomes N of the Lüders measurement of Q if the instrument I is pure. It is therefore tempting to consider the auxiliary system E as a “memory” that holds the information about the result of the measurement and to interpret the “entropy cost of information acquisition” as the entropy S ( ρ 2 ) of the final state ρ 2 of E after the measurement. The probabilities p n , n N , of the various outcomes are given by p n = Tr ρ F n , where F = F n n N is the generalized observable (24) corresponding to the conditional action and ρ is the initial state of the system Σ . The Shannon entropy H ( p ) of the probability distribution p n n N is independent of any measurement dilation and will be called the “Shannon entropy of the experiment”:
H ( ρ , F ) : = H ( p ) : = n N p n log p n .
Then, we can prove the following inequality, which confirms Szilard’s principle in the above given version:
Theorem 1. 
(Szilard’s principle: quantum case)
The entropy decrease Δ S = S ( ρ ) S ( ρ 1 ) of a conditional action corresponding to a pure instrument I is bounded by the Shannon entropy of the experiment, i.e.,
Δ S H ( ρ , F ) .
For the proof, see Appendix B.2. It is worth noting that the bound in (29) is independent of the pure instrument describing the conditional action and depends only on the probabilities p n = Tr ρ F n . The theorem is trivially satisfied if the conditional action leads to an increase of entropy, i.e., Δ S 0 , as in the case of a Lüders measurement without any conditional action. Another trivial case is given if the observable F is sharp and the projections F n are one-dimensional. In this case, S ( ρ ) S n F n ρ F n = H ( ρ , F ) , and the theorem holds since S ( ρ 1 ) 0 . In other words: entropy cannot fall below the value of zero.
Otherwise, the bound (29) is non-trivial. Consider the example of H = C 3 with three mutually orthogonal one-dimensional projections P 1 , P 2 , P 3 ,
ρ = 1 2 P 1 + 3 10 P 2 + 1 5 P 3 ,
and F = ( P 1 , P 2 + P 3 ) . It follows that:
S ( ρ ) = 1 2 log 1 2 3 10 log 3 10 1 5 log 1 5 1.02965 > H ( ρ , F ) = log 2 0.693147 ,
and hence, in this example, Theorem 1 says more than just that the entropy of ρ cannot drop to negative values. It is straightforward to construct a Maxwell instrument corresponding to the generalized observable F such that ρ 1 = 4 5 P 1 + 1 5 P 2 , and hence, Δ S 0.529251 < log 2 0.693147 , in accordance with Theorem 1.
A slight generalization of Theorem 1 is the following:
Corollary 1.
The upper bound (29) also holds if the instrument I can be written as a convex linear combination of pure instruments with the same set of outcomes N .
For the proof, see Appendix B.3. A pure instrument has a standard dilation with one-dimensional projections Q n , n N , and a pure initial state σ = P ϕ of E. If we extend this standard dilation by considering a real mixed initial state σ of E, we obtain a convex combination of pure instruments for which Corollary 1 holds. However, not every instrument is a convex linear combination of pure ones. Actually, there exist instruments where the bound of entropy decrease given in (29) is violated.
To provide an example, we consider the (perfect) erasure of two qubits. Thus, H = C 2 C 2 C 4 , and we consider an orthonormal basis of H denoted by ψ 1 = , ψ 2 = , ψ 3 = , ψ 4 = . The conditional action maps all these four basis states onto the default state ψ 4 = . It has the following measurement dilation: K = H , initial auxiliary state ϕ = , unitary time evolution V of the total system defined by V ( Φ Ψ ) = Ψ Φ , and Lüders measurement of the auxiliary sharp observable Q = Q ν ν = 1 , , 4 = | ψ ν ψ ν | ν = 1 , , 4 . The corresponding instrument I = D K , ϕ , V , Q is pure and hence satisfies (29).
Then, we consider another instrument I ˜ by changing the auxiliary sharp observable to Q ˜ = Q 1 + Q 2 , Q 3 + Q 4 = | ψ 1 ψ 1 | 𝟙 , | ψ 2 ψ 2 | 𝟙 . The corresponding system observable F ˜ = | | 𝟙 , | | 𝟙 is also sharp, two-valued, and corresponds to a measurement of the first qubit w.r.t. the considered basis. All other components of D K , ϕ , V , Q are left unchanged. Consider the initial state ρ = 1 4 𝟙 of the system with S ( ρ ) = log 4 and H ( ρ , F ˜ ) = log 2 . It follows that V ( ρ P ϕ ) V = P ϕ ρ , and hence, ρ 1 = P ϕ and S ( ρ 1 ) = 0 . Consequently, Δ S = S ( ρ ) S ( ρ 1 ) = log 4 > log 2 = H ( ρ , F ˜ ) , in contrast to (29).
Similar examples abound: whenever the entropy decrease Δ S due to a conditional action is larger than log 2 , then a corresponding measurement dilation can be modified to yield S ( ρ , F ˜ ) = log 2 without changing Δ S due to Proposition 2. The modified instrument I ˜ cannot be written as a convex linear combination of pure instruments according to Corollary 1.
As a conclusion for the evaluation of Szilard’s principle, we can state that there are examples of conditional actions where the entropy decrease in the system can be explained by an entropy increase at least as large in a memory, as well as counterexamples. In the counterexamples, however, we have no violation of the second law, but only an impossibility to reduce the auxiliary system to its function as a memory. This is especially true for the limiting case of an entropy reduction without measurement.
The example of Section 5 shows that the upper bound (29) of the entropy decrease holds for a larger class of conditional actions than given by Theorem 1 or Corollary 1. It remains an open task to determine this class more precisely.

4. Connections to the OLR Approach

There exists a vast amount of literature on Maxwell’s demon and related questions [10,11,20]. Among them is an article that comes rather close to the results of the present work, namely [21], that deals with Szilard’s engine and where we read in the abstract:
In this paper, Maxwell’s Demon is analyzed within a “referential” approach to physical information that defines and quantifies the Demon’s information via correlations between the joint physical state of the confined molecule and that of the Demon’s memory. On this view […] information is erased not during the memory reset step of the Demon’s cycle, but rather during the expansion step, when these correlations are destroyed.
The mentioned notion of “observer-local referential (OLR) information” is further outlined in [16]. A detailed comparison of the “conditional action approach” and the “OLR approach” is beyond the scope of this paper. Arguably, a key difference is that we could not model the formation of a correlation by a measurement and the subsequent destruction of that correlation by a conditional action as a sequence of state changes, and instead had to use measurement dilation as a surrogate; see Section 6.
To illustrate the nevertheless existing connections between the two approaches, we will derive another upper bound for the entropy decrease analogous to Theorem 1 using the notion of OLR information. We will restrict ourselves to the case of conditional actions described by Maxwell instruments M , i.e., instruments of the form (4). Let L denote the corresponding Lüders instrument of the form (2) such that M and L share the same Hilbert space H and the same sharp observable P = P n n N . Further, consider the standard measurement dilations:
L = D K , ϕ , V , Q , and M = D K , ϕ , V , Q ,
as explicitly constructed in Appendix A, but specialized to the case of pure instruments, see also [1]. With respect to these measurement dilations, we further define:
ρ 12 : = n N 𝟙 Q n V ρ P ϕ V 𝟙 Q n ,
ρ 1 : = Tr K ρ 12 , and ρ 2 : = Tr H ρ 12 ,
and analogously for the primed quantities:
ρ 12 : = n N 𝟙 Q n V ρ P ϕ V 𝟙 Q n ,
ρ 1 : = Tr K ρ 12 , and ρ 2 : = Tr H ρ 12 .
In accordance with [16], we define the OLR information:
I : = S ( ρ 1 ) + S ( ρ 2 ) S ( ρ 12 ) = : S 1 + S 2 S 12 ,
I : = S ( ρ 1 ) + S ( ρ 2 ) S ( ρ 12 ) = : S 1 + S 2 S 12 ,
Δ I : = I I .
Further, let:
S 0 : = S ( ρ ) = S ( ρ P ϕ ) = S ( ρ P ϕ ) ,
and:
Δ S : = S 0 S 1 .
Then, we can prove the following:
Proposition 3.
Under the preceding conditions, the entropy decrease Δ S is bounded from above by:
Δ S Δ I .
The proof of Proposition 3 can be found in Appendix B.4. Of course, this is only a first step to analyze the mentioned relations, since the conditions of Proposition 3 are rather limited, e.g., by the fact that only the standard dilation is considered and not an arbitrary measurement dilation.

5. Imperfect Erasure of a Qubit

The role of the detailed example considered in this section is twofold: First, we can explain and illustrate the definitions of the previous sections using a non-trivial, but still computable example. Second, this reasonably realistic case also demonstrate the viability of the general theory.

5.1. Definition of the Model

We consider a system of N spins with spin quantum number s = 1 / 2 equipped with a uniform anti-ferromagnetic Heisenberg coupling and a Zeeman term. This leads to a Hamiltonian:
H N = J 1 μ < ν N s s μ · s s ν + B μ = 1 N s s μ z ,
where J > 0 and B > 0 are dimensionless physical parameter characterizing the spin system. s s μ = s s μ x , s s μ y , s s μ z represents the vector of spin operators at the site μ . It is well known that the corresponding time evolution can be analytically calculated since we may write the Hamiltonian in the form:
H N = J 2 S 2 3 N 4 𝟙 + B S z ,
where S S : = μ = 1 N s s μ denotes the total vector of spin operators and S S z its z-component. Since S S 2 and S S z commute, they possess a system of common eigenvectors α ; S , M satisfying the eigenvalue equations:
S S 2 α ; S , M = S ( S + 1 ) α ; S , M ,
S S z α ; S , M = M α ; S , M ,
and hence:
H N α ; S , M = J 2 S ( S + 1 ) 3 N 4 + B M α ; S , M = : E N ( S , M ) α ; S , M .
The theory of coupling angular momenta treated in many textbooks yields that the quantum number S assumes the values 1 2 , 1 , 3 2 , , N 2 for odd N and 0 , 1 , 2 , , N 2 for even N and M = S , S + 1 , , S 1 , S . The symbol “ α ” stands for further quantum numbers that allow for the degeneracy D N ( S ) of the eigenspaces with common eigenvalues S ( S + 1 ) and M of S S 2 and S S z , resp., such that the normalized eigenvectors α ; S , M will be unique up to a phase. For a selection of such degeneracies, see Figure 1.
We will consider a single spin with s = 1 / 2 with Hilbert space H C 2 representing one qubit and try to realize the erasure of the qubit by coupling the single spin to a “heat bath” consisting of N = 6 uniformly coupled spins such that the total Hamiltonian H will be of the form H 7 . Moreover, we choose J = B = 1 , thereby fixing a natural energy unit and corresponding physical units of time and temperature by setting = k B = 1 . The choice of the “heat bath” with N = 6 spins has the pleasant consequence that all relevant quantities can be directly calculated by computer-algebraic means without resorting to the theory of coupling angular momenta.
The “heat bath” with Hilbert space K C 64 has a ground state with energy E 6 ( 0 , 0 ) = E 6 ( 1 , 1 ) = 9 4 that is 14-fold degenerate. This follows from the degeneracies D 6 ( S = 0 ) = 5 and D 6 ( S = 1 ) = 9 ; see Figure 1. Let Q 0 denote the projector onto the corresponding eigenspace and Q 1 the complementary projector such that Q 0 + Q 1 = 𝟙 K .
We will assume that initially, the “heat bath” is in its ground state σ : = 1 14 Q 0 corresponding to the temperature T = 0 , whereas the single spin is in an arbitrary mixed state ρ . Then, a unitary time evolution U t : = exp i t H takes place followed by a Lüders measurement of the sharp heat bath observable ( Q 0 , Q 1 ) . After this measurement, we consider the two reduced states:
ρ 1 = Tr K n = 0 , 1 𝟙 Q n U t ρ σ U t 𝟙 Q n ,
and:
ρ 2 = Tr H n = 0 , 1 𝟙 Q n U t ρ σ U t 𝟙 Q n .
Obviously, ρ 1 is the result of the total operation ρ 1 = I ( N ) ( ρ ) corresponding to the instrument:
I ( n ) ( ρ ) : = Tr K 𝟙 Q n U t ρ σ U t 𝟙 Q n ,
where n N = { 0 , 1 } .
It turns out that for the special model, we considered that the matrix elements of ρ n are 4 π -periodic functions of t. Instead of dwelling on a debate for how to cope with these oscillating terms, we simply make the choice t = 2 π , i.e., we consider the time evolution of a half period before performing the final measurement. This choice gives reasonable results, which suffice to construct an example of the general theory outlined in this paper. Now, all parameters of our model for imperfect erasure are fixed, and we proceed by presenting the relevant results without explicating the further details of the computer-algebraic calculation.

5.2. Results on the Instrument I

The first results concern the calculation and visualization of the total trace-preserving operation ρ ρ 1 = I ( N ) ( ρ ) . Recall that ρ B ( H ) where the latter is a four-dimensional space spanned by the four Pauli matrices:
σ 0 = 1 0 0 1 , σ 1 = 0 1 1 0 , σ 2 = 0 i i 0 , σ 3 = 1 0 0 1 .
They are mutually orthogonal w.r.t. the Euclidean scalar product ( A , B ) Tr A B of Hermitian 2 × 2 -matrices and have the length 2 . With respect to this basis, the total operation I ( N ) can be represented by the 4 × 4 matrix:
I = 1 7 7 0 0 0 0 1 0 0 0 0 1 0 4 0 0 3 .
Note that I ( N ) being trace-preserving is equivalent to the property I ( N ) ( σ 0 ) = σ 0 of the dual instrument I . The latter is represented by the transposed matrix I , and hence, the first row of I , which corresponds to the first column of I , must be necessarily of the form ( 1 , 0 , 0 , 0 ) .
The density matrices ρ in the Hilbert space H C 2 can be represented by the points ( x 1 , x 2 , x 3 ) of a unit ball in R 3 such that the pure states corresponding to one-dimensional projectors form its surface S 2 , the so-called “Bloch sphere”. This representation is given by:
ρ = 1 2 σ 0 + i = 1 3 x i σ i .
Under the total operation I ( N ) , which is an affine map on states, the Bloch sphere is mapped onto an ellipsoid E lying inside S 2 ; see Figure 2. Due to det ( I ) = 1 7 × 1 7 × 3 7 = 3 343 , the volume of S 2 is compressed to less than one percent. This volume compression is typical for conditional action. The invariance of E under rotations about the three-axis is due to the azimuthal symmetry of the Hamiltonian (43) and of the initial state of the “heat bath”.
Some properties of the mapping I ( N ) can be read off the matrix (52): The south pole of S 2 is mapped onto itself, and this is the only point where E touches the Bloch sphere. Since this south pole corresponds to the state ρ = 1 2 σ 0 σ 3 , its invariance under I ( N ) is reflected by the equation I ( 1 , 0 , 0 , 1 ) = ( 1 , 0 , 0 , 1 ) . Physically, the south pole represents the ground state of the qubit corresponding to the Gibbs state with temperature T = 0 . Its invariance under I ( N ) hence means that the qubit remains in its ground state if it is coupled to a “heat bath” of temperature T = 0 , which is very plausible.
The orientation relative to the coordinate system and the semi-axes ( 1 7 , 1 7 , 3 7 ) of the ellipsoid E can be read off the lower right 3 × 3 -submatrix of I ; see (52). The center of E lies at x 3 = 4 7 corresponding to the state:
ρ 1 = 1 2 σ 0 4 7 σ 3 = 1 14 3 0 0 11 ,
and the north pole of S 2 is mapped onto:
ρ 1 = 1 2 σ 0 1 7 σ 3 = 1 7 3 0 0 4 .
By a perfect erasure of a qubit, the Bloch sphere would be completely mapped onto the south pole; a more realistic scenario corresponds to a mapping onto a small ellipsoid close to the south pole. The present example may not yield the best possible result; however, its virtue lies in the fact that E can be analytically calculated and is rather simple in form.
After having analyzed the total operation I ( N ) , we proceed by considering the two components I ( n ) , n = 0 , 1 , of the instrument I . We will determine the corresponding Kraus operators A n m j such that:
I ( n ) ( ρ ) = m j A n m j ρ A n m j , n N .
The index i occurring in (20) is replaced here by a multi-index i = ( m , j ) . According to the general theory, the Kraus operators A n m j can be derived from the measurement dilation I = D K , σ , V , Q by means of the equation:
a A n m j b = q j a ϕ m V b ψ j , a , b H , m M n ;
see [4], 8.35. Here, we use the spectral decomposition of the initial state σ of the auxiliary system:
σ = j q j ψ j ψ j ,
and that of the projector Q n :
Q n = m M n ϕ m ϕ m .
The latter is defined w.r.t. a suitable partition M = n M n of the index set M corresponding to an orthonormal basis ϕ m m M of K adapted to the sharp observable Q n n N .
For the present example, it appears at first sight that we would need 14 × 64 = 896 Kraus operators. Fortunately, only 94 Kraus operators do not vanish. Furthermore, they can be combined and simplified so that only the following three operators remain:
A 1 = 5 14 0 0 5 14 , A 2 = 1 14 0 0 3 14 , A 3 = 0 0 2 7 0 .
Here, the first two operators A 1 and A 2 belong to I ( 0 ) and A 3 to I ( 1 ) . Hence, the generalized observable F = ( F 0 , F 1 ) given by the instrument I is obtained as:
F 0 = A 1 A 1 + A 2 A 2 = 3 7 0 0 1 a n d F 1 = A 3 A 3 = 4 7 0 0 0 ,
satisfying F 0 + F 1 = 𝟙 , as is required for F being an generalized observable. The fact that F is not a sharp observable means that, despite energy conservation, there is no perfect correlation between the energy of the individual spin and that of the “heat bath”, as would be the case for a standard measurement dilation of an instrument leading to a sharp observable.

5.3. Results on the Entropy Balance

We now turn to the entropy balance. First, we plot S 1 : = S ( ρ 1 ) versus S 0 : = S ( ρ ) ; see Figure 3. For the one-parameter family of states:
ρ ( p ) : = p 0 0 1 p ,
where 0 p 1 , we obtain a curve with parametric representation:
S 0 ( ρ ( p ) ) = p log ( p ) ( 1 p ) log ( 1 p ) ,
S 1 ( ρ ( p ) ) = 1 7 ( 3 p 7 ) log 1 3 p 7 3 p log 3 p 7 ;
see the green curve in Figure 3. The value p = 1 / 2 corresponds to the maximum log 2 of S 0 and the value:
S 1 / 2 = 1 7 11 2 log 14 11 + 3 2 log 14 3
of S 1 corresponding to the entropy of the center of the ellipsoid E . This curve is the envelope of the set of all points S 0 ( ρ ) , S 1 ( ρ ) , as can be seen as follows.
The surfaces with constant entropy (“adiabatic surfaces”) are the concentric spheres S inside the Bloch sphere (together with the center considered as a degenerate sphere). The set of states ρ corresponding to such a concentric sphere S is mapped under I ( N ) onto an ellipsoid E E that is also invariant under rotations about the three-axis. The north pole N of S corresponding to the state ρ ( p ) is mapped onto the north pole N of E . Similarly, the south pole S of S corresponding to the state ρ ( 1 p ) is mapped onto the south pole S of E . The total ellipsoid E is bounded by the two concentric spheres through N and S , and hence, the entropy of all states corresponding to E is bounded by S 1 ( ρ ( p ) ) and S 1 ( ρ ( 1 p ) ) .
The decrease of entropy, i.e., S ( ρ 1 ) < S ( ρ 0 ) , will not always occur. For example, the north pole of S 2 corresponds to a pure state of vanishing entropy and is mapped onto a mixed state with positive entropy. For the states, the ρ ( p ) decrease of entropy is equivalent to:
0 < p < p 1 : = 1 7 11 2 log 14 11 + 3 2 log 14 3 0.51958 ;
see Figure 3. For the value p = p 1 , the initial state ρ ( p 1 ) is mapped under I ( N ) onto ρ ( 1 p 1 ) , which has the same entropy. Recall that within the family ρ ( p ) , 0 p p 1 , only states with 0 < p < 1 / 2 have a positive temperature, and hence, for these, the entropy decrease is guaranteed.
Although the instrument I does not satisfy the conditions of Theorem 1 or Corollary 1, its entropy decrease satisfies the same bound given in (29); see Figure 3.
According to the considerations of Section 2 and Proposition 1, it is clear that a possible decrease of entropy will be compensated by an equal or larger increase of the entropy of the auxiliary system, i.e., of the “heat bath”. Nevertheless, it will be instructive to check this result for the considered example; see Figure 4. We plotted the initial total entropy S 0 + S i (red curve) and the final entropy S 1 + S 2 (blue curve) calculated for the one-parameter family ρ ( p ) according to (62) such that S 0 = S ( ρ ( p ) ) . Obviously, S 0 + S i < S 1 + S 2 for 0 < p 1 in accordance with the second law. S i = log 14 denotes the initial entropy of the “heat bath” due to the 14-fold degeneracy of the ground state of H 6 . At p = 1 , the final entropy S 1 + S 2 assumes the value:
S f = 4 7 log 7 4 + 3 7 log 7 3 + 5 log ( 14 ) 14 + 4 7 log 63 4 + log ( 126 ) 14 3.54621 ;
see Figure 4.

6. Summary and Outlook

In this paper, we elaborate on a recent proposal [1] to describe the “intervention of intelligent beings” in quantum systems in terms of “conditional action”. This is a genuinely physical concept. Mathematically, the notion of general “instruments”, originally intended to explain state changes due to measurements, is already broad enough to include conditional action.
A fundamental assumption here is that it is not necessary to describe the inner life of “intelligent beings” in more detail; it is sufficient to analyze the workings of apparatuses built to realize measurements and conditional actions. Ideally, such an analysis includes the original measurement on the system Σ , the storage of the measurement result in a classical memory, and the subsequent unitary time evolution of Σ conditioned by the memory contents. However, the construction of such a complete model of a conditional action would, in our opinion, require a solution of the quantum measurement problem and hence is impossible at present.
We must therefore confine ourselves to considering physical realizations of conditional actions restricted to so-called “measurement dilations”. These are well-known mathematical constructions [6] that reduce general instruments acting on Σ to special Lüders instruments acting on a larger system Σ + E . These tools also open the way to understanding the (possible) entropy decrease in Σ due to the conditional action as an entropy flow from Σ to the auxiliary system E, in the same sense as the (possible) entropy decrease due to a general measurement can be explained.
The latter explanation can also be related to existing approaches to resolving the apparent contradiction of said entropy decrease with a tentative second law of quantum thermodynamics. Among such approaches are the Szilard principle, the Landauer–Bennett principle, and the recent OLR approach. Due to the Szilard principle, the entropy decrease in Σ is, at least, compensated by the entropy production associated with the measurement of the system’s state. This principle is confirmed by the present conditional action approach in the special case where the auxiliary system E can be conceived as a memory device (see Theorem 1), but not in general. Furthermore, a partial compatibility to the OLR approach is shown, in so far as, in special cases, the entropy decrease in Σ is bounded by the loss of mutual information due to conditional action; see Proposition 3.
We analyze the imperfect erasure of a qubit by means of a physical model. This model descries the cooling of a single spin by coupling it to a “cold bath” consisting of six other spins such that the total time evolution can be analytically calculated. This model thus represents a more or less realistic measurement dilatation of imperfect erasure conceived as a conditional action and as such motivates the slight generalization of this concept compared to [1]. At the same time, this example reveals some problems of the mentioned principles based on acquisition or deletion of information, since imperfect erasure of a qubit can be achieved without any measurement at all. This is even more plausible if one considers the physical interpretation of the erasure as a cooling of a single spin. The OLR approach appears to avoid this problem because it relies on an information concept that is independent of possible measurements; see [22] for a corresponding treatment of imperfect memory erasure.
In summary, a first achievement of this work, also prepared in [1], is the proposal of a new physical notion of ”conditional action” that encompasses certain external interventions in a system and is mathematically represented by the well-known notion of an ”instrument”. Physical systems Σ subject to a conditional action can no longer be considered adiabatically closed and therefore may lose entropy flowing into the auxiliary system E needed to perform the conditional action. Moreover, if the conditional action is realized according to a “measurement dilation”, then the entropy loss can be calculated, and it follows that the total system, Σ plus E, obeys the second law, cf. Proposition 1.
This analysis sheds critical light on the Landauer–Bennett principle, which is now widely accepted by the mainstream of physicists. From our point of view, erasing memory contents appears as another conditional action that transfers entropy from E to another second auxiliary system E , but is neither necessary nor sufficient to solve Maxwell’s paradox.
For future investigations, it seems to be a desirable goal to extend the conditional action approach to the field of classical physics. The first steps toward this goal restricted to discrete state spaces were made in [1]. The role of measurement is different in classical theories because, unlike in quantum theory, there are always idealized measurements that do not change the state of the system. However, there exist non-trivial instruments describing conditional action even in classical theories, and it should be possible to realize them by extending the system analogously to the quantum case.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

I thank all members of the DFG Research Unit FOR 2692, as well as Thomas Bröcker for stimulating and insightful discussions. I dedicate this work to the memory of my friend and mentor Marshall Luban (1936–2021).

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Construction of the Standard Measurement Dilation for a General Instrument

Let an instrument of the form (20) be given, i.e.,
J ( n ) ( ρ ) = i I n A n i ρ A n i , n N ,
such that the corresponding generalized observable F = F n n N is given by:
F n = i I n A n i A n i , for   all n N ,
satisfying n N F n = 𝟙 H .
Following [4], we want to explicitly construct a measurement dilation of J of the form (15); see also the analogous construction for a Maxwell instrument in [1].
To this end, we define N : = { ( n , i ) n N and i I n } and choose K = C N and an orthonormal basis | n i n N , i I n in K . Let ϕ K be one of these basis vectors, say ϕ = | 11 . Further, let Q n n N denote the complete family of projectors in the Hilbert space K defined by:
Q n = i I n | n i n i | , for   all n N .
Moreover, let Q ˇ n i be the subspace of H K formed by vectors of the form ψ | n i for all ψ H and fixed n N , i I n , and define a linear map V 11 : Q ˇ 11 H K by:
V 11 ψ 11 : = V 11 ψ | 11 : = n N , i I n A n i ψ | n i ,
for all ψ H .
Lemma A1.
The map V 11 : Q ˇ 11 H K is a partial isometry, i.e., it satisfies V 11 V 11 = 𝟙 Q ˇ 11 .
Proof. 
Let φ , ψ be two arbitrary vectors of H , and consider the scalar products:
φ 11 V 11 V 11 ψ 11 = ( A 4 ) n i m j φ A n i A m j ψ n i m j δ n m δ i j
= n i φ A n i A n i ψ
= ( A 2 ) φ n F n 𝟙 H ψ = φ 11 ψ 11 ,
which completes the proof of Lemma A1. □
Next, we extend the partial isometry V 11 to a unitary operator V : H K H K . This completes the definition of the quantities K , ϕ , V , Q required for the measurement dilation. It remains to show that J = D K , ϕ , V , Q . To this end, we introduce an orthonormal basis = 1 , , d in H and write:
ρ = k | k k | ρ | | .
Hence:
ρ P ϕ = k | k 11 k | ρ | 11 | ,
and, further,
V ρ P ϕ V = ( A 9 ) k V | k 11 k | ρ | 11 | V = ( A 4 ) k n i m j A n i | k k | ρ | | A m j | n i m j |
= ( A 8 ) n i m j A n i ρ A m j | n i m j | .
Using:
Q r | n i m j | Q r = δ r n δ r m | r i r j | ,
for all r N , (A11) implies:
𝟙 Q r V ρ P ϕ V 𝟙 Q r = i j A r i ρ A r j | r i r j | ,
and:
D K , ϕ , V , Q ( r ) ( ρ ) = Tr K 𝟙 Q r V ρ P ϕ V 𝟙 Q r
= ( A 13 ) i j A r i ρ A r j Tr K | r i r j |
= i A r i ρ A r i ,
since Tr K | r i r j | = δ i j for all r N . The latter expression equals:
J ( r ) ( ρ ) = ( A 1 ) i I r A r i ρ A r i ,
thereby proving that the above construction is a correct measurement dilation of J .

Appendix B. Proofs

Appendix B.1.

Proof of Proposition 2.
We define:
ρ 1 : = I ( N ) ( ρ ) = n N Tr K 𝟙 Q n ρ 𝟙 Q n ,
where:
ρ : = V ρ σ V ,
and will prove Proposition 2 by showing that ρ 1 = Tr K ρ . Assume some orthonormal basis , | α , , | β , in H and another orthonormal basis | m m M in K that is adapted to the sharp environment observable Q in the sense that:
Q n = m M n | m m | , for   all n N ,
w.r.t. a partition M = n N M n of the index set M . It follows that:
𝟙 Q n β m = | β | m : m M n , 0 : else , ,
for all m M and any base vector β and analogously for α m 𝟙 Q n . Hence, an arbitrary matrix element of ρ 1 assumes the form:
α ρ 1 β = ( A 17 ) n N m M α m 𝟙 Q n ρ 𝟙 Q n β m
= ( A 20 ) n N m M n α m ρ β m
= m M α m ρ β m
= α Tr K ρ β ,
thereby completing the proof of Proposition 2. □

Appendix B.2.

Proof of Theorem 1.
For a pure instrument, the construction of a standard dilation given in Appendix A is simplified by omitting the indices i , j I n and the corresponding sums. Especially, we obtain:
ρ 12 : = r 𝟙 Q r V ρ P ϕ V 𝟙 Q r = ( A 13 ) r A r ρ A r | r r | .
It follows that:
S ( ρ ) S ( ρ 12 ) ,
since the von Neumann entropy vanishes for pure state like P ϕ , which is additive for tensor products and invariant under unitary transformations. Moreover, it is non-decreasing under Lüders measurements. Further, we consider the two reduced states of ρ 12 :
ρ 1 = Tr K ρ 12 = ( A 25 ) r A r ρ A r ,
ρ 2 = Tr H ρ 12 = ( A 25 ) r Tr A r ρ A r | r r | = ( A 24 ) r Tr ρ F r | r r | = r p r | r r | .
Due to the subadditivity of the von Neumann entropy (see [4], 11.3.4), we conclude:
S ( ρ ) ( A 26 ) S ( ρ 12 ) S ( ρ 1 ) + S ( ρ 2 ) ,
(see also Proposition 1) and hence:
Δ S = S ( ρ ) S ( ρ 1 ) S ( ρ 2 ) = H ( ρ , F ) .
The latter equation follows from the spectral composition of ρ 2 due to (49), which implies:
S ( ρ 2 ) = r p r log p r = ( 28 ) H ( ρ , F ) ,
thereby completing the proof of Theorem 1. □

Appendix B.3.

Proof of Corollary 1.
Turning to the proof of Corollary 1, we assume that the instrument I can be written as a convex sum of pure instruments, i.e.,
I ( n ) ( ρ ) = i I λ i I ( i ) ( n ) ( ρ ) ,
for all n N and ρ B 1 + ( H ) , such that:
λ i > 0 for   all i I and i I λ i = 1 .
We will apply Theorem 1 for each pure instrument I ( i ) and obtain:
Δ S ( i ) = S ( ρ ) S ρ 1 ( i ) H p ( i ) ,
using some self-explaining notation. In particular, the Shannon entropy H p ( i ) is calculated for the probability distribution:
p n ( i ) = Tr I ( i ) ( n ) ( ρ ) ,
satisfying:
n p n ( i ) = 1 .
Moreover,
i λ i p n ( i ) = ( A 32 ) , ( A 35 ) Tr I ( n ) ( ρ ) = p n ,
for all n N . Due to concavity of the Shannon entropy (see [4], Ex. 11.21) (A37) implies:
H ( ρ , F ) = H ( p ) i λ i H p ( i ) .
Similarly, the concavity of the von Neumann entropy (see [4], 11.3.5.) yields:
S ( ρ 1 ) = S I ( N ) ( ρ ) = ( A 32 ) S i λ i I ( i ) ( N ) ( ρ ) i λ i S I ( i ) ( N ) ( ρ ) = i λ i S ρ 1 ( i ) .
Finally,
Δ S = S ( ρ ) S ρ 1 ( A 39 ) S ( ρ ) i λ i S ρ 1 ( i ) = i λ i S ( ρ ) S ρ 1 ( i ) ( A 34 ) i λ i H p ( i ) ( A 38 ) H ( p ) ,
thereby completing the proof of Corollary 1. □

Appendix B.4.

Proof of Proposition 3.
By setting A n = U n P n , resp. A n = P n , for all n N , we obtain from (A25):
ρ 12 = n U n P n ρ P n U n | n n | = : n p n ρ n | n n | ,
where p n : = Tr ρ P n , and:
ρ 12 = n P n ρ P n | n n | .
Moreover, by means of (49),
ρ 2 = ρ 2 = n p n | n n | ,
and hence:
S 2 = S 2 = H ( p ) = H ( ρ , P ) .
Since the ρ n | n n | in (A41), as well as the P n ρ P n | n n | in (A42) have orthogonal support, Theorem 11.8 (4) of [4] can be applied and yields:
S 12 = S ρ 12 = n p n S ρ n | n n | + H ( p )
= n p n S 1 p n U n P n ρ P n U n + H ( p )
= n p n S 1 p n P n ρ P n + H ( p )
= S 12 ,
using the invariance of the von Neumann entropy under unitary transformations in (A47). Finally,
Δ I = ( 37 ) ( 39 ) S 1 + S 2 S 12 S 1 + S 2 S 12
= ( A 48 ) S 1 + S 2 S 1 S 2 = ( A 44 ) S 1 S 1
S 0 S 1 = Δ S ,
where we used S 1 S 0 in (A51) since a total Lüders operation never decreases entropy. This completes the proof of Proposition 3. □

References

  1. Schmidt, H.-J. Conditional action and quantum versions of Maxwell’s demon. Found. Phys. 2020, 50, 1480–1508. [Google Scholar] [CrossRef]
  2. Busch, P.; Lahti, P.J.; Mittelstädt, P. The Quantum Theory of Measurement, 2nd revised ed.; Springer: Berlin, Germany, 1996. [Google Scholar]
  3. von Neumann, J. Mathematische Grundlagen der Quantenmechanik; Springer: Berlin, Germany, 1932. [Google Scholar]
  4. Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
  5. Schmidt, H.-J.; Gemmer, J. Sequential measurements and entropy. J. Phys. Conf. Ser. 2020, 1638, 012007. [Google Scholar] [CrossRef]
  6. Busch, P.; Lahti, P.J.; Pellonpää, J.P.; Ylinen, K. Quantum Measurement; Springer: Berlin, Germany, 2016. [Google Scholar]
  7. Lindblad, G. Entropy, Information and Quantum Measurements. Commun. Math. Phys. 1973, 33, 305–322. [Google Scholar] [CrossRef]
  8. Szilard, L. Über die Entropieverminderung in einem thermodynamischen System bei Eingriffen intelligenter Wesen. Z. Phys. 1929, 53, 11–12. [Google Scholar] [CrossRef]
  9. Leff, H.S.; Rex, A.F. Entropy of Measurement and Erasure: Szilard’s Membrane Model Revisited. Am. J. Phys. 1994, 62, 994–1000. [Google Scholar] [CrossRef]
  10. Earman, J.; Norton, J.D. Exorcist XIV: The Wrath of Maxwell’s Demon. Part I. From Maxwell to Szilard. Stud. Hist. Philos. Mod. Phys. 1998, 29, 435–471. [Google Scholar] [CrossRef]
  11. Earman, J.; Norton, J.D. Exorcist XIV: The Wrath of Maxwell’s Demon. Part II. From Szilard to Landauer and Beyond. Stud. Hist. Philos. Mod. Phys. 1999, 30, 1–40. [Google Scholar] [CrossRef]
  12. Zurek, W.H. Maxwell’s demon, Szilard’s engine and quantum measurements. In Frontiers of Nonequilibrium Statistical Mechanics; Moore, G.T., Seully, M.O., Eds.; Plenum Press: New York, NY, USA, 1984; pp. 151–161. [Google Scholar]
  13. Filippov, S.N. Capacity of trace decreasing quantum operations and superadditivity of coherent information for a generalized erasure channel. arXiv 2021, arXiv:2101.05686. [Google Scholar]
  14. Landauer, R. Irreversibility and heat generation in the computing process. IBM J. Res. Dev. 1961, 5, 183–191. [Google Scholar] [CrossRef]
  15. Bennett, C.H. The Thermodynamics of Computation—A Review. Int. J. Theor. Phys. 1982, 21, 905–940. [Google Scholar] [CrossRef]
  16. Anderson, N.G. Information as a physical quantity. Inf. Sci. 2017, 415–416, 397–413. [Google Scholar] [CrossRef]
  17. Kraus, K. States, Effects, and Operations—Fundamental Notions of Quantum Theory; Springer: Berlin, Germany, 1983. [Google Scholar]
  18. Pellonpää, J.-P. Quantum instruments: I. Extreme instruments. J. Phys. A: Math. Theor. 2013, 46, 025302. [Google Scholar] [CrossRef]
  19. Pellonpää, J.-P. Quantum instruments: II. Measurement theory. J. Phys. A: Math. Theor. 2013, 46, 025303. [Google Scholar] [CrossRef] [Green Version]
  20. Leff, H.; Rex, A. Maxwell’s Demon 2: Entropy, Classical and Quantum Information, Computing; Institute of Physics: Bristol, UK, 2003. [Google Scholar]
  21. Anderson, N.G. Conditioning, Correlation and Entropy Generation in Maxwell’s Demon. Entropy 2013, 15, 4243–4265. [Google Scholar] [CrossRef]
  22. Anderson, N.G. Information erasure in quantum systems. Phys. Lett. A 2008, 372, 5552–5555. [Google Scholar] [CrossRef]
Figure 1. “Half Galton Board”: The degeneracies D N ( S ) of states S , M generated by coupling N spins with s = 1 / 2 can be obtained by the (red) number of coupling paths that start at ( N = 0 , S = 0 ) and terminate at ( N , S ) .
Figure 1. “Half Galton Board”: The degeneracies D N ( S ) of states S , M generated by coupling N spins with s = 1 / 2 can be obtained by the (red) number of coupling paths that start at ( N = 0 , S = 0 ) and terminate at ( N , S ) .
Entropy 23 00289 g001
Figure 2. Visualization of the total operation I ( N ) of the imperfect erasure of a qubit as the affine mapping of the Bloch sphere S 2 onto an ellipsoid E that touches S 2 at the south pole.
Figure 2. Visualization of the total operation I ( N ) of the imperfect erasure of a qubit as the affine mapping of the Bloch sphere S 2 onto an ellipsoid E that touches S 2 at the south pole.
Entropy 23 00289 g002
Figure 3. Plot of the final entropy S 1 versus the initial one S 0 for an imperfect erasure of a qubit. The blue dots correspond to 10,000 randomly chosen initial states ρ of the single spin. The enveloping green curve is analogously calculated for the one-dimensional family ρ ( p ) according to (63,64) and reaches its maximum of S 0 = log 2 for p = 1 2 (dashed red lines). The corresponding value S 1 / 2 of S 1 is given by (65). Only for the points below the (red) line S 0 = S 1 , there occurs a decrease of entropy due to the conditional action. For the green curve, this will happen if 0 < p < p 1 , where p 1 is given by (66). The dashed blue curve represents S 0 H ( ρ ( p ) , F ) according to (28). Therefore, the bound (29) holds, although the conditions of Theorem 1 or Corollary 1 are not satisfied.
Figure 3. Plot of the final entropy S 1 versus the initial one S 0 for an imperfect erasure of a qubit. The blue dots correspond to 10,000 randomly chosen initial states ρ of the single spin. The enveloping green curve is analogously calculated for the one-dimensional family ρ ( p ) according to (63,64) and reaches its maximum of S 0 = log 2 for p = 1 2 (dashed red lines). The corresponding value S 1 / 2 of S 1 is given by (65). Only for the points below the (red) line S 0 = S 1 , there occurs a decrease of entropy due to the conditional action. For the green curve, this will happen if 0 < p < p 1 , where p 1 is given by (66). The dashed blue curve represents S 0 H ( ρ ( p ) , F ) according to (28). Therefore, the bound (29) holds, although the conditions of Theorem 1 or Corollary 1 are not satisfied.
Entropy 23 00289 g003
Figure 4. Plot of the initial total entropy S 0 + S i (red curve) and the final entropy S 1 + S 2 (blue curve) calculated for the one-parameter family ρ ( p ) according to (62) such that S 0 = S ( ρ ( p ) ) . Obviously, S 0 + S i < S 1 + S 2 for 0 < p 1 . S i = log 14 denotes the initial entropy of the “heat bath”, and hence, S 0 + S i assumes its maximum log 2 + log 14 = log 28 3.3322 at p = 1 / 2 . At p = 1 , the final entropy S 1 + S 2 assumes the value S f 3.54621 according to (67).
Figure 4. Plot of the initial total entropy S 0 + S i (red curve) and the final entropy S 1 + S 2 (blue curve) calculated for the one-parameter family ρ ( p ) according to (62) such that S 0 = S ( ρ ( p ) ) . Obviously, S 0 + S i < S 1 + S 2 for 0 < p 1 . S i = log 14 denotes the initial entropy of the “heat bath”, and hence, S 0 + S i assumes its maximum log 2 + log 14 = log 28 3.3322 at p = 1 / 2 . At p = 1 , the final entropy S 1 + S 2 assumes the value S f 3.54621 according to (67).
Entropy 23 00289 g004
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Schmidt, H.-J. Conditional Action and Imperfect Erasure of Qubits. Entropy 2021, 23, 289. https://doi.org/10.3390/e23030289

AMA Style

Schmidt H-J. Conditional Action and Imperfect Erasure of Qubits. Entropy. 2021; 23(3):289. https://doi.org/10.3390/e23030289

Chicago/Turabian Style

Schmidt, Heinz-Jürgen. 2021. "Conditional Action and Imperfect Erasure of Qubits" Entropy 23, no. 3: 289. https://doi.org/10.3390/e23030289

APA Style

Schmidt, H. -J. (2021). Conditional Action and Imperfect Erasure of Qubits. Entropy, 23(3), 289. https://doi.org/10.3390/e23030289

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop