1. Introduction
According to a widespread opinion, there are two types of state change in quantum mechanics: time evolution in closed systems and state changes due to measurements. The mathematical description of these two processes is known in principle:
- (i)
Time evolution in closed systems can be described by means of unitary operators
according to:
where
is obtained by the solutions of the time-dependent Schrödinger equation and
denotes any statistical operator.
- (ii)
Conditional state changes according to the outcome of a measurement of an observable
A will be described, in the simplest case, by maps of the form:
where
is the family of eigenprojections of a self-adjoint operator
. Without selection according to the outcomes of the measurement, the total state change will be:
In a recent article [
1], we suggested a third type of state change, called “conditional action”, that combines the two aforementioned ones insofar as it describes a state change depending on the result of a preceding measurement.
- (iii)
In the simplest case, a conditional action is mathematically described by maps of the form:
with the same notation as in (
2) and a family of unitary operators
. Without selection according to the outcomes of the measurement, the total state change will be:
We emphasize that the above form of state changes in (
2)–(
5) represents only the simplest cases and will be generalized in the following sections. Before explaining the details of this suggestion and suitable generalizations, we will fix some general notation used in the present paper. A measurement leading to the state change (
2) is called a “Lüders measurement” in accordance with [
2], sometimes also called “projective measurements” in the literature. In order not to have to go into the technical intricacies, the quantum system
will be described by a finite-dimensional Hilbert space
. In this case, the index set
will also be finite. Let
denote the real linear space of Hermitian operators
and
the convex subset of statistical operators, i.e., Hermitian operators
with non-negative eigenvalues and
. The state changes considered above in (
2) and (
3) can be viewed as a map
defined by:
which will be called a “Lüders instrument”, and the corresponding map:
the “total Lüders operation”. A difference of the two state changes according to (i) and (ii) arises when we consider the change of the von Neumann entropy:
Under unitary time evolutions (i), the entropy remains constant,
whereas for Lüders measurements (ii), the entropy may increase, and we can only state that:
see [
3,
4,
5], in accordance with the second law of thermodynamics.
In contrast to closed systems, time evolution in open systems can take a more general form. An obvious model to account for the time evolution in open systems is to consider the extension of the system
with Hilbert space
by another, auxiliary system
E (environment, heat bath, measurement apparatus, etc.) with Hilbert space
and the unitary time evolution
V of the total system
. If the total system is initially in the state
, it will generally evolve into an entangled state
. In the end, we again consider the system
and find its reduced state
given by the partial trace:
The corresponding state change
will, in general, not be of the unitary type (
1), but represents a natural extension (ie) of the state changes according to (i). In general, the entropy balance for these state changes is ambivalent:
can be smaller or larger than
. In fact, the initial entropy of the total system is
, and the unitary time evolution
V leaves this invariant. However, the separation of the total system into its parts
according to (
11) and:
increases the entropy (or leaves it constant) according to the “subadditivity” of
S (see [
4], 11.3.4.), and hence:
However,
may assume positive or negative values. This can be physically understood as the phenomenon that, apart from a possible increase of the total entropy according to (
13), there may be an entropy flow from the system
into the environment
E or vice versa.
An analogous extension of the system
to
can also be considered for Lüders measurements. We again start with an initial total state
, where
, and a unitary time evolution
V of the total system. Then, a Lüders measurement corresponding to a complete family
of mutually orthogonal projections of the auxiliary system is performed, and the post-measurement total state is reduced to the system
leading to the final state:
or, without selection, to:
Thus, we obtain extensions (iie) of the state changes (ii) due to Lüders measurements by maps
of the form (
14), which are called “instruments” in the literature; see [
6] and
Section 2 for more precise mathematical definitions. Lüders instruments are idealized special cases of general instruments that, in some sense, minimize the perturbation of the
system by the measurement, but “real measurements” are better described by general instruments. Analogous remarks as in the case of open systems apply for the entropy balance: It is well known (see [
4] or [
7], Exercise 11.15) that general measurements may decrease the system’s entropy.
The latter observation leads us to the suggestion [
1] that the entropy decrease of systems due to the “intervention of intelligent beings” as, e.g., Maxwell’s demon, can be explained by the same mechanism. Originally, the notion of “conditional action” was developed to describe the intervention of Maxwell’s demon in the energy distribution of a gas with two chambers: depending on the result of an energy measurement on a gas molecule approaching the partition between the two chambers, a door is opened or shut. Thus, the further time evolution of the gas depends on the result of the measurement. Similarly, the result of measuring whether a single molecule is in the left or right chamber can be used to trigger an isothermal expansion to the right or left (Szilard’s engine). Szilard argues [
8] that the entropy decrease of the system is compensated by the entropy costs of acquiring information about the position of the gas particle (“Szilard’s principle”). His arguments are formulated within classical physics and not easy to understand; see also the analysis and reconstruction of Szilard’s reasoning in [
9,
10,
11]. Nevertheless, it seems possible that the entropy decrease due to such external interventions is a special case of the well-understood entropy decrease due to state changes described by general instruments.
In fact, it can be easily confirmed that the maps of the form (
4) describing “conditional action” are special cases of instruments and hence were called “Maxwell instruments” in [
1]. The mathematical notion of state changes described by instruments is sufficiently general to cover not only changes due to inevitable measurement disturbances, but also “deliberate” state changes depending on the result of a measurement.
This notion of “conditional action” will be slightly generalized in the present paper and then comprises not only interventions of Maxwell’s demon or cycles of Szilard’s engine [
1,
8,
12], but also quantum teleportation ([
4], Ch. 1.3.7), quantum error correction ([
4], Ch. 10), or the erasure of qubits [
1]. With respect to the latter example, there is some relationship with the recently proposed definition of “generalized erasure channels” in [
13].
Relative to the choice of a suitable basis, a qubit has two values, “0” or “1”. Consider a “yes-no”-measurement corresponding to said basis. If the result is “1”, the two states are swapped; hence, . If the result is “0”, then nothing is done; hence . This constitutes the conditional action, which sets the qubit state to its default value “0” for any case, and hence, it can be legitimately considered as an “erasure of a qubit”.
The latter example contains an ironic punch line in that the erasure of memory contents with measurement results and the corresponding entropy costs are usually considered to resolve the apparent contradiction between the actions of Maxwell’s demon and the second law (Landauer’s principle). If memory erasure itself were taken as a conditional action, we would seem to enter an infinite circle of creating and erasing new memory contents. The obvious resolution to this problem is the observation that the entropy decrease in the system is due to some flow of entropy from to the auxiliary system E as described above. If E can be viewed as a “memory device”, then at the end of the conditional action, it already contains the missing entropy. It is not necessary to erase the content of the memory. The latter would not create the missing entropy, but only make it visible.
As in [
1], it seems sensible to distinguish between the principle that erasure of memory produces entropy [
14] (“Landauer’s principle” in the narrow sense) and the position that this effect constitutes the solution of the apparent paradox of Maxwell’s demon [
15] (“Landauer/Bennett principle”). Moreover, it will be a matter of substantiating our critique of the Landauer/Bennett principle (not of the Landauer principle) outlined above with a more realistic model of qubit erasure than that given in [
1].
To this end, we realize the qubit (the system
) by a single spin with spin quantum number
described by a Hilbert space
and model the erasure of the qubit by the coupling of the single spin with a “heat bath”
E consisting of
spins such that the time evolution of the total system can be analytically calculated. The quotation marks refer to the fact that the “heat bath” is pretty small and not macroscopic, as usually required, and that, moreover, it is rather a “cold bath”. This is due to the choice of the default value ”0” of the qubit as the ground state ↓ of the single spin. Thus, erasing the qubit is physically equivalent to cooling the system
to the temperature
. Although this is, strictly speaking, impossible due to the third law of thermodynamics, it can be approximately accomplished by coupling the single spin to a system of
spins in its ground state. Here, we ignore the physical impossibility to prepare a system in its ground state and consider the ground state of the “heat bath” as a suitable approximation to a state of very low temperature. This approximation has the advantage of providing fairly simple expressions for the relevant quantities considered in this paper. The corresponding calculations are presented in
Section 5.
As a side effect of this account results the necessity to define the concept of “conditional action” somewhat more generally than in [
1]. This is done in
Section 2, where we also recapitulate the basic notions of quantum measurement theory required for the present work. A critical account of the Szilard principle in the realm of quantum theory is given in
Section 3, where we also formulate an upper bound for the entropy decrease due to conditional action that is compatible with Szilard’s reasoning, but only valid under certain restrictions. A similar bound is derived in
Section 4, where the connections of the present theory with the observer-local referential (OLR) approach [
16] are considered. The proofs are moved to the Appendix, as is the explicit construction of a “standard” measurement dilation for a general instrument. This measurement dilation is well known, but nevertheless reproduced here since some arguments given in this paper depend on its details. We close with a summary and outlook in
Section 6.
2. General Definitions and Results
In the following, we will heavily rely on the mathematical notions of operations and instruments. Although these notions are well known (see, e.g., [
6,
17,
18,
19]), it will be in order to recall the pertinent definitions adapted to the present purposes and their interpretations in the context of measurement theory. For readability, we sometimes will repeat definitions already presented in the Introduction.
Let be a d-dimensional Hilbert space, denote the space of Hermitian operators , and the cone of positively semi-definite operators, i.e., having only non-negatives eigenvalues. The convex subset consists of statistical operators with . Such operators physically describe (mixed) states. Pure states are represented by one-dimensional projectors , where with .
According to [
4], 8.2.1, there are three equivalent ways to define operations:
By considering the system coupled to environment E,
by an operator-sum representation, or
via physically motivated axioms.
Here, we follow the second approach and define an “operation” to be a map
of the form:
with the Kraus operators
and a finite index set
,
such that:
see [
17]. It follows that an operation is linear and maps
into itself. It is mathematically convenient not to require that an operation preserves the trace. The normalized state after the operation would be obtained as
.
Operations are intended to describe state changes due to measurements. For example, the total Lüders operation (
7) is a trace-preserving operation in the above sense with
and
for all
. An operation
will be called pure iff the representation (
16) of
A can be reduced to a single Kraus operator, i.e.,
Physically, this means that a pure operation maps pure states onto pure states, up to a positive factor.
There exists a so-called statistical duality between states and observables; see [
6], Chapter 23.1. In the finite-dimensional case,
can be identified with its dual space
by means of the Euclidean scalar product
. Physically, we may distinguish between the two spaces in the sense that
is spanned by the subset of statistical operators representing states and
is spanned by the subset of operators with eigenvalues in the interval
representing effects. Effects describe yes-no measurements including the subset of projectors, which are the extremal points of the convex set of effects; see [
6].
Every operation
, viewed as a transformation of states (Schrodinger picture), gives rise to the dual operation
viewed as a transformation of effects (Heisenberg picture). Reconsider the representation (
16) of the operation
A by means of the Kraus operators
. Then, the dual operation
has the corresponding representation:
for all
.
Let be a finite set of outcomes. Then, the map will be called an instrument iff:
The first condition can be re-written as:
with suitable Kraus operators
. The second condition can be rephrased by saying that the total operation
defined by:
will be trace-preserving. An instrument
will be called “pure” iff each operation
is pure.
Examples of pure instruments are given by Lüders instruments (
6) and “Maxwell instruments” (
4).
Similar as for operations, every instrument
gives rise to a dual instrument
defined by:
for all
and
. The condition that the total operation (
21) will be trace-preserving translates into:
Thus, every dual instrument yields a resolution of the identity by means of effects:
and hence to a generalized observable in the sense of a positive operator-valued measure
; see [
6]. Note, however, that compared to the general definition in [
6], we will have to consider generalized observables only in the discrete, finite-dimensional case. The traditional notion of “sharp” observables represented by self-adjoint operators corresponds to the special case of a projection-valued measure
satisfying
. From now on, we will correspondingly distinguish between “sharp observables” and “generalized observables”.
It can be shown [
1] that “Maxwell instruments” are just pure instruments corresponding to sharp observables. The example of imperfect erasure of qubits considered in
Section 5 suggest that the class of “Maxwell instruments” is too narrow to describe realistic conditional actions. First, imperfect erasure cannot be described by a pure instrument, since the initial state of the “heat bath” is not pure. Moreover, the measurement of a sharp “heat bath” observable does not give rise to a sharp qubit observable. Hence, it seems sensible to use general instruments to describe conditional action. Fortunately, the main results on the entropy balance of conditional action in [
1] can be easily generalized to general instruments.
To this end, we reconsider the map
defined in (
14) by means of coupling the system
to some environment
E. Recall that the environment
E is described by some Hilbert space
and an initial state
. Moreover,
V denotes the unitary time evolution of the total system and
a sharp environment observable. It can be shown that (i) (
14) defines an instrument in the above sense and (ii) every instrument can be obtained in this way (see Theorem 7. 14 of [
6], Exercise 8. 9 of [
4], or
Appendix A). The special instrument defined in (
14) will be referred to as a “measurement dilation”
of a given instrument
. If the initial state
of the environment is pure,
, the measurement dilation will also be denoted by
. A measurement dilation of a given instrument
is hence a physical realization of
by a Lüders instrument of the extended system
and a subsequent reduction to
.
Let a conditional action be described by the instrument
with measurement dilation
. With respect to this measurement dilation, we define the two reduced states:
and:
Then, the analogous arguments leading to the entropy balance (
13) also prove the following:
Proposition 1. Under the preceding conditions, the following holds: In connection with the Szilard principle discussed in the next section, the following proposition will be of some interest:
Proposition 2. The total operation of an instrument with measurement dilation is independent of the sharp environment observable Q.
This means in particular that
could even be realized by a coupling of
to some environment
E, unitary time evolution and final state reduction, without any measurement at all. The proof of Proposition 2 can be found in
Appendix B.1.
3. The Szilard Principle Revisited
As mentioned in the Introduction, the ideas of L. Szilard [
8] to resolve the apparent contradiction between the results of the “intervention of intelligent beings” and the second law were published more than nine decades ago and are confined to classical physics. Therefore, the reconstruction of “Szilard’s principle” for quantum mechanics could appear as somewhat daring. In this section, nevertheless, we will reconsider what we understand by “Szilard’s principle” from the point of view developed in the present article.
According to this principle, the entropy decrease of the system is compensated by the entropy costs of acquiring information about the system’s state. Recall that we have considered a so-called measurement dilation of the instrument
describing state changes due to conditional action, which extends the system
by an auxiliary system
E (environment). For the “standard dilation” given in
Appendix A, the dimension of the Hilbert space
corresponding to the auxiliary system
E equals the number of outcomes
of the Lüders measurement of
Q if the instrument
is pure. It is therefore tempting to consider the auxiliary system
E as a “memory” that holds the information about the result of the measurement and to interpret the “entropy cost of information acquisition” as the entropy
of the final state
of
E after the measurement. The probabilities
of the various outcomes are given by
, where
is the generalized observable (
24) corresponding to the conditional action and
is the initial state of the system
. The Shannon entropy
of the probability distribution
is independent of any measurement dilation and will be called the “Shannon entropy of the experiment”:
Then, we can prove the following inequality, which confirms Szilard’s principle in the above given version:
Theorem 1. (Szilard’s principle: quantum case)
The entropy decrease of a conditional action corresponding to a pure instrument is bounded by the Shannon entropy of the experiment, i.e., For the proof, see
Appendix B.2. It is worth noting that the bound in (
29) is independent of the pure instrument describing the conditional action and depends only on the probabilities
. The theorem is trivially satisfied if the conditional action leads to an increase of entropy, i.e.,
, as in the case of a Lüders measurement without any conditional action. Another trivial case is given if the observable
F is sharp and the projections
are one-dimensional. In this case,
, and the theorem holds since
. In other words: entropy cannot fall below the value of zero.
Otherwise, the bound (
29) is non-trivial. Consider the example of
with three mutually orthogonal one-dimensional projections
,
and
. It follows that:
and hence, in this example, Theorem 1 says more than just that the entropy of
cannot drop to negative values. It is straightforward to construct a Maxwell instrument corresponding to the generalized observable
F such that
, and hence,
, in accordance with Theorem 1.
A slight generalization of Theorem 1 is the following:
Corollary 1. The upper bound (29) also holds if the instrument can be written as a convex linear combination of pure instruments with the same set of outcomes . For the proof, see
Appendix B.3. A pure instrument has a standard dilation with one-dimensional projections
and a pure initial state
of
E. If we extend this standard dilation by considering a real mixed initial state
of
E, we obtain a convex combination of pure instruments for which Corollary 1 holds. However, not every instrument is a convex linear combination of pure ones. Actually, there exist instruments where the bound of entropy decrease given in (
29) is violated.
To provide an example, we consider the (perfect) erasure of two qubits. Thus,
, and we consider an orthonormal basis of
denoted by
. The conditional action maps all these four basis states onto the default state
. It has the following measurement dilation:
, initial auxiliary state
, unitary time evolution
V of the total system defined by
, and Lüders measurement of the auxiliary sharp observable
. The corresponding instrument
is pure and hence satisfies (
29).
Then, we consider another instrument
by changing the auxiliary sharp observable to
. The corresponding system observable
is also sharp, two-valued, and corresponds to a measurement of the first qubit w.r.t. the considered basis. All other components of
are left unchanged. Consider the initial state
of the system with
and
. It follows that
, and hence,
and
. Consequently,
, in contrast to (
29).
Similar examples abound: whenever the entropy decrease due to a conditional action is larger than , then a corresponding measurement dilation can be modified to yield without changing due to Proposition 2. The modified instrument cannot be written as a convex linear combination of pure instruments according to Corollary 1.
As a conclusion for the evaluation of Szilard’s principle, we can state that there are examples of conditional actions where the entropy decrease in the system can be explained by an entropy increase at least as large in a memory, as well as counterexamples. In the counterexamples, however, we have no violation of the second law, but only an impossibility to reduce the auxiliary system to its function as a memory. This is especially true for the limiting case of an entropy reduction without measurement.
The example of
Section 5 shows that the upper bound (
29) of the entropy decrease holds for a larger class of conditional actions than given by Theorem 1 or Corollary 1. It remains an open task to determine this class more precisely.
4. Connections to the OLR Approach
There exists a vast amount of literature on Maxwell’s demon and related questions [
10,
11,
20]. Among them is an article that comes rather close to the results of the present work, namely [
21], that deals with Szilard’s engine and where we read in the abstract:
In this paper, Maxwell’s Demon is analyzed within a “referential” approach to physical information that defines and quantifies the Demon’s information via correlations between the joint physical state of the confined molecule and that of the Demon’s memory. On this view […] information is erased not during the memory reset step of the Demon’s cycle, but rather during the expansion step, when these correlations are destroyed.
The mentioned notion of “observer-local referential (OLR) information” is further outlined in [
16]. A detailed comparison of the “conditional action approach” and the “OLR approach” is beyond the scope of this paper. Arguably, a key difference is that we could not model the formation of a correlation by a measurement and the subsequent destruction of that correlation by a conditional action as a sequence of state changes, and instead had to use measurement dilation as a surrogate; see
Section 6.
To illustrate the nevertheless existing connections between the two approaches, we will derive another upper bound for the entropy decrease analogous to Theorem 1 using the notion of OLR information. We will restrict ourselves to the case of conditional actions described by Maxwell instruments
, i.e., instruments of the form (
4). Let
denote the corresponding Lüders instrument of the form (
2) such that
and
share the same Hilbert space
and the same sharp observable
. Further, consider the standard measurement dilations:
as explicitly constructed in
Appendix A, but specialized to the case of pure instruments, see also [
1]. With respect to these measurement dilations, we further define:
and analogously for the primed quantities:
In accordance with [
16], we define the OLR information:
Then, we can prove the following:
Proposition 3. Under the preceding conditions, the entropy decrease is bounded from above by: The proof of Proposition 3 can be found in
Appendix B.4. Of course, this is only a first step to analyze the mentioned relations, since the conditions of Proposition 3 are rather limited, e.g., by the fact that only the standard dilation is considered and not an arbitrary measurement dilation.
6. Summary and Outlook
In this paper, we elaborate on a recent proposal [
1] to describe the “intervention of intelligent beings” in quantum systems in terms of “conditional action”. This is a genuinely physical concept. Mathematically, the notion of general “instruments”, originally intended to explain state changes due to measurements, is already broad enough to include conditional action.
A fundamental assumption here is that it is not necessary to describe the inner life of “intelligent beings” in more detail; it is sufficient to analyze the workings of apparatuses built to realize measurements and conditional actions. Ideally, such an analysis includes the original measurement on the system , the storage of the measurement result in a classical memory, and the subsequent unitary time evolution of conditioned by the memory contents. However, the construction of such a complete model of a conditional action would, in our opinion, require a solution of the quantum measurement problem and hence is impossible at present.
We must therefore confine ourselves to considering physical realizations of conditional actions restricted to so-called “measurement dilations”. These are well-known mathematical constructions [
6] that reduce general instruments acting on
to special Lüders instruments acting on a larger system
. These tools also open the way to understanding the (possible) entropy decrease in
due to the conditional action as an entropy flow from
to the auxiliary system
E, in the same sense as the (possible) entropy decrease due to a general measurement can be explained.
The latter explanation can also be related to existing approaches to resolving the apparent contradiction of said entropy decrease with a tentative second law of quantum thermodynamics. Among such approaches are the Szilard principle, the Landauer–Bennett principle, and the recent OLR approach. Due to the Szilard principle, the entropy decrease in is, at least, compensated by the entropy production associated with the measurement of the system’s state. This principle is confirmed by the present conditional action approach in the special case where the auxiliary system E can be conceived as a memory device (see Theorem 1), but not in general. Furthermore, a partial compatibility to the OLR approach is shown, in so far as, in special cases, the entropy decrease in is bounded by the loss of mutual information due to conditional action; see Proposition 3.
We analyze the imperfect erasure of a qubit by means of a physical model. This model descries the cooling of a single spin by coupling it to a “cold bath” consisting of six other spins such that the total time evolution can be analytically calculated. This model thus represents a more or less realistic measurement dilatation of imperfect erasure conceived as a conditional action and as such motivates the slight generalization of this concept compared to [
1]. At the same time, this example reveals some problems of the mentioned principles based on acquisition or deletion of information, since imperfect erasure of a qubit can be achieved without any measurement at all. This is even more plausible if one considers the physical interpretation of the erasure as a cooling of a single spin. The OLR approach appears to avoid this problem because it relies on an information concept that is independent of possible measurements; see [
22] for a corresponding treatment of imperfect memory erasure.
In summary, a first achievement of this work, also prepared in [
1], is the proposal of a new physical notion of ”conditional action” that encompasses certain external interventions in a system and is mathematically represented by the well-known notion of an ”instrument”. Physical systems
subject to a conditional action can no longer be considered adiabatically closed and therefore may lose entropy flowing into the auxiliary system
E needed to perform the conditional action. Moreover, if the conditional action is realized according to a “measurement dilation”, then the entropy loss can be calculated, and it follows that the total system,
plus
E, obeys the second law, cf. Proposition 1.
This analysis sheds critical light on the Landauer–Bennett principle, which is now widely accepted by the mainstream of physicists. From our point of view, erasing memory contents appears as another conditional action that transfers entropy from E to another second auxiliary system , but is neither necessary nor sufficient to solve Maxwell’s paradox.
For future investigations, it seems to be a desirable goal to extend the conditional action approach to the field of classical physics. The first steps toward this goal restricted to discrete state spaces were made in [
1]. The role of measurement is different in classical theories because, unlike in quantum theory, there are always idealized measurements that do not change the state of the system. However, there exist non-trivial instruments describing conditional action even in classical theories, and it should be possible to realize them by extending the system analogously to the quantum case.