1. Introduction
In his seminal paper on the path integral formulation of quantum mechanics [
1], Feynman started the introduction of his new theory by pointing out that the law of total probability in classical probability theory must be replaced by a new form of rule. Specifically, in a slightly different notation, the classical law of probability,
where
is the probability of obtaining measurement result
y given measurement result
x, is no longer true in quantum theory and must be replaced by
where
is a complex number called probability amplitude and related to classical probability by Born’s rule
. From this key idea, Feynman continued to expand the theory that led to the path integral formulation of quantum mechanics. He also discussed when the new rule of summation over probability amplitude can fall back to the classical law of probability. This is when one “attempts to perform” intermediate measurements that obtain results of all
b. In modern terms, what Feynman means by “attempting to perform measurement” can be understood as the decoherence phenomenon [
2].
The above example shows that it has been long known that the law of total probability cannot be taken for granted in quantum theory. Indeed, many other classical probability rules are only upheld in specific conditions. For instance, a joint probability can be definitely assigned only when the two measurement operators are commutative [
3,
4,
5,
6]. There are many variants of definitions of the conditional probability in quantum theory (for a review, see [
7]). However, a family of no-go theorems recently published [
8,
9,
10,
11] appears to rely on the total law of probability one way or another without considering the sufficient conditions. These no-go theorems are related to the extensively discussed Wigner’s friend experiments. In quantum mechanics, the Wigner’s friend [
12,
13] thought experiment has been widely discussed, as it tests the validity of many quantum interpretation theories. The significance of such experiments is that Wigner and his friend give two different descriptions of the same physical process happening inside the lab. Deutsch further extended the thought experiment to be applicable to macroscopic system such as the lab system [
14] itself. Based on that, a more sophisticated extended Wigner’s friend experiment is put forwarded by Brukner [
8,
15]. Such an experimental setup involves two remotely separated labs. Each lab contains half of an entangled pair of spins and a local observer. Outside each lab there is a super-observer who can choose to perform different types of measurements on the lab as a whole. The intention of such an experimental setup is to prove, through a no-go theorem, that measured facts are observer-dependent in quantum theory. A subsequent experiment [
16] has been carried out to confirm the inequality developed in [
8]. A stronger version of the no-go theorem is further proposed for reaching a similar conclusion [
9]. The statement that measured facts are observer-dependent was considered important for the quantum foundation and deserved rigorous theoretical proving and experimental testing. However, proving the no-go theorems by taking the law of total probability for granted casts doubt on their theoretical rigorousness.
The fact that there is still ambiguity in using the total law of probability in quantum theory—though it has long been recognized as not being upheld in quantum mechanics—shows the need to provide a rigorous formulation of the law of total probability in quantum theory and to clarify under what conditions it holds true. This is indeed the motivation behind the present work. Formulation of the law of total probability depends on a clear definition of conditional probability in quantum theory. There is already extensive research on how conditional probability is defined [
7,
17,
18,
19,
20,
21,
22,
23,
24,
25]. However, these formulations are either based on projection measurements or only consider simultaneous measurements with commutative operators. In this work, I extend a two-time conditional probability formulation from projection measurement to more generic POVM measurements. Generalization for POVM measurement is needed because some of the no-go theorems choose POVM operators in their proofs. I then give several sufficient conditions for the law of total probability to become true. The theory is applied to analyze several no-go theorems related to the extended Wigner’s friend scenario. Logical loopholes are shown in these no-go theorems because their proofs rely on the law of total probability one way or another, but the conditions to validate the law are not met. Thus, these no-go theorems do not really prove the results they expect, such as “measured facts are observer-dependent”. Instead, they just indirectly confirm that the law of total probability does not not hold in quantum theory.
It is worth mentioning that other concerns regarding these no-go theorems have already been pointed out [
26,
27]. In particular, only when a measurement is completed should a probability distribution be assigned. Assigning probability distribution for pre-measurement without results leads to contradiction [
26]. The analysis in this work will go one step further by showing that even assigning a probability distribution for completed measurements still leaves logical loopholes in the no-go theorem. This is because the law of total probability that the proofs rely on does not hold true with the specific measurement operators and initial quantum state being chosen. Lastly, it is important to emphasize that I do not take a stand on the assertions of the no-go theorems themselves. For instance, it could still be a valid statement that “measured facts are observer-dependent”. What I only show here is that there are logical loopholes in the proof of the no-go theorems.
In summary, this paper extends the formulation of conditional probability to generic POVM measurements and clarifies the conditions under which the law of total probability can be valid in quantum theory. Applying the theory developed in this work to the extended Wigner friend scenario reveals logical loopholes in several no-go theorems that take for granted the validity of the law of total probability. The contradictions in these no-go theorems only reconfirm the invalidity of the law of total probability in quantum theorem rather than invalidating the physical statements that the no-go theorems are intended to refute, such as “measured facts are independent of the observer”. I hope the results presented here inspire further research to find more convincing proof and experimental testing. This is important because the implications of the extended Wigner’s friend scenario are conceptually fundamental in quantum theory.
2. The Law of Total Probability in Quantum Theory
First, I briefly review classical probability theory. Suppose there are two random variables X and Y. Without loss of generality, I assume X and Y are discrete random variables. Measuring
X (or
Y) will obtain one of the values in
(or in
), which is finite or countable infinite. Denote the joint probability of measuring X with result
, measuring Y with result
as
, and the conditional probability of obtaining
given that
as
. They are related by the following axioms:
where
is the marginal probability of measuring X with result
, and similarly for
. Axiom (3) ensures the joint probability is defined uniquely regardless if it is defined by (1) or (2). We explicitly call out (3) since it is not always true in quantum theory.
The law of total probability can be derived (Axioms (1)–(3) give
, which is Bayes’ law. Summing over
i on both sides and using identity
, one obtains (4)) from axioms (1)–(3), expressed as following,
What I want to investigate here is how the equivalent version of (4) in quantum theory can be formulated.
To start with, I need to examine how conditional probability is constructed in quantum theory. The subtlety of constructing conditional probability in quantum theory has been investigated long ago. G. Bobo gives an extensive review and discussion [
7]. The generally accepted formulation of conditional probability in quantum theory is provided by Lüders rule [
18], where the measurements are associated with projection operators. Lüders rule is based on Gleason’s theorem, which mathematically justifies Born’s rule. Here I wish to follow a similar approach to generalize the formulation for conditional probability when the measurements are associated with POVM operators.
Mathematical proofs for generalizing Gleason’s theorem to POVM measurements are given by [
28,
29], which is our starting point. Suppose a quantum system
S is prepared such that its state is described by density operator
.
S could be a composite system, which I will discuss later. Let
be a POVM for
S. The probability of measurement with element
resulting in value
is [
28,
29].
and the post-measurement density operators
are given by [
4]
Let
be another POVM for
S. Given post measurement state
, the probability of measurement with element
resulting in value
is, by recursively applying (5),
. Substituting the expression for
in (6), I obtain the conditional probability
There is an underlying assumption in this definition that the probability is assigned only after the measurements are completed. In particular, the first POVM measurement
must be completed in order to be qualified as a condition. We strictly follow this assumption as opposed to assigning a probability with only “pre-measurement”. Pre-measurement refers only to the unitary process that entangles the measured system and measuring apparatus [
30] but without the projection process to single out a particular outcome.
Given the same initial state
, if I swap the order of measurements such that
goes first, followed by
, I obtain a conditional probability
Note
and
; Equations (7) and (8) can be rewritten as
Equations (9) and (10) are not necessarily equal, which indicate that the quantum version of Bayes’ theorem,
does not hold in general in quantum theory. This posts a difficulty to define a joint probability as either
or
because it depends on the order of measurement events. Another consequence is that the laws of total probability, i.e., the quantum version of (4)
does not hold in general either. This is because from (9),
, while
, and these are not equal in general (Note that on the other hand, given (7) and the completeness of POVM elements,
, where
I is the identity operator, it is straightforward to verify that
). We are interested in finding the conditions under which (12) becomes true.
It is well-known that when , i.e., and commute, from (7) and (8), one gets . Consequently, the law of total probability (12) becomes true and a joint probability can be well-defined. However, the situation becomes much complicated when .
Strictly speaking, due to the uncertainty principle, when
and
are non-commutative, the two measurements cannot be performed to obtain definite outcomes at the same time. The conditional probability defined in (7) or (8) needs to be extended to a two-time formulation of conditional probability in order to be applicable when
. There is extensive research on how to construct two-time conditional probability in quantum theory [
7,
17,
18,
19,
20,
21,
22,
23,
24,
25]. One noticeable approach is based on the Page–Wootters timeless formulation [
21,
22,
23,
24,
25]. However, this work will continue to be based on the generalized Gleason theorem for POVM [
28,
29] to derive the two-time conditional probability, and will leave discussion of the Page–Wootters mechanism for
Section 4.
For conceptual clarity, I start the analysis by considering that there is finite nonzero duration for each measurement. After I construct the conditional probability formulation, for practical purpose of calculation, I can approximate the measurement duration to zero. Suppose the first measurement starts at
and completes at
. Here
covers the time duration for both the pre-measurement unitary phase that entangles the measured system and the measuring apparatus, and the projection phase. The measurement process (Theorem 5.2 of [
4] gives a detailed account on how this POVM measurement is physically realized through indirect measurement) is represented by a POVM element
associated with outcome
. Similarly, the second measurement starts at
and completes at
. Between
and
there is a free time evolution for the measured system
S, described by operator
, where
. Since it is only meaningful to assign a probability distribution after a measurement is completed, the two-time conditional probability I want to construct is “given the measurement outcome of
at
where
, what is the probability of measurement outcome
b at
". Mathematically, this two-time conditional probability can be written as
, where
is the initial density operator of
S when the first measurement starts. After the first measurement with POVM element
, the post-measurement state is
. The quantum system
S then time evolves from
to
to a new state
. At
, the second measurement occurs. This is represented by applying POVM element
on
and obtaining outcome
at
with probability
. Substituting
, the two-time conditional probability is
For practical purposes of calculation, I can assume the measurement duration is very small compared to the free evolution time, i.e., and . Then, I can denote as , as , and .
Suppose the two POVM elements
and
are projection measurements,
and
; one can verify that the conditional probability defined in (13) gives the correct transition probability in standard quantum mechanics:
However, Equation (13) is more generic as it is defined with general POVM operators. Note that the denominator in (13)
; Equation (13) can be rewritten as
To analyze the two-time version of the total law of probability, which can be expressed as
I consider a series of two-time measurements
on
N copies of measured system
S with the same initial state
. Each two-time measurement consists a first measurement from one possible POVM element from the complete set
at time
and the same second measurement
at time
. For
, from (15) I have
However, by definition,
. We can see (16) is not true in general. The Theorem next attempts to address the question of under what conditions (16) is valid.
Theorem 1. Let be the density operator for a quantum system S before the measurements. Let and be two POVM elements to measure S at time and , respectively, and is the unitary time evolution operator from to . Select and such that . The law of total probability (16) is true if one of the following conditions is met.
- C1.
, ,
- C2.
and ,
- C3.
is a pure state, given by , is a projection operator and .
The proof of Theorem 1 is in
Appendix A, but a few comments are in order here. First, Condition
implies
. The sequence of operations for
means performing measurement
at
, time evolving the post-measurement state from
to
, performing measurement
at
, and reversing time evolution of the post-measurement state back to
. The sequence of operations
means time evolving the state from
to
, performing measurement
at
, then reversing time evolution of the state back to
, and performing measurement
at
. Condition
says that if these two sequences of operations are equivalent, then the law of total probability (16) holds true.
Second, if the post-measurement state
after the first measurement does not change during free time evolution, such as the case of a spin state in free space, one will have
. Then, Equation (13) can be written as
Equation (18) appears the same as (7), but the precise meaning is different in that the two measurements
and
in (18) are taken at two different times. With such a special post-measurement quantum state, the sufficient conditions in Theorem 1 become
- C1′.
, ,
- C2′.
and ,
- C3′.
, is a projection operator and .
A couple of comments are in order before closing this section. First, when two measurement operations are not commutative, the conditional probability needs to be defined in the two-time formulation. Second, I can give an intuitive explanation of why (16) does not hold in general in quantum theory. As shown in (17), the right-hand side of (16) refers to the summation of traces of multiplication of operators from a series of experiments where two measurements are carried out in a sequence. In the case of a special post-measurement state where (18) holds, this is . Measurement of changes the initial quantum state such that it affects the probability of outcome for a subsequent measurement . However, the term on the left-hand side of (16) refers to the probability of an experiment where only measurement is carried out with the same initial quantum state. There is no reason to assume both sides are equal. Equation (16) holds only in special conditions such as those specified in Theorem 1.
The conclusion here is that one should not take for granted that the law of total probability holds true in general. Instead, sufficient conditions, such as those provided in Theorem 1, need to be clearly called out. Failing to do so may leave a loophole in logical deduction when applying the law of total probability.
3. Application to Composite Systems
In this subsection, I will apply the conditional probability definition to composite quantum systems and reexamine Theorem 1 when measuring composite systems. Suppose the measured system
S consists of two subsystems
and
that are space-like separated. Define
, where
is a local POVM element on subsystem
, and
is an identity operator on subsystem
. Similarly, define
, where
is a local POVM element on subsystem
. By the principle of locality, a local measurement on a subsystem should not impact the other remote subsystem. Therefore,
. For measurement outcomes of two such local measurements, Equations (7) and (8) are correct formulations for conditional probability; the joint probability is well-defined. Consequently, Equations (11) and (12) hold true. There is no need to use the two-time formulation of conditional probability. This is the case for typical Bell tests and has been used to derive the Bell–CHSH inequalities (On the other hand, in the derivation of Bell–CHSH inequalities, identity (1) is further expressed as
where
is a hidden variable. This is known as the
outcome independence assumption [
31,
32]).
However, suppose
, where
is another local POVM element on subsystem
, and
. In this case, Equation (7) is incorrect for conditional probability. The two-time conditional probability formulation is needed and can be calculated as
where
.
Next, I wish to apply the two-time conditional probability to the extended Wigner’s friend (EWF) scenario introduced in [
8]. As shown in
Figure 1, the EWF scenario consists of two space-like separated laboratories
and
. Each laboratory contains half of an entangled pair of systems
and
.
also contains a friend Charlie who can perform measurements on
. Outside
there is a super-observer Alice who can perform different types of measurements on
as a whole. Similarly, there is a friend Debbie in
and a super-observer Bob outside
. Here, four POVM measurements are needed and represented by POVM elements
, where operators
A and
C act on Hilbert space
, and
B and
D act on Hilbert space
. I drop the subscripts of the operators and
for simplifying notations. In a typical EWF experiment, the chosen operators are not all commutative with one another. Specifically,
and
, while
and
. The two-time probability formulation to compute the conditional probability is needed because measurements
C and
D are taken before measurements
A and
B. Since
and
, I can assume measurements
C and
D are taken at the same time,
, as
, while measurement
A and
B are taken at the same, later time
as
. Without loss of clarity, I drop the symbol ⊗ hereafter. Then, the conditional probability for
is given by
where
is the time evolution operator from
to
. The law of total probability I am interested in is
From (21), the R.H.S. of (22) becomes
The summation is over POVM element sets for
and
. Since
, the L.H.S. of (22) is
. Both sides are not equal in general.
In the case that the post-measurement state after the first measurement is unchanged during free time evolution (this is indeed the assumption in the no-go theorems I will analyze in the next section), Equation (21) becomes
Equation (23) is simplified to
and
. In this case, one can derive the following corollary based on Theorem 1.
Corollary 1. In the Extended Wigner’s Friend scenario setup, suppose the post-measurement state is unchanged during free time evolution from to . Select and such that . The law of total probability (22) is true if one of the following conditions is met.
- C4.
and , ,
- C5.
, C and D are projection operators, and
Condition
is quite obvious. Proof of condition (26) is given in
Appendix B.