1. Introduction
A measurement has three characteristics. One is the measurement’s content: this is the question the measurement answers, or equivalently, the physical property whose value the measurement determines. The second characteristic of a measurement is its context: this includes other measurements made together with this one, and the spatiotemporal relations among them. The “togetherness” of two measurements means that there is an empirical rule by which the outcomes of these measurements are paired. The third characteristic of a measurement is the probability distribution of its values. More precisely, all the measurements made in the same context possess a joint distribution which determines the distribution of any given measurement.
Counterfactual definiteness (CFD) of a measurement is its compliance with the following counterfactual statement: had the measurement with the same content been made in another context, its outcome would have been the same. We will argue that under an assumption commonly accepted in quantum mechanics, CFD is always satisfied.
We begin by illustrating the terms and notions mentioned above (and to be formally defined in
Section 2) using a toy example. It is based on the parable of the seer of Nineveh that was introduced by Ernst Specker [
1] and subsequently used by others as a simple example of contextuality [
2]. Omitting the colorful story line, the seer of Nineveh had three boxes obeying the following Magic Box Rules: (MBR1) only two of them could be opened at any given time; (MBR2) regardless of which two boxes were opened, one and only one of them contained a gem, and (MBR3) the gem could be contained in either of the two with equal probabilities. We can take these Magic Box Rules as an analogue of the laws of quantum mechanics. A formal representation of this situation is by the following
system of random variables:
It represents six measurements
having three
contents q and made pairwise in three
contexts c. The content
q of
can be thought of as the question “does the box
q contain a gem?” (
). This question is answered
or
, which are the possible values of
. Equivalently, we can view
q as the property “the contents of the box
q” (
), in which case the possible values of
are “gem” and “no gem.” Irrespective, we will denote the values of
as
and
. The context
c indicates which two boxes have been opened. All other conditions under which the measurement are made (e.g., the shape of the boxes) are the same in the three contexts, so we do not list them in the definition of a context. Contexts are always mutually exclusive, by definition: random variables measured in different conditions never co-occur, there is no empirical rule for pairing the values of, say,
and
.
The rules MBR2 and MBR3 say that in each of the three contexts, the two measurements
and
made in it have the joint distribution
The individual distribution of each
, by MBR3, is one and the same for all random variables:
and
with equal probabilities. In particular, it is the same for any two measurements answering the same question in different contexts, such as
and
. If one repeatedly observes openings of the box
, and sees no other boxes, one has no way of determining in which context the box is being opened, in the one with the box
or with the box
. This is a special case of an assumption we are going to make throughout this paper: in the quantum mechanical literature it is known under variety of names, such as
non-signaling or
non-disturbance [
3,
4,
5]. In
Section 2, we will define a strong version of this notion following Abramsky and Brandenburger [
6].
1.1. Noncontextual Representation of Variables
A standard way of introducing the notion of contextuality, applying it to our example, is to ask:
- Q0:
is it possible to treat all random variables in the system as if any two variables with the same content were identical?
For instance, and have the same content, both answer the question : “Does the box #1 contain a gem?”. The other boxes opened together with the box #1 (i.e., the box #2 in context or the box #3 in context ) in no way affect the distribution of the possible answers to the question . Someone who observes the box #1 repeatedly, without seeing the other boxes, has no way of determining the context of the box #1 when it is opened. Therefore it seems it should be possible to simply view and as one and the same variable. Analogous reasoning applies to other pairs of measurements sharing a content, and .
However, it is easy to see that this
noncontextual representation of the variables in our example is not possible. Let us begin by renaming
into
X, and then proceed by identifying other random variables following the Magic Box Rules and noncontextual representation. The first step will yield
where the
in the first row follows MBR2, and the second
X in the first column follows noncontextual representability. Proceeding in this manner, we obtain
and we see that the cell with “?” cannot be filled, as it should be
to maintain noncontextual representation in the third column but it should be
X to follow MBR2 in the second row. The conclusion is that no noncontextual representation of the random variables in our system exists. When this happens, a system is said to be
contextual (otherwise it is
noncontextual).
1.2. Counterfactual Definiteness
One can, however, approach our system in a different way. Given that a box was opened in some context (that we will call the factual context), one can ask: had this box been opened in another context (called counterfactual), would the outcome have been the same? The term “outcome” has two meanings: “random variable” and “value of a random variable.” In the present context, however, the two are interchangeable, and the counterfactual question can also be formulated thus: had this box been opened in another context, would the counterfactual variable have been representable by the same random variable as the factual one, R? The reason for this is that we can think of the counterfactual question about values of the variables being asked repeatedly, and and R can always have the same value if and only if .
In the contextuality literature the counterfactual question above is considered to be logically equivalent to Q0 [
7,
8,
9,
10,
11,
12]. However, a detailed analysis shows this is not the case. Using our example (
1), consider the situation when the factual context is
, i.e., we observe the values of the
factual variables and
. One can then ask two counterfactual questions:
- Q1:
if instead of one had recorded (the same box in context ), would have been the same as ?
- Q2:
if instead of one had recorded (the same box in context ), would have been the same as ?
It is easy to see that in our example the answer to both questions is affirmative, in the sense that there is nothing in the Magic Box Rules that would prevent one from considering a counterfactual variable identical to the corresponding factual one. Denoting
by
X, we have, for Q1,
Moreover, this representation can never come into a conflict with other variables in the same counterfactual context:
We can repeat the same reasoning for Q2:
We have here a special case of the general theorem proved in the next section: it says that if a system satisfies the no-disturbance condition, then for any factual context and any counterfactual one, the variables in the latter can be chosen so that the same-content variables in the two contexts are identical. That is, any system with no disturbance has the property of CFD.
Returning to our example, however, we have a natural question to ask: What if the questions Q1 and Q2 are answered together? Would we not run into a contradiction then? What we have is
And it seems that we indeed have run into a contradiction, because in the first column the variables are not the same. However, one can notice this only if one compares two counterfactual contexts to each other with the purpose of determining if they comply with noncontextual representability. In other words, one notices this contradiction if the question one answers is Q0 rather than Q1 and Q2.
We already know that the system is contextual, i.e., Q0 is answered in the negative. What we should be interested in now is whether a contradiction occurs if we deal only with the counterfactual questions, without explicitly involving noncontextual representability. It is clear, however, that one cannot formulate purely counterfactual questions to compare two counterfactual contexts without making one of them factual. It is logically impossible.
The principal difference between noncontextual representability and CFD is that the latter puts the system into a frame of reference formed by the choice of a factual context. Changing the factual context changes the frame of reference. In this picture, noncontextual representability can be viewed as the possibility of reconciling all different frames of reference. However, this is an additional and different question—about contextuality. A system may be contextual or noncontextual, but CFD is satisfied always.
3. Concluding Remarks and Possible Generalizations
3.1. Counterfactual Definiteness and Noncontextuality
We have shown that compliance with CFD and noncontextuality of a system of random variables are related but different concepts: CFD is always satisfied as a consequence of non-disturbance (s.c.c.) property, irrespective of whether the system is contextual. The mathematical reason for this is that a counterfactual question creates a single frame of reference, a factual context, which, together with the variables in the counterfactual contexts to which the question pertains, forms a noncontextual subsystem. The overall (non)contextuality is a property without such a frame of reference, or one in which all different frames of reference are reconciled. However, to achieve such a reconciliation, or to establish that it is not possible, one has to use conceptual means that cannot be presented in the form of counterfactual questions.
Interestingly, David Mermin, in his well-known paper [
20], makes a distinction between
Strong Baseball Principle, corresponding to CFD, and
Very Strong Baseball Principle, corresponding to overall noncontextuality. Except for terminological and expository differences, the Strong Baseball Principle is introduced as in our example (
8), by choosing a factual context and creating two counterfactual ones following CFD. Then Mermin compares the two counterfactual contexts to each other, and says: “This last application of the Strong Baseball Principle, by comparing hypothetical cases, has a different character than the first two, which compare a hypothetical case with the real one, and here it might more accurately be termed the Very Strong Baseball Principle.” Mermin proceeds, however, by arguing that the latter should not be treated separately from the Strong Baseball Principle, the argument being that comparing counterfactual contexts is as “reasonable” and “permissible” as comparing them with a factual context. One of the authors of the present paper maintained the same position in Ref. [
12], using similar arguments. Our position here is that being equally reasonable and permissible, CFD and noncontextuality are logically distinct principles, of which only the latter may fail to hold in s.c.c. systems.
We find a similar situation in a 1981 paper by John Clauser and Abner Shimony [
9]. They analyze Henry Stapp’s [
7] approach to the derivation of Bell’s inequalities, according to which they follow from certain four equations. These equations can be interpreted as stating that CFD holds for the four F-CF subsystems of the EPR/Bohm system, i.e.,
in our example (
10). Thus, Stapp’s position is that whenever
is contextual, it must contravene CFD. Clauser and Shimony point out that Stapp’s equations only apply to the pairs of contexts (F-CF pairs, in our terminology), and that “Stapp has not given a reason for demanding the existence of a quadruple of possible worlds which mesh together [these four pairs]”. However, they seem to accept Stapp’s response to this objection, which we do not present here as, in our view, it misses the point. Clauser and Shimony’s objection is accepted as valid by Bernard d’Espagnat [
21], but his position, in contrast to ours, seems to be skeptical of the meaningfulness of CFD altogether.
For completeness, we should mention that Robert Griffiths [
22,
23] also argues that CFD is always satisfied in quantum mechanics. His argumentation, however, is very different from ours. Moreover, unlike in our paper, Griffiths considers CFD to be completely unrelated to noncontextuality.
3.2. Systems with Signaling/Disturbance
CbD provides a generalization of the notion of (non)contextuality to systems that are not necessarily s.c.c. In fact, the distributions of the random variables in the system can be arbitrary. It is interesting to see if CFD generalizes similarly, and if so, what the relations are between generalized (non)contextuality and generalized CFD.
Definition 6. A system in (11) is considered noncontextual in CbD if it has a multimaximally connected
coupling, defined as an indexed set of jointly distributed variablessuch that, (1) for any and any with for all , the distribution of is the same as the distribution of ; and (2) the probability of , for any , is maximal possible. For s.c.c. systems, this definition specializes to Definition 2, with the maximal probability of being 1 (because of which they both can be renamed into ).
One can now apply Definition 5 to an arbitrary system :
Definition 7. A system is said to satisfy generalized CFD (gCFD), if all of its F-CF subsystems are noncontextual.
The intuitive meaning of gCFD is that the outcome of a counterfactual measurement, had it been made, would have been the same as the factual one with the highest possible probability (given the distributions of the two measurements).
We now have no analogue of Theorem 1. Based on the general properties of (non)contextuality, one can only say that (A) if a system is noncontextual then so are all of its F-CF subsystems (i.e., the system satisfies gCFD); and (B) if a system does not satisfy gCFD, then it is contextual. Moreover, all s.c.c. systems provide evidence that compliance with gCFD (in the form of CFD) does not imply noncontextuality. All
cyclic systems [
24], even if not s.c.c. [
25], demonstrate the same. For instance, the system
in our example (
10) is contextual whenever
with the same meaning of the terms as in (
14), which is a special case of (
25). At the same time, every F-CF subsystem of this system is trivially noncontextual (because single-variable rows cannot contribute to contextuality):
More work is needed to find out if gCFD and more generally F-CF systems may productively complement the notion of (non)contextuality in the theory of systems of random variables.