Quantum Theory from Rules on Information Acquisition

Höhn, Philipp Andres

doi:10.3390/e19030098

Open AccessReview

Quantum Theory from Rules on Information Acquisition

by

Philipp Andres Höhn

^1,2

¹

Vienna Center for Quantum Science and Technology, University of Vienna, Boltzmanngasse 5, 1090 Vienna, Austria

²

Institute for Quantum Optics and Quantum Information, Austrian Academy of Sciences, Boltzmanngasse 3, 1090 Vienna, Austria

Entropy 2017, 19(3), 98; https://doi.org/10.3390/e19030098

Submission received: 23 January 2017 / Accepted: 17 February 2017 / Published: 3 March 2017

(This article belongs to the Special Issue Quantum Information and Foundations)

Download

Browse Figures

Versions Notes

Abstract

:

We summarize a recent reconstruction of the quantum theory of qubits from rules constraining an observer’s acquisition of information about physical systems. This review is accessible and fairly self-contained, focusing on the main ideas and results and not the technical details. The reconstruction offers an informational explanation for the architecture of the theory and specifically for its correlation structure. In particular, it explains entanglement, monogamy and non-locality compellingly from limited accessible information and complementarity. As a by-product, it also unravels new ‘conserved informational charges’ from complementarity relations that characterize the unitary group and the set of pure states.

Keywords:

reconstruction of quantum theory; entanglement; monogamy; quantum non-locality; conserved informational charges; limited information; complementarity; characterization of unitary group and state spaces

1. Introduction

Why is the physical world described by quantum theory? If we wish to sensibly address this question, we have to step beyond quantum theory and to consider it within a landscape of alternative theories. This, after all, permits us to ponder about how the world could have been different, possibly described by modifications of quantum theory. Such an endeavor forces us to leave the usual textbook formulation of quantum theory, and everything we take for granted about it, behind and to develop a more general language that also applies to alternative theories. Ideally, this language should be operational, encompassing the interactions of some observer with physical systems in a plethora of conceivable, physically-distinct worlds.

If we wish to also provide a possible answer to the above question, we then have to find physical properties of quantum theory that single it out, at least within the given landscape of alternatives. In particular, the goal should be to find an operational justification for the textbook axioms, i.e., ultimately for complex Hilbert spaces, unitary dynamics, tensor product structure for composite systems, Born rule, and so on. The result would be a reconstruction of quantum theory from operational axioms [1,2,3,4,5,6,7,8,9,10] and should ideally yield a better understanding of what quantum theory tells us about Nature; and why it is the way it is.

In this manuscript, we shall review and summarize how the quantum formalism for arbitrarily many qubits can be reconstructed from operational rules restricting an observer’s acquisition of information about a set of observed systems [1,2]. The goal of this summary is to provide a didactical and easily-accessible overview of this reconstruction. Its underlying framework is especially engineered for unraveling the architecture of quantum theory, and so many reconstruction steps are instructive for understanding the origin of quantum properties. As we shall see, this reconstruction provides a transparent, informational explanation for the structure of qubit quantum theory and especially also for its paradigmatic features, such as entanglement, monogamy and non-locality. The approach also produces novel ‘conserved informational charges’, indeed appearing in quantum theory, that turn out to characterize the unitary group and the set of pure states and which might find practical applications in quantum information.

The premise of the summarized approach is to only speak about information that the observer has access to. It is thus purely operational and survives without any ontological commitments. This approach is inspired, in part, by Rovelli’s relational quantum mechanics [11] and the Brukner–Zeilinger informational interpretation of quantum theory [12,13]; this successful reconstruction can be viewed as a completion of these ideas for qubit systems.

The rest of the manuscript is organized as follows. In Section 2, we review the landscape of alternative theories; in Section 3, we formulate the operational quantum axioms; in Section 4, we summarize the key steps of the reconstruction itself and, finally, conclude in Section 5.

2. Overview of a Landscape of Theories

We shall begin with an overview of a landscape of alternative theories, which has been developed in [1,2] to which we also refer for further details.

2.1. From Questions and Answers to Probabilities and States

Our first aim is to define a notion of a state both for a single system and an ensemble of systems.

Consider an observer O who interrogates an ensemble of (identically prepared [1]) systems

{S_{a}}_{a = 1}^{n}

, coming out of a preparation device, with binary questions

Q_{i}

from some set

Q

. For example, in the case of quantum theory, such a question could read “is the spin of the electron up in x-direction?” This set

Q

shall only contain repeatable questions in the sense that O will receive

m \in N

times the same answer whenever asking any

Q_{i} \in Q

m times in immediate succession to a single system

S_{a}

. We shall assume any

S_{a}

to always give a definite answer if asked some

Q_{i} \in Q

, which moreover is not independent of

S_{a}

’s preparation. Accordingly,

Q

can only contain physically-implementable questions, which are ‘answerable’ by the

{S_{a}}

and not arbitrary logically conceivable binary questions. Furthermore, since we assume definite answers, we do not address the measurement problem. The answers to the

Q_{i} \in Q

given by the

{S_{a}}

shall follow a specific statistics for each way of preparing the

{S_{a}}

(for n sufficiently large). The set of all the possible answer statistics for all

Q_{i} \in Q

for all preparations is denoted by Σ.

O, being a good experimenter, has developed, through his experiments, a theoretical model for

Q

and Σ which he employs to interpret the outcomes of his interrogations (and to decide whether a question is in

Q

or not). This permits O to assign, for the next

S_{a}

to be interrogated, a prior probability

y_{i}

that

S_{a}

’s answer to

Q_{i} \in Q

will be ‘yes’. Namely, O determines

y_{i}

through a belief updating—in a broadly Bayesian spirit—according to his model of Σ, any prior information on the way of preparation and possibly to the frequencies of ‘yes’ answers to questions from

Q

, which he may have recorded in previous interrogation runs on systems identically prepared to

S_{a}

. (We add “broadly” here as we also consider the typical laboratory situation of an ensemble of systems.) In particular, O may also not have carried out previous interrogations on systems identically prepared to

S_{a}

(e.g., if the ensemble contains only the single

S_{a}

) in which case, he will estimate the prior

y_{i}

for the single

S_{a}

solely according to his model of Σ and any prior information about the preparation (more on this and update rules will be discussed in Section 2.3 and Section 2.4).

While

Q

need not necessarily contain all binary measurements that O could, in principle, perform on the

{S_{a}}

, we shall assume that

Q

is ‘tomographically complete’ in the sense that the

{y_{i}}_{\forall Q_{i} \in Q}

are sufficient to compute the probabilities for all other physically realizable measurements possibly not contained in the

Q

, as well. Hence, the

y_{i}

encode everything O could possibly say about the future outcomes to arbitrary experiments on the

{S_{a}}

in his laboratory. It will therefore be sufficient to henceforth restrict O to acquire information about the

S_{a}

solely through the

Q_{i} \in Q

. It is also natural to identify O’s ‘catalog of knowledge’ about the given

S_{a}

, i.e., the collection of

{y_{i}}_{\forall Q_{i} \in Q}

, with the state of

S_{a}

relative to O. This is a state of information and an element of Σ. Conversely, any element in Σ assigns a probability

y_{i}

to all

Q_{i} \in Q

. Thus, we identify Σ with the state space of

S_{a}

.

The state

{y_{i}}_{\forall Q_{i} \in Q}

is the prior state for the single

S_{a}

to be interrogated next, but also coincides with the state O assigns to the ensemble

{S_{a}}

(which may only contain a single member) given that its members are identically prepared [1].

2.2. Time Evolution of O’s “Catalog of Knowledge”

We permit O to subject the

{S_{a}}

to interactions, which cause a state

{y_{i} (t_{0})}_{\forall Q_{i} \in Q}

at time

t_{0}

to evolve in time to another legitimate state. Any permitted time evolution shall be temporally translation invariant, thus defining a one-parameter map

T_{Δ t} ({y_{i} (t_{0})}_{\forall Q_{i} \in Q}) = {y_{i} (t_{0} + Δ t)}_{\forall Q_{i} \in Q}

from Σ to itself, which only depends on the time interval

Δ t

, but not on

t_{0}

. We denote by

T

the set of all time evolutions to which we allow O to expose the

{S_{a}}

.

Clearly,

T

is a further crucial ingredient of O’s world model; his model for describing his interrogations with the

{S_{a}}

is thus encoded in the triple

(Q, Σ, T)

.

2.3. Convexity and State of No Information

It will be our challenge to unravel what O’s world model is. This requires us to subject the triple

(Q, Σ, T)

to a number of further operational conditions that are ‘natural’ in the context of information acquisition with a broadly Bayesian spirit. Upon imposing the quantum postulates, this will turn out to restrict

Q

and

T

to incorporate only a ‘natural’ subset of all possible quantum measurements and time evolutions, namely projective binary measurements and unitaries, respectively (rather than arbitrary positive operator-valued measures (POVMs) and completely positive maps). However, this suffices for our purposes to reconstruct the textbook quantum formalism.

To account for the possibility of randomness in the method of preparation, we assume Σ to be convex. Consider a collection of identical systems (i.e., with identical

(Q, Σ, T)

) that are not necessarily in identical states and for which O uses a cascade of biased coin tosses to decide which system to interrogate. Then O is enabled to assign a single prior state to this collection, which is a convex combination of their individual states.

Next, we assume the existence of a special method of preparation, which generates even completely random answer statistics over all

Q_{i} \in Q

. This preparation is described by a special state in Σ, namely

y_{i} = \frac{1}{2}

,

\forall Q_{i} \in Q

, and shall be called the state of no information. This distinguished state is a constraint on the pair

(Q, Σ)

. (E.g., in quantum theory, the pair

({binary POVMs}, {density matrices})

does not satisfy this condition because there exist inherently biased POVMs, while

({projective binary measurements}, {density matrices})

does.) It plays two crucial roles: it defines (1) the prior state of

S_{a}

that O will start with in a Bayesian updating when he has no ‘prior information’ about the

{S_{a}}

(except what his model

(Q, Σ, T)

is); and (2) an unambiguous notion of the (in-)dependence of questions (cf. Section 2.4), which otherwise would be state dependent. (E.g., in quantum theory, the questions

Q_{x_{1}} =

“Is the spin of Qubit 1 up in x-direction?” and

Q_{x_{2}} =

“Is the spin of Qubit 2 up in x-direction?” are independent relative to the completely mixed state, however not relative to a state with entanglement in x-direction.)

2.4. State Updating and (In)Dependence and Compatibility of Questions

There are two kinds of state update rules, one for the state of the ensemble

{S_{a}}

(which coincides with the prior state assigned to the next

S_{a}

to be interrogated) and one for the posterior state of a given ensemble member

S_{a}

. In a single shot interrogation, O receives a single

S_{a}

, assigns a prior state to it according to his prior information (cf. Section 2.1), interrogates it with some questions from

Q

(without intermediate re-preparation) and, depending on the answers, updates the prior to a posterior state valid for this specific

S_{a}

only. This requires a consistent posterior state update rule, which permits O to update the probabilities

y_{i}

for all

Q_{i} \in Q

in a manner that respects the structure of Σ and the repeatability of questions (i.e., an answer

Q_{i} =

‘yes’ or ‘no’ must have a posterior

y_{i} = 1

or 0 as a consequence, respectively). This is also a belief updating, but about the single

S_{a}

, and is not the same as in Section 2.1 and Section 2.3. Specifically, the posterior state of

S_{a}

may differ significantly from its prior state if O has experienced an information gain on at least some

Q_{i} \in Q

(this will necessarily happen when complementary questions are involved; see below). This is the ‘collapse’ of the state: it is merely O’s update of information about the specific

S_{a}

[1].

By contrast, in a multiple shot interrogation, O carries out a single shot interrogation on each member of an entire (identically prepared [1]) ensemble

{S_{a}}

to do ensemble state tomography and estimate the state of the ensemble from his/her prior information about the preparation and the collection of posterior states from the single shot interrogations. With every further interrogated

S_{a}

, O updates the ensemble state, which coincides with the prior state of the next system from the ensemble to be interrogated. Accordingly, this requires a prior state update rule. This is the belief updating alluded to in Section 2.1 and Section 2.3 about the ensemble

{S_{a}}

.

It will not be necessary to specify these two update rules in detail; we just assume O uses consistent ones. Specifically, given a posterior state update rule, we shall call

Q_{i}, Q_{j} \in Q

(maximally) independent	if, after having asked $Q_{i}$ to S in the state of no information, the posterior probability $y_{j} = \frac{1}{2}$ . That is, if the answer to $Q_{i}$ relative to the state of no information tells O ‘nothing’ about the answer to $Q_{j}$ .
dependent	if, after having asked $Q_{i}$ to S in the state of no information, the posterior probability $y_{j} \neq \frac{1}{2}$ (if $y_{j} = 0$ or 1, they are maximally dependent). That is, if the answer to $Q_{i}$ relative to the state of no information gives O at least partial information about the answer to $Q_{j}$ .
(maximally) compatible	if O may know the answers to both $Q_{i}, Q_{j}$ simultaneously, i.e., if there exists a state in Σ such that $y_{i}, y_{j}$ can be simultaneously zero or one.
(maximally) complementary	if every state in Σ, which features $y_{i} = 0, 1$ , necessarily implies $y_{j} = \frac{1}{2}$ . Notice that complementarity implies independence (but not vice versa).

(One can also define partial compatibility similarly [1].) These relations shall be symmetric; e.g.,

Q_{i}

is independent of

Q_{j}

if and only if

Q_{j}

is independent of

Q_{i}

, etc.

We impose a final condition on the posterior state update rule: if

Q_{i}, Q_{j}

are maximally compatible and independent, then asking

Q_{i}

shall not change

y_{j}

, i.e., O’s information about

Q_{j}

.

2.5. Informational Completeness

The fundamental building blocks of the theories in the landscape that we are constructing are to be sets of pairwise independent questions. This will help to render the convoluted parametrization of a state by

{y_{i}}_{\forall Q_{i} \in Q}

more economical. Consider a set of pairwise independent questions

Q_{M} : = {Q_{1}, \dots, Q_{D}}

; it is called maximal if no question from

Q ∖ Q_{M}

can be added to

Q_{M}

without destroying the pairwise independence of its elements. We shall assume that any maximal

Q_{M}

is informationally complete in the sense that all

{y_{i}}_{\forall Q_{i} \in Q}

can be computed from the corresponding probabilities

{y_{i}}_{i = 1}^{D}

for all states in Σ. Any such

Q_{M}

features D elements [1] such that Σ becomes a D-dimensional convex set and states become vectors:

\begin{matrix} \vec{y} = (\begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{D} \end{matrix}) . \end{matrix}

2.6. Information Measure

Our focus is O’s acquisition of information, so we need to quantify O’s information about the systems. Since

Q_{i} \in Q

is binary, we quantify O’s information about

S_{a}

’s answer to it by a function

α (y_{i})

with

0 \leq α (y_{i}) \leq 1

bit and

α (y) = 0

bit ⇔

y = \frac{1}{2}

and

α (1) = α (0) = 1

bit. O’s total information about a

S_{a}

must be a function of the state; we make an additive ansatz:

\begin{matrix} I (\vec{y}) : = \sum_{i = 1}^{D} α (y_{i}) . \end{matrix}

(1)

The quantum postulates will single out the specific function α.

Consider a set

{Q_{1}, \dots, Q_{n}}

of mutually (maximally) complementary questions. It is clear that whenever O has maximal information

α (y_{i}) = 1

bit about

Q_{i}

from this set, he must have zero bits of information about all other questions in the set. We require more generally that such a set cannot support more than one bit of information, regardless of the state:

\begin{matrix} α (y_{1}) + \dots + α (y_{n}) \leq 1 bit \end{matrix}

(2)

for otherwise O could, for some states, reduce his total information about such a set by asking another question from it. These complementarity inequalities represent informational uncertainty relations that describe how the information gain about one question enforces an information loss about questions complementary to it (see also the state ‘collapse’ in Section 2.4).

2.7. Composite Systems and (Classical) Rules of Inference

O must be able to tell a composite system apart into its constituents purely by means of the information accessible to him through interrogation and thus ultimately by means of the question sets. Let systems

S_{A}, S_{B}

have question sets

Q_{A}, Q_{B}

. It is then natural to say that they define a composite system

S_{A B}

if any

Q_{a} \in Q_{A}

is maximally compatible with any

Q_{b} \in Q_{B}

and if:

\begin{matrix} Q_{A B} = Q_{A} \cup Q_{B} \cup {\tilde{Q}}_{A B}, \end{matrix}

(3)

where

{\tilde{Q}}_{A B}

only contains composite questions, which are iterative compositions,

Q_{a} *_{1} Q_{b}, Q_{a} *_{2} (Q_{a^{'}} *_{3} Q_{b}), (Q_{a} *_{4} Q_{b}) *_{5} Q_{b^{'}}, (Q_{a} *_{6} Q_{b}) *_{7} (Q_{a^{'}} *_{8} Q_{b^{'}}), \dots

, via some logical connectives

*_{1}, *_{2}, *_{3}, \dots

, of individual questions

Q_{a}, Q_{a^{'}}, \dots \in Q_{A}

about

S_{A}

and

Q_{b}, Q_{b^{'}}, \dots \in Q_{B}

about

S_{B}

. This definition is extended recursively to composite systems with more than two subsystems.

Since O can never test the truthfulness of statements about the logical connectives of complementary questions through interrogations and since all propositions must have operational meaning, we shall permit O to logically connect two (possibly composite) questions directly with some * only if they are compatible. For the same reason, O is allowed to apply classical rules of inference (in terms of Boolean logic) exclusively to sets of mutually-compatible questions.

We stress that this definition of composite systems is distinct from the usual state tensor product rule in generalized probabilistic theories coming from local tomography [3,4,5]. In particular, this composition rule admits non-locally tomographic composites (see Section 4.3).

2.8. Computing Probabilities and Questions as Vectors

Thanks to informational completeness, the probability function

Y (Q | \vec{y}) \in [0, 1]

that

Q =

‘yes’, given the state

\vec{y}

, exists for all

Q \in Q

and

\vec{y} \in Σ

. As shown in [2], the exhibited structure yields:

\begin{matrix} Y (Q | \vec{y}) = Y (\vec{q} | \vec{y}) = \frac{1}{2} (\vec{q} \cdot (2 \vec{y} - \vec{1}) + 1), \end{matrix}

(4)

where

\vec{q} \in R^{D}

is a question vector encoding

Q \in Q

and

\vec{1}

is a vector with each coefficient equal to one in the basis corresponding to

Q_{M}

. This equation gives rise to (part of) the Born rule.

Suppose

Q, Q^{'} \in Q

were both encoded by the same

\vec{q}

. Then, by (4), they would be probabilistically indistinguishable, and O must view them as logically equivalent. O is free to remove any such redundancy from his description of

Q

upon which every permissible question vector

\vec{q}

will encode a unique

Q \in Q

. Finally, for every

Q \in Q

, there exists a state

{\vec{y}}_{Q}

, which is the updated posterior state of

S_{a}

after O received a ‘yes’ answer to the single question Q from

S_{a}

in the (prior) state of no information. O had zero bits of information before, and

{\vec{y}}_{Q}

encodes a single independent question answer, so we naturally require that it encodes one independent bit. Hence, for every

Q \in Q

, there exists

{\vec{y}}_{Q} \in Σ

with

I ({\vec{y}}_{Q}) = 1

bit, such that

Y (Q | {\vec{y}}_{Q}) = 1

. (In quantum theory, the

{\vec{y}}_{Q}

will only turn out to be pure states for a single qubit; e.g., for two qubits and

Q =

‘Is the spin of Qubit 1 up in z-direction?’, represented by the rank-two projector

P_{z_{1}} = \frac{1}{2} (1 + σ_{z} \otimes 1_{2 \times 2})

,

{\vec{y}}_{Q}

corresponds to the mixed state

ρ_{z_{1}} = \frac{1}{4} (1 + σ_{z} \otimes 1_{2 \times 2})

. Clearly,

tr (P_{z_{1}} ρ_{z_{1}}) = 1

.)

3. The Quantum Principles as Rules Constraining O’s Information Acquisition

In the sequel, we consider the most elementary of information carriers. Within the introduced landscape of theories, we now establish rules on O’s acquisition of information that single out the quantum theory of a composite system

S_{N}

of

N \in N

qubits, modeled in our language by a triple

(Q_{N}, Σ_{N}, T_{N})

. Effectively, these rules constitute a set of ‘coordinates’ for quantum theory on this landscape. The rules are spelled out first colloquially, then mathematically and are motivated in more detail in [1,2].

Empirically, the information accessible to an experimenter about (characteristic properties of) elementary systems is limited. For example, an experimenter may know one binary proposition about an electron (e.g., its spin in x-direction), but nothing fully independent of it (and similarly for a classical bit). We shall characterize a composition of N elementary systems according to how much information is, in principle, simultaneously available to O.

Rule 1.

(Limited information) “The observer O can acquire maximally

N \in N

independent bits of information about the system

S_{N}

at any moment of time.”

There exists a maximal set

Q_{i}

,

i = 1, \dots, N

, of N mutually maximally independent and compatible questions in

Q_{N}

.

O can thereby distinguish maximally

2^{N}

states of

S_{N}

in a single shot interrogation.

However, empirically, elementary systems admit more independent propositions than what, due to the information limit, they are able to answer at a time. This is Bohr’s complementarity. The unanswered properties must be random (and so ‘in superposition’) because the information limit makes it impossible to ascribe definite outcomes to them. For example, an experimenter may also inquire about the spin of the electron in y-direction. Yet doing so is at the total expense of his information about its spin in the x- and z-directions, and subsequent such measurements have random outcomes. For the N elementary systems, we assert the existence of complementarity.

Rule 2.

(Complementarity) “The observer O can always get up to N new independent bits of information about the system

S_{N}

. However, whenever O asks

S_{N}

a new question, he experiences no net loss in his total amount of information about

S_{N}

.”

There exists another maximal set

Q_{i}^{'}

,

i = 1, \dots, N

, of N mutually maximally independent and compatible questions in

Q_{N}

, such that

Q_{i}^{'}, Q_{i}

are maximally complementary and

Q_{i}^{'}, Q_{j \neq i}

are maximally compatible.

The peculiar mathematical form of Rule 2 becomes intuitive upon recalling that

S_{N}

is a composite system, such that complementarity should exist per elementary system [1].

Rules 1 and 2 are conceptually inspired by (non-technical) proposals made by Rovelli [11] and Zeilinger and Brukner [12,13]. These rules say nothing about what happens in-between interrogations. Naturally, we demand O not to gain or lose information without asking questions.

Rule 3.

(Information preservation) “The total amount of information O has about (an otherwise non-interacting)

S_{N}

is preserved in-between interrogations.”

I (\vec{y})

is constant in time in-between interrogations for (an otherwise non-interacting)

S_{N}

.

Hence, O’s total information

I (\vec{y})

is a ‘conserved charge’ of any time evolution

T_{Δ t} \in T_{N}

.

The more interactions to which O may subject

S_{N}

are available, the more ways in which any state may, in principle, change in time and, thus, the more ‘interesting’ O’s world. We therefore demand that any time evolution is physically realizable as long as it is consistent with the other rules (since

Σ_{N}, T_{N}

are interdependent, this is distinct from ‘maximizing the number’ of states).

Rule 4.

(Time evolution) “O’s ‘catalog of knowledge’ about

S_{N}

evolves continuously in time in-between interrogations, and every consistent such evolution is physically realizable.”

T_{N}

is the maximal set of transformations

T_{Δ t}

on states such that, for any fixed state

\vec{y}

,

T_{Δ t} (\vec{y})

is continuous in

Δ t

and compatible with Principles 1–3 (and the structure of the theory landscape).

(If we did not require this ‘maximality’ of

T_{N}

, we would still ultimately obtain a linear, unitary evolution, but not necessarily the full unitary group. This is the sole reason for demanding ‘maximality’. Note that Principles 3 and 4 are not equivalent to the axiom of ‘continuous reversibility’ of generalized probabilistic theories [3,4,5].)

We shall also allow O to ask any question to

S_{N}

which ‘makes (probabilistic) sense’.

Rule 5.

(Question unrestrictedness) “Every question that yields legitimate probabilities for every way of preparing

S_{N}

is physically realizable by O.”

Every question vector

\vec{q} \in R^{D_{N}}

that satisfies

Y (\vec{q} | \vec{y}) \in [0, 1]

\forall \vec{y} \in Σ_{N}

and for which there exists

{\vec{y}}_{Q} \in Σ_{N}

with

I ({\vec{y}}_{Q}) = 1

bit, such that

Y (\vec{q} | {\vec{y}}_{Q}) = 1

corresponds to a

Q \in Q_{N}

.

(Without Principle 5, we would still obtain the structure of an informationally complete set

Q_{M_{N}}

, finding that it encodes a basis of projective Pauli operator measurements [2]; Principle 5 legalizes all such measurements.)

These five rules turn out to leave two solutions for the triple

(Q_{N}, Σ_{N}, T_{N})

. Remarkably, they cannot distinguish between complex and real numbers. Namely, the two solutions are qubit and rebit quantum theory, i.e., two-level systems over real Hilbert spaces [1,2]. Since the latter is both mathematically and physically a subcase of the former, these five rules can be regarded as sufficient. However, if one also wishes to discriminate rebits operationally, then an extra rule, adapted from [3,4,5] and imposed solely for this purpose (it is partially redundant), succeeds.

Rule 6.

(Tomographic locality) “O can determine the state of the composite system

S_{N}

by interrogating only its subsystems.”

As shown in [1,2], Rules 1–6 are equivalent to the textbook axioms. More precisely:

Claim.

The only solution to Rules 1–6 is qubit quantum theory where:

$Σ_{N} ≃ convex hull of {CP}^{2^{N} - 1}$ is the space of $2^{N} \times 2^{N}$ density matrices over $C^{2^{N}}$ ,
states evolve unitarily according to $T_{N} ≃ PSU (2^{N})$ and the equation describing the state dynamics is (equivalent to) the von Neumann evolution equation,
$Q_{N} ≃ {CP}^{2^{N} - 1}$ is (isomorphic to) the set of projective measurements onto the $+ 1$ eigenspaces of N-qubit Pauli operators (a Hermitian operator on $C^{2^{N}}$ is a Pauli operator iff it has two eigenvalues $\pm 1$ of equal multiplicity), and the probability for $Q \in Q_{N}$ to be answered with ‘yes’ in some state is given by the Born rule for projective measurements.

4. Synopsis of the Reconstruction Steps and Key Results

Since this gives rise to a constructive derivation of the explicit architecture of qubit quantum theory, it involves a large number of individual steps compared to the rather abstract reconstructions [3,4,5,6,7,8,9,10]. However, this is also rewarding as it offers novel informational explanations for typical features of quantum theory, and so many reconstruction steps are actually quite instructive. We now provide a summary of key results and reconstruction steps from [1,2] (to which we refer for technical details) needed for proving the claim of the previous section.

4.1. Logical Connectives for Building Informationally Complete Sets

The first task is to build informationally complete sets

Q_{M_{N}}

[1]. The conjunction of Rules 1 and 2 implies that

Q_{M_{1}} = {Q_{1}, Q_{2}, \dots, Q_{D_{1}}}

for a single elementary system must be a maximal mutually complementary set with

D_{1} \geq 2

. We changed notation slightly compared to rules 1 and 2, labeling complementary questions by numbers, not primes. Of course, in quantum theory,

D_{1} = 3

; the more involved

N = 2

case will entail this. The structure (3) of a composite system implies that

Q_{M_{2}}

should contain individual questions about its subsystems. Continuing with a slight change of notation, we denote

Q_{M_{1}}

for System 1 by

{Q_{1}, Q_{2}, \dots, Q_{D_{1}}}

and for System 2 with a prime by

{Q_{1}^{'}, Q_{2}^{'}, \dots, Q_{D_{1}}^{'}}

. Apart from these individual questions,

Q_{M_{2}}

should contain composite questions

Q_{i} * Q_{j}^{'}

for some connective *. Pairwise independence of

Q_{M_{2}}

enforces that * must satisfy the following truth table, where ‘yes’

= 1

and ‘no’

= 0

(

Q_{i}, Q_{j}^{'}

are compatible) [1]:

\begin{matrix} Q_{i} & Q_{j}^{'} & Q_{i} * Q_{j}^{'} \\ 0 & 1 & a \\ 1 & 0 & a \\ 1 & 1 & b \\ 0 & 0 & b \end{matrix} a \neq b a, b \in {0, 1}

(5)

Hence, * is either the XNOR ↔ (for

a = 0

,

b = 1

) or its negation, the XOR ⊕ (for

a = 1

,

b = 0

). Up to an overall negation ¬, the two connectives are logically equivalent, and so, we henceforth make the convention to only build up composite questions (for informationally complete sets) using the XNOR. The composite question

Q_{i j} : = Q_{i} \leftrightarrow Q_{j}^{'}

is a ‘correlation question’, representing “are the answers to

Q_{i}, Q_{j}^{'}

the same?.” Ultimately, in quantum theory, ↔ will turn out to correspond to the tensor product ⊗ in

σ_{i} \otimes σ_{j}

where

σ_{i}

is a Pauli matrix;

Q_{i j}

will then correspond to “are the spins of Qubit 1 in the i- and of Qubit 2 in the j-direction correlated?.”

4.2. Question Graphs, Independence and Compatibility for $N = 2$ and Entanglement

It is convenient to represent questions graphically: individual questions are represented as vertices and bipartite correlation questions as edges between them. For instance, we may have: Entropy 19 00098 i001

Since O is only allowed to connect compatible questions logically, there can be no edge between individual questions of the same system.

Using only Rules 1 and 2 and logical arguments, the following result is proven in [1]:

Lemma 1.

Q_{i}, Q_{j}^{'}, Q_{i j}

are pairwise independent for all

i, j = 1, \dots, D_{1}

and will thus be part of an informationally complete set

Q_{M_{2}}

. Furthermore:

(i): $Q_{i}$ is compatible with $Q_{i j}$ , $\forall j = 1, \dots, D_{1}$ and complementary to $Q_{k j}$ , $\forall k \neq i$ and $\forall j = 1, \dots, D_{1}$ . That is, graphically, an individual question $Q_{i}$ is compatible with a correlation question $Q_{i j}$ if and only if its corresponding vertex is a vertex of the edge corresponding to $Q_{i j}$ . By symmetry, the analogous result holds for $Q_{j}^{'}$ .
(ii): $Q_{i j}$ and $Q_{k l}$ are compatible if and only if $i \neq k$ and $j \neq l$ . That is, graphically, $Q_{i j}$ and $Q_{k l}$ are compatible if their corresponding edges do not intersect in a vertex and complementary if they intersect in one vertex.

For example,

Q_{1}

in the third question graph above is compatible with

Q_{11}

and complementary to

Q_{22}

, while

Q_{11}

and

Q_{22}

are compatible and

Q_{11}

and

Q_{31}

are complementary.

This lemma has a striking consequence: it implies entanglement. Indeed, since, e.g.,

Q_{11}

and

Q_{22}

are independent and compatible, O may spend his maximally accessible amount of

N = 2

independent bits of information (Rule 1) over correlation questions only. Since non-intersecting edges do not share a common vertex, the lemma implies that no individual question is simultaneously compatible with two correlation questions that are compatible. Hence, when knowing the answers to

Q_{11}, Q_{22}

, O will be entirely ignorant about the individual questions; O has then maximal information about

S_{2}

, but purely composite information. This is entanglement in the very sense of Schrödinger (“...the best possible knowledge of a whole does not necessarily include the best possible knowledge of all its parts...” [14]). For example, in quantum theory, a state with

Q_{11} = Q_{22} =

‘yes’ will coincide with a Bell state having the spins of Qubits 1 and 2 correlated in x- and y-direction (and anti-correlated in z-direction). Of course, there is nothing special about

Q_{11}, Q_{22}

, and the argument works similarly for other composite question pairs and can be extended also to states with non-maximal entanglement (see [1] for details).

For systems with limited information content, entanglement is therefore a direct consequence of complementarity; without it there would be no independent and compatible composite questions sufficient to saturate the information limit [1]. For instance, two classical bits satisfy Rule 1, as well, but admit no complementarity so that

Q_{M_{2}}^{cbit} = {Q_{1}, Q_{1}^{'}, Q_{11}}

and the maximum amount of

N = 2

independent bits cannot be spent on composite questions only. Entropy 19 00098 i002

We also note that Rules 1 and 2 offer a simple, intuitive explanation for monogamy of entanglement. Consider, for a moment,

N = 3

elementary systems

S_{A}, S_{B}, S_{C}

, and suppose

S_{A}

and

S_{B}

are maximally entangled (say, because O received the answer

Q_{11} = Q_{22} =

‘yes’ from

S_{A B}

). Noting that

S_{A B}

is a composite bipartite system inside the tripartite

S_{A B C}

, O has then already spent his maximal amount of information of

N = 2

independent bits, which he may know about

S_{A B}

and can therefore not know anything else that is independent, including non-trivial correlations with

S_{C}

, about the pair. To saturate the

N = 3

independent bit limit for the tripartite system

S_{A B C}

, he may then only inquire about individual information about

S_{C}

. This is monogamy in its extreme form: the maximally entangled pair

S_{A B}

cannot be entangled with any other system

S_{C}

. This heuristic argument can be made rigorous in terms of the compatibility and independence structure of questions for

N \geq 3

and can be extended to the non-extremal case using informational monogamy inequalities [1].

4.3. A Logical Explanation for the Three-Dimensionality of the Bloch Ball

A key result of the reconstruction, proven in [1] is the following. Since its proof is instructive and representative for this approach, we shall rephrase it here.

Theorem 1.

D_{1} = 2

or 3.

Proof.

Consider the

N = 2

case. Lemma 1 implies that any maximal set of pairwise compatible correlation questions has

D_{1}

elements. Indeed, there are maximally

D_{1}

non-intersecting edges between the

D_{1}

vertices of System 1 and the

D_{1}

vertices of System 2; e.g., the

D_{1}

‘diagonal’

Q_{i i}

:

are pairwise independent and compatible. The constraints on the posterior state update rule in Section 2.4 entail that they are also mutually compatible (Specker’s principle) [1] such that O may simultaneously know the answers to all

D_{1}

Q_{i i}

. Since O may not know more than

N = 2

independent bits (Rule 1), the

D_{1}

Q_{i i}

cannot be mutually independent if

D_{1} > 2

. Thus, assuming the

Q_{i i}

are of equivalent status, the answers to any pair of them, say

Q_{11}, Q_{22}

, must imply the answers to all others, say

Q_{i i}

,

i = 3, \dots, D_{1}

. Hence,

Q_{j j} = Q_{11} * Q_{22}

,

j \neq 1, 2

, for a connective * that preserves pairwise independence of

Q_{11}, Q_{22}, Q_{j j}

. Reasoning as in (5) implies that either:

\begin{matrix} Q_{j j} = Q_{11} \leftrightarrow Q_{22}, or Q_{j j} = \neg (Q_{11} \leftrightarrow Q_{22}), j = 3, \dots, D_{1} \end{matrix}

(6)

so that for

D_{1} > 3

Q_{j j}

,

j = 3, \dots, D_{1}

could not be pairwise independent. Arguing identically for all other sets of

D_{1}

pairwise independent and compatible

Q_{i j}

, we conclude that

D_{1} \leq 3

. ☐

This theorem has several crucial repercussions. We may already suggestively call

D_{1} = 2

and

D_{1} = 3

the ‘rebit’ (two-level systems over real Hilbert spaces) and ‘qubit’ case, respectively. Reasoning as in (6) shows that the

Q_{i j}

are logically closed under ↔; as demonstrated in [1]:

Theorem 2.

If

D_{1} = 3

, then

Q_{M_{2}} : = {Q_{i}, Q_{j}^{'}, Q_{i j}}_{i, j = 1, 2, 3}

is logically closed under ↔ and, thus, constitutes an informationally complete set for

N = 2

with

D_{2} = 15

.

If

D_{1} = 2

, then

Q_{M_{2}} = {Q_{i}, Q_{j}^{'}, Q_{i j}, Q_{11} \leftrightarrow Q_{22}}_{i, j = 1, 2}

is logically closed under ↔ and, thus, constitutes an informationally complete set for

N = 2

with

D_{2} = 9

. Furthermore,

Q_{11} \leftrightarrow Q_{22}

is complementary to the individual questions

Q_{i}, Q_{j}^{'}

,

i, j = 1, 2

.

Indeed,

D_{2} = 9, 15

are the correct numbers of degrees of freedom for

N = 2

rebits and qubits, respectively. However, since the composite question

Q_{11} \leftrightarrow Q_{22}

is complementary to all individual questions in the rebit case (this is not true in the qubit case!), it is impossible for O to do ensemble state tomography by asking only individual questions

Q_{i}, Q_{j}^{'}

, thereby violating Rule 6. We are left with the qubit case and shall henceforth ignore rebits (for rebits see [1]).

4.4. Ruling out Local Hidden Variables and the Correlation Structure for $N = 2$

Using (6) and repeating the argument leading to it for ‘non-diagonal’

Q_{i j}

show that either:

\begin{matrix} Q_{11} \leftrightarrow Q_{22} = Q_{12} \leftrightarrow Q_{21}, or Q_{11} \leftrightarrow Q_{22} = \neg (Q_{12} \leftrightarrow Q_{21}) . \end{matrix}

(7)

The first case (without relative negation) is the case of classical logic and compatible with local hidden variables for the individual questions

Q_{i}, Q_{j}^{'}

. Namely, note that

Q_{11} \leftrightarrow Q_{22} = Q_{12} \leftrightarrow Q_{21}

can be rewritten in terms of the individuals as:

\begin{matrix} (Q_{1} \leftrightarrow Q_{1}^{'}) \leftrightarrow (Q_{2} \leftrightarrow Q_{2}^{'}) = (Q_{1} \leftrightarrow Q_{2}^{'}) \leftrightarrow (Q_{2} \leftrightarrow Q_{1}^{'}) . \end{matrix}

(8)

Suppose for a moment that

Q_{1}, Q_{1}^{'}, Q_{2}, Q_{2}^{'}

had simultaneous definite values (although not accessible to O). It is easy to convince oneself that any distribution of simultaneous truth values over the

Q_{i}, Q_{j}^{'}

satisfies (8) [1]. In fact, (8) is a classical logical identity and can be argued to follow from classical rules of inference [1]. However, it involves complementary individual questions, thereby violating our premise from Section 2.7 that O may apply classical rules of inference exclusively to mutually compatible questions. This classical case is thus ruled out.

One can check that the second case,

Q_{11} \leftrightarrow Q_{22} = \neg (Q_{12} \leftrightarrow Q_{21})

, does not admit a local hidden variable interpretation, but is consistent with the structure of the theory landscape and rules [1]. Since one of the two cases (7) must be true, we conclude that this second case holds. In fact, for any complementary pairs

Q, Q^{'}

and

Q^{″}, Q^{‴}

such that both Q and

Q^{'}

are compatible with both

Q^{″}, Q^{‴}

, one finds similarly [1]:

\begin{matrix} (Q \leftrightarrow Q^{″}) \leftrightarrow (Q^{'} \leftrightarrow Q^{‴}) = \neg ((Q \leftrightarrow Q^{‴}) \leftrightarrow (Q^{'} \leftrightarrow Q^{″})) . \end{matrix}

(9)

This precludes to reason classically about the distribution of truth values over O’s questions.

Equation (9) permits us to unravel the complete correlation structure for

Q_{M_{2}}

. In fact, it turns out that there are two distinct representations of this correlation structure: one corresponding to quantum theory in its standard representation, the other to its ‘mirror’ representation, related by a passive (not a physical) transformation, reassigning

Q_{1} \mapsto \neg Q_{1}

(in quantum theory tantamount to a partial transpose on qubit 1) [1]. The two distinct representations turn out to be physically equivalent, and so, a convention has to be made. Choosing the ‘standard’ case and using (9), one finds that the compatibility and correlation structure of

Q_{M_{2}}

can be represented graphically as in Figure 1. For

Q, Q^{'}, Q^{″}

compatible, we shall henceforth distinguish between:

even correlation:	if $Q = Q^{'} \leftrightarrow Q^{″}$ and
odd correlation:	if $Q = \neg (Q^{'} \leftrightarrow Q^{″})$ .

One can easily check that quantum theory satisfies this correlation structure for projective spin measurements if one replaces

i = 1, 2, 3

by

x, y, z

. For instance,

Q_{11} = Q_{22} =

‘yes’ implies, by Figure 1, the dependent

Q_{33} =

‘no’. In quantum theory, this corresponds to the (unnormalized) Bell state with spin correlation in the x- and y-direction and anti-correlated spins in the z-direction:

\begin{matrix} | x_{+} x_{+} 〉 - | x_{-} x_{-} 〉 = - i | y_{+} y_{+} 〉 + i | y_{-} y_{-} 〉 = | z_{+} z_{-} 〉 + | z_{-} z_{+} 〉 . \end{matrix}

4.5. Compatibility, Independence and Informational Completeness for Arbitrary N

Consider N elementary systems in the ‘qubit’ (

D_{1} = 3

) case and the XNOR conjunction:

\begin{matrix} Q_{μ_{1} μ_{2} \dots μ_{N}} : = Q_{μ_{1}} \leftrightarrow Q_{μ_{2}} \leftrightarrow \dots \leftrightarrow Q_{μ_{N}} \end{matrix}

(10)

of individual questions, where

μ_{a} = 0, 1, 2, 3

and

Q_{0} : =

‘yes’. The conjunction yields ‘yes’ and ‘no’ if an even and odd number of

Q_{μ_{a}} =

‘no’, respectively, and thus, does not represent “are the answers to all

Q_{μ_{a}}

the same?.” As shown in [1], these conjunctions are informationally complete:

Theorem 3.

(Qubits) The

4^{N} - 1

questions

Q_{μ_{1} \dots μ_{N}}

,

μ = 0, 1, 2, 3

(we deduct the trivial question

Q_{000 \dots 000}

), are pairwise independent and logically closed under ↔ and, thus, form an informationally complete set

Q_{M_{N}}

with

D_{N} = 4^{N} - 1

. Moreover,

Q_{μ_{1} \dots μ_{N}}

and

Q_{ν_{1} \dots ν_{N}}

are compatible if they differ by an even number (including zero) of non-zero indices and complementary otherwise.

We note that an N-qubit density matrix has precisely

4^{N} - 1

degrees of freedom.

4.6. Linear, Reversible Time Evolution and a Quadratic Information Measure

Thus far, the summarized results invoked only Rules 1 and 2 (and in one instance, Rule 6). Rules 3 and 4, on the other hand, can be demonstrated to entail a linear and reversible evolution of the generalized Bloch vector

R^{4^{N} - 1} ∋ \vec{r} = 2 \vec{y} - \vec{1}

that already appeared in (4),

\begin{matrix} \vec{r} (Δ t + t_{0}) = T (Δ t) \vec{r} (t_{0}), \end{matrix}

(11)

where

T (Δ t) \subset T_{N}

defines a one-parameter matrix group [1]. Suppose

T (Δ t), T^{'} (Δ t^{'}) \in T_{N}

correspond to two distinct interactions to which O may subject

S_{N}

. By Rule 4,

T (Δ t) \cdot T^{'} (Δ t^{'})

must likewise be contained in

T_{N}

, and since both

T, T^{'}

are invertible, also the entire set

T_{N}

must be a group. We shall henceforth often represent states with Bloch vectors

\vec{r}

.

Rules 3 and 4, together with elementary operational conditions on the information measure, enforce it to be quadratic

α (y_{i}) = {(2 y_{i} - 1)}^{2}

so that O’s total information (1):

\begin{matrix} I_{N} (\vec{y}) = \sum_{i = 1}^{4^{N} - 1} {(2 y_{i} - 1)}^{2} = {| \vec{r} |}^{2} \end{matrix}

(12)

is simply the square norm of the Bloch vector [1]. Interestingly, this derivation would not work without the continuity of time evolution (Rule 4). Crucially, (12) is not the Shannon entropy (see [1] for a discussion about why the Shannon entropy is also conceptually not suitable for quantifying O’s information). This reconstruction thereby corroborates an earlier proposal for a quadratic information measure for quantum theory by Brukner and Zeilinger [13,15,16].

This quadratic information measure becomes key for the remaining steps of the reconstruction. Given that (12) is a ‘conserved charge’ of time evolution (rule 3), we can already infer that

T_{N} \subset SO

(4

^{N}

−1) because time evolution must be connected to the identity.

4.7. Pure and Mixed States

Suppose O knows

S_{N}

’s answers to N mutually compatible questions from

Q_{M_{N}}

, thereby saturating the information limit of N independent bits (Rule 1). He will then also know the answers to each of their bipartite, tripartite, ..., and N-partite XNOR conjunctions which, by Theorem 3, are also in

Q_{M_{N}}

(and compatible). In total, he then knows the answers to:

\begin{matrix} (\binom{N}{1}) + (\binom{N}{2}) + \dots (\binom{N}{N}) = \sum_{i = 1}^{N} (\binom{N}{i}) = 2^{N} - 1 \end{matrix}

questions from

Q_{M_{N}}

. Thus, O’s total information (12) is

2^{N} - 1

bits in this case. It contains dependent bits of information because the questions in

Q_{M_{N}}

are pairwise, but not all mutually independent. Thanks to Rule 3, this is invariant under time evolution.

This allows us to distinguish two kinds of states [1];

\vec{y}

is called a:

pure state:: if it is a state of maximal information and, hence, of maximal length:

$\begin{matrix} I_{N} (\vec{y}) = \sum_{i = 1}^{4^{N} - 1} {(2 y_{i} - 1)}^{2} = (2^{N} - 1) bits, \end{matrix}$

(13)
mixed state:: if it is a state of non-maximal information,

$\begin{matrix} 0 bit \leq I_{N} (\vec{y}) = \sum_{i = 1}^{4^{N} - 1} {(2 y_{i} - 1)}^{2} < (2^{N} - 1) bits . \end{matrix}$

(14)

The square length of the Bloch vector thus corresponds to the number of answered questions. The state of no information

\vec{y} = \frac{1}{2} \vec{1}

has length zero bits.

As can be easily checked, quantum theory satisfies this characterization. In particular, an N-qubit density matrix, corresponding to a pure state, has a Bloch vector with square norm equal to

2^{N} - 1

. This peculiar mathematical fact now has a clear informational interpretation.

4.8. The Bloch Ball and Unitary Group for a Single Qubit from a Conserved Informational Charge

Since

D_{1} = 3

(cf. Section 4.3), we have that

Q_{M_{1}} = {Q_{1}, Q_{2}, Q_{3}}

is a maximal set of mutually complementary questions, i.e., no further

Q \in Q_{1}

can be added to

Q_{M_{1}}

without destroying mutual complementarity in the set (cf. Section 4.1). According to (13), a pure state satisfies:

\begin{matrix} I_{N = 1} (\vec{y}) = r_{1}^{2} + r_{2}^{2} + r_{3}^{2} = {(2 y_{1} - 1)}^{2} + {(2 y_{2} - 1)}^{2} + {(2 y_{3} - 1)}^{2} = 1 bit . \end{matrix}

(15)

For later, we thus observe: for pure states, the maximal mutually complementary set carries exactly 1 bit of information, and this is a conserved charge of time evolution (Rule 3).

Rule 1 implies that, e.g., the pure state

{\vec{y}}_{*} = (1, 0, 0)

exists in

Σ_{1}

, and we know

T_{1} \subset SO (3)

. However, it is clear that applying any

T \in SO (3)

to

{\vec{y}}_{*}

, according to (11), yields only states that are also compatible with all Rules 1–3 (and the landscape). Hence, by Rule 4, we must actually have

T_{1} = SO (3) ≃ PSU (2)

. Clearly,

T_{1}

then generates all quantum pure states from

{\vec{y}}_{*}

, i.e., it yields the entire Bloch sphere (the image of any legal state under a legal time evolution is also a legal state). Recalling that

Σ_{1}

is convex, we obtain that

Σ_{1} = B^{3} ≃ convex hull of {CP}^{1}

is the entire unit Bloch ball with mixed states (14) lying inside; the completely mixed state equals the state of no information at the center.

Σ_{1}, T_{1}

coincide exactly with the set of density matrices

ρ = \frac{1}{2} (1 + \vec{r} \cdot \vec{σ})

and the set of unitary transformations

ρ \mapsto U ρ U^{†}

,

U \in SU (2)

, respectively, for a single qubit in its adjoint (i.e., Bloch vector) representation, where

\vec{σ} = (σ_{1}, σ_{2}, σ_{3})

is the vector of Pauli matrices. Finally, from the assumptions in Section 2.8 and Rule 5, it is also clear that

Q_{1} = {\vec{q} \in R^{3} | | \vec{q} |^{2} = 1 bit} ≃ {CP}^{1}

. This coincides with the set of projectors

P_{\vec{q}} = \frac{1}{2} (1 + \vec{q} \cdot \vec{σ})

onto the

+ 1

eigenspaces of the Pauli operators

\vec{q} \cdot \vec{σ}

. Noting that:

\begin{matrix} Tr (ρ P_{\vec{q}}) = \frac{1}{2} (1 + \vec{r} \cdot \vec{q}) \equiv Y (Q | \vec{y}) \end{matrix}

(16)

we also recover that (4) yields the Born rule for projective measurements. We thus have the claim of Section 3 for

N = 1

(for details see [1,2]).

4.9. Unitary Group and Density Matrices for Two Qubits from Conserved Informational Charges

Also for

N = 2

, it is rewarding to consider maximal mutually complementary sets within

Q_{M_{2}}

. Using Lemma 1, one can check that there are exactly six maximal complementarity sets containing five questions and twenty containing three [2]; e.g., two graphical representatives are: Entropy 19 00098 i004

The six maximal complementarity sets of five elements can be represented as a lattice of pentagons; see Figure 2 (which also contains four green triangles, each representing one of the twenty maximal complementarity sets of three questions) [2].

Each of these sets has to satisfy the complementarity inequalities (2); specifically

0 bits \leq I ({Pent}_{a}) : = \sum_{i \in {Pent}_{a}} r_{i}^{2} \leq 1 bit

for the information carried by the five questions in pentagon a. Since any

Q \in Q_{M_{2}}

is contained in precisely two pentagons (cf. Figure 2), we find:

\begin{matrix} \sum_{a = 1}^{6} I ({Pent}_{a}) = 2 (\sum_{i = 1, 2, 3} (r_{i_{1}}^{2} + r_{i_{2}}^{2}) + \sum_{i, j = 1, 2, 3} r_{i j}^{2}) = 2 I_{N = 2} (\vec{r}) . \end{matrix}

(17)

Noting that for pure states

I_{N = 2} ({\vec{r}}_{pure}) = 3

bits thus produces the pentagon equalities [2]:

\begin{matrix} pure states : I ({Pent}_{a}) \equiv 1 bit, a = 1, \dots, 6 . \end{matrix}

(18)

Any pure state must satisfy (18), and

T_{2}

evolves pure states to pure states (Rule 3). Hence, in analogy to

N = 1

: for pure states, these six maximal mutually complementary sets carry exactly one bit of information, and these are six conserved charges of time evolution. There are further interesting constraints on the distribution of O’s information over

Q_{M_{2}}

[2].

It can be straightforwardly checked that quantum theory actually satisfies (18). Indeed, in the case of quantum theory, the identity for

{Pent}_{1}

reads in more familiar language (pure states):

\begin{matrix} I ({Pent}_{1}) = {〈 σ_{2} \otimes 1 〉}^{2} + {〈 σ_{3} \otimes 1 〉}^{2} + {〈 σ_{1} \otimes σ_{1} 〉}^{2} + {〈 σ_{1} \otimes σ_{2} 〉}^{2} + {〈 σ_{1} \otimes σ_{3} 〉}^{2} = 1, \end{matrix}

etc. Remarkably, these identities of quantum theory seem not to have been reported before in the literature. These novel conserved informational charges are a prediction of our reconstruction, underscoring the benefits of taking this informational approach. Additionally, these informational charges are indispensable for deriving the unitary group and the state space, as we shall now see.

Using that

I ({Pent}_{a} (\vec{r}))

is conserved under

T_{2} \subset SO (15)

entails (with new index

i = 1, \dots, 15

):

\begin{matrix} \sum_{i \in {Pent}_{a}, 1 \leq j \leq 15} r_{i} G_{i j} r_{j} = 0, a = 1, \dots 6, \end{matrix}

(19)

where

T (Δ t) = \exp (Δ t G)

for

G \in s o (15)

[2]. The correlation structure of Figure 1 enforces [2]:

\begin{matrix} G_{i j} = 0, whenever Q_{i}, Q_{j} are compatible . \end{matrix}

(20)

Each of the 15

Q_{i} \in Q_{M_{2}}

is complementary to eight others, and since

G_{i j} = - G_{j i}

, there could be maximally 60 linearly independent

G_{i j}

of

T_{2}

.

These are constructed as follows. For every pair of pentagons, there is a unique information swap transformation that preserves (18). For instance, the red arrows in Figure 2 represent the complete information swap between pentagons

{Pent}_{1}

and

{Pent}_{2}

(⟷ is not the XNOR):

r_{2}^{2} ⟷ r_{31}^{2} ({Pent}_{5}), r_{3}^{2} ⟷ r_{21}^{2} ({Pent}_{3}), r_{12}^{2} ⟷ {r_{3}^{'}}^{2} ({Pent}_{4}), r_{13}^{2} ⟷ r_{2}^{' 2} ({Pent}_{6})

(21)

that keeps all other components fixed. (18) are preserved because every swap in (21) occurs within a pentagon. The correlation structure of Figure 1 fixes the corresponding generator to [2]:

\begin{matrix} G_{i j}^{{Pent}_{1}, {Pent}_{2}} = δ_{i 2} δ_{j (31)} - δ_{i 3} δ_{j (21)} + δ_{i (12)} δ_{j 3^{'}} - δ_{i (13)} δ_{j 2^{'}} - (i ⟷ j) . \end{matrix}

(22)

One can repeat the argument for all 15 pentagon pairs, producing 15 linearly independent generators [2]. Remarkably, they turn out to coincide exactly with the adjoint representation of the 15 fundamental generators of

SU (4)

[2]. In particular, (22) is the generator of entangling unitaries leaving

r_{11}

invariant. The other 45 independent generators satisfying (20) are ruled out by the correlation structure so that

T_{2}

cannot be generated by anything else than these 15 pentagon swaps [2]. One can show that the exponentiation of (linear combinations of) these 15 pentagon swaps generates

PSU (4)

and that this group abides by all rules and forms a maximal subgroup of

SO (15)

[2]. Rule 4 then implies

T_{2} ≃ PSU (4)

, which is the correct set of unitary transformations

ρ \mapsto U ρ U^{†}

,

U \in SU (4)

, for two qubits.

It turns out that the set of Bloch vectors satisfying all six pentagon equalities (18) and the conservation equations (19) for the 15 pentagon swaps splits into two sets on each of which

T_{2} = PSU (4)

acts transitively [2]. These two sets correspond precisely to the two possible conventions of building up composite questions either using the XNOR or XOR (cf. Section 4.1) and are therefore physically equivalent. Adhering to the XNOR convention, we conclude that the surviving set of Bloch vectors solving (18) and (19) is the set of

N = 2

states admitted by the rules. Indeed, it coincides exactly with the set of quantum pure states, which forms a

{CP}^{3}

of which

PSU (4)

is the isometry group [2]. Employing convexity of

Σ_{2}

, one finally finds:

\begin{matrix} Σ_{2} = closed convex hull of C P^{3}, \end{matrix}

which is exactly the set of normalized

4 \times 4

density matrices over

C^{2} \otimes C^{2}

.

Concluding, the new conserved informational charges (18), in analogy to (15) for

N = 1

, define both the unitary group and the set of states for two qubits (for neglected details, see [2]).

4.10. Unitaries and States for $N > 2$ Elementary Systems

According to Theorem 3,

Σ_{N}

is (4

^{N}

−1)-dimensional and

T_{N} \subset SO

(4

^{N}

−1) (cf. Section 4.6). The reconstruction of the unitary group uses a universality result from quantum computation: two-qubit unitaries

PSU (4)

(between any pair) and single-qubit unitaries

PSU (2) ≃ SO (3)

generate the full projective unitary group

PSU

(2

^{N}

) for N qubits [17,18]. Given that

S_{N}

is a composite system, all of these bipartite and local unitaries must be in

T_{N}

. One can check that

PSU

(2

^{N}

) again abides by all rules and constitutes a maximal subgroup of

SO

(4

^{N}

− 1) [2]. Thanks to Rule 4, this yields

T_{N} ≃ PSU

(2

^{N}

), which coincides with the set of unitary transformations on N-qubit density matrices. In analogy to the previous case, one obtains as the state space:

\begin{matrix} Σ_{N} = closed convex hull of C P^{2^{N} - 1}, \end{matrix}

which agrees with the set of normalized N-qubit density matrices (for details, see [2]).

4.11. Questions as Projective Measurements and the Born Rule

The assumptions in Section 2.8 and Rule 5 yield the following question set characterization [2]:

\begin{matrix} Q_{N} ≃ {\vec{q} \in R^{4^{N} - 1} | Y (\vec{q} | \vec{r}) \in [0, 1] \forall \vec{r} \in Σ_{N} and \vec{q} is a 1 bit quantum state} . \end{matrix}

(23)

As shown in [2], this set is isomorphic to the set of projectors

P_{\vec{q}} = \frac{1}{2} (1 + \vec{q} \cdot \vec{σ})

onto the

+ 1

eigenspaces of the Pauli operators

\vec{q} \cdot \vec{σ} = \sum_{μ_{1} \dots μ_{N}} q_{μ_{1} \dots μ_{N}} σ_{μ_{1} \dots μ_{N}}

, where

σ_{μ_{1} \dots μ_{N}} = σ_{μ_{1}} \otimes \dots \otimes σ_{μ_{N}}

and

σ_{0} = 1

. Noting that

q_{μ_{1} \dots μ_{N}}

corresponds to (10) reveals that the XNOR at the question level corresponds to the tensor product ⊗ at the operator level. One also finds that (16) again holds, such that (4) yields the Born rule for projective measurements for arbitrary N (for the neglected details and many further interesting properties of

Q_{N}

, we refer to [2]).

4.12. The von Neumann Evolution Equation

We thus obtain qubit quantum theory in its adjoint (i.e., Bloch vector) representation. Lastly, we note that

\vec{r} (t) = T (t) \vec{r} (0)

with

T (t) = e^{t G} \in PSU

(2

^{N}

) is equivalent to the adjoint action:

\begin{matrix} ρ (t) = U (t) ρ (0) U^{†} (t), \end{matrix}

(24)

of

U (t) = e^{- i H t} \in SU (2^{N})

for some Hermitian operator H on

C^{2^{N}}

, where

ρ (t) = \frac{1}{2^{N}} (1 + \vec{r} (t) \cdot \vec{σ})

[2]. (24), in turn, is equivalent to

ρ (t)

solving the von Neumann evolution equation:

\begin{matrix} i \frac{\partial ρ}{\partial t} = [H, ρ] . \end{matrix}

(25)

We have therefore also recovered the correct time evolution equation for quantum states.

5. Conclusions

We have reviewed and summarized the key steps from [1,2] necessary to prove the claim of Section 3. This yields a reconstruction of the explicit formalism of qubit quantum theory from rules constraining an observer’s acquisition of information about a system [1,2]. The derivation corroborates the consistency of interpreting the state as the observer’s ‘catalog of knowledge’ and shows that it is sufficient to speak only about the information accessible to him for reproducing quantum theory. In fact, for qubits, this derivation accomplishes an informational reconstruction of the type proposed in Rovelli’s relational quantum mechanics [11] and in the Brukner-Zeilinger informational interpretation of quantum theory [12,13].

As a key benefit, this reconstruction also provides a novel informational explanation for the architecture of qubit quantum theory. In particular, it explains the logical structure of a basis of spin measurements, the dimensionality and structure of quantum state spaces, the correlation structure and the unitarity of time evolution from the perspective of information acquisition. This unravels previously unknown structural properties: conserved ‘informational charges’ from complementarity relations define and explain the unitary group and the set of pure states.

Acknowledgments

The author thanks Christopher S. P. Wever for an enjoyable collaboration on [2]. The project leading to this publication has received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie Grant Agreement No. 657661.

Conflicts of Interest

The author declares no conflict of interest.

References

Höhn, P.A. Toolbox for reconstructing quantum theory from rules on information acquisition. arXiv, 2014; arXiv:1412.8323. [Google Scholar]
Höhn, P.A.; Wever, C.S.P. Quantum theory from questions. Phys. Rev. A 2017, 95, 012102. [Google Scholar] [CrossRef]
Hardy, L. Quantum Theory From Five Reasonable Axioms. arXiv, 2001; arXiv:quant-ph/0101012. [Google Scholar]
Dakic, B.; Brukner, C. Quantum Theory and Beyond: Is Entanglement Special? In Deep Beauty; Halvorson, H., Ed.; Cambridge University Press: Cambridge, UK, 2011; p. 365. [Google Scholar]
Masanes, L.; Müller, M.P. A derivation of quantum theory from physical requirements. New J. Phys. 2011, 13, 063001. [Google Scholar] [CrossRef]
Chiribella, G.; D’Ariano, G.M.; Perinotti, P. Informational derivation of quantum theory. Phys. Rev. A 2011, 84, 012311. [Google Scholar] [CrossRef]
Barnum, H.; Müller, M.P.; Ududec, C. Higher-order interference and single-system postulates characterizing quantum theory. New J. Phys. 2014, 16, 123029. [Google Scholar] [CrossRef]
De la Torre, G.; Masanes, L.; Short, A.J.; Müller, M.P. Deriving Quantum Theory from Its Local Structure and Reversibility. Phys. Rev. Lett. 2012, 109, 090403. [Google Scholar] [CrossRef] [PubMed]
Goyal, P. From information geometry to quantum theory. New J. Phys. 2010, 12, 023012. [Google Scholar] [CrossRef]
Appleby, M.; Fuchs, C.A.; Stacey, B.C.; Zhu, H. Introducing the Qplex: A Novel Arena for Quantum Theory. arXiv, 2016; arXiv:1612.03234. [Google Scholar]
Rovelli, C. Relational quantum mechanics. Int. J. Theor. Phys. 1996, 35, 1637–1678. [Google Scholar] [CrossRef]
Zeilinger, A. A Foundational Principle for Quantum Mechanics. Found. Phys. 1999, 29, 631–643. [Google Scholar] [CrossRef]
Brukner, C.; Zeilinger, A. Information and fundamental elements of the structure of quantum theory. In Time, Quantum and Information; Castell, L., Ischebeck, O., Eds.; Springer: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
Schrödinger, E. Discussion of Probability Relations between Separated Systems. Math. Proc. Camb. Philos. Soc. 1935, 31, 555–563. [Google Scholar] [CrossRef]
Brukner, C.; Zeilinger, A. Operationally Invariant Information in Quantum Measurements. Phys. Rev. Lett. 1999, 83, 3354. [Google Scholar] [CrossRef]
Brukner, C.; Zeilinger, A. Conceptual inadequacy of the Shannon information in quantum measurements. Phys. Rev. A 2001, 63, 022113. [Google Scholar] [CrossRef]
Bremner, M.J.; Dawson, C.M.; Dodd, J.L.; Gilchrist, A.; Harrow, A.W.; Mortimer, D.; Nielsen, M.A.; Osborne, T.J. Practical Scheme for Quantum Computation with Any Two-Qubit Entangling Gate. Phys. Rev. Lett. 2002, 89, 247902. [Google Scholar] [CrossRef] [PubMed]
Harrow, A.W. Exact universality from any entangling gate without inverses. Quant. Inf. Comput. 2009, 9, 773–777. [Google Scholar]

Figure 1. The compatibility and correlation structure of the informationally complete set

Q_{M_{2}}

for the

N = 2

qubit case. Two questions are compatible if connected by a triangle edge and complementary otherwise. Red and green triangles denote odd and even correlation, respectively; e.g.,

Q_{33} = \neg (Q_{11} \leftrightarrow Q_{22}) = Q_{12} \leftrightarrow Q_{21}

. (Taken from [1].)

Figure 1. The compatibility and correlation structure of the informationally complete set

Q_{M_{2}}

for the

N = 2

qubit case. Two questions are compatible if connected by a triangle edge and complementary otherwise. Red and green triangles denote odd and even correlation, respectively; e.g.,

Q_{33} = \neg (Q_{11} \leftrightarrow Q_{22}) = Q_{12} \leftrightarrow Q_{21}

. (Taken from [1].)

Figure 2. The six maximal complementarity sets represented as pentagons. Two questions are complementary if they share a pentagon or are connected by an edge and compatible otherwise. Every pentagon is connected to all of the other five because any

Q \in Q_{M_{2}}

is contained in precisely two pentagons. The red arrows represent the information swap (21) between Pentagons 1 and 2 that preserves all pentagon equalities (18) and defines the time evolution generator (22). (Figure adapted from [2]. Reprinted with permission from [P. Höhn and C. Wever, Phys. Rev. A95, 012102 2017.] Copyright (2017) by the American Physical Society.)

Figure 2. The six maximal complementarity sets represented as pentagons. Two questions are complementary if they share a pentagon or are connected by an edge and compatible otherwise. Every pentagon is connected to all of the other five because any

Q \in Q_{M_{2}}

is contained in precisely two pentagons. The red arrows represent the information swap (21) between Pentagons 1 and 2 that preserves all pentagon equalities (18) and defines the time evolution generator (22). (Figure adapted from [2]. Reprinted with permission from [P. Höhn and C. Wever, Phys. Rev. A95, 012102 2017.] Copyright (2017) by the American Physical Society.)

© 2017 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Höhn, P.A. Quantum Theory from Rules on Information Acquisition. Entropy 2017, 19, 98. https://doi.org/10.3390/e19030098

AMA Style

Höhn PA. Quantum Theory from Rules on Information Acquisition. Entropy. 2017; 19(3):98. https://doi.org/10.3390/e19030098

Chicago/Turabian Style

Höhn, Philipp Andres. 2017. "Quantum Theory from Rules on Information Acquisition" Entropy 19, no. 3: 98. https://doi.org/10.3390/e19030098

APA Style

Höhn, P. A. (2017). Quantum Theory from Rules on Information Acquisition. Entropy, 19(3), 98. https://doi.org/10.3390/e19030098

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Quantum Theory from Rules on Information Acquisition

Abstract

1. Introduction

2. Overview of a Landscape of Theories

2.1. From Questions and Answers to Probabilities and States

2.2. Time Evolution of O’s “Catalog of Knowledge”

2.3. Convexity and State of No Information

2.4. State Updating and (In)Dependence and Compatibility of Questions

2.5. Informational Completeness

2.6. Information Measure

2.7. Composite Systems and (Classical) Rules of Inference

2.8. Computing Probabilities and Questions as Vectors

3. The Quantum Principles as Rules Constraining O’s Information Acquisition

4. Synopsis of the Reconstruction Steps and Key Results

4.1. Logical Connectives for Building Informationally Complete Sets

4.2. Question Graphs, Independence and Compatibility for $N = 2$ and Entanglement

4.3. A Logical Explanation for the Three-Dimensionality of the Bloch Ball

4.4. Ruling out Local Hidden Variables and the Correlation Structure for $N = 2$

4.5. Compatibility, Independence and Informational Completeness for Arbitrary N

4.6. Linear, Reversible Time Evolution and a Quadratic Information Measure

4.7. Pure and Mixed States

4.8. The Bloch Ball and Unitary Group for a Single Qubit from a Conserved Informational Charge

4.9. Unitary Group and Density Matrices for Two Qubits from Conserved Informational Charges

4.10. Unitaries and States for $N > 2$ Elementary Systems

4.11. Questions as Projective Measurements and the Born Rule

4.12. The von Neumann Evolution Equation

5. Conclusions

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Quantum Theory from Rules on Information Acquisition

Abstract

1. Introduction

2. Overview of a Landscape of Theories

2.1. From Questions and Answers to Probabilities and States

2.2. Time Evolution of O’s “Catalog of Knowledge”

2.3. Convexity and State of No Information

2.4. State Updating and (In)Dependence and Compatibility of Questions

2.5. Informational Completeness

2.6. Information Measure

2.7. Composite Systems and (Classical) Rules of Inference

2.8. Computing Probabilities and Questions as Vectors

3. The Quantum Principles as Rules Constraining O’s Information Acquisition

4. Synopsis of the Reconstruction Steps and Key Results

4.1. Logical Connectives for Building Informationally Complete Sets

4.2. Question Graphs, Independence and Compatibility for N = 2 and Entanglement

4.3. A Logical Explanation for the Three-Dimensionality of the Bloch Ball

4.4. Ruling out Local Hidden Variables and the Correlation Structure for N = 2

4.5. Compatibility, Independence and Informational Completeness for Arbitrary N

4.6. Linear, Reversible Time Evolution and a Quadratic Information Measure

4.7. Pure and Mixed States

4.8. The Bloch Ball and Unitary Group for a Single Qubit from a Conserved Informational Charge

4.9. Unitary Group and Density Matrices for Two Qubits from Conserved Informational Charges

4.10. Unitaries and States for N > 2 Elementary Systems

4.11. Questions as Projective Measurements and the Born Rule

4.12. The von Neumann Evolution Equation

5. Conclusions

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2. Question Graphs, Independence and Compatibility for $N = 2$ and Entanglement

4.4. Ruling out Local Hidden Variables and the Correlation Structure for $N = 2$

4.10. Unitaries and States for $N > 2$ Elementary Systems