1. Introduction
The experimental characterization of physical processes is an important component of the scientific method. Such a characterization, known as process tomography, is widely adopted in classical [
1] and quantum technologies [
2,
3,
4,
5,
6,
7,
8,
9,
10,
11,
12,
13,
14]. In general, one can regard the in-principle feasibility of process tomography as a requirement for the construction of new physical theories: arguably, a sensible physical theory should describe processes that—at least in principle—can be characterized experimentally.
Operational axioms inspired by process tomography were first proposed by D’Ariano in a sequence of works [
15,
16,
17,
18], where they featured as potential candidates for an axiomatization of quantum theory. In this paper, we will explore the conditions that guarantee the feasibility of process tomography in general physical theories, independently of the quantum axiomatization problem. Specifically, our goal will be to identify physical conditions that guarantee the achievability of process tomography using a finite number of auxiliary systems and a finite number of input states.
To understand the role of auxiliary systems in process tomography, it is useful to consider first a popular approach known as
standard process tomography [
2,
3], where no auxiliary system is used. Standard process tomography aims at characterizing processes by letting them act on a set of input states and by analyzing the output with a set of measurements, as in the following schematic:
Here, an unknown process
(with input system A and output system B) is tested on a set of fiducial states
with a set of fiducial measurements
. We use the boldface notation
to indicate that the
j-th measurement has multiple outcomes, labelled by some index that is not written down explicitly. By choosing a complete set of fiducial states and measurements, one can then identify the action of the process
on its input system—that is, one can uniquely determine the function
that maps a generic input state of system A into the corresponding output state of system B.
In classical and quantum physics, standard process tomography is sufficient to completely characterize physical processes: by determining the function
, one can identify the action of the process
in every possible experiment, even including experiments where the process
acts locally on a part of a composite system. For example, this includes experiments of the form
where
is applied locally on a joint state Ψ of systems A and system C, and a joint measurement
M is performed on systems B and C.
In classical and quantum theory, the outcome probabilities of all experiments of Type (2) are uniquely determined by the outcome probabilities of experiments of Type (1). The origin of this favourable situation is a property, known as
Local Tomography, whereby the states of every composite system can be uniquely identified by performing local measurements on the components [
16,
17,
19,
20,
21,
22,
23,
24,
25,
26,
27]. In quantum foundations, Local Tomography has often been taken as an axiom for the characterization of standard quantum theory (on complex Hilbert spaces) [
21,
25,
28,
29,
30,
31,
32,
33,
34] and as a principle for the construction of new physical theories [
22,
23,
24]. In locally tomographic theories, the action of a process on its input system uniquely determines the action of the process on every composite system [
24]. As a result, standard process tomography provides a complete characterization of physical processes.
The situation is radically different when Local Tomography does not hold. A simple counterexample arises in real-vector-space quantum theory [
35], a variant of standard quantum theory that violates Local Tomography [
19,
20] and deviates from standard quantum theory in a number of operational tasks [
20,
24,
36,
37,
38,
39,
40,
41]. In real-vector-space quantum theory, one can find explicit examples of processes that have the same input-output function, and yet act in completely different ways on composite systems [
42].
When Local Tomography fails, the only way to unambiguously identify a process is to test its action on composite systems, using an approach known as
ancilla-assisted process tomography [
4,
5,
6,
7,
8]. This approach consists in performing experiments of the type of Equation (2). Now, the crucial question is: which auxiliary systems C have to be taken into account? In the worst case, the answer could be “
all possible systems”: the characterization of an unknown physical process may require experiments performed, as it were, on the whole universe. It seems natural to demand, as a basic principle, that physical theories should be free from this pathology. In a sensible physical theory, it should be in principle possible to identify any given process up to any desired level of accuracy using a finite number of experimental settings. This means: testing the process with a finite number of auxiliary systems, preparing a finite number of states, and performing a finite number of measurements, each with a finite number of outcomes.
In this paper, we will focus on the requirements of a finite number of auxiliary systems/finite number of inputs states, and we will consider the case of process tomography with ideally perfect accuracy. We will require that, at least in principle, physical processes can be completely characterized by probing their action with a finite number of auxiliary systems. Note that, in turn, every
finite set of auxiliary systems
can be replaced without loss of generality by a
single auxiliary system R, for example by merging all the systems of the finite set into a composite system. Indeed, most of the frameworks for general probabilistic theories include a notion of system composition, denoted by ⊗ and corresponding to a generalization of the tensor product in quantum theory [
21,
22,
23,
24,
27,
28,
34,
42,
43,
44,
45,
46,
47,
48]. As long as the number of components is finite, the composite system is well defined in all these frameworks. At the mathematical level, the composition ⊗ is defined as the tensor product in a monoidal category describing physical systems and processes between them [
49,
50,
51,
52]. When monoidal categories are used as a framework for general physical theories, they are often called
process theories [
53].
These observations motivate the following requirement:
Principle 1 (Dynamically Faithful Systems). For every pair of systems A and B, there exists an auxiliary system R such that all processes with input A and output B can be completely characterized by their action on the states of the composite system A ⊗ R.
We will also consider a slightly stronger requirement, inspired by D’Ariano’s early axiomatization works [
15,
16,
17,
18]. The requirement stipulates the achievability of process tomography with a
single state of a suitable composite system.
Principle 2 (Dynamically Faithful States). For every pair of systems A and B, there exists an auxiliary system R such that all processes with input A and output B can be completely characterized by their action on a single state of the composite system A ⊗ R.
Dynamically Faithful Systems/Dynamically Faithful States are strictly weaker requirements than Local Tomography. For example, real-vector-space quantum theory violates Local Tomography but satisfies Dynamically Faithful States (and therefore Dynamically Faithful Systems): every real-vector-space process with input A can be uniquely characterized by its action on an entangled state of the composite system A ⊗ A [
24]. It is also worth noting that Dynamically Faithful Systems/Dynamically Faithful States do not imply finite dimensionality: for example, all the processes on a quantum system with separable Hilbert space can be characterized by preparing a single bipartite state [
6]. Nevertheless, in this paper we will be mostly concerned with finite dimensional systems for simplicity of presentation. Note that we do not make any assumption on the size (however defined) of the system R. A further strengthening of our requirements would by to demand that the size of system R be bounded in terms of the sizes of systems A and B. For example, one could demand that R be embeddable into A ⊗ B, or that R be operationally equivalent to system A, as it happens in quantum theory and in classical probability theory.
The core result of the paper is a mathematical characterization of the theories satisfying Dynamically Faithful States. The characterization is valid for a broad class of physical theories—so broad that, in fact, it even includes non-causal theories where the choices of experiments performed in the future can affect the outcome probabilities of experiments performed in the past. After providing the mathematical characterization, we identify physical conditions that guarantee Dynamically Faithful States. Specifically, we show that Dynamically Faithful States holds in all theories where all correlations of a given physical system with its environment can be probabilistically generated by local operations on a single “universally correlated state”. We call this property Universal Extension. Two important classes of theories satisfying Universal Extension are (1) the theories allowing for conclusive teleportation [
49,
54], and (2) the theories satisfying three requirements of Causality, Pure Product States, and Purification. Informally, Causality is the requirement that the outcome probabilities of present experiments are independent on the choice of future experiments [
24,
26,
27,
28,
42,
46,
53,
55,
56]. Pure Product States is the requirement that if the parts of a composite system are in pure states, then the whole composite system is in a pure state. Purification is the requirement that every mixed state can be extended to a pure state, and that such extension is unique up to local symmetry transformations [
24,
26,
27,
28,
42,
46,
57,
58]. Remarkably, the symmetry property of purifications guarantees that every purification is a universal extension, and therefore every theory with purification has Dynamically Faithful States. Our main results can be summarized by the following logical implications:
Conclusive Teleportation ⟹ Universal Extension;
Causality, Pure Product States, and Purification ⟹ Universal Extension;
Universal Extension ⟹ Dynamically Faithful States ⟹ Dynamically Faithful Systems.
The overall conclusion of our results is that Universal Extension guarantees the in-principle characterizability of physical processes, without invoking any assumption of Local Tomography.
2. Materials and Methods
In this section, we briefly introduce the framework and the notation used in the paper.
2.1. Operational-Probabilistic Theories
A major trend in quantum foundations is the study of information-processing tasks in a broad class of general probabilistic theories, which include quantum theory as a special case [
21,
22,
23,
24,
27,
28,
42,
43,
44,
45,
46,
47]. A convenient approach to general probabilistic theories is provided by the framework of
operational-probabilistic theories (OPTs) [
24,
27,
28,
42,
44,
45,
46,
47,
48,
59]. The framework consists of two distinct conceptual ingredients: an operational structure, describing circuits that produce outcomes, and a probabilistic structure, which assigns probabilities to the outcomes. The operational structure is inspired by the approach of categorical quantum mechanics [
49,
50,
51,
52] (see also [
53] for a recent presentation), and follows it rather closely, although there are a few relevant differences in the way classical outcomes are treated [
48,
59].
The OPT framework describes a set of experiments that can be performed on a given set of systems with a given set of physical processes. The framework is based on a primitive notion of composition, whereby every pair of physical systems A and B can be combined into a
composite system A ⊗ B. The set of all physical systems is closed under composition, and will be denoted by
. Physical processes can be combined in sequence or in parallel to build circuits, such as
In this example, A,
, and
are
systems,
is a
state of the composite system A ⊗B,
,
and
are
transformations (a.k.a.
processes),
a and
b are
measurements. We use boldface fonts
a and
b for measurements in order to indicate that they generally have multiple outcomes. When necessary, we will explicitly write
and
, where
x and
y are measurement outcomes, and
and
are the sets of possible outcomes of the two measurements, respectively. Here
and
play the same role as the linear operators associated to the outcomes of quantum measurements. Following a traditional terminology dating back to Ludwig [
60], we will call them
effects.
Among the physical systems, every OPT includes a trivial system, denoted by I and corresponding to the degrees of freedom ignored by theory. States (resp. effects) are transformations with the trivial system as input (resp. output).
Transformations from the trivial system to itself are called
scalars. Physically, they are associated to circuits with no external wires, like the circuit in Equation (3). In the OPT framework, scalars are typically identified with numerical probabilities, valued in the interval
. More generally, other identifications are also possible, including signed probabilities or possibilities, as shown in the work of Gogioso and Scandolo [
48]. In the following, we will not make any assumption on the nature of the scalars, although we will often call them “probabilities” to facilitate the connection to the existing literature in quantum foundations.
For generic systems A and B, we denote by:
the set of transformations from A to B, also called transformations of type A → B;
the set of states of system A;
the set of effects on system A;
(or , for short) the sequential composition of two transformations and , with the input of matching the output of ;
the parallel composition of the transformations and ;
the identity transformation on system A;
the identity transformation on the trivial system.
Definition 1. We say that two transformations and are operationally indistinguishable [
24]
if they give rise to the same probabilities in all possible experiments, namely In this paper, we will focus our attention on
quotient theories [
42,
46], where physical transformations that are operationally indistinguishable are identified: in other words, we will assume that Condition (4) implies the equality
When the scalars are positive real numbers, the set of transformations
in the quotient theory can be regarded as elements of an ordered vector space [
24,
42,
46,
61].
Every transformation takes place in a test, that is, a non-deterministic process that generates an outcome out of a set of possible outcomes. Mathematically, a test of type A → B, with outcomes in , is a list of transformations of type A → B indexed by elements of the outcome set . We denote such a list by , with for every .
Tests of type I → A are called preparation tests, or sources. A preparation test describes the non-deterministic preparation of the state , heralded by an outcome . In quantum theory, the states in a preparation test are subnormalized density matrices, and the trace of each subnormalized density matrix is interpreted as the probability of the corresponding preparation.
Tests of type A → I are called measurements. A measurement is a collection of effects labelled by measurement outcomes.
A transformation is deterministic if it is part of a test with a single outcome. By definition, deterministic transformations can be performed deterministically by setting up the corresponding test. We use the notations:
for the set of deterministic transformations of type A → B;
for the set of deterministic states of system A;
for the deterministic effects of system A.
For the trivial system I, we assume that there exists only one deterministic transformation of type I → I, namely the identity transformation 1. This assumption was introduced by Coecke in Ref. [
62], motivated by the interpretation of the scalars as “probabilities”, which suggests that there should be only one deterministic scalar, corresponding to the notion of certainty.
We assume that the set of tests is closed under
coarse-graining, the operation of joining together two or more outcomes. Given a test
and a partition of the outcome set
into disjoint subsets
, we define the
coarse-grained test as the test with transformations
At this level, the symbol of sum is just a notation for the operation of coarse-graining: the only mathematical requirements on the coarse-graining operation is that it distributes over parallel and sequential composition, namely
for every transformation
,
for every transformation
with input matching the output of
, and
for every transformation
with output matching the input of
. Closely related notions of coarse-graining were proposed in the works of Tull [
59], Gogioso and Scandolo [
48], where Equations (
7)–(
9) were assumed together with a few additional requirements.
Since there is only one deterministic transformation of type I → I (the scalar 1), every test
of type I → I satisfies the normalization condition
This condition is the analogue of the normalization of a probability distribution. For this reason, in the following we will call the tests of type I → I “probability distributions”, even though in general the scalars may not be real numbers in the interval .
When the scalars are real numbers, the sum notation used above is consistent with the notion of sum in the ordered vector space containing the physical transformations in the quotient theory [
24,
42,
46,
61] (cf. Definition 1 and comments below it).
2.2. Framework Assumptions
In this paper, we will make four basic assumptions that are shared by most probabilistic theories in the literature. We spell out the assumptions explicitly because they will play a significant role in our results, and it is convenient to keep track of which assumption is used in which result. For example, we will write “Lemma 1 (A1, A2, A3)” to state that that Lemma 1 follows from Assumptions 1, 2, and 3.
Our first assumption is that all non-deterministic tests arise from deterministic processes followed by measurements. This assumption is compatible with the idea that the outcomes can be read-out from some physical system, like the display of a device, and that such system is also described by the theory. The readout process is then realized as a deterministic transformation, followed by a measurement on the display:
Assumption 1 (Displays [
46]).
Every test arises from a deterministic transformation followed by a measurement on one of the output systems.
Formally, Assumption 1 is that every test
of type A → B can be realized as:
for some system C, some deterministic transformation
, and some measurement
.
Our second assumption is that the set of systems described by the theory includes some “coin”, that is, some physical system that generates random outcomes. In the familiar setting where probabilities are numbers in the interval , the assumption is that there exists some two-outcome experiment whose outcome probabilities are both non-zero. In the general case, an analogue of non-zero probability is a cancellative scalar: a scalar s is cancellative if the condition implies for every pair of scalars p and .
Assumption 2 (Coins). There exist two-outcome experiments whose outcome probabilities are both cancellative.
We stress that we do not assume the existence of coins with arbitrary biases: even if the scalars are real numbers, we do not assume that they are the whole interval .
An important property of cancellative scalars is that they cancel out in equations involving arbitrary transformations:
Lemma 1. For a cancellative scalar s, the condition implies , for every pair of processes and of the same type.
Proof. Given two processes
and
of type A → B, the condition
implies
which in turn implies
because
s is cancellative. Hence, the processes
and
are operationally indistinguishable, in the sense of Definition 1. Since we are dealing with a quotient theory, this condition implies
. □
Our third assumption is that it is possible to perform randomized tests. Informally, a randomized test is a test where one tosses a coin and performs a test depending on the outcome of the coin toss. Here we give the formal definition for the randomization of two tests (the extension to more than two tests is straightforward):
Definition 2. Let and be two tests of type A → B,
with outcomes in and , respectively, and let be a probability distribution allowed by the theory (i.e.,
a test of type I → I).
The randomization of and with probabilities is a test of type A → B,
with outcomes in the disjoint union , in which all elements of are regarded as distinct from all elements of . The test is defined as Our third assumption is that all randomizations are valid tests:
Assumption 3 (Randomizations). The set of tests is closed under randomizations.
The Randomizations Assumption, together with the possibility of coarse graining, guarantees the existence of random mixtures, defined as follows:
Definition 3. Let σ and τ be two deterministic states of system A
and let be a probability distribution allowed by the theory. The mixture of σ and τ with probabilities is the deterministic state ρ defined by The fact that ρ is a deterministic state follows from the fact that ρ is the coarse-graining of the randomized test with and.
The notion of mixture introduces a pre-order relation on the set of deterministic states:
Definition 4. Let ρ and σ be two deterministic states of system A. We say that ρ contains σ, denoted as , if , where is a probability distribution allowed by the theory, is a cancellative scalar, and τ is a deterministic state.
Note that the relation ⊒ is transitive: if and , then .
Our final assumption is about the existence of complete states, defined as follows:
Definition 5. A deterministic state is complete if for every deterministic state.
The assumption is:
Assumption 4 (Complete States). For every system, there exists at least one complete state.
While Assumptions 1, 2, and 3 are satisfied by most probabilistic theories, including infinite dimensional ones, Assumption 4 is more specific to the finite dimensional setting. It is satisfied in the standard scenario where the set of deterministic states is a finite-dimensional convex set. In this scenario, the complete states are exactly the points in the interior of the convex set. Assumption 4 is also satisfied in some theories where the set of deterministic states is not convex. For example, it is satisfied in Spekkens’ toy theory [
63,
64], where some convex combinations are forbidden but nevertheless the set of deterministic states contains a complete state. However, Assumption 4 is generally not satisfied by classical and quantum theory infinite dimensions. An infinite-dimensional generalization of the notion of complete state will be discussed in
Section 3.2.
2.3. Local Tomography
In this section, we review the property of Local Tomography. The scope of this review is to clarify, by contrast, what it means to characterize physical processes in theories where Local Tomography does not hold.
The Local Tomography has been formulated in several forms and under different names [
16,
19,
20,
21,
22,
23,
24,
25]. A simple formulation is the following [
26].
Definition 6 (Local Tomography). A physical theory satisfies Local Tomography if the state of every composite system is uniquely determined by the joint statistics of local measurements on the components.
Mathematically, Local Tomography states that, for every pair of systems A and B, and for every pair of states of the composite system A ⊗ B, say
and
, the condition
implies the equality
2.4. Standard Process Tomography
Local Tomography has an important implication for the task of characterizing physical processes: in every locally tomographic theory, the action of a process on its input system determines the action of the process on all possible composite systems. Explicitly, one has the following
Proposition 1 (Lemma 14 of [
24]).
Suppose that the theory satisfies Local Tomography. Then, for every pair of systems A
and B
and for every pair of processes and of type A → B,
the condition implies the equality or equivalently, the equality When system
A is finite dimensional, Proposition 1 guarantees that one can characterize an arbitrary process of type A → B by applying it to a fiducial set of input states of system
A, and by characterizing the output states of system
B with a fiducial set of measurements. This result is the conceptual foundation underpinning standard process tomography [
2,
3].
Interestingly, a converse of Proposition 1 also holds, under the assumption that the all states of all composite systems can be probabilistically generated by acting locally on one part of a fixed bipartite state. This point is discussed in
Appendix A.
2.5. The Counterexample of Real-Vector-Space Quantum Theory
When Local Tomography does not hold, standard process tomography may not work. A simple example can be found in real-vector-space quantum theory [
35], a variant of standard quantum theory that violates Local Tomography [
19,
20] and deviates from standard quantum theory in a number of operational tasks [
20,
24,
36,
37,
38,
39,
40,
41]. The example, introduced in [
42], involves two processes,
and
, acting on a “rebit”, that is, a two-dimensional quantum system associated to the vector space
. Mathematically, the two processes are defined by the linear maps:
where
M is a generic 2 × 2 matrix and
are the three Pauli matrices. Now, consider the action of the two processes on a single-rebit, whose possible states are described by density matrices with real entries. Writing a generic real-valued density matrix
in the Bloch form
we can easily obtain the relation
meaning that the two processes
and
act in the same way on every single-rebit input state.
On the other hand, the processes
and
are clearly different. The difference can be detected by applying the processes on a maximally entangled state of
two rebits: for example, using the notation
one has
Note that the output states are not only different, but also
orthogonal, meaning that a joint measurement on the two rebits can tell the two states apart without any error. The two states in the r.h.s. of Equation (
25) were first studied by Wootters [
20] as an example of bipartite states that perfectly distinguishable by global measurements, and yet completely indistinguishable by local measurements in real-vector-space quantum theory.
Summarizing, standard process tomography does not work in real-vector-space quantum theory: in order to unambiguously characterize an unknown process on real-vector space quantum states, it is mandatory to test it on composite systems.
2.6. Process Tomography without the Assumption of Local Tomography
In the lack of Local Tomography, process tomography requires experiments on composite systems. Now, the key question is: which composites have to be tested? In principle, the answer could be “all”: a process with input system A may have to be tested on every composite system A ⊗ C for every possible auxiliary system C. Loosely speaking, the only way to completely characterize a process would be to make experiments on “the whole universe”.
This situation does not arise in real-vector-space quantum theory: there, a complete process tomography can be achieved by preparing a
single input state of a
single composite system. Specifically, process tomography can be achieved by preparing two identical copies of the input system in a maximally entangled state and by letting the process act on one copy. We do not provide a proof here because later in the paper we will give a general proof valid for all theories obeying the Purification Principle (see
Section 3.5.3 for the formal definition). Since real-vector-space quantum theory satisfies the Purification Principle, the general proof applies.).
Summarizing, real-vector-space quantum theory satisfies the principle of Dynamically Faithful States (Principle 2 in the introduction). In the rest of the paper we will investigate the conditions that guarantee the validity of Dynamically Faithful States in general physical theories.
3. Results
3.1. Four Levels of Process Tomography
The goal of tomography is to characterize the action of an unknown process on all possible inputs. Nevertheless, it is also useful to first consider intermediate tasks where the goal is to characterize the process on a subset of inputs. In this section we define three such tasks, listed in order of increasing strength. All together, these three tasks and the task of full process tomography define four levels of characterization of physical processes.
3.1.1. Equality on a Source
Consider the situation where a source prepares system A in a non-deterministic fashion. Such a source can be described by a preparation test , where the preparation of the state is heralded by the outcome x. The states in the source are then used as inputs:
Definition 7. We say that two processes and , of type A → B,
are equal on the source if one has:
When this is the case, we write.
It is easy to see that is an equivalence relation. The equivalence relation defines a weak notion of process tomography, where the goal is just to identify the action of processes on the fixed set of states .
3.1.2. Equality upon Input of a State
We now consider a stronger notion of process tomography. Instead of characterizing a process on a single source, one can try to characterize the process on
all sources with the same average state. The
average state of a source is the deterministic state
obtained by coarse-graining over all possible outcomes.
Definition 8. We say that two processes and , of type A → B, are equal upon input of ρ if they are equal on every source with average state equal to ρ. When this is the case, we write .
The condition
is equivalent to the condition
where the notation
means that
is
contained in
, that is, that there exists a source
such that
and
for some cancellative scalar
(Note that this definition is slightly more general than Definition 4: here do not require the state
to be deterministic).
In quantum information, the notion of equality upon input of
was introduced in [
65]. Its extension to general probabilistic theories was discussed in [
24,
27,
28].
It is easy to check that equality upon input of is an equivalence relation. Note that, by definition, identifying a process upon input of is more demanding than just identifying it on a specific source with average state .
3.1.3. Equality on the Extensions of a State
All notions of process tomography considered so far focussed on the action of a process on its input system alone. To identify the process, however, one also needs to characterize its local action on composite systems.
Definition 9. Let be a deterministic state of system, A.
An extension
of ρ on system E is a deterministic state
satisfying the relation:
for some deterministic effect . We denote by the set of all extensions of ρ on system E.
Our third level of process tomography is to identify the action of a process on all the extensions of a given state.
Definition 10. We say that two processes and are equal on the extensions of
if one has for every possible system E
and for every possible extension . In this case, we write .
In general, equality of all the extensions of implies equality upon input of . This is because all sources with average state can be viewed as extensions of involving an environment serving as a “display”. Explicitly, we have the following:
Lemma 2 (A1).
Let be a deterministic state of system A,
and let be an arbitrary source with average state ρ. Then, there exists an extension , for suitable system E,
such that, for every pair of processes and of type A → B,
with arbitrary system B,
the conditionimplies the condition The proof is provided in
Appendix B. An immediate consequence is that equality on all the extensions of
implies equality upon input of
:
Corollary 1 (A1).
Let be a deterministic state of system A.
Then, for every pair of processes and of type A → B,
with arbitrary system B,
one has the implication In
Appendix B we also show that the relations
and
coincide in the special case of theories satisfying Local Tomography and Causality, the principle that the outcome probabilities of present experiments are independent on the choice of future experiments [
24,
26,
27,
28,
42,
46,
53,
55,
56].
3.1.4. Equality on All States
The ultimate goal of process tomography is to identify the action of an unknown process on all possible states. Using the results collected so far, the condition for perfect identification can be expressed as follows:
Lemma 3 (A1).
Let and be two processes of type A → B.
Then, one has the condition The proof is provided in
Appendix C. Summarizing, two processes of type A → B are equal if and only if they coincide on all the extensions of all deterministic states of system A. In
Section 3.4 we will show that, in fact, the extensions of a single state are enough.
3.2. Tomographic Ordering and Dynamically Faithful States
A natural notion in the context of process tomography is the notion of tomographic ordering:
Definition 11. Let and be two (possibly non-deterministic) states. We say that Φ
is tomographically more powerful than Ψ
for processes of type A →B,
if, for every pair of processes and of type A → B,
the condition implies the condition.
When this is the case, we write .
In general, the relation of tomographic ordering (for a fixed systems A and B) may or may not have a maximum. When it does, this maximum is called a dynamically faithful state:
Definition 12 (Dynamically faithful state [
15,
16,
17,
18,
24,
27,
28,
42]).
A (possibly non-deterministic) state is dynamically faithful for processes of type A→ B
if, for every pair of processes and of type A → B,
the condition implies Originally [
15,
16,
17,
18], the term “dynamically faithful” was used to refer to states that determine the action of an unknown process on its input system, up to probabilistic rescalings, that is, states for which the condition (35) implies
for every state
. Later [
24,
27,
28,
42], the term was used to denote states that identify the action of the process on arbitrary composite systems, as per Definition 12.
In every theory satisfying the Displays Assumption 1, dynamically faithful states can be taken to be deterministic without loss of generality:
Lemma 4 (A1). For a generic pair of systems A and B, if there exists a dynamically faithful state for processes of type A → B, then there exists a deterministic dynamically faithful state for processes of type A →B.
Proof. Let be a (possibly non-deterministic) dynamically faithful state for processes of type A → B, and let be a source that produces the state , say for some outcome . Since is dynamically faithful, equality of two processes on the source implies equality of the processes themselves: for every pair of processes and of type A → B, the condition implies the condition . Then, Lemma 2 shows that there exists a deterministic state , for suitable system E, such that the condition implies (and therefore , in the present case). Hence, the deterministic state is dynamically faithful for processes of type A →B. □
3.3. Relation between Tomographic Ordering and Containment
There is an important connection between the tomographic ordering and the containment relation defined earlier in the paper. Recall the definition: a deterministic state contains another deterministic state , denoted as , if is a non-trivial mixture of and some other state , namely where are probabilities and is cancellative. Here we show that the containment relation orders the deterministic states according to the tomographic power of their extensions.
Lemma 5 (A1, A3).
Let ρ and σ be two deterministic states of system A.
If ρ contains σ, then for every extension of σ, say there exists an extension of ρ, say , such thatfor every system B.
A simple consequence of Lemma 5 is the following:
Corollary 2 (A1, A3). Let ρ and σ be two deterministic states of system A. If ρ contains σ, then, for every system B and for every pair of processes and of type A → B, the condition implies the condition .
The case of complete states is especially important here. In this case, we have the following:
Corollary 3 (A1, A3). If is a complete state, then, for every system B and every pair of processes and of type A → B, the condition implies the equality .
Proof. Immediate from Corollary 2, Lemma 3, and the definition of complete state. □
This result plays a crucial role in the rest of the paper. In passing, we note that it also suggests an infinite-dimensional generalization of the notion of complete state. The generalization is as follows: we say that a state is tomographically complete if the relation implies the equality for every pair of processes of type A → B with arbitrary output system B. Corollary 3 shows that every complete state is tomographically complete. In infinite dimensional settings, complete states in the sense of Definition 5 may not exist, and one could regard tomographically complete states as the appropriate notion. Notably, all the results in this paper hold if the notion of complete state is replaced by the more general notion of tomographically complete state.
3.4. Necessary and Sufficient Condition for Dynamically Faithful States
We are now ready to provide the core technical result of the paper:
Theorem 1 (A1, A2, A3, A4). For a generic pair of systems A and B, the following are equivalent:
there exists a dynamically faithful state for processes of type A → B
there exists a complete state such that the set of all the extensions of ω has a maximum with respect to the tomographic ordering ⪰A→B.
The proof is provided in
Appendix E. The important point of the above theorem is that one can focus the attention on the complete states. In summary, the existence of dynamically faithful states boils down to the existence of a
maximally powerful extension of a complete state.
3.5. Physical Conditions for Dynamically Faithful States
We have seen that Dynamically Faithful States holds if and only if every complete state has a maximally powerful extension. Still, this is just a mathematical requirement. Can we find physical reasons for this requirement to be satisfied? Here we give three physically motivated sufficient conditions.
3.5.1. Universal Extension
Heuristically, it is natural to imagine that the “maximally powerful extension” in Theorem 1 determines all the other ways in which the system can be correlated to an environment. Going further, we can imagine that all the other correlations can be physically generated from the “maximally powerful extension” using some (possibly non-deterministic) process. The idea motivates the notion of universal extension, from which all the extensions of a given state can be generated:
Definition 13 (Universal extension).
A state has a universal extension
if there exists a reference system R
and a state with the property that every other extension of ρ can be probabilistically generated from Ψ
by applying a transformation on R.
Explicitly: for every system E,
every extension can be obtained as where is a cancellative scalar and is a transformation of type R →E.
Using this definition, we can formulate a requirement for general physical theories:
Principle 3 (Universal Extension). Every physical system has a complete state with a universal extension.
Note that one could also consider the stronger requirement that every state have a universal extension. The reasons why we did not phrase Principle 3 in this way are (1) universal extensions of complete states are already sufficient for process tomography, and (2) as we will see later in the paper, the possibility of conclusive teleportation guarantees the existence of a universal extension of a a complete state, while in general it may not guarantee the existence of universal extensions of arbitrary states.
We now show that Universal Extension guarantees Dynamically Faithful States. The first step is the following lemma:
Lemma 6. Suppose that a deterministic state has a universal extension . Then, for every system B
and for every pair of processes of type A → B,
say and , the condition implies the condition Proof. Let
be an arbitrary extension of
, and let
be the transformation in Equation (40). By applying the transformation
on both sides of Equation (
41), we obtain the relation
Inserting Equation (40) into both sides of the equality, we then obtain:
and, since
p is cancellative,
(cf. Lemma 1). Since
is an arbitrary extension of
, we conclude that the equivalence
holds. □
Applying the above lemma to a complete state, we obtain the following
Theorem 2 (A1, A3). Universal Extension implies Dynamically Faithful States.
Proof. Universal Extension guarantees that there exists a complete state
with a universal extension
, for some auxiliary system R. Since
is a universal extension, for every system B and every pair of processes
and
of type A → B, the equality
implies the condition
(cf. Lemma 6). Now, since
is complete, the equality on the extensions of
implies
(Corollary 3). To summarize, we obtained that the condition
implies
. Hence, the state
is dynamically faithful for processes of type A → B. Since this construction works for arbitrary A and B, we conclude that the theory satisfies Dynamically Faithful States. □
Notably, Universal Extension implies the existence of a state which is dynamically faithful for processes of type A → B, with arbitrary output system B. In other words, the choice dynamically faithful state depends only on the input of the tested process and not on its output.
3.5.2. Conclusive Teleportation
Quantum teleportation [
66] is a protocol for transferring the state of a quantum system from a sender to a receiver using only classical communication and previously established quantum correlations. The most basic version of quantum teleportation is probabilistic: the sender performs a joint measurement on the system to be teleported and on part of a bipartite state, shared with the receiver. If the measurement yields a specific outcome, then the state of the system is transferred to the receiver without any alteration. Probabilistic teleportation is often called
conclusive. The analogue of conclusive teleportation in general theories was studied by Abramsky and Coecke in [
49] in the context of categorical quantum mechanics, by Barnum, Barrett, Leifer, and Wilce [
23] in the context of general probabilistic theories, and by Chiribella, D’Ariano, and Perinotti [
24,
27] in the context of theories satisfying Purification.
In a general physical theory, conclusive teleportation means that, for every system A there exists a system R, a deterministic state
, an effect
, and a cancellative scalar
such that:
The existence of conclusive teleportation is a valuable operational feature, and may be put forward as a desideratum for general physical theories:
Principle 4 (Conclusive Teleportation). For every physical system A, there exists a conclusive teleportation protocol as in Equation (48).
Modulo a few technical differences, Conclusive Teleportation coincides with the main axiom for compact closed categories in categorical quantum mechanics [
49,
50,
51,
52,
53]. In this context, it is known that the teleportation state
induces a one-to-one correspondence between processes and bipartite states. In other words,
is dynamically faithful for processes of type A → B, with arbitrary B. Here we show a stronger result, namely that the state
is the universal extension of a complete state:
Proposition 2 (A4). Let be a teleportation state for system A. Then, Φ is the universal extension of a complete state.
The proof is provided in
Appendix F, where we also observe that, in general, a theory with conclusive teleportation may not have universal extensions for arbitrary states.
A simple consequence of Proposition 2 is the following:
Corollary 4 (A4). Conclusive Teleportation implies Universal Extension.
Summarizing, we proved the chain of implications:
In short, the possibility of conclusive teleportation guarantees the feasibility of process tomography, even in theories that violate Local Tomography.
3.5.3. Purification
A
purification of a given state is an extension of that state to a pure state of a composite system [
24,
27,
42,
46,
58]. In the standard scenario where the state space of physical systems is a convex set, pure states are defined as the extreme points of the convex set. A diagrammatic definition of pure state, valid also for theories where the scalars are not real numbers, was provided in Ref. [
67]. The second definition is equivalent to the first in every theory satisfying Causality [
24,
26,
27,
28,
42,
46,
53,
55,
56], the requirement that it should be impossible to “send signals from the future to the past” (see below for a more precise mathematical statement) and Local Tomography, or Local Tomography on Pure States, a weakening of Local Tomography satisfied by quantum theory on real vector spaces. Specifically, Local Tomography on Pure States is the requirement that Equation (16) implies Equation (17) when at least one of the two states
and
is an extreme point of the state space.
In the following, we will not commit to a specific definition of pure state. Instead, we will simply assume that for every system A there exists a set of states
, called
pure, satisfying the condition
We call the above condition Pure Product States. Pure Product States is automatically satisfied by the diagrammatic definition of pure state in [
67]. It is also satisfied by the usual definition of pure states as extreme points, when it coincides with the diagrammatic definition.
Let us now review the notion of purification. In its basic form, purification is formulated in theories satisfying Causality. An equivalent condition for Causality is the existence of a unique deterministic effect:
Proposition 3 ([
24]).
A theory satisfies Causality if and only if for every system A
there exists a unique deterministic effect .
Theories satisfying Causality are often called
causal theories. In a causal theory, a state
is an extension of another state
if the condition
is satisfied.
We are now ready to state the Purification principle in its basic form:
Principle 5 (Purification). For a theory satisfying Causality and Pure Product States, we require that
every state has a purification: for every system and every deterministic state there exists a system R
and a deterministic pure state such that every two purifications with the same purifying system are interconvertible via a local symmetry transformation: for every system R,
and every pair of deterministic pure states Ψ
and in , the condition:
implies that there exists a reversible transformation such that
Remarkably, the symmetry property of purifications implies that every purification is a universal extension:
Proposition 4. Let be an arbitrary state of an arbitrary system A,
and let be an arbitrary extension of ρ. If the theory satisfies Causality, Pure Product States, and Purification, then the state Γ
can be generated as where is an arbitrary purification of ρ and is a deterministic transformation.
The proof is provided in
Appendix G. The obvious consequence of Proposition 4 is the following
Corollary 5 (A1, A3, A4). If a theory satisfies Causality, Pure Product States, and Purification, then it satisfies Universal Extension, and, in particular, Dynamically Faithful States.
Proof. Proposition 4 shows that every purification of a given state is a universal extension, in the sense of Definition 13. In particular, every complete state has a universal extension (complete states exist by Assumption 4). Hence, Universal Extension holds. Then, Theorem 2 guarantees that Dynamically Faithful States holds. □
Summarizing, we proved the chain of implications:
Note that this is the same chain of implications as in the previous subsection, except that “Conclusive Teleportation” is now replaced by “Causality + Pure Product States + Purification.” A natural question is whether there are relations between Purification and Conclusive Teleportation. When the scalars are real numbers in the interval
, it is known that Purification implies Conclusive Teleportation [
24,
27,
28,
42]. More generally, it is also possible to show that Purification implies Conclusive Teleportation in every theory that satisfies two additional axioms of Pure Sharpness and Purity Preservation [
58,
68,
69].
4. Conclusions
In this paper, we analyzed the task of process tomography in general physical theories, exploring the requirement that physical processes should be identifiable by their action on a finite set of auxiliary systems/a finite set of input states. Most of the paper focussed on the requirement that physical processes can be identified by their action on a single state, called dynamically faithful. The existence of dynamically faithful states is a broader condition than the usual principle of Local Tomography, and is satisfied in a number of variants of quantum theory, including quantum theory on real Hilbert spaces and a Fermionic version of quantum theory [
70,
71]. It is natural to conjecture that dynamically faithful states also exist in a recent extension of standard quantum theory that includes complex, real, and quaternionic Hilbert spaces in a single theory [
72]. A formal proof of this statement, however, is currently missing and remains an interesting direction for future research.
One of our main results is that the existence of dynamically faithful states can be guaranteed by a simple physical condition, namely that complete states have a universal extension from which all the other extensions can be generated with non-zero probability. Example of physical theories with this property are the set of theories where conclusive teleportation is possible, and the set of theories satisfying the Causality, Pure Product States, and Purification. For every theory satisfying Causality, Pure Product States, and Purification, process tomography can be achieved using a single bipartite state: specifically, the purification of any complete state of a given system A can be used to characterize all the processes with input system A and arbitrary output system B.
An important observation is that the existence of dynamically faithful states is independent of the Causality axiom, that is, the requirement that the outcome probabilities of present experiments are independent of the choice of future experiments. This observation suggests that the possibility of characterizing physical processes may have a more primitive role than considerations of causality, and that one could adopt Dynamically Faithful States as a foundational principle for general physical theories where causality is emergent.