1. Introduction
Quantum mechanics was originally developed for the world of atoms and electrons, where it has been very successful. The understanding of the microscopic world, let us call it “quantum world”, that has developed from this success, seems to be very different or even incompatible with the understanding of the everyday world of our immediate experience, which we can call “classical world”. This is unsatisfactory because one of the strongest feelings of a modern physicist is the belief in the unity of knowledge. It is even paradoxical because the bodies of the everyday world are composed of atoms and electrons, which ought to be described by quantum mechanics.
We can distinguish three problems that are met on the way from quantum to classical physics. Classical theories such as Newtonian mechanics, Maxwellian electrodynamics or thermodynamics are objective, in the sense that the systems they are studying can be considered as real objects, and the values of their observables, such as position, momentum, field strengths, charge current, temperature, etc., can be ascribed to the systems independently of whether they are observed or not. If we are going to construct quantum models of classical systems, the question naturally arises how such an objective world can emerge from quantum mechanics.
Indeed, in quantum mechanics, the values obtained by most registrations on microsystems (technically, they are values of observables) cannot be assumed to exist before the registrations, that is, to be objective properties of the microsystem on which the registration is made. An assumption of this kind would lead to contradictions with other assumptions of standard quantum mechanics and, ultimately, with observable facts (contextuality [
1,
2], Bell inequalities [
3], Hardy impossibilities [
4], Greenberger–Horne–Zeilinger equality [
5]).
This property of quantum mechanical observables has lead to growing popularity of various forms of weakened realism. For example, according to Bohr, realism applies only to the results of quantum measurements, which can be described by the relation between objective classical properties of real classical preparation and registration apparatuses. Various concepts of quantum mechanics itself, such as electron, wave function, observable, etc., do not possess any direct counterparts in reality; they are just instruments to keep order in our experience and to make it ready for application. A rigorous account of this kind of weakened realism is [
6]. Similar view is the so-called “statistical” or “ensemble” interpretation [
7]. This refuses to attribute any kind of reality to the quantum-mechanical probability amplitudes either at the microscopic or macroscopic level. According to this view, the amplitudes are simply intermediate symbols in a calculus whose only ultimate function is to predict the statistical probability of various directly observed macroscopic outcomes, and no further significance should be attributed to them. Another example of even weaker realism is the “constructive empiricism” by van Fraassen [
8], which proposes to take only empirical adequacy, but not necessarily “truth”, as the goal of science. Even the reality of classical systems and their properties is then only apparent. The focus is on describing “appearances” rather than how the world really is [
9]. Thus, quantum mechanics does not seem to allow a realist interpretation. Let us call this
Problem of Realist Interpretation.
The second problem for construction of quantum models of classical systems can be called
Problem of Classical Properties. This is the apparent absence of quantum superpositions, as well as the robustness, of classical properties (for detailed discussion, see Ref. [
10]). Clearly, classical properties such as a position do not allow linear superposition. Nobody has ever seen a table to be in a linear superposition of being simultaneously in the kitchen as well as in the bedroom. Also, observing the table in the kitchen does not shift it to the bedroom, while quantum registration changes properties of registered systems. That is roughly what is meant when one says that classical properties are
robust (for a better definition, see Ref. [
10]).
Finally, there is a serious problem at the quantum–classical boundary. For quantum measurements, evidence suggests the assumption that the registration apparatus always is in a well-defined classical state at the end of any quantum measurement indicating just one value of the registered observable. This is called
objectification requirement [
11]. However, if the initial state of the registered system is a linear superposition of different eigenvectors of the observable, then the linearity of Schrödinger equation implies that the end state of the apparatus is also a linear superposition of eigenstates of its pointer observable. (This holds, strictly speaking, only if the measured system
and the apparatus
constitute together an isolated system. But if they are not isolated from their environment
then the composite
can be considered as isolated and the difficulty reappears.) Thus, it turns out that realism in most cases leads to contradictions with the postulate of linear quantum evolution, see the analysis in [
10]. Let us call this
Problem of Quantum Measurement.
There is a vast literature dealing with the three problems containing many different proposals from various variants of weakened realism to radical changes of quantum mechanics (for clear reviews see [
11,
12,
13]). One also proposes that quantum mechanics is based on a kind of approximation that ceases to be valid for macroscopic systems, see e.g. [
10]. An even more radical approach is to look for the way quantum mechanics could be obtained from a kind of deeper theory with classical character (see e.g. the proceedings [
14]) These or other attempts in the literature do not seem to lead to a satisfactory solution. We shall not discuss this work because we shall look for the solution in a different and utterly novel direction. Our approach starts directly from quantum mechanics as it has been formulated by Bohr, Born, Dirac, Heisenberg, von Neumann, Pauli and Schrödinger. Our analysis has shown that they have delivered enough tools to deal with the problems. The aim of this review is to prove that our proposals of solutions to the three problems form a logically coherent whole with the rest of quantum mechanics.
Let us briefly describe the ideas from which our approach starts. First, it is true that values of observables are not objective. However, in [
15], we have shown that there are other observable properties of quantum systems that can be ascribed to the systems without contradictions and there is a sufficient number of them to describe the state of the systems completely. We shall introduce and discuss this approach in
Section 1.0.3. in general terms and then, in technical detail, in
Section 2.1.2..
Second, many attempts to derive classical theories from quantum mechanics are based on quantum states of minimal uncertainty (Gaussian wave packets, coherent states, etc.). But a sharp classical trajectory might be just a figment of imagination because each measurement of a classical trajectory is much fuzzier than the minimum quantum uncertainty. Thus, the experimental results of classical physics do not justify any requirement that we have to approximate absolutely sharp trajectories as accurately as possible. This was observed as early as 1822 by Exner [
16] and evolved further in 1955 by Born [
17]. Taking this as a starting point, one must ask next what the quantum states that correspond to realistically fuzzy classical ones are. In [
18], a new assumption about such quantum states has been formulated and studied. We shall discuss it in
Section 4.
Third, in [
19] it is shown that the measurement of observables such as a position, momentum, spin, angular momentum, energy, etc. on a quantum system must be strongly disturbed by all existing systems of the same type, at least according to the standard quantum mechanics. To eliminate the disturbance, and to give an account of what is in fact done during a quantum measurement process, the standard theory of observables must be rewritten. We shall do this in
Section 3.2 on identical quantum systems.
Fourth, the current theory of quantum measurement describes a registration apparatus by a microscopic quantum system, the so-called pointer, and assumes that a reading of the apparatus is an eigenvalue of a pointer observable. For example, Ref. [
20], p. 64, describes a measurement of energy eigenvalues with the help of scattering process similar to Stern–Gerlach experiment, and it explicitly states:
We can consider the centre of mass [of a microscopic system] as a ’special’ measuring apparatus...
Similarly, Ref. [
21], p. 17 describes Stern–Gerlach experiment:
The microscopic object under investigation is the magnetic moment μ of an atom.. . . The macroscopic degree of freedom to which it is coupled in this model is the centre of mass position .. . . I call this degree of freedom macroscopic because different final values of can be directly distinguished by macroscopic means, such as the detector.. . . From here on, the situation is simple and unambiguous, because we have entered the macroscopic world: The type of detectors and the detail of their functioning are deemed irrelevant.
Paradoxically, this notion of measurement apparatus (mostly called “meter”) is quite useful for the very precise modern quantum experiments, such as non-demolition measurements or weak measurements, see, e.g., Refs. [
22,
23]. Quite generally, these experiments utilize auxiliary microscopic systems called
ancillas. If the pointer is interpreted as an ancilla, then the old theory works well at least for the interaction of the measured system with the ancilla. However, the fact that ancillas themselves must be registered by macroscopic detectors is “deemed irrelevant”. A more detailed analysis is given in
Section 5.1.
In fact, detectors are very special macroscopic systems. We shall show that their role is to justify the state reduction (collapse of wave function) and to define the so-called “preferred basis” (see [
13]) that determines the form of state reduction. Further, the reduction can be assumed as a process within detectors at the time of the detector’s macroscopic signal. We shall discuss the design and role of detectors in
Section 4.3.
The present paper has grown from review [
24] by adding new results, by correcting many minor errors and by explaining many points in a clearer and more coherent way. It is a systematic exposition of the non-relativistic quantum mechanics (the space and time structure is everywhere assumed to be Newtonian).
1.0.1. Examples of quantum systems
To explain what quantum mechanics is about, this section describes some well-known quantum systems following Ref. [
24]. It also introduces some general notions, such as microsystem, macrosystem, type of system and structural property following Ref. [
15].
Quantum mechanics is a theory that describes certain class of properties of certain class of objects in a similar way as any other physical theory does. For example, among others, Newtonian mechanics describes bodies that can be considered as point-like in a good approximation and studies the motion of the bodies.
Quantum systems that we shall consider are photons, electrons, neutrons and nuclei, which we call particles, and systems containing some number of particles, such as atoms, molecules and macroscopic systems, which are called composite. (One can ask whether there is a non-relativistic limit of photons. In one such limit, photons may move with infinite velocity and therefore their position does not need to be very well defined. In another, photons may be represented by a classical electromagnetic wave.) Of course, neutrons and nuclei themselves are composed of quarks and gluons, but non-relativistic quantum mechanics can and does start from some phenomenological description of neutrons and nuclei.
Let us call particles and quantum systems that are composed of small number of particles microsystems. They are extremely tiny and they mostly cannot be perceived directly by our senses. We can observe directly only macroscopic quantum systems that are composed of very many particles. (It is true that the eye can recognize signals of just several photons, but it can be viewed as a quantum registration apparatus with macroscopic parts and only these are observed “directly”.) “Very many” is not too different from (the Avogadro number). Let us call these macrosystems. Some properties of most macroscopic systems obey classical theories. For example, shape and position of my chair belong to Euclidean geometry, its mass distribution to Newtonian mechanics, chemical composition of its parts to classical chemistry and thermodynamic properties of the parts such as phase or temperature to phenomenological thermodynamics. Such properties are called ’classical’. Thus, properties of microsystems can only be observed via classical properties of macrosystems; if microsystems interact with them, then this interaction changes their classical properties.
Microsystems are divided into types, such as electrons, hydrogen atoms, etc. Systems of one type are not distinguishable from each other in a sense not existing in classical physics. Systems of the same type are often called identical. Microsystems exist always in a huge number of identical copies. The two properties of microsystems, viz. 1) their inaccessibility to direct observations and 2) utter lack of individuality that is connected with the existence of a huge number of identical copies, make them rather different from classical systems or “things”. Each classical system can be observed directly by humans (in principle: for example, the distant galaxies) and each can be labelled and distinguished from other classical systems, because it is a quantum system composed of a huge number of particles and hence it is highly improbable that it has a kin of the same type in the world.
Objective properties that are common to all microsystems of the same type will be called structural. Thus, each particle has a mass, spin and electric charge. For example, the mass of electron is about 0.5 MeV, the spin 1/2 and the charge about C. In non-relativistic quantum mechanics, any composite system consists of definite numbers of different particles with their masses, spins and charges. (We do not view quasiparticles as particles but as auxiliary entities useful for description of the spectrum of some composite systems.) E.g., a hydrogen atom contains one electron and one proton (nucleus). The composition of a system is another structural property. The structural properties influence the dynamics of quantum systems; the way they do it and what dynamics is will be explained later. Only then, it will be clear what the meaning of these parameters is and that the type of each system can be recognized, if its dynamics is observed. When we shall know more about dynamics, further structural properties will emerge.
Structural properties are objective in the sense that they can be assumed to exist before and independently of any measurement. Such assumption does not lead to any contradictions with standard quantum mechanics and is at least tacitly made by most physicists. In fact, the ultimate goal of practically all experiments is to find structural properties of quantum systems.
From the formally logical point of view, all possible objective properties of given kind of objects ought to form a Boolean lattice. The structural properties satisfy this condition: systems with a given structural property form a subset of all systems. These subsets are always composed of whole type-classes of quantum systems. Clearly, the intersection of two such subsets and the complement of any such subset is again a structural property.
Structural properties characterize a system type completely but they are not sufficient to determine the dynamics of individual systems.
1.0.2. Examples of quantum experiments
The topic of this section plays an important role in understanding quantum mechanics. Specific examples of typical experiments will be given in some detail following Ref. [
24]. In this way, we gain access to the notions of preparation and registration, which are assumed by the basic ideas of our realist interpretation of quantum mechanics. Describing the experiments, we shall already use some of the language of the interpretation, which will be introduced and motivated in this way.
Let us first consider experiments with microsystems that are carried out in laboratories. Such an experiment starts at a source of microsystems that are to be studied. Let us give examples of such sources.
Electrons. One possible source (called field emission, see e.g. Ref. [
25], p. 38) consists of a cold cathode in the form of a sharp tip and a flat anode with an aperture in the middle at some distance from the cathode, in a vacuum tube. The electrostatic field of, say, few kV will enable electrons to tunnel from the metal and form an electron beam of about
electrons per second through the aperture, with a relatively well-defined average energy.
Neutrons can be obtained through nuclear reaction. This can be initiated by charged particles or gamma rays that can be furnished by an accelerator or a radioactive substance. For example the so-called Ra-Be source consists of finely divided RaCl mixed with powdered Be, contained in a small capsule. Decaying Ra provides alpha particles that react with Be. The yield for 1 mg Ra is about neutrons per second with broad energy spectrum from small energies to about 13 MeV. The emission of neutrons is roughly spherically symmetric centred at the capsule.
Atoms and molecules. A macroscopic specimen of the required substance in gaseous phase at certain temperature can be produced, e.g., by an oven. The gas is in a vessel with an aperture from which a beam of the atoms or molecules emerges.
Each source is defined by an arrangement of macroscopic bodies of different shapes, chemical compositions, temperatures and by electric or magnetic fields that are determined by their macroscopic characteristics, such as average field intensities: that is, by their classical properties. These properties determine uniquely what type of microsystem is produced. Let us call this description
empirical. It is important that the classical properties defining a source do not include time and position so that the source can be reproduced later and elsewhere. We call different sources that are defined by the same classical properties
equivalent. Empirical description is sufficient for reproducibility of experiments but it is not sufficient for understanding of how the sources work. If a source defined by an empirical description is set into action, we have an instance of the so-called
preparation.
Quantum mechanics assumes that these are general features of all sources, independently of whether they are arranged in a laboratory by humans or occur spontaneously in nature. For example, classical conditions at the centre of the Sun (temperature, pressure and plasma composition) lead to emission of neutrinos that reach the space outside the Sun.
Often, a source yields very many microsystems that are emitted in all possible directions, a kind of radiation. We stress that the detailed structure of the radiation as it is understood in classical physics, that is where each individual classical system exactly is at different times, is not determined for quantum systems and the question even makes no sense. Still, a fixed source gives the microsystems that originate from it some properties. In quantum mechanics, these properties are described on the one hand by the structural properties that define the prepared type, on the other, e.g., by the so-called
quantum state. The mathematical entity that is used in quantum mechanics to describe a state (the so-called state operator) will be explained in
Section 2.1. To determine the quantum state that results from a preparation with a given empirical description in each specific case requires the full formalism of quantum mechanics. Hence, we postpone this point to
Section 5.
After arranging the source, another stage of the experiment can start. Generally, only a very small part of the radiation from a source has the properties that are needed for the planned experiment. The next step is, therefore, to select the part and to block off the rest. This is done by the so-called collimator, mostly a set of macroscopic screens with apertures and macroscopic electric or magnetic fields. For example, the electron radiation can go through an electrostatic field that accelerates the electrons and through electron-microscope “lenses”, each followed by a suitable screen. A narrow part of the original radiation, a beam, remains. Another example is a beam of molecules obtained from an oven. It can also contain parts of broken molecules including molecules with different degrees of ionisation. The part with suitable composition can then be selected by a mass spectrometer and the rest blocked off by a screen. Again, the beam resulting from a raw source and a collimator consists of individual quantum objects with a well-defined type and quantum state. The process of obtaining these individual quantum objects can be viewed as a second stage of the preparation. Again, there is an empirical description that defines an equivalence class of preparations and equivalent preparations can be reproduced.
The final beam can be characterized not only by the quantum state of individual objects but also by its approximate current, that is how many individual objects it yields per second. The beam can be made very thin. For example, in the electron-diffraction experiment [
26], the beam that emerges from the collimator represents an electric current of about
, or
electrons per second. As the approximate velocity of the electrons and the distance between the collimator and the detector are known, one can estimate the average number of electrons that are there simultaneously at each time. In the experiment, it is less than one. One can understand in this way that it is an experiment with individual electrons.
Next, the beam can be lead through further arrangement of macroscopic bodies and fields. For example, to study the phenomenon of diffraction of electrons, each electron can be scattered by a thin slab of crystalline nickel or by an electrostatic biprism interference apparatus. The latter consists of two parallel plates and a wire in between with a potential difference between the wire on the one hand and the plates on the other. An electron object runs through between the wire and both the left and right plate simultaneously and interferes with itself afterwards (for details see [
26]). Again, the beam from the graphite or the biprism can be viewed as prepared by the whole arrangement of the source, collimator and the interference apparatus. This is another example of a reproducible preparation procedure.
Finally, what results from the original beam must be made directly perceptible by its interaction with another system of macroscopic bodies and fields. This process is called registration and the system registration apparatus. The division of an experimental arrangement into preparation and registration parts is not unique. For example, in the electron diffraction experiment, one example of a registration apparatus begins after the biprism interference, another one includes also the biprism interference apparatus. Similarly to preparations, the registrations are defined by an empirical description of their relevant classical properties in such a way that equivalent registrations can be reproduced.
An important, even definition, property of a registration is that it is applicable to an individual quantum system and that each empirical result of a registration is caused by just one individual system. In the above experiment, this assumption is made plausible by the extreme thinning of the beam, but it is adopted in general even if the beam is not thin.
An empirical description of a registration apparatus can determine a quantum mechanical
observable similarly as preparation determines a quantum state. Again, more theory is needed for understanding of what are the mathematical entities (the so-called
positive valued operator measures, see
Section 2.2) describing observables and how they are related to registration devices. Each individual registration performed by the apparatus, i.e., registration performed on a single quantum object, then gives some value of the observable. The registration is not considered to be finished without the registration apparatus having given a definite, macroscopic and classical signal. This is the objectification requirement.
A part of registration apparatuses for microsystems are
detectors. At the empirical level, a detector is determined by an arrangement of macroscopic fields and bodies, as well as by the chemical composition of its sensitive matter [
27]. For example, in the experiment [
26] on electron diffraction, the electrons coming from the biprism interference apparatus are absorbed in a scintillation film placed transversally to the beam. An incoming electron is thus transformed into a light signal. The photons are guided by parallel system of fibres to a photo-cathode. The resulting (secondary) electrons are accelerated and lead to a micro-channel plate, which is a system of parallel thin photo-multipliers. Finally, a system of tiny anodes enables to record the time and the transversal position of the small flash of light in the scintillation film.
In this way, each individual electron coming from the biprism is detected at a position (two transversal coordinates determined by the anodes and one longitudinal coordinate determined by the position of the scintillation film). Such triple
of numbers is the result of each registration and the value of the corresponding observable, which is a coarsened and localized position operator
in this case (see
Section 3.2.3.). Also the time of the arrival at each anode can be approximately determined. Thus, each position obtains a certain time.
For our theory, the crucial observation is the following. When an electron that has been prepared by the source and the collimator hits the scintillation film, it is lost as an individual system. Indeed, there is no property that would distinguish it from other electrons in the scintillation matter. Thus, this particular registration is a process inverse to preparation: while the preparation has created a quantum system with certain individuality, the registration entails a loss of the individuality. Our theory of quantum registration in
Section 5 will make precise and generalize this observation.
If we repeat the experiment with individual electrons many times and record the transversal position coordinates, the gradual formation of the electron interference pattern can be observed. The pattern can also be described by some numerical values. For example, the distance of adjacent maxima and the direction of the interference fringes can be such values. Still, the interference pattern is not a result of one but of a whole large set of individual registrations.
In some sense, each electron must be spread out over the whole plane of the scintillation detector after coming from the biprism but the excitation of the molecules in the detector matter happens always only within a tiny well-localized piece of it, which is different for different electrons. (This is what is sometimes called “the collapse of wave function”.) Thus, one can say that the interference pattern must be encoded in each individual electron, even if it is not possible to obtain the property by a single registration. The interference pattern can be considered as an objective property of the individual electrons prepared by the source, the collimator and the biprism interference apparatus. The interference pattern is not a structural property: preparations that differ in the voltage at some stage of the experiment (e.g., the accelerating field in the collimator or the field between the wire and the side electrodes in the biprism interference apparatus, etc.) will give different interference patterns. We call such objective properties dynamical. On the other hand, the hitting position of each individual electron cannot be considered as its objective property. Such an assumption would lead to contradictions with results of other experiments. The position must be regarded as created in the detection process.
It is a double-slit experiment, a special kind of which is described above, that provides a strong motivation for considering an individual quantum particle as an extended object of sorts. Without any mathematical description, it is already clear that such an extended character of electrons could offer an explanation for the stability of some states of electrons orbiting atomic nuclei. Indeed, a point-like electron would necessarily have a time-dependent dipole momentum and lose energy by radiation. However, an extended electron can define a stationary charge current around the nucleus.
Some structural properties can be measured directly by a registration (on individual quantum systems) and their values are real numbers. For example, mass can be measured by a mass spectrometer. Such structural properties can be described by quantum observables (see
Section 2.2.5.). (These observables must commute with all other observables ([
6], IV.8), and can be associated with the so-called superselection rules, see e.g. [
11].) However, there are also structural properties that cannot be directly measured on individual objects similarly to the interference pattern, such as cross sections or branching ratios. They cannot be described by observables.
1.0.3. Realist Model Approach to quantum mechanics
In the previous section, describing specific experiments, we have used certain words that are avoided in careful textbooks of quantum mechanics such as
objective properties or
quantum object or
…An electron object runs through between the wire and both the left and right plate simultaneously and interferes with itself afterwards …
The electron is viewed here as a real object that is extended over the whole width of the biprism apparatus. After this intuitive introduction, we give now a general and systematic account of our realist interpretation.
A realist interpretation of a physical theory is a more subtle question than whether the world exists for itself rather than being just a construction of our mind. This question can always be answered in positive without any danger of falsification. However, every physical theory introduces some general, abstract concepts. For example, Newtonian mechanics works with mass points, their coordinates, momenta and their dynamical trajectories. The truly difficult question is whether such concepts possess any counterparts in the real world. On the one hand, it seems very plausible today that mass points and their sharp trajectories cannot exist and are at most some idealizations. On the other hand, if we are going to understand a real system, such as a snooker ball moving on a table, then we can work with a construction that uses these concepts but is more closely related to the reality. For example, we choose a system of infinitely many mass points forming an elastic body of a spherical shape and calculate the motion of this composite system using Newton’s laws valid for its constituent points. Then, some calculated properties of such a model can be compared with interesting observable properties of the real system. Thus, even if the general concepts of the theory do not describe directly anything existing, a suitable model constructed with the help of the general concepts can account for some aspects of a real system.
Motivated by this observation, we shall divide any physical theory into two parts. First, there is a treasure of successful models. Each model gives an approximative representation of some aspects of a
real object [
28]. Historically, models form a primary and open part of the theory. For example, in Newtonian mechanics, the solar system was carefully observed by Tycho de Brahe and then its model was constructed by Kepler. Apparently, Newton was able to calculate accelerations and doing so for Kepler trajectories, he might discover that they pointed towards the Sun. Perhaps this lead to the Second Law. The hydrogen atom had a similar role in quantum mechanics.
Second, there is a general language part. It contains the mathematical structure of state space, conditions on trajectories in the state space, their symmetries and the form of observables [
8]. It is obtained by generalization from the study of models and is an instrument of further model construction and of model unification. For example, in Newtonian mechanics, the state space is a phase space, the conditions on trajectories is the general structure of Newton’s dynamical equations, symmetries are Galilean transformations and observables are real functions on the phase space.
A model is constructed as a particular subset of trajectories in a particular state space as well as a choice of important observables. For example, to describe the solar system, assumptions such as the number of bodies, their point-like form, their masses, the form of gravitational force and certain class of their trajectories can be made if we want to construct a model. The observed positions of the planets would then match the theoretical trajectories of the model within certain accuracy. Thus, a model consists of a language component on the one hand, and an identifiable-object component on the other. The language component always contains simplifying assumptions, always holds only for some aspects of the associated object and only within some approximation. The approximation that is referred to is bounded from above by the accuracy of performed measurements. This is measurable and can be expressed numerically by statistical variances.
Clearly, the models of a given theory are not predetermined by the general part but obtained in the historical evolution and dependent on observation of real objects. On the one hand, the general part can also be used to construct language components of models that do not have any real counterparts. On the other hand, the model part is steadily evolving and never closed. For example, a satisfactory quantum model of high temperature superconductivity is not yet known. This is why the treasure of successful models is an independent and, in fact, the basic part of any theory.
Such philosophy forms a first step of what we call our Realist Model Approach to quantum mechanics. Thus, the Approach lies somewhat within the recent trend of the philosophy of science that defines a theory as a class of models (see, e.g., [
8,
28,
29,
30,
31]). It can be said that it combines ideas of the constructive realism by Ronald Giere (I enthusiastically adopt Giere’s view that philosophy of science is to be removed from the realm of philosophy and put into the realm of cognitive sciences) with van Fraassen notions of state space and symmetries [
8] as a basis of the general language part. (Van Fraassen also applied his constructive empiricism to quantum mechanics [
32] and, adding some further ideas, arrived at his own, the so-called “modal interpretation” of it. To prevent misunderstanding, it must be stressed that the account and interpretation of quantum mechanics described here is different from van Fraassen’s.) It is important that constructive realism is immune to the usual objections against naive realism. In addition, we add some further importance to the general part by recognizing its unification role. The effort at unification is without any doubts a salient feature of scientists that can be observed at any stage of research. For example, Newton was admired for his unification of such different phenomena as apples falling from trees and the Moon moving in the sky. Today’s endeavor to unify the theories of quantum fields and gravity is a very well observable historical fact. From the point of view of Giere, this bent might, perhaps, be understood as one of cognitive instincts.
The focus on models allows to define the task of quantum-explaining the classical world in the following way. Instead of trying to find a direct relation between the general language parts of, say, quantum and Newtonian mechanics or a universal correspondence between states of Newtonian and quantum mechanics such as Wigner–Weyl–Moyal map [
33], p. 85, one ought to build quantum models of real macroscopic systems and their aspects for which there are models in Newtonian mechanics striving for approximate agreement between the two kinds of models on those aspects for which the Newtonian models are successful. For example, we shall not attempt to obtain from quantum mechanics the sharp trajectories that is a concept of general language part of Newtonian mechanics, but rather try to model the observed fuzzy trajectories of specific classical systems (
Section 4), or to analyse different specific registration apparatuses first and then try to formulate some features common to all (
Section 5).
However, the Realist Model Approach is not so easily applied to quantum mechanics as it is to Newtonian mechanics. A question looms large at the very start: What is a real quantum object? Of course, such object are met “empirically” in preparations and registrations. However, we would like to subscribe to the notion that the language component of a model must ascribe to its real object a sufficient number of objective properties. Objective means that the properties can be ascribed to the object alone. Sufficient means that the dynamics of any object as given by its model is uniquely determined by initial data defined by values of a minimal set of its objective properties. For example, in Newtonian mechanics, the values of coordinates and momenta determine a unique solution of Newton’s dynamical equations.
In Newtonian mechanics, coordinates and momenta are observables, and values of observables can be viewed as objective without any danger of contradictions. Using this analogy, one asks: Can values of observables be viewed as objective properties of quantum systems? As is well known, the answer is negative (see
Section 2.2.4.). If we assume that values of observables are the only properties of quantum systems that are relevant to their reality, then there are no real quantum systems. For rigorous no-go theorems concerning such objective properties see, e.g., Ref. [
6].
Our approach to properties of quantum systems is therefore different from those that can be found in literature. First, we extend the notion of properties to include complex ones in the following sense [
24]:
Their values may be arbitrary mathematical entities (sets, maps between sets, etc.). For example, the Hamiltonian of a closed quantum system involves a relation between energy and some other observables of the system. This relation is an example of such a complex property.
Their values do not need to be directly obtained by individual registrations. For example, to measure a cross-section a whole series of scattering experiments must be done. Thus, their values do not necessarily possess probability distribution but may be equivalent to, or derivable from, probability distributions.
Point 2 is usually not clearly understood and we must make it more precise. A real system of Newton mechanics is sufficiently robust so that we can do many experiments with it and perform many measurements on it without changing it. Moreover, any such system is sufficiently different from other systems anywhere in the world (even two cars from one factory series can be distinguished from each other). Any physical experiment on a given classical system can then be repeated many times and only then the results can be considered reliable. The results are then formulated in statistical terms (e.g., as averages, variances, etc.). One can, therefore, feel that it might be more precise account of what one does generally in physics if one spoke of ensembles of equivalent experiments done on equivalent object systems in terms of equivalent experimental set-ups and of the statistics of these ensembles.
This is of course a well-known idea. We shall apply it consequently to Newtonian mechanics in the theory of classical limit in
Section 4. However, one ought not to forget that each ensemble must consist of some elements. Indeed, to get a statistics, one has to possess a sufficiently large number of different individual results. Hence, these individual elements must always be there independently of how large the ensemble is. Then, we can ask the question: What do the statistical properties of an ensemble tell us about properties of the individual object systems used in each individual experiment? In the classical physics, at least, the answer to this question is considered quite obvious and one interprets the experimental results as properties of the individual objects.
Quantum microsystems are never robust in the above sense. After a single registration, the microsystem is usually lost. Then we can repeat the experiment only if we do it with another system. Here, we can utilize another property of quantum microsystems that is different from classical ones: there is always a huge number of microscopic systems of the same type, which are principally indistinguishable from each other. Thus, we can apply the same preparation together with the same registration many times. In quantum mechanics, the thought set of all such experiments is called ensemble of experiments, and similarly ensembles of prepared systems and of obtained results. The elements of the ensembles are again called individuals. Suppose that each individual result of an ensemble of measurements is a real number. Then we can e.g. calculate an average of the results ensemble. The average can then surely be considered as a property of the ensemble.
However, as in classical physics, one can also understand the average as a property of each individual system of the ensemble. In any case, the fact that a given individual belongs to a given ensemble is a property of the individual. It is a crucial step in our theory of properties that we consider a property of an ensemble as a property of each individual element of the ensemble. In fact, this is the only way of how the logical union or intersection of two properties can be understood. For example, the logical union, , of properties A and B of system is the property, that has either property A or B.
In our theory, we shall use both notions, individual object and ensemble. The notion of system ensemble is defined as usual (see, e.g., [
21], p. 25): it is the thought set of all systems obtained through equivalent preparations.
Returning to objectivity of observables in quantum mechanics, the problem is that a registration of an eigenvalue
a of an observable
of a quantum system
by an apparatus
disturbs the microsystem and that the result of the registration is only created during the registration process. The result of the individual registration cannot thus be assumed to be an objective property of
before the registration. It can however be assumed, as we shall do, that it is an objective property of the composite
after the registration. This is the objectification requirement [
11].
It seems therefore that the objective properties of quantum systems, if there are any, cannot be directly related to individual registrations, as they can in classical theories. (Paradoxically, most of the prejudices that hinder construction of quantum models of classical theories originate in the same classical theories.) However, there are observable properties in quantum mechanics that are different from values of observables [
24]:
Basic Ontological Hypothesis of Quantum Mechanics A sufficient condition for a property to be objective is that its value is uniquely determined by a preparation according to the rules of standard quantum mechanics. The “value” is the value of the mathematical expression that describes the property and it may be more general than just a real number. To observe an objective property, many registrations of one or more observables are necessary.
In fact, the Hypothesis just states explicitly the meaning that is tacitly given to preparation by standard quantum mechanics. More discussion on the meaning of preparation is in [
19,
34]. In any case, prepared properties can be assumed to be possessed by the prepared system without either violating any rule of standard quantum mechanics or contradicting possible results of any registration performed on the prepared system. The relation of registrations to such objective properties is only indirect: an objective property entails limitations on values of observables that will be registered. In many cases, we shall use the Hypothesis as a heuristic principle: it will just help to find some specific properties and then it will be forgotten, that is, an independent assumption will be made that these properties can be objective and each of them will be further studied.
We shall divide objective properties into structural (see
Section 1.0.1.) and
dynamical and describe the dynamical ones mathematically in
Section 2.1.2.. Examples of dynamical properties are a state, the average value and the variance of an observable. We shall define so-called simple objective properties and show first, that there is enough simple objective properties to characterize quantum systems completely (at least from the standpoint of standard quantum mechanics) and second, that the logic of simple properties satisfies Boolean lattice rules. Thus, a reasonable definition of a real object in quantum mechanics can be given (see
Section 2.1.2.).
Often, the Hypothesis meets one of the following two questions. First, how can the Hypothesis be applied to cosmology, when there was nobody there at the Big Bang to perform any state preparation? Second, a state preparation is an action of some human subject; how can it result in an objective property? Both objections result from a too narrow view of preparations (see
Section 1.0.2.). Moreover, the second objection is not much more than a pun. It is not logically impossible that a human manipulation of a system results in an objective property of the system. For example, pushing a snooker ball imparts it a certain momentum and angular momentum that can then be assumed to be objective properties of the ball.
One may wonder how the average of an observable in a state can be objective while the individual registered value of the observable are not because an average seems to be defined by the individual values. However, the average is a property of a prepared state and is, therefore, defined also by the preparation. The results of a huge number of individual registrations must add to their predetermined average. This can be seen very well for averages with small variances. In our theory of classical properties (
Section 4), the explanation of classical realism will be based on the objectivity of averages because some important classical observables will be defined as (quantum) averages of (quantum) observables in a family of specific (quantum) states that will be called
classicality states.
Let us compare our Realist Model Approach with what is usually understood as the Realism of Classical Theories. This is the philosophy that extends some successful features of classical theories, especially Newtonian mechanics, to the whole real world. There are three aspects of the Realism of Classical Theories that are not included in our Realist Model Approach. First, classical physics is deterministic, assuming that every event has a cause, but quantum theory does not tell us the causes of some of its events. Second, each interaction of classical physics is local in the sense that the mutual influence of two interacting systems asymptotically vanishes if the systems are separated by increasing spatial distance (cluster separability). But, in addition to local interactions, quantum theory contains mutual influence that is independent of distance (entanglement, mutual influence between particles of the same type). Third, classical physics requires a causal explanation for every correlation. This can be rigorously expressed by Reichennbach’s condition of
common cause [
35]. The existence of the common cause for some quantum correlation is incompatible with experiments (for discussion, see [
32]).
Our Realist Model Approach just states which ontological hypotheses can reasonably be made under the assumption that quantum mechanics is valid. A few words have to be said on ontological hypotheses. As is well-known, the objective existence of anything cannot be proved (even that of the chair on which I am now sitting, see, e.g., Ref. [
12], where this old philosophical tenet is explained from the point of view of a physicist). Thus, all such statements are only hypotheses, called ontological.
It is clear, however, that a sufficiently specific ontological hypothesis may lead to contradictions with some observations. Exactly that happens if one tries to require objectivity of quantum observables. (More precisely, the existence in question is that of systems with sufficient number of properties defined by values of observables.) Moreover, hypotheses that do not lead to contradictions may be useful. For example, the objective existence of the chair nicely explains why we all agree on its properties. Similarly, the assumption that quantum systems possess certain objective properties will be useful for the quantum theory of classical properties or for a solution of the problem of quantum measurement. The usefulness of ontological hypotheses in the work of experimental physicists has been analysed by Giere [
28], p. 115. The hypothesis in question is the existence of certain protons and neutrons. It explains, and helps to perform, the production, the manipulations, the control and the observations of proton and neutron beams in an experiment at Indiana University Cyclotron Facility. From the point of view of van Fraassen, the ontological hypotheses of the kind used by this paper might perhaps be considered as a part of theoretical models: such a hypothesis may or may not be “empirically adequate”.
The position on ontological hypotheses taken here is, therefore, rather different from what has been called “metaphysical realism” by Hillary Putnam [
36]: “There is exactly one true and complete description of ’the way the world is’ ”.
The Realist Model Approach enables us to characterize the subject of quantum mechanics as follows:
Quantum mechanics studies objective properties of existing microscopic objects.
This can be contrasted with the usual cautious characterization of the subject, as e.g. [
21], p. 13:
...quantum theory is a set of rules allowing the computation of probabilities for the outcomes of tests [registrations] which follow specific preparations.
1.0.4. Probability and information
Let us return to Tonomura experiment. At each individual registration, a definite value of the observable is obtained. Quantum mechanics cannot predict which value it will be, but it can give the probability that the value will be obtained. This is a general situation for any registration. In this way, registrations introduce a specific statistical element into quantum mechanics.
A correct understanding of probability and information is an important part in the conceptual framework of the theory. The discussion whether probability describes objective properties that can be observed in nature or subjective states of the knowledge of some humans has raged since the invention of probability calculus by Jacob Bernoulli and Pierre-Simon Laplace [
37]. The cause of this eternal argument might be that the dispute cannot be decided: probability has both aspects, ontological and epistemic [
24].
Probability is a function of a proposition
A and its value,
, is a measure of the degree of certainty that
A is true. As a function on a Boolean lattice of propositions, it satisfies Cox’s axioms [
37]. Then it becomes a real additive measure on the lattice. Whether a proposition is true or false must be decided by observation, at least in principle. Hence, the probability always concerns objective events, at least indirectly.
As an example, consider the Tonomura experiment. The probability concerns the proposition that the value of observable is individually registered on an electron is . The value of the probability can be verified by studying an ensemble of such registrations. Indeed, if we perform a huge number of such registrations, we obtain an interference pattern that approximately reproduces the smooth probability distribution obtained by calculating the quantum mechanical model of an electron in Tonomura’s apparatus. It is a real interference pattern shown by the apparatus.
We have used the term “ensemble” with the meaning of a statistical ensemble of real events or objects. Here, the objectification requirement is involved. The value obtained in each individual registration is considered as a real property of the system consisting of the electron and the apparatus. Then, the probability concerns both the lack of knowledge of what objectively happens and properties of real systems. We emphasize that is not the probability that the prepared electron possesses value of observable but the probability that a registration will give such a value.
Another question is whether the individual outcomes are in principle predictable from some more detailed initial conditions (the so-called “hidden variables”) on the electron that we do not know. Quantum mechanics does not contain information on any such conditions. It does not deliver these predictions and it would even be incompatible with any ’deeper’ theory that did (see
Section 2.2.4.). We shall, therefore, assume that they are objectively unpredictable.
Let us describe the general framework that is necessary for any application of probability theory. First, there is a system, denote it by . Second, certain definite objective conditions are imposed on the system, e.g., it is prepared as in the example above and observable is registered. In general, there must always be an analogous set of conditions, let us denote it by . To each system subject to condition , possibilities in some range are open. These possibilities are described by a set of propositions that form a Boolean lattice . A probability distribution is a real additive measure on .
In quantum mechanics, is usually constructed with the help of some observable, say. If is a discrete observable, its value set Ω is at most countable, . Then the single-element sets , are atoms of that generate and the probability distribution , , can be calculated from by means of Cox’s axioms. The atoms are called outcomes. If is a continuous observable, then there are no atoms but continuous observables can be considered as idealizations of more realistic discrete observables with well-defined atoms. For example, in Tonomura’s experiment, observable is defined by the photo-multiplier cells in the micro-channel plate, and not only is discrete but also even has a finite number of values.
If condition
is reproducible or it obtains spontaneously sufficiently often (which is mostly the case in physics), anybody can test the value of the probabilities because probability theory enables us to calculate the frequencies of real events starting from any theoretical probability distribution and the frequencies are measurable [
37]. The probability distribution
is therefore an objective property of condition
on
. It is so even in cases when the outcomes can be in principle predicted if occurrence of more detailed conditions is observable and can in principle be known. More precisely, this would depend on whether condition
can be decomposed into other conditions
,
in the following sense. Condition
can be viewed as a logical statement ’
satisfies
’. Let
, where ∨ is the logical union (disjunction),
each be still recognizable and reproducible,
each outcome allowed by is uniquely determined by one of ’s.
It follows that
for all
, where ∧ is logical intersection (conjunction). Even in such a case, condition
itself leaves the system a definite amount of freedom that can be described in all detail by the probability distribution
and it is an objective, verifiable property of
alone. And, if we know only that condition
obtains and that a probability distribution is its objective property, then this probability distribution describes the state of our knowledge, independently of whether the conditions
do exist and we just do not know which of them obtains in each case or not. Examples of these two different situations are given by standard quantum mechanics, which denies the existence of
’s, and the Bohm–de Broglie pilot wave theory, which specifies such
’s.
Of course, there are also cases where some condition,
, say, occurs only once so that a measurement of frequencies is not possible. Then, no probability distribution associated with
can be verified so that our knowledge about
is even more incomplete. However, in some such cases, one can still give a rigorous sense to the question [
37], p. 343 (see also the end of this section): “What is the most probable probability distribution associated with
?” One can then base one’s bets on such a probability distribution. Such a probability distribution can be considered as an objective property of
and again, there is no contradiction between the objective and subjective aspects of probability.
In quantum mechanics, it is also possible to mix preparations in a random way. Suppose we have two preparations,
and
, and can mix them randomly by e.g. mixing the resulting particle beams in certain proportions
and
,
. Then, each particle in the resulting beam is either prepared by
, with probability
, or by
, with probability
. In this way, another kind of statistical element can be introduced into quantum theory. This element will be discussed in
Section 2.1.2..
Condition is defined by the two beams and their mixing so that the ensemble has, say, N particles. Then, can be decomposed in , where is the preparation by and , . We can know and because we know the intensity of the corresponding beams but it is unlikely that we also know whether a given element of the ensemble has been prepared by or .
An important role in probability theory is played by entropy. Entropy is a certain functional of that inherits both objective and subjective aspects from probability. Discussions similar to those about probability spoil the atmosphere about entropy. The existence of a subjective aspect of entropy—the lack of information—seduces people to ask confused questions such as: Can a change of our knowledge about a system change the system energy?
The general definition of entropy as a measure of missing information has been given by Shannon [
38] and its various applications to communication theory are, e.g., described in the book by Pierce [
39]. Our version is:
Definition 1 Let be a probability distribution and let be the atoms of . Then the entropy of is Let us return to the quantum-mechanical example in order to explain which information is concerned. After the choice of preparation and registration devices, we do not know what will be the outcome of a registration but we just know that any outcome
of the registration has probability
. After an individual registration, one particular outcome will be known with certainty. The amount of information gained by the registration is the value of
S given by Equation (
1). For more detail, see Ref. [
21]. Thus, the value of
S measures the lack of information before the registration.
The entropy and the so-called Maximum Entropy Principle (MEP) have become important notions of mathematical probability calculus, see, e.g., Ref. [
37]. (There is also a principle of statistical thermodynamics that carries the same name but ought not to be confused with the mathematical MEP.) The mathematical problem MEP solves can be generally characterized as follows. Let system
, condition
and lattice
with atoms
be given. Let there be more than one set of
’s that appears compatible with
. How the probabilities
are to be assigned so that condition
is properly accounted for without any additional bias? Such
’s yield the maximum of
S as given by (1). MEP clearly follows from the meaning of entropy as a measure of lack of information. We shall use this kind of MEP in
Section 4.2.
Part I
Corrected language of quantum mechanics
The origin of quantum mechanics can be traced back to the study of a few real systems: hydrogen atom and black-body radiation. The resulting successful models used some new concepts and methods that were readily generalized so that they formed a first version of a new theoretical language. This language was then used to construct models of some aspects of further real objects, such as atoms and molecules, solid bodies, etc., and this activity lead in turn to the refinement of the language.
This evolution does not seem to be finished. A large number of real objects, the so-called classical world, have as yet no satisfactory quantum models. Our own attempts [
15] and [
19] at constructing such models have lead to some changes in the quantum language. The first part of this review starting here is an attempt at a systematic formulation of this new language.
The general notions of a theoretical language are imported from some mathematical theory and satisfy the corresponding relations given by the axioms and theorems of the mathematical theory. They are rather abstract and by themselves, they do not possess any direct connection to real physical systems. However, as building blocks of various models that do possess such connections, some of them acquire physical meaning. Such a model-mediated physical interpretation can be postulated for most of the mathematical notions by basic assumptions that will be called rules to distinguish them from the axioms of the mathematical theory. What can be derived from these rules and axioms will be called propositions. We shall however formulate only most important theorems and propositions explicitly as such in order to keep the text smooth.
A state of a quantum system is determined by a preparation while a value of an observable is determined by a registration. The notions of preparation and registration are used in their empirical (see
Section 1.0.2.) meaning first, just to catch the model-mediated significance of mathematical notions, and the quantum mechanical models explaining relevant aspects of preparation and registration processes will be constructed later. The calculation of a state from classical conditions defined by the preparation needs a sophisticated model of the mature quantum mechanics. Similarly, to calculate an observable from classical properties of the registration device, a quantum model must be used. Only in
Section 4 and
Section 5 shall we be able to find the way from the empirical description of preparation and registration to a particular mathematical state or observable.
4. Quantum models of classical properties
There are many classical aspects of real objects that have been successfully modelled by quantum mechanics, such as electrical conductivity or specific heats. These are typically phenomena that occur in systems with very many degrees of freedom so that statistical methods can be used. The statistical methods were invented already before quantum mechanics was born and introduced some elements that could be understood only later by quantum mechanics. For example, the microcanonical or canonical ensemble is, in fact, methods of preparation of thermodynamic systems. Or, theoretical results are given in the form of averages and variances. The modern condensed matter theory around room temperature can, therefore, be included into our theory of classical properties without much change. We would just utilize the objectivity of averages and variances in our interpretation of quantum mechanical results.
However, the Galilean invariance of quantum theory leads to separation of the overall motion from all other degrees of freedom. The motion of mass centre and of the total angular momentum with respect to the mass centre comprises only six degrees of freedom that do not seem to allow statistical methods. Exactly this kind of motion is studied by Newtonian mechanics. Thus, the situation is that there are quantum models of classical thermodynamic properties but none of mechanical properties that would be really satisfactory.
Quantum modelling of non-thermodynamic properties of classical systems encounters two main problems. First, a key feature of Newtonian mechanics (and any other classical theory as well) is that each system objectively has a sharp trajectory. Any fuzziness is just due to incomplete knowledge. In particular, the state of a Newtonian system is described by a point of its phase space, and the system is always in a definite state, i.e., it cannot be at two points of the phase space simultaneously (see also the discussion at the end of
Section 2.2.2.). Second, the system is robust so that measurements can be done on it without changing its properties. For example, the state of a system can be determined or confirmed by a suitable set of measurements on the system.
Thus, any quantum model of a classical system must satisfy the first two conditions of what Leggett has called Principle of Macroscopic Realism [
10]:
A macroscopic system that has available to it two or more distinct macroscopic states is at any given time in a definite one of those states.
It is possible in principle to determine which of these states the system is in without any effect on the state itself or on the subsequent system dynamics.
In trying to model the sharpness of classical states and trajectories, one may be mislead to overestimate the importance of quantum states of minimum uncertainty, which is of coherent states. However, such states are always extremal states, which can be linearly superposed, and quantum mechanics requires that linear superpositions of available states are also an available state. Moreover, measurements of the classical parameters of a coherent state necessarily disturb the value of the parameters.
To solve this problem, one could e.g. assume that some as yet unknown phenomena exist at the macroscopic level that are not compatible with standard quantum mechanics. For example, they may prevent linear superpositions (see, e.g., [
10] and the references therein). However, no such phenomena have been observed.
Another strategy is to assume that the macroscopic realism is only apparent in the sense that there
are linear superpositions of macroscopic states but the corresponding interference phenomena are difficult or impossible to observe. For example, the quantum decoherence theory [
9,
80] works only if certain observables concerning both the environment and the quantum system cannot be measured (see the analysis in [
12,
48]). Another example is the theories based on coarse-grained operators [
21,
81,
82] being measurable but fine grained being not. The third example is the Coleman–Hepp theory [
52,
53,
54,
55,
56] and its modifications [
46,
57,
58]: they are based on some particular theorems that hold only for infinite systems (see the analysis in [
53]) or for asymptotic regions [
58].
However, if we turn from theory to experiment, we may notice that any well-founded scientific observation of classical properties always has a statistical form. A measurement or observation is only viewed as well understood if it is given as an average with a variance. This fact does not by itself contradict the sharp character of the corresponding theory. The usual excuse is that the observation methods are beset with inaccuracy but that improvement of techniques can lead to better and better results approaching the “objective sharp” values arbitrarily closely. In any case, however, the measured classical parameters of real objects are much fuzzier than the minimal quantum uncertainty requires.
Moreover, that popular excuse is clearly incompatible with the assumption that the classical world is only an aspect of a deeper quantum world and that each classical model is nothing but a kind of incomplete description of the underlying quantum system. If we assume such universality of quantum theory, then the statistical character of classical observational results must not only be due to inaccuracy of observational methods but also to genuine uncertainty of quantum origin. This point of view is due to Exner [
16], p. 669, and Born [
17] and will be adopted here as a starting point of our theory of classical properties.
We can formulate this idea in terms of the Realist Model Approach as follows. The language part of classical theories contains the notions of sharp state and trajectory. These are idealized notions that do not possess any counterpart in the real world, but they are useful for model construction.
In this section, we first formulate some general hypotheses that can be applied to both thermodynamic and mechanical properties, introducing thus a unified theory of classical properties: they turn out to be selected objective properties of high-entropy quantum states of macroscopic systems. Next, we show in detail how these ideas are to be applied to Newtonian mechanics, introducing states called ME packets. Then, we construct a quantum model of a classical rigid body. Finally, we modify the well-known model of a simultaneous measurement of position and momentum of a Gaussian wave packet to that of position and momentum of a ME packet.
Thus, our project to construct quantum models of observed classical systems seems to work nicely. What remains open is the question of what is the origin of all the high-entropy states that are observed in such a great abundance around us.
4.1. Modified correspondence principle
The Born–Exner assumption has quite radical consequences, which is only seldom realized. First, the exactly sharp states and trajectories of classical theories are not objective. They do not exist in reality but are only idealizations. What really exist are fuzzy states and trajectories. The objectivity of fuzzy states of classical models is a difficult point to accept and understand. Let us explain it in more detail.
In quantum mechanics, the basis of objectivity of dynamical properties is the objectivity of the conditions that define preparation procedures. In other words, if a property is uniquely determined by a preparation, then it is an objective property. If we look closely, one hindrance to try the same idea in classical theories is the custom always to speak about initial data instead of preparations. An initial datum can be and mostly is a sharp state. The question on how an exactly sharp state can come into being is ignored. This in turn seems justified by the hypothesis that sharp states are objective, that is, they just exist by themselves.
To come away from this self-deception, we accept that preparation procedures play the same basic role in classical physics as in quantum physics. Then, the nature and form of necessary preparation procedures must be specified and the corresponding states described. In this way, the Exner–Born idea leads to a rather radical change of interpretation of classical theories and this will enable us to construct quantum models of all classical aspects of real objects.
An obvious starting point of such constructions is that all classical systems are also quantum systems. Let us now make this more precise. Consider a real physical object (so to speak, independent of any theory). If there is an adequate classical model of some aspects of this system, the system is called classical. Adequate means that some properties of approximately represent important properties of the real system. In addition, the same real object must also be understood in terms of a quantum model . could be richer than so that classical properties of can be identified with some quantum properties of . In particular, these quantum properties ought to be objective. This follows from the fact that all classical properties can be assumed to be objective without any danger of contradictions.
The construction of quantum model
consists of the following points. (1) The composition of
must be defined. (2) The observables that can be measured on
are to be determined. On the one hand, this is a non-trivial problem because there are relatively strong restrictions on what observables of macroscopic systems can be measured (see
Section 3.2.2.). On the other hand, as any quantum observable is measurable only by a classical apparatus, the existence of such apparatuses is tacitly assumed from the very beginning. Quantum model
will thus always depend on some classical elements. This does not mean that classicality has been smuggled in because, in our approach, classical properties are specific quantum ones. (3) A Hamiltonian operator of the system must be set up. (4) Suitable quantum states must be chosen. Finally, the known classical properties of
must be listed and each derived as an objective property of
from the four sets of assumptions above. This is a self-consistent framework for a non-trivial problem.
It follows that there must be some at least approximate relation between classical observables of and quantum observables of as well as between the classical states of and the quantum states of . The following model assumption on such a relation might be viewed as a version of Correspondence Principle, let us call it Modified Correspondence Principle.
Assumption 1- 1.
The state of classical model in a given classical theory is described by a set of n numbers that represent values of some classical observables. The set is not uniquely determined. Let us call any such set state coordinates.
- 2.
We assume that state coordinates can be chosen so that there is a subset of sharp observables of quantum model and a state of such that - 3.
All such states form a subset of . Some of these states satisfy the condition that all properties of can be (at least approximately) obtained from if it is in such states. They are called classicality states of quantum system .
Clearly, Modified Correspondence Principle does not need the assumption that all observables from
commute with each other. Then, even if they themselves are not jointly measurable, their fuzzy values can be (see
Section 4.7). Further, it does not follow that each classical property is an average of a quantum operator. That would be false. We assume only that the classical state coordinates
can be chosen in such a way.
It is important to realize that Modified Correspondence Principle suggests how Principle of Macroscopic Realism is to be understood. For example, macroscopic systems also have extremal states that satisfy Equation (
70). These seem to be macroscopic states available to the system. However, extremal states are readily linearly superposed and any quantum registration that ought to find the parameters of a coherent state (a generalized measurement: positive operator valued measure) would strongly change the state (for a general argument, see Ref. [
11], p. 32). We assume that the validity of Principle of Macroscopic Realism can be achieved if the words “distinct macroscopic states” are replaced with “distinct classicality states”. Let us try to motivate a proposal of what such classicality states might be.
An interesting subset of classical properties of macroscopic system is the thermodynamic ones. They are important for us because quantum models of these properties are available. Existing models based on statistical physics need one non-trivial assumption: the states of sufficiently small macroscopic systems that we observe around us are approximately states of maximum entropy. As it has been discussed in
Section 1.0.4., entropy is an objective property of quantum systems because it is defined by their preparation. Thus, the validity of thermodynamics depends on the preparation conditions, or the origin, of observed macroscopic systems. The averages and variances that result from the models based on the maximum-entropy assumption agree with observations. In particular, they explain why classical states and properties are relatively sharp. Moreover, high entropy states are very far from extremal and linear superposition does not make any sense for them.
The physical foundations of thermodynamics are not yet well-understood but there are many ideas around about the origin of high-entropy states. Their existence might follow partially from logic (Bayesian approach, [
37]) and partially from quantum mechanics (thermodynamic limit, [
83], Vol. 4). Some very interesting models of how maximum entropy quantum states come into being are based on entanglement [
84,
85,
86,
87]). However, just in order to construct quantum models of classical properties, they can be used as one of the main assumptions without really understanding their origin.
We generalize the statistical methods as follows [
18].
Assumption 2 All classicality states are states of high entropy.
Assumption 2 is a heuristic one and it is therefore formulated a little vaguely. It will be made clearer after some examples of its use will be studied in this section. But some brief discussion can be given already now.
Consider first states of macroscopic systems that are at or near absolute zero of temperature. These are approximately or exactly extremal and maximize entropy at the same time. Thus, they are not classicality states but the entropy, though maximal, is not high, either. Second, consider states of macroscopic systems at room temperature that are not at their thermodynamic equilibrium but are close to it. There are many such states, and they and the systems can be described by classical physics to a good approximation. They are not in maximum- but in high-entropy states.
4.2. Maximum entropy assumption in classical mechanics
In this subsection, we follow loosely Ref. [
18]. As explained at the start of this section, the basic notions of the language part of Newtonian mechanics are that of a sharp state—a point of the phase space—and of a sharp trajectory—a curve in the phase space of an isolated system. We accept this language without assuming that the sharp trajectories have any real counterpart in the world because this does not prevent us from building fuzzy models that have a more direct relation to reality.
However, most physicists take the existence of sharp trajectories seriously and try to obtain them from quantum mechanics as exactly as possible. Hence, they focus on quantum states the phase-space picture of which is as sharp as possible. Those are states with minimum uncertainty allowed by quantum mechanics. For one degree of freedom, described by coordinate
and momentum
, the uncertainty is given by the quantity
where
denotes the variance of quantity
, as defined by Equation (
16).
The states with minimum uncertainty
are, however, very special extremal states. Such states do exist for macroscopic systems but are very difficult to prepare, unlike the usual states of macroscopic systems described by classical mechanics. As we explained in
Section 2.1.2., they have a number of properties that are very strange from the point of view of classical theories and they are therefore not what we have called classicality states.
We feel that there is no point in attempts to derive the language part of Newtonian mechanics from that of quantum mechanics. Instead, we propose that the classical limit is to be considered at the level of models. That is, properties of successful Newtonian models are to be obtained from some quantum models under suitable conditions. We assume further that a good model of Newtonian mechanics is necessarily fuzzy and that the fuzziness is determined by the preparation of the system similarly as in quantum mechanics. Let us give some examples.
Consider a gun in a position that is fixed in a reproducible way and that shoots bullets using cartridges of a given provenance. All shots made under these conditions form an ensemble with average trajectory and the trajectory variance that describes objective properties of the ensemble. The Newtonian model of this ensemble is the evolution of a suitable distribution function on the phase space. According to Newtonian mechanics, each individual shot has a sharp trajectory . Each individual shot is also an element of the ensemble and this is a property of the individual that can be considered also as objective, even in Newtonian mechanics.
The existence of this fuzzy property of an individual shot does not contradict the fact that some more precise observations (optical, say) of this one shot can give a different fuzzy structure. Indeed, such an optical measurement method ought to have been studied on other ensembles and already well established itself, which will allow to estimate its error (variance) and hence to understand the result of the measurement as saying that a given, fixed trajectory is an element of a thought ensemble with an average
and variance
, where
and similarly for the momentum part. Still,
must be much larger than the minimum quantum uncertainty
.
The simplest way to construct a fuzzy model is to fix initial averages and variances of coordinates and momenta, , , , , and leave everything else as fuzzy as possible. To calculate the corresponding probability distributions in classical mechanics and the state operators in quantum mechanics, we shall, therefore, apply the maximum entropy principle. The resulting states will be called maximum-entropy packets, abbreviated as ME packets. The averages of coordinates and momenta take over the role of coordinate and momenta in classical mechanics. In any case, these averages represent measurable aspects of these variables. Quantities , , , will also play the role of classical state coordinates defined by Assumption 1. To limit ourselves just to given averages and variances of coordinates and momenta is a great simplification that enables us to obtain interesting results easily. One can imagine, however, more complicated models, where further moments are fixed, or moments of different observables (e.g., mass centre, total momentum, angles and total angular momentum) are fixed.
The variances are not assumed small. How large they are depends on the accuracy of a preparation or of a measurement, as the gun example shows.
In fact, the dynamical evolution of variances is an important indicator of the applicability of the model one is working with. It determines the time intervals within which reasonable predictions are possible. Consider a three-body system that is to model the Sun, Earth and Jupiter in Newtonian mechanics. It turns out that generic trajectories starting as near to each other as, say, the dimension of the irregularities of the Earth surface will diverge from each other by dimensions of the Earth–Sun distance after the time of only about years. This seems to contradict the years of relatively stable Earth motion around the Sun that is born out by observations. The only way out is the existence of a few special trajectories that are much more stable than the generic ones and the fact that bodies following an unstable trajectory have long ago fallen into the Sun or have been ejected from the solar system. By the way, this spontaneous evolution can be considered as a preparation procedure of solar system.
4.3. Classical ME packets
Let us first consider a system
with one degree of freedom and then generalize it to any number of degrees. Let the coordinate be
q and the momentum
p. A state is a distribution function
on the phase space spanned by
q and
p. The function
is dimensionless and normalized by
where
v is an auxiliary phase-space volume to make
ρ dimensionless. The entropy of
can be defined by
The value of entropy will depend on
v but most other results will not. Classical mechanics does not offer any idea of how to fix
v. We shall get its value from quantum mechanics.
4.3.1. Definition and properties
Definition 28 ME packet is the distribution function ρ that maximizes the entropy subject to the conditions:andwhere Q, P, and are given values. We have used the abbreviation
The explicit form of
ρ can be found using the partition-function method as described in Ref. [
37]. The variational principle yields
where
and
,
,
and
are the four Lagrange multipliers corresponding to the four conditions (72) and (73). Hence, the partition function for classical ME packet is
The expressions for
,
,
and
in terms of
Q,
P,
and
can be obtained by solving the equations
and
The result is:
and
Substituting this into Equation (
74), we obtain the distribution function of a one-dimensional ME packet. The generalization to any number of dimensions is:
Theorem 13 The distribution function of the ME packet for a system with given averages and variances , of coordinates and , of momenta, is We observe that all averages obtained from
ρ are independent of
v and that the right-hand side of equation (
78) is a Gaussian distribution in agreement with Jaynes’ conjecture that the maximum entropy principle gives the Gaussian distribution if the only conditions are fixed values of the first two moments.
As
and
approach zero,
ρ becomes a
δ-function and the state becomes sharp. For some quantities this limit is sensible, for others it is not. In particular, the entropy, which can easily be calculated,
diverges to
. This is due to a general difficulty in giving a definition of entropy for a continuous system that would be satisfactory in every respect. What one could do is to divide the phase space into cells of volume
v so that
could not be chosen smaller than
v. Then, the limit
of entropy would make more sense.
The average of any monomial of the form
can be calculated with the help of partition-function method as follows:
where
,
Z is given by Equation (
75) and the values (76) and (77) must be substituted for the Lagrange multipliers after the derivatives are taken.
Observe that this enables to calculate the average of a monomial in several different ways. Each of these ways, however, leads to the same result due the identities
which are satisfied by the partition function.
Assumption 3 ME packet Equation (78) is a part of a satisfactory model for many systems in Newtonian mechanics. 4.3.2. Classical equations of motion
Let us assume that the Hamiltonian of
has the form
where
μ is the mass and
the potential function. The equations of motion are
Inserting (80) for
H, we obtain
The general solution to these equations can be written in the form
where
q and
p being arbitrary initial values. This implies for the time dependence of the averages and variances, if the initial state is an ME packet:
and
In general,
and
will depend not only on initial
Q and
P, but also on
and
.
Let us consider the special case of at most quadratic potential:
where
are constants with suitable dimensions. If
, we have a free particle, if
, it is a particle in a homogeneous force field and if
, it is a harmonic or anti-harmonic oscillator.
In this case, general solution (82) has the form
where
and
. If
, the functions are
where
Only for
, the functions remain bounded. If
, we obtain
The resulting time dependence of averages and variances resulting from Equations (
82), (
72) and (
73) are [
18]
and
For the last term, we have from Equation (
79)
Using Equations (
75), (
76) and (
77), we obtain from Equation (
94)
Similarly,
We observe that, if functions
,
,
and
remain bounded, the variances also remain bounded and the predictions are possible in arbitrary long intervals of time. Otherwise, there will always be only limited time intervals in which the theory can make predictions.
In the case of general potential, the functions (82) can be expanded in products of powers of
q and
p, and the averages of these products will contain powers of the variances. However, as one easily sees form formula (79) and (75),
where
X and
Y are bounded functions. It follows that the dynamical equations for averages coincide, in the limit
, with the exact dynamical equations for
q and
p. It is an idealization that we consider as not realistic, even in principle, but it may still be useful for calculations.
Let us expand a general potential function in powers of
q,
where
are constants of appropriate dimensions. The Hamilton equations can be used to calculate all time derivatives at
. First, we have
This equation can be used to calculate all derivatives of
q in terms of those of
p:
A simple iterative procedure gives:
and
where
is the rest term that is due to all powers in (98) that are not smaller than
k (the rests symbolize different expressions in different equations). The purpose of having time derivatives up to the fourth order is to see better the difference to quantum corrections that will be calculated in
Section 4.4.3..
Taking the average of both sides of Equations (
100)–(
103), and using Equations (
79), (
75)–(
77), we obtain
and
We can see, that the limit
in Equations (
104)–(
107) lead to equations that coincide with Equations (
100)–(
103) if
as promised.
4.4. Quantum ME packets
Let us now turn to quantum mechanics and try to solve an analogous problem.
Definition 29 Let the quantum model of system has spin 0, position and momentum . State that maximizes von Neumann entropy (see Section 2.2.2.)under the conditionswhere Q, P, and are given numbers, is called quantum ME packet. 4.4.1. Calculation of the state operator
To solve the mathematical problem, we use the method of Lagrange multipliers as in the classical case. Thus, the following equation results:
The differentials of the terms that are linear in
ρ are simple to calculate:
Although not all elements of the matrix
are independent (it is a Hermitian matrix), we can proceed as if they were because the matrix
is to be also Hermitian. The only problem is to calculate
. We have the following
Proof Let
be a unitary matrix that diagonalizes
,
where
is a diagonal matrix with elements
. Then
. Correction to
if
can be calculated by the first-order formula of the stationary perturbation theory. This theory is usually applied to Hamiltonians but it holds for any perturbed Hermitian operator. Moreover, the formula is exact for infinitesimal perturbations. Thus,
In this way, we obtain
QED.
With the help of Lemma 1, Equation (
111) becomes
so that we have
The first two terms in the exponent determine the normalization constant
because they commute with the rest of the exponent and are independent of the dynamical variables. Taking the trace of Equation (
113), we obtain
where
Z is the partition function
Thus, the state operator has the form
At this stage, the quantum theory begins to differ from the classical one. It turns out that, for the case of non-commuting operators in the exponent of the partition function, formula (79) is not valid in general. We can only show that it holds for the first derivatives. To this aim, we prove the following
Lemma 2 Let and be Hermitian matrices. Then Proof We express the exponential function as a series and then use the invariance of trace with respect to any cyclic permutation of its argument.
QED.
The proof of Lemma 2 shows why formula (79) is not valid for higher derivatives than the first in the quantum case: the operator
B does not commute with
and cannot be shifted from its position to the first position in product
For the first derivative, it can be brought there by a suitable cyclic permutation. However, each commutator
is proportional to ℏ. Hence, formula (79) with higher derivatives is the leading term in the expansion of averages in powers of ℏ.
Together with Equation (
114), Lemma 2 implies the formulae:
and
The values of the multipliers can be calculated from Equations (
117) and (
118), if the form of the partition function is known.
Variational methods can find locally extremal values that are not necessarily maxima. We can however prove that our state operator maximizes entropy. The proof is based on the generalized Gibbs’ inequality,
for all pairs
of state operators (for proof of the inequality, see [
21], p. 264). The proof of maximality is then analogous to the “classical” proof (see, e.g., [
37], p. 357). The first proof of maximality in the quantum case was given by von Neumann [
77].
The state operator (115) can be inserted in the formula (108) to give the value of the maximum entropy,
This, together with Equations (
117) and (
118), can be considered as the Legendre transformation from the function
to the function
.
4.4.2. Diagonal representation
The exponent in Equation (
115) can be written in the form
where
This is an operator acting on the Hilbert space of our system.
has the form of the Hamiltonian of a harmonic oscillator with the coordinate
and momentum
that satisfy the commutation relation
. The oscillator has mass
M and frequency Ω,
(The operator
must not be confused with the Hamiltonian
of our system, which can be arbitrary.) The normalized eigenstates
of the operator form a basis in the Hilbert space of our system defining the so-called
diagonal representation and its eigenvalues are
. As usual, we introduce operator
such that
To calculate
Z in the diagonal representation is easy:
Hence, the partition function for the quantum ME-packets is
Now, we can express the Lagrange multipliers in terms of the averages and variances. Equations (
117) and (
118) yield
and
where
ν (
71).
From Equations (
119), (
130) and (
131), we obtain the entropy:
Thus,
S depends on
Q,
P,
,
only via
ν. We have
so that
S is an increasing function of
ν. Near
,
Asymptotically (
),
In the classical region,
,
.
It is clear that the choice of Q and P cannot influence the entropy. The independence of
S from
Q and
P does not contradict the Legendre transformation properties. Indeed, usually, one would have
but here
which is zero.
The resulting state operator, generalized to n degrees of freedom, is described by the following
Theorem 14 The state operator of the ME packet of a system with given averages and variances , of coordinates and , of momenta, iswhereand Strictly speaking, the state operator (133) is not a Gaussian distribution. Thus, it seems to be either a counterexample to, or a generalization of, Jaynes’ hypothesis.
Assumption 4 The quantum model corresponding to the classical model described by Assumption 3 is the ME packet (133).
Let us study the properties of quantum ME packets. In the diagonal representation, we have for
:
We easily obtain for
that
Hence,
and the state
becomes
. In general, states
depend on
ν. The state vector
expressed as a function of
Q,
P,
and
ν is given, for any
ν, by
This is a Gaussian wave packet that corresponds to other values of variances than the original ME packet but has the minimum uncertainty. For
, it remains regular and the projection
becomes the state operator of the original ME packet. Hence, Gaussian wave packets are special cases of quantum ME packets.
The diagonal representation offers a method for calculating averages of coordinates and momenta products that replaces the partition function way. Let us denote such a product X. We have
To calculate
, we use Equations (
124), (
125), (
122), (
123), (
130) and (
131) to obtain
By substituting these relations into
and using the commutation relations
, we obtain
where
and where, in each monomial of the polynomial
, the number of
-factors is different from the number of
-factors. Thus,
In Equation (
139), there are, therefore, sums
With Equation (
137), this becomes
where
We easily obtain
The desired average value is then given by
The calculation of the polynomial
for a given
and the evaluation of the right-hand side of Equation (
140) are the two steps of the promised method.
4.4.3. Quantum equations of motion
Let the Hamiltonian of
be
and the unitary evolution group be
. The dynamics in the Schrödinger picture leads to the time dependence of
:
Substituting for
from Equation (
133) and using a well-known property of exponential functions, we obtain
In the Heisenberg picture,
remains constant, while
and
are time dependent and satisfy the equations
They are solved by
where
and
are the initial operators,
and
. The resulting operators can be written in the form of operator functions analogous to classical expressions (82) so that Equations (
84) and (
85) can again be used.
The example with potential function (86) is solvable in quantum theory, too, and we can use it for comparison with the classical dynamics as well as for a better understanding of the ME packet dynamics. Equation (
142) has then the solutions given by (87) and (88) with functions
and
given by (89) and (90) or (91) and (92). The calculation of the averages and variances is analogous to the classical one and we obtain Equations (
93) and (
95) again with the difference that the term
on the right hand side of (94) is now replaced by
.
To calculate
, we use the method introduced in the previous section. We have
hence,
, and
The result is again Equation (
95). Similarly for
, the results are given by Equations (
96) and (
97).
We have shown that the averages and variances of quantum ME packets have exactly the same time evolution as those of classical ME packets in the special case of at-most-quadratic potentials. From Equations (
95) and (
97) we can also see an interesting fact. On the one hand, both variances must increase near
. On the other hand, the entropy must stay constant because the evolution of the quantum state is unitary. As the relation between entropy and
ν is fixed for ME packets, the ME packet form is not preserved by the evolution (the entropy ceases to be maximal). This is similar for Gaussian-packet form or for coherent-state form.
For general potentials, there are two types of corrections to the dynamics of the averages: terms containing the variances and terms containing ℏ. To obtain these corrections, let us calculate time derivatives for the quantum analogue of Hamiltonian (80) with potential (98). The Heisenberg-picture equations of motion give again
so that Equation (
99) is valid. The other equation,
can be applied iteratively as in the classical case so that all time derivatives of
can be obtained. Thus,
and
This differs from the classical equation only by factor ordering. We can use the commutator
to simplify the last term,
Similarly,
and
Next, we calculate quantum averages with the help of Equation (
140). The quantum averages of the monomials that are linear in one of the variables
or
can differ from their classical counterparts only by terms that are of the first order in
and purely imaginary. For example,
or
These corrections clearly cancel for all symmetric factor orderings. The first term in which a second-order correction occurs is
and we obtain for it:
The Equations (
143)–(
146) do not contain any such terms and so their averages coincide exactly with the classical Equations (
104)–(
107). The terms
with different factor orderings occur in the fifth time derivative of
and have the form
The average of the resulting term in the fifth time derivative of
is
If we express ℏ as
, we can write the last two terms in the parentheses as
A similar term appears in the third time derivative of
, if we allow
in the expansion (98):
which contributes to
by
Again, the correction is of the second order in
.
We can conclude. The quantum equations begin to differ from the classical ones only for the higher order terms in
V or in the higher time derivatives and the correction is of the second order in
. This seems to be very satisfactory: our quantum model reproduces the classical dynamic very well. Moreover, Equation (
138) shows that Gaussian wave packets are special cases of ME packets with
. Thus, they approximate classical trajectories less accurately than ME packets with large
ν. Of course, these results have as yet been shown only for the first four time derivatives. It would be nice if a general theorem could be proved.
4.5. Classical limit
Let us now look to see if our equations give some support to the statement that is the classical regime.
The quantum partition function (129) differs from its classical counterpart (75) by the denominator
. If
we can write
The leading term in the partition function then is
where
. Comparing this with Equation (
75) shows that the two expressions are identical if we set
We can interpret this by saying that quantum mechanics gives us the value of
v. Next, we have to express condition (147) in terms of the averages and variances. Equations (
130) and (
131) imply
Hence, condition (147) is equivalent to
The result can be formulated as follows. Classical mechanics allows not only sharp, but also fuzzy trajectories and the comparison of some classical and quantum fuzzy trajectories shows a very good match. The fuzzy states chosen here are the so-called ME packets. Their fuzziness is described by the quantity
. The entropy of an ME packet depends only on
ν and is an increasing function of it. The time evolution of classical and quantum ME packets with the same initial values of averages and variances defines the averages as time functions. The larger
ν is, the better the quantum and the classical evolutions of average values have been shown to agree for the first four terms in the expansion in powers of time. Thus, the classical regime is neither
(absolutely sharp trajectory) nor
(minimum quantum uncertainty). This is the most important result of Ref. [
18]. The time functions coincide for the two theories in the limit
. Hence, in our approach, this is the classical limit. This is just the opposite to the usual assumption that the classical limit must yield the variances as small as possible. Of course,
ν can be very large and still compatible with classically negligible variances.
One also often requires that commutators of observables vanish in classical limit. This is however only motivated by the assumption that all basic quantum properties are single values of observables. Within our interpretation, this assumption is replaced by the following claim: If classical observables are related to quantum operators then only in such a way that they are average values of the operators in classicality states. Then, first, all such averages are defined by a preparation and do exist simultaneously, independently of whether the operators commute or not. For example,
Q and
P are such simultaneously existing variables for ME packets. Second, a joint measurement of fuzzy values of non-commuting observables is possible. This will be explained in
Section 4.7.
It might be helpful to emphasize that construction of models of Newtonian mechanics and the so-called semi-classical or WKB approximation to quantum mechanics are two different things. Indeed, the semi-classical approximation is a mathematical method, usually defined as the expansion in powers of
h in some quantum expressions [
21], to calculate approximately correct values of quantum expressions in suitable applications. Equations resulting from
may be similar to the corresponding classical equations. In fact, limit
also results from
if the variances are kept constant. The suitable applications can be more general than the above construction of models in that they, e.g., do not necessarily concern fuzzy trajectories and macroscopic systems.
4.6. A model of classical rigid body
To show how the above theory of classical properties works, we construct a one dimensional model of a free solid body. The restrictions to one dimension and absence of external forces enable us to calculate everything explicitly—the model is completely solvable. The real object is a thin solid rod of mass
M and length
L. Its classical model
is a one-dimensional continuum of the same mass and length, with mass density
, internal energy
E, centre of mass
X and total momentum
P. The classical state coordinates (see
Section 4.1) are
M,
L,
X,
P and
E.
The construction of its quantum model
entails that, first, the structural properties of the system must be defined, second, some assumptions on the state of the system must be done; third, the quantum objective properties must be found that correspond to the classical properties
M,
L,
E,
X and
P. Large parts of this section follow reference [
15].
4.6.1. Composition, Hamiltonian and spectrum
Assumption 5 is an isolated linear chain of N identical particles of mass μ distributed along the x-axis with the quantum Hamiltonianinvolving only nearest-neighbor elastic forces. Here operator is the position, operator the momentum of the n-th particle, κ the oscillator strength and ξ the equilibrium interparticle distance. The parameters
N,
μ,
κ and
ξ are structural properties (determining the Hamiltonian of a closed system, see
Section 2.3.2.).
This kind of chain seems to be different from most chains that are studied in literature: the positions of the chain particles are dynamical variables so that the chain can move as a whole and the invariance with respect to Galilean group is achieved. However, the chain can still be solved by methods that are described in [
88].
First, we find the variables
and
that diagonalize the Hamiltonian and thus define the so-called normal modes. The transformation is
and
where the mode index
m runs through
and
is an orthogonal matrix; for even
m,
while for odd
m,
and the normalization factors are given by
To show that
and
do represent normal modes, we substitute Equations (
150) and (
151) into (149) and obtain, after some calculation,
which is indeed diagonal. The mode frequencies are
Consider the terms with
. We have
, and
. Hence,
so that
where
is the centre-of-mass coordinate of the chain and
is its total momentum. The “zero” terms in the Hamiltonian then reduce to
with
. Thus, the “zero mode” describes a straight, uniform motion of the chain as a whole. The fact that the centre of mass degrees of freedom decouple from other (internal) ones is a consequence of Galilean invariance.
The other modes are harmonic oscillators called “phonons” with eigenfrequencies
,
. The energy of the phonons,
is the internal energy of our system and its spectrum is built from the mode frequencies by the formula
where
is an
-tuple of non-negative integers—phonon occupation numbers.
Let us define the operator describing the mass by
and the length of the body by
We assume that the second term in the expression for the mass can be safely neglected in the non-relativistic regime in which we are working. The length can be expressed in terms of modes
using Equation (
150),
The differences on the right-hand side are non-zero only for odd values of
m, and equal then to
. We easily find, using Equations (
153) and (
154):
4.6.2. Maximum-entropy assumption
The next point is the choice of classicality states. We write the Hilbert space of
as
where
is constructed from the wave functions
(see
Section 2.3.1.) and
has the phonon eigenstates as a basis.
Assumption 6 The classicality states have the formInternal state maximizes the entropy under the condition of fixed average of the internal energy,The external state is the ME packet for given averages , , and . Let us first focus on
. It is the state of thermodynamic equilibrium or the Gibbs state, which we denote by
(see, e.g., [
37]).
The maximum of entropy does not represent an additional condition but rather the absence of any, see
Section 1.0.4.. This is, of course, also a condition, and its validity in overwhelming number of real cases is an interesting problem. For
, it must have to do with the preparation (not by physicists but by nature). Physically, the thermodynamic equilibrium can settle down spontaneously starting from an arbitrary state only if some weak but non-zero interaction exists both between the phonons and between the rod and the environment. We assume that this can be arranged so that the interaction can be neglected in the calculations of the present section.
The internal energy has itself a very small relative variance in the Gibbs state if N is large. This explains why it appears to be sharp. All other classical internal properties will turn out to be functions of the classical internal energy. Hence, for the internal degrees of freedom, E forms itself a complete set of state coordinates introduced in Assumption 1. The properties of internal energy are well known and we shall not repeat the calculations here.
4.6.3. The length of the body
The mathematics associated with the maximum entropy principle is variational calculus. The condition of fixed average energy is included with the help of Lagrange multiplier denoted by λ. It becomes a function for the resulting state. As it is well known, has to do with temperature.
The phonons of one species are excitation levels of a harmonic oscillator, so we have
where
is the annihilation operator for the
m-th species. The diagonal matrix elements between the energy eigenstates
that we shall need then are
For our system, the phonons of each species form statistically independent subsystems, hence the average of an operator concerning only one species in the Gibbs state
of the total system equals the average in the Gibbs state for the one species. Such a Gibbs state operator for the
m-th species has the form
where
and
is the partition function for the
m-th species
The average length is obtained using Equation (
160),
It is a function of objective properties
N,
ξ and
E.
Equation (
160) is an important result. It shows that contributions to the length are more or less evenly distributed over all odd modes. Such a distribution leads to a very small variance of
in Gibbs states. A lengthy calculation [
15] gives for large
NThus, the small relative variance for large N does not need to be assumed from the start. The only assumptions are values of some structural properties and that an average value of energy is fixed. We have obtained even more information, viz. the internal-energy dependence of the length (in this model, the dependence is trivial). This is an objective relation that can be in principle tested by measurements.
Similar results can be obtained for further thermodynamic properties such as specific heat, elasticity coefficient, etc. (If we extend the classical model so that it contains the elasticity coefficient, we could calculate the coefficient for an extended quantum model, in which the rod would be placed into a non-homogeneous “gravitational” field described by, say, a quadratic potential. This would again give a solvable model.) All these quantities are well known to have small variances in Gibbs states. The reason is that the contributions to these quantities are evenly distributed over the normal modes and the modes are mechanically and statistically independent.
4.6.4. The bulk motion
The mechanical properties of the system are the centre of mass and the total momentum. The contributions to them are evenly distributed over all atoms, not modes: the bulk motion is mechanically and statistically independent of all other modes and so its variances will not be small in Gibbs states defined by a fixed average of the total energy. Still, generalized statistical methods of
Section 4.2,
Section 4.3 and
Section 4.4 can be applied to it. This is done in the present subsection.
First, we assume that the real rod we are modelling cannot possess a sharp trajectory. Thus, satisfactory models of it can be ME packets in both Newtonian and quantum mechanics. Then, according to Assumption 6 and Theorem 13, the external state of the classical model can be chosen as
(For the definition of
v, see
Section 4.3.) Similarly, Theorem 14 implies that the external state of the quantum model can be chosen as
where
and
The Hamiltonian for the bulk motion of both models is given by Equation (
156). Thus, as explained in
Section 4.4.3., the quantum trajectory coincides with the classical one exactly. (Recall that trajectory has been defined as the time dependence of averages and variances.)
Hopefully, this simple rod example has sufficiently illustrated how our idea of model construction works in the case of classical properties and we can finish the comparison of classical and quantum models here.
4.7. Joint measurement of position nd momentum
The existence of an observable that represents a joint measurement of position and momentum plays some role in the theory of classical properties. To show it, we generalize the construction of such an observable for a simplified model that was first proposed in Ref. [
89]. We follow Ref. [
24]. The model system
is a free one-dimensional spin-zero particle with position
and momentum
. The Hilbert space is
and the operators are defined by equations analogous to (31) and (32).
Operators
and
have an invariant common domain and their commutator is easily calculated to be
Hence, the joint measurement may be a problem.
The general construction of a non-trivial POV measure for system introduces another system, ancilla, that forms a composite system with . Let our ancilla be a similar particle with position and momentum . We work in Q-representation so that the Hilbert space of the composite system is , which can be identified with . Then, we have wave functions and integral operators with kernels of the form .
The dynamical variables
and
of the composite system
commute and can therefore be measured jointly. The value space of PV observable
is
with coordinates
a and
b (see end of
Section 2.2.3.).
The next step is to smear
to obtain a realistic POV measure
, where
k and
l are integers. Let us divide the
plane into disjoint rectangular cells
covering the entire plane. Each cell is centred at
,
,
and
is its area. Then,
The cells can be arbitrarily small.
The probability to obtain the outcome
in state
of the composite system is
We assume that the composite system
is prepared in a factorized state
and express the probability (168) in terms of the state
. The action of the projections
and
on vector states of the form
is
where
and
where
The trace (168) can be calculated in several steps as follows. First,
Second, introduce new integration variables
q,
a,
p,
b,
,
,
Third, if the cells are sufficiently small, the integrands do not change appreciably inside the integration intervals of
a and
b so that they can be approximated as follows:
In this way, we obtain
But, fourth, the factor containing the integral over
p is a
δ-function,
Thus, we obtain,
Fifth, we use Equation (
169),
with the result:
Proposition 13 The probability of the outcome to be found in the factorized state (169) is given approximately bywhereis the state first shifted by and then boosted by . The approximation improves if the cells are smaller. If the cells are sufficiently small, we have and . In this way, the coordinate of is shifted by while the inverted momentum is shifted by with respect to those of .
Equation (
170) shows that there is an ‘effective’ POV measure
for system
defined by
that yields the probability of the outcome
k of the above registration considered as a registration performed on
.
The state
is completely arbitrary. To construct a useful quantity, one usually chooses a vector state in the form of a Gaussian wave packet (see, e.g., Ref. [
21], p. 418),
Easy calculation yields
so that
is a state of minimum uncertainty. If we shift
by
and then boost it by
, we obtain
which is the Gaussian packet concentrated at
,
and
As the Gaussian wave packet is uniquely determined by its averages and variances, we can interpret the observable
as giving the probability that the corresponding registration applied to state
detects the state of
with the averages and variances given by Equations (
171) and (
172).
Equation (
170) is a general formula valid for an arbitrary state
of the ancilla. We have chosen
to be a Gaussian wave packet, which has
. However,
can also be chosen as a quantum ME packet with
,
and
and
allowing arbitrary large
ν. The resulting observable represents the detection of a shifted and boosted ME packet.
6. Conclusions
Our careful study of difficulties that are met along the way from quantum mechanics to classical world has lead to a new understanding of quantum mechanics in a number of aspects. The main points might be summarized as follows.
A value of an observable of system cannot be considered as a property of but only as an indirect piece of information on such properties. It is well known that each such value is only formed in the process of interaction between system and a suitable registration apparatus. We have looked, therefore, for another kind of observable properties that could be ascribed to quantum systems and we have found them among those that are uniquely determined by preparations. This simple observation has been developed into a systematic realist interpretation, the so-called Realist Model Approach to quantum mechanics. Thus, the myth of quantum mechanics disproving realism has been shown to be unfounded.
There are two kinds of objective properties: structural, which are common to the whole class of indistinguishable systems such as mass, charge and spin, and dynamical, which are different for different dynamical situations such as states and averages of observables. The space of quantum states is convex and exhibits a rich face structure. Two kinds of states are distinguished: the indecomposable ones, which are analogous to points in phase space in Newtonian mechanics, and the decomposable ones, which are analogous to probability distributions in Newtonian mechanics. The opposites “decomposable–indecomposable” is different from “pure–mixed”.
We have also attempted to make our interpretation compatible with a full-fledged and self-consistent realist philosophy, namely the so-called Constructive Realism as introduced by Ronald Giere. This is not a naive realism so that it is immune to usual arguments against realism. (Naive realism is the view that the world is as we perceive it.) A practical aspect of the realist interpretation is not only that the emergence of an objective classical world from quantum mechanics is not hindered by possible non-objective character of the latter, but also that it provides explicit help and guidance in constructing models of classical world.
The aim of our theory of classical properties is a unified approach to all classical theories, such as Newtonian mechanics, thermodynamics or Maxwellian electrodynamics. The starting point is that the main assumption of classical theories, viz. the existence of absolutely sharp trajectories, does not correspond to reality but is only a practical and productive idealization. Indeed, any classical measurement is much fuzzier than the minimum quantum uncertainty. Thus, we can change the whole aim of semi-classical approximation: what is to be approximated by quantum models are reasonably fuzzy classical trajectories, not sharp ones.
This allows to formulate a so-called Modified Correspondence Principle, which specifies which quantum observables can correspond to important classical ones for a general classical system as well as what is the form of the correspondence: the classical property is the average of the corresponding quantum observable in a particular kind of state, the so-called classicality one.
The second main hypothesis of our theory is that classicality states are some quantum states of high entropy. (It follows, in particular, that coherent states are not classicality states.) This principle is already in use in thermodynamics. We have shown how it can be applied to classical mechanics by introducing a new class of states, the so-called ME packets. They maximize von Neumann entropy for fixed averages and variances of positions and momenta. Gaussian wave packets are a special case of ME packets for the value of maximum entropy equal to zero. ME packets approximate classical trajectories better when their entropy is higher. As yet, only thermodynamics and Newtonian mechanics could be unified in this way, but Maxwellian electrodynamics is hoped to allow an analogous approach. In this way, we arrive at the natural conclusion that quantum mechanics can be much less fuzzy than any part of classical reality ever is.
We also stress that macrosystems, as well as generally large composite systems, have much smaller number of observables than one would expect according to standard quantum mechanics. There are two reasons why observables concerning single constituents of such a system can be measured only in exceptional cases. On the one hand, the constituents may be elements of a large family of identical systems within the macroscopic system from which they are not separated by preparation and do not, therefore, possess any really measurable observables of their own. On the other hand, they are neither individually manipulable by fields and shields nor registrable by detectors and can only be measured by some measurements that use ancillas. The differences between macroscopic and microscopic systems are thus not due to inapplicability of quantum mechanics to macroscopic systems. Just the opposite is true: they result from strict and careful application of standard quantum mechanics to macroscopic systems.
One of our most important observations is that none of the quantum observables that are introduced in textbooks is measurable because the form of the corresponding operators implies that their measurement must be disturbed by all other systems of the same type in the environment. Only some sufficiently local kind of observables could be measured and only if the microsystems to be registered are sufficiently separated from the set of identical microsystems. Starting form this point, a new quantum theory of observables has been constructed and it is rather different from the standard one.
These considerations also lead to an important condition on preparation procedures: they must give the prepared microsystem a non-trivial separation status. Only then, it can be viewed as an individual system that can be dealt with as if its entanglement with other identical particles does not exist. This explains why quantum mechanics is viable at all. In particular, the standard rule for composition of identical microsystems must be weakened. This is justified by the idea of cluster separability. In addition, a preparation must separate the microsystem from all other microsystems, even of a different kind, so that it can be individually manipulated by external fields or matter screens and registered by detectors.
Not only the notion of preparation has been changed, but also registrations have been given a more specific form than is usually assumed. The necessity to distinguish systematically between ancillas and detectors has been justified. The interaction of the registered system with an ancilla is not considered as the whole registration. The system or the ancilla, or both, has to be further registered by a detector. Any apparatus that is to register a microsystem must therefore contain a detector and what is read off the apparatus is a classical signal from the detector rather than a value of the observable that is usually called “pointer”. Thus, our theory of classical properties finds an application here. We assume that each detector contains an active volume of sensitive matter with which the registered microsystem becomes unified and in which it looses its separation status.
The next important assumption is that standard quantum mechanics does not provide true information about processes, in which the separation status of microsystems changes. Preparations and registrations belong to such processes. Blind application of standard rules to such processes leads to contradictions with experimental evidence. This justifies adding new rules to quantum mechanics that govern changes of separation status. We have formulated such rules and shown that they form a logically coherent whole with other rules of quantum mechanics.
A substantial progress has been achieved in the theory of wave function collapse, more precisely, of the state reduction. This is considered to be a real physical process. A reason, or justification, for why the state reduction takes place has been found in the loss of separation status of the registered quantum microsystem. Our theory replaces the collapse by a more radical transformation, a change in microsystem kinematic description. The loss of separation status accompanied by a dissipation can be considered as a kind of disappearance of a registered object during its registration. The state is degraded, because the system is more or less lost. This justification comes, so to speak, from outside of the measurement theory.
Moreover, our theory leads to sharper specification of where and when the reduction occurs than that given by the von Neumann theory of the collapse. The place is the sensitive volume of the detector and the time is that of the detector signal. However, our theory gives only the final change from the state at some suitable time resulting from the standard unitary evolution to the reduced state , and the question of the detailed time evolution , or even the existence of such evolution, is left open. This can be compared with the quantum decoherence theories or with the dynamical reduction theories. If the exact state of the environment and the exact Hamiltonian of its interaction with the registered system plus apparatus are known, the decoherence theory would give the evolution in all detail (of course, the desired end of the evolution, a decomposable state, cannot be obtained by the decoherence theory alone). Similarly, if the two parameters of the dynamical collapse theory are chosen, again such an evolution can be calculated. We hope that future improvements in experimental techniques will allow to address the question of detailed evolution.