Direct Parameter Estimations from Machine Learning-Enhanced Quantum State Tomography

Hsieh, Hsien-Yi; Ning, Jingyu; Chen, Yi-Ru; Wu, Hsun-Chung; Chen, Hua Li; Wu, Chien-Ming; Lee, Ray-Kuang

doi:10.3390/sym14050874

Open AccessArticle

Direct Parameter Estimations from Machine Learning-Enhanced Quantum State Tomography

by

Hsien-Yi Hsieh

¹,

Jingyu Ning

¹,

Yi-Ru Chen

¹

,

Hsun-Chung Wu

¹,

Hua Li Chen

²,

Chien-Ming Wu

¹

and

Ray-Kuang Lee

^1,2,3,4,*

¹

Institute of Photonics Technologies, National Tsing Hua University, Hsinchu 30013, Taiwan

²

Department of Physics, National Tsing Hua University, Hsinchu 30013, Taiwan

³

Physics Division, National Center for Theoretical Sciences, Taipei 10617, Taiwan

⁴

Center for Quantum Technology, Hsinchu 30013, Taiwan

^*

Author to whom correspondence should be addressed.

Symmetry 2022, 14(5), 874; https://doi.org/10.3390/sym14050874

Submission received: 31 March 2022 / Revised: 17 April 2022 / Accepted: 18 April 2022 / Published: 25 April 2022

(This article belongs to the Special Issue Quantum Optimization & Machine Learning)

Download

Browse Figures

Versions Notes

Abstract

:

With the power to find the best fit to arbitrarily complicated symmetry, machine-learning (ML)-enhanced quantum state tomography (QST) has demonstrated its advantages in extracting complete information about the quantum states. Instead of using the reconstruction model in training a truncated density matrix, we develop a high-performance, lightweight, and easy-to-install supervised characteristic model by generating the target parameters directly. Such a characteristic model-based ML-QST can avoid the problem of dealing with a large Hilbert space, but cab keep feature extractions with high precision, capturing the underlying symmetry in data. With the experimentally measured data generated from the balanced homodyne detectors, we compare the degradation information about quantum noise squeezed states predicted by the reconstruction and characteristic models; both are in agreement with the empirically fitting curves obtained from the covariance method. Such a ML-QST with direct parameter estimations illustrates a crucial diagnostic toolbox for applications with squeezed states, from quantum information process, quantum metrology, advanced gravitational wave detectors, to macroscopic quantum state generation.

Keywords:

quantum machine-learning; quantum state tomography

1. Introduction

Due to unavoidable coupling from the noisy environment, the capability to precisely characterize the quantum features in a large Hilbert space is needed. In general, the reconstruction is not in the quantum state, but the corresponding density matrix as the degradation transforms the target quantum state into a mixed state. For continuous variables with infinite dimensions, by utilizing quantum homodyne measurements, quantum state tomography (QST) has provided us with a useful tool for reconstructing quantum states [1,2]. Nowadays, QST has been successfully implemented as a crucial diagnostic toolbox for many quantum systems, including quantum optics [3,4], ultracold atoms [5,6], ions [7,8], and superconducting circuit-QED devices [9].

By estimating the closest probability distribution to the data, the maximum likelihood estimation (MLE) method is one of the most popular methods in reconstructing arbitrary quantum states [10]. However, MLE suffers from the overestimation problem as the required amount of measurements to reconstruct the quantum state exponentially increases with the number of involved modes. To overcome the overestimation in MLE, by assuming some physical restrictions imposed upon the state in question, several alternative algorithms are proposed, such as permutationally invariant tomography [11], quantum compressed sensing [12], tensor networks [13,14], generative models [15], and restricted Boltzmann machine [16]. Instead, with the capability to find the best fit to arbitrarily complicated symmetry with a limited number of parameters available, machine-learning (ML) enhanced QST was implemented experimentally, demonstrating a fast, robust, and precise QST for continuous variables [16,17,18,19].

However, in dealing with continuous variables, even truncating the Hilbert space into a finite dimension, a very large amount of data are still needed in reconstructing a truncated density matrix. In this work, instead of training the machine on the reconstruction model, alternatively, we develop a characteristic model-based ML-QST by skipping the training on the truncated density matrix. Such a characteristic model-based ML-QST can avoid the problem of dealing with large Hilbert space but keep feature extraction with high precision. With the prior knowledge of the experimentally measured data generated from the balanced homodyne detectors, the direct parameter estimations, including the average photon numbers in the pure squeezed states, squeezed thermal states, and thermal reservoirs, agree with those acquired from the reconstruction model. Compared to the empirically fitting curves obtained from the covariance matrix, our characteristic model-based ML-QST also reveals all the degradation information about quantum noise squeezed states, indicating the loss and phase noises in the measured anti-squeezing. With the ability to instantly monitor quantum states, as well as to make feedback control possible, our experimental implementations illustrate a crucial diagnostic toolbox for all the possible applications with squeezed states. Based on the direct parameter estimations from this ML-QST, applications to the advanced gravitational wave detectors, quantum metrology, macroscopic quantum state generation, and quantum information process can be readily realized.

The paper is organized as follows: in Section 2, we introduce the supervised machine learning-enhanced quantum state tomography based on the convolutional neural network (CNN). Then, the implementations of the reconstruction model and characteristic model are illustrated in Section 2.1 and Section 2.2, respectively. The comparisons on the predicted average photon numbers, as well as the squeezing-anti-squeezing curve to the experimental fittings, are demonstrated in Section 3, validating the feature extraction from our direct parameter estimations. Finally, we summarize this work with some perspectives in Section 4.

2. Supervised Machine Learning-Enhanced Quantum State Tomography

When applying MLE to reconstruct the target quantum state, the data acquisition is performed by the balanced homodyne detectors based on the covariance method or nullifiers [20,21,22]. However, in order to estimate the probability distribution function in different quadratures, at least three measurements must be performed at a fixed local oscillator (LO) phase. To reduce unwanted uncertainty in the fixed quadrature, therefore, a precise phase locking for the LO phase is also needed. However, in the homodyne experiments, the repeatability of the PZT drifts, owing to the airflow and the temperature difference, resulting in introducing additional (phase) noises into the measuring system. Moreover, the validation of this method relies on the Gaussian properties of reconstructed states [23]. Nevertheless, information about unmeasured LO phases is missing due to the limitations of the selected measurements.

Instead, by scanning the LO phase from 0 to

2 π

, referred to as a single-scan measurement of quadrature sequence data,

X_{θ}

, our homodyne measurements contain all the information at different LO phases [24]. Intrinsically, the phase noise automatically is counted in our ML-QST [19]. A fast QST is possible with such a single-scan measurement by just varying the LO phase. Here, the quadrature sequence data

X_{θ}

shares the similarity to the sound (voice) pattern in a time series [25]. With prior knowledge of the squeezed states, a supervised ML with CNN configuration is introduced in this work.

As illustrated in Figure 1, by feeding noisy data of a quadrature sequence acquired by quantum homodyne tomography into 17 convolutional layers, we take advantage of good generalizability in applying CNN [26]. In our one-dimensional (1D)-CNN kernel, there are five convolution blocks used, each of which contains two convolution layers (filters) in different sizes. In order to tackle the gradient vanishing problem, which commonly happens in the deep CNN when the number of convolution layers increases, some shortcuts are also introduced among the convolution blocks [27]. Nevertheless, after flattening the 1D-CNN kernel, we either apply extra fully connected layers to reconstruct the truncated density matrix (coined as the reconstructed model) or predict physical parameters directly (coined as the characteristic model). Below, the details and differences in the reconstruction model and characteristic modes are described.

2.1. Reconstruction Model

The target of implementing the reconstruction model is to predict the truncated density matrix. In the quantum noise squeezing experiments, we have three families of possible states, i.e., pure squeezed state

ρ^{s q}

, squeezed thermal states

ρ_{t h}^{s q}

, and thermal states

ρ_{t h}

[28,29,30,31]. These three families can be described uniformly by a generic formula for squeezed thermal states:

\begin{matrix} \hat{ρ} = \hat{S} (r, θ) {\hat{ρ}}_{t h} (n_{t h}) {\hat{S}}^{†} (r, θ) . \end{matrix}

(1)

As shown in Equation (1), we have three characteristic parameters, r,

θ

, and

n_{t h}

, corresponding to the squeezing ratio, squeezing angle, and the average photon number, respectively. Here,

\hat{S} (r, θ) = \exp [\frac{1}{2} (ξ^{*} {\hat{a}}^{2} - ξ {\hat{a}}^{† 2})]

denotes the squeezing transformation, with

ξ \equiv r \exp (i θ)

;

r \in [0, \infty]

and

θ \in [0, 2 π]

.

One can see that when

r = 0

, Equation (1) describes the thermal states with the average photon number

n_{t h}

, reflecting the corresponding temperature in the thermal reservoir, i.e.,

{\bar{n}}^{- 1} = \exp [ℏ ω / k_{B} T] - 1

. However, when

n_{t h} = 0

, Equation (1) gives the pure squeezed vacuum state, characterized by its squeezing ratio r and the squeezing angle

θ

. In training the machine, a uniform sampling with different physical parameters

(r, θ, n_{t h})

is applied for generating the simulated quadrature sequence.

The task of our reconstruction model can be formulated as mapping the estimated function to a truncated density matrix, i.e.,

f_{est} : X_{θ} \to {\hat{ρ}}_{m \times m}

. Here, m denotes the dimension of our truncated Hilbert space in the number state basis. To avoid non-physical states, we impose the positive semi-definite constraint into the predicted density matrix. An auxiliary (lower triangular) matrix is introduced before generating the predicted factorized density matrix through the Cholesky decomposition, i.e.,

{\hat{ρ}}_{m \times m} \equiv L_{m \times m} L_{m \times m}^{*}

. The training set for the quadrature data

{X_{θ}^{j}}

is the set formed by:

\begin{matrix} \{X_{θ}^{j}, L_{m \times m}^{j} | \dim (X_{θ}^{j}) = 4096, θ \in [0, 2 π], j = 1, 2, 3 \dots N\} . \end{matrix}

(2)

where N is the number of the training set, and

\dim (X_{θ}^{j}) = 4096

is chosen for the number of sampling data in a quadrature sequence. Our target is training the machine to learn the function

f_{est}

, which can be mapped from

X_{θ}

to

L_{m \times m}

. This estimation function can be approximated by a deep neural network which is parametrized by trainable weight variables

W^{l}

, with l corresponding to the l-th layer in the deep neural network, i.e.,

f_{est} \sim f^{l} (\dots f^{2} (f^{1} (X_{θ}, W^{1}), W^{2}) \dots W^{l})

. The training process is to minimize the mean squared error (MSE), while the optimizer used for training is Adam, which is a well-adopted optimization method used to find the minimum cost function for a neural network.

We take the batch size as 32 in the training process. In this setting, the network is trained with 70 epochs to decrease the loss (MSE) up to

5 \times 10^{- 6}

. Moreover, the normalization is also applied during the training process in order to ensure that the trace of the output density matrix is kept as 1. Furthermore, to improve the performance in feature extraction and to reduce the number of parameters, the dense connection is also introduced in our 1D-CNN kernel [32]. This makes the our 1D-CNN model more efficient and lightweight. Finally, as the schematic shown in Figure 1, after flattening, the predicted matrices are used to reconstruct the density matrices in truncation.

By considering our quantum optics experiments with the maximum squeezing level up to 10 dB and the maximum anti-squeezing level up to 20 dB, we keep the sum in the probability up to

0.9999

by truncating the photon number to

m = 35

. More than one million datasets (exactly,

N =

1,200,000) are fed into our machine with all possible combinations of pure squeezed states, squeezed thermal states, and thermal states in a variety of squeezing levels, quadrature angles, and reservoir temperatures. All the training is carried out with the Python package tensorflow.keras performed using a GPU (Nvidia Titan RTX). Typically, in less than one hour, the execution time for our well-trained machine learning-enhanced QST takes an average cost time

38.1

milliseconds (by averaging 100 times) in a standard GPU server.

Regarding the hyper-parameters (filter sizes of each layer), in Table 1, we provide information about the architecture and parameters used in our 1D-CNN kernel. The parameters in this table correspond to the kernel size and channel length. For example,

[4, 96]

means that kernel size

= 4

and channel length

= 96

. Here, the size of our density matrix is

35 \times 35

. There are also convolutions in shortcuts with dense connections [32].

2.2. Characteristic Model

In general, the supervised ML is performing a regression task, predicting a truncated density matrix for the quantum state tomography. However, as shown in Equation (1), the target mixed state is just a linear combination of three families composed of pure squeezed states, squeezed thermal states, and thermal states. These physical states can be basically described by a few simple physical parameters. In addition to reconstructing the density matrix, one can also train a machine to predict parameters directly, coined as a characteristic model.

In the quantum noise squeezing experiments, the parameter set defined by

(r, θ, n_{t h})

should provide enough information in the output measurements, which are the measured squeezing level (SQZ) and the anti-squeezing level (ASQZ). This characteristic model can help us to avoid the problem that occurs in dealing with high-dimensional Hilbert space. Compared to the reconstruction model, now, the task of our supervised estimation is mapping the estimated function to the physical parameters directly, i.e.,

f_{est} : X_{θ} \to (r, θ, n_{t h})

.

As marked in Figure 1 with the shadowed background, we can directly generate these three physical parameters, without bothering additional extra fully connected layers. In this characteristic model, after the convolution kernel completes the feature extraction, we do not need to apply the fully connected layers, but just perform a linear transformation to predict the characteristic values of quantum states. In addition, in the characteristic model, we take the batch size as 32 in the training process. By this setting, the network is trained with 30 epochs to guarantee that the error (MSE) is no larger than

0.03

.

The advantages of applying this characteristic model come from the absence in dealing with any post-processing. Of course, one can calculate these physical parameters with the help of the reconstructed density matrix. However, as fewer model parameters (architecture size) are involved, we also avoid the possible errors caused due to the truncation in the density matrix. In the following, we will demonstrate the implementation of this characteristic model-based ML in the laboratory, by directly and quickly inferring the value of

(r, θ, n_{t h})

in the quantum noise squeezing experiments.

3. Comparison between the Reconstruction and Characteristic Models

As reported in Ref. [19], our quantum noise squeezed states are generated through an optical parametric oscillator cavity with a periodically poled KTiOPO

_{4}

(PPKTP) inside. Experimentally, operated below the threshold at the wavelength of 1064 nm, the quantum homodyne tomography is performed by collecting quadrature sequence with the spectrum analyzer at

2.5

MHz with 100,001 data points, 100 kHz RBW (resolution bandwidth), and 100 Hz VBW (video bandwidth). The phase of LO is scanned with a 1 Hz triangle wavefunction. While the pump power increases to 70 mW, the measured noise levels for squeezing (SQZ) and anti-squeezing (ASQZ) in decibel (dB) are

8.37

and

17.00

, respectively. In training the reconstruction model, a “uniform distribution” is used to sample the value of LO angle. Here, we feed 4096 sampling points from the experimental datasets (5,000,000 data points). Our well-trained reconstruction model-based ML-QST has demonstrated its advantage in keeping the fidelity in the predicted density matrix as high as

0.99

[19].

Now, to verify the physical parameter estimation with the characteristic model, in Figure 2, we compare the predicted average photon number, as a function of pump power, between (a) the characteristic model and (b) the reconstruction model. As the pump power increases, both the characteristic and reconstruction models give great agreement in predicting the three curves of average photon numbers for the measured data

{〈 n 〉}_{total}

, the pure squeezed state

{〈 n 〉}_{sq}

, and non-pure components

{〈 n 〉}_{other}

, denoted as (para est) for the parameter estimation and (dmtx) for the density matrix in Figure 2a and Figure 2b, respectively. Besides the tendency of monochromatic increment in these three curves, both models also reveal the cross-over between the pure squeezed states and non-pure components, as shown in blue and green colors. This cross-over indicates that the non-pure components become dominant parts at a higher pump power, which degrades the quantum noises, resulting in ASQZ being larger than SQZ.

We want to remark that unlike the reconstruction model, the physical parameters are predicted directly from the characteristic model without any post-data processing. However, in the reconstruction model, the singular value decomposition is applied first to the predicted density matrix (dmtx). Then, only with the obtained coefficient

σ

for the pure squeezed state, the weighting in the non-pure components

(1 - σ)

can be known. However, as one can see in Figure 2, when the pump power is larger than 40 mW, a larger discrepancy between the characteristic and reconstruction models appears. It is known that when the pump power increases, many additional effects may cause degradation. Without any prior knowledge on these additional effects, such as the heating in crystals, shift of resonance frequency, and/or other nonlinear mechanisms, the parameter estimations (para est) over-estimate the predicted average photon numbers, resulting in yielding a large number than the predicted density matrix (dmtx).

As a crucial diagnostic toolbox for practical applications, we also compare our ML-QST, both on the reconstruction and characteristic models, with the experimental fitting curves on the degradation in squeezed states. In the experiments, the degradation in quantum noise squeezing is typically described by the squeezing versus anti-squeezing curve, as shown in Figure 3.

For the ideal case, without any degradation, the squeezing and anti-squeezing levels should be the same, located along the black line in Figure 3. However, the measured squeezing level is limited due to the phase noise and loss mechanisms coupled with the environment and surrounding vacuum. Empirically, to estimate the loss and phase noises, not a single set of quadrature data, but a series of sets of quadrature data must be performed in order to have accurate fitting parameters for exp-fitting (co-variance fitting). The measured squeezing

V^{SQZ}

and anti-squeezing

V^{ASQZ}

levels can be modeled by taking the optical loss (denoted as L) and phase noise (denoted as

θ

) into account:

\begin{matrix} V^{SQZ} = (1 - L) [V_{i d}^{SQZ} \times \cos^{2} θ + V_{i d}^{ASQZ} \times \sin^{2} θ] + L, \end{matrix}

(3)

\begin{matrix} V^{ASQZ} = (1 - L) [V_{i d}^{ASQZ} \times \cos^{2} θ + V_{i d}^{SQZ} \times \sin^{2} θ] + L, \end{matrix}

(4)

where

V_{i d}^{SQZ}

and

V_{i d}^{ASQZ}

are the squeezing and anti-squeezing levels in the ideal case, respectively. As shown in Figure 3, the optimal fitting curve obtained by the orthogonal distance regression is depicted in green, along with the corresponding standard deviation (one-sigma variance) shown by the shadowed region. As we show in Figure 3, an accurate EXP-fitting can only be obtained by performing many (in our illustration, 12) different pump power levels.

Moreover, the success of EXP-fitting relies on the common belief that the loss and phase noises can be estimated as long as the system is stable. Nevertheless, as we illustrated, such a common belief is only valid at a low degree of squeezing (less than 5 dB). On the contrary, when the pump power increases, many additional effects occur, such as the shift of resonance frequency due to the heating in crystals, and/or other nonlinear mechanisms, resulting in the increment in loss [19].

On the contrary, only with a single-scan measurement, our ML-QST based on the reconstruction model (dmtx) and characteristic model (para est) both give agreement to experimental data, depicted in blue-dashed and red-dotted curves, respectively, in our figures. The curves shown in Figure 3 clearly demonstrate that our well-trained ML-QST can extract the degradation information in quantum states not only very precisely, but also very fast. Compared to the time-consuming MLE, our methodology paves the road toward a real-time and online QST [33,34]. For example, this machine learning-enhanced QST has also been applied to the reconstruction of Wigner current [33], which can be achieved with this methodology in a more efficient manner than traditional methods.

4. Conclusions

In summary, we develop a characteristic model to directly predict physical parameters in a 1D-CNN configuration, without dealing with density matrix in a higher dimensional Hilbert space. Based on the prior knowledge about target quantum states, the predicted physical parameters obtained by our characteristic model are as good as those generated by a reconstruction model. Through the validation with the experimentally measured data acquired from the balanced homodyne detectors, agreement to the empirically fitting curves obtained from the covariance method is clearly demonstrated. Such a characteristic model-based ML-QST can be easily installed on edge devices such as FPGA as an in-line diagnostic toolbox for all the possible applications with squeezed states. The implementations of ML-QST should be ready for quantum information processing, macroscopic quantum state generation, quantum metrology, and advanced gravitational wave detectors.

Author Contributions

Conceptualization, H.-Y.H. and J.N.; methodology, H.-Y.H., J.N. and Y.-R.C.; software, H.-Y.H. and J.N.; validation, H.-Y.H., J.N. and Y.-R.C.; formal analysis, H.-Y.H., J.N. and Y.-R.C.; investigation, H.-C.W., H.L.C. and C.-M.W.; resources, C.-M.W. and R.-K.L.; data curation, Y.-R.C., H.-C.W. and H.L.C.; writing—original draft preparation, H.-Y.H.; writing—review and editing, C.-M.W. and R.-K.L.; visualization, H.-Y.H., J.N. and Y.-R.C.; supervision, C.-M.W.; project administration, C.-M.W. and R.-K.L.; funding acquisition, R.-K.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work is partially supported by the Ministry of Science and Technology, Taiwan, under grants MOST 110-2123-M-007-002, 110-2627-M-008-001, the International Technology Center Indo-Pacific (ITC IPAC) and Army Research Office, under Contract No. FA5209-21-P-0158, and the Collaborative research program of the Institute for Cosmic Ray Research (ICRR), the University of Tokyo.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Hradil, Z. Quantum-state estimation. Phys. Rev. A 1997, 55, R1561. [Google Scholar] [CrossRef] [Green Version]
Lvovsky, A.I.; Raymer, M.G. Continuous-variable optical quantum-state tomography. Rev. Mod. Phys. 2009, 81, 299. [Google Scholar] [CrossRef]
Leonhardt, U. Measuring the Quantum State of Light; Cambridge University Press: Cambridge, UK, 1997. [Google Scholar]
Andersen, U.L.; Neergaard-Nielsen, J.S.; van Loock, P.; Furusawa, A. Hybrid discrete- and continuous-variable quantum information. Nat. Phys. 2015, 11, 713. [Google Scholar] [CrossRef] [Green Version]
Barredo, D.; de Leseleuc, S.; Lienhard, V.; Lahaye, T.; Browaeys, A. An atom-by-atom assembler of defect-free arbitrary two-dimensional atomic arrays. Science 2016, 354, 1021. [Google Scholar] [CrossRef] [Green Version]
Endres, M.; Bernien, H.; Keesling, A.; Levine, H.; Anschuetz, E.R.; Krajenbrink, A.; Senko, C.; Vuletic, V.; Greiner, M.; Lukin, M.D. Atom-by-atom assembly of defect-free one-dimensional cold atom arrays. Science 2016, 354, 1024. [Google Scholar] [CrossRef] [Green Version]
Zhang, J.; Pagano, G.; Hess, P.W.; Kyprianidis, A.; Becker, P.; Kaplan, H.; Gorshkov, A.V.; Gong, Z.-X.; Monroe, C. Observation of a many-body dynamical phase transition with a 53-qubit quantum simulator. Nature 2017, 551, 601. [Google Scholar] [CrossRef]
Friis, N.; Marty, O.; Maier, C.; Hempel, C.; Holzäpfel, M.; Jurcevic, P.; Plenio, M.B.; Huber, M.; Roos, C.; Blatt, R.; et al. Observation of entangled states of a fully controlled 20-qubit system. Phys. Rev. X 2018, 8, 021012. [Google Scholar] [CrossRef] [Green Version]
Vlastakis, B.; Kirchmair, G.; Leghtas, Z.; Nigg, S.E.; Frunzio, L.; Girvin, S.M.; Mirrahimi, M.; Devoret, M.H.; Schoelkopf, R.J. Deterministically encoding quantum information using 100- photon Schrödinger cat states. Science 2013, 342, 607. [Google Scholar] [CrossRef]
Lvovsky, A.I. Iterative maximum-likelihood reconstruction in quantum homodyne tomog-raphy. J. Opt. B Quant. Semiclass. Opt. 2004, 6, S556. [Google Scholar] [CrossRef]
Tóth, G.; Wieczorek, W.; Gross, D.; Krischek, R.; Schwemmer, C.; Weinfurter, H. Per-mutationally invariant quantum tomography. Phys. Rev. Lett. 2010, 105, 250403. [Google Scholar] [CrossRef] [Green Version]
Gross, D.; Liu, Y.-K.; Flammia, S.T.; Becker, S.; Eisert, J. Quantum state tomography via compressed sensing. Phys. Rev. Lett. 2010, 105, 150401. [Google Scholar] [CrossRef] [Green Version]
Cramer, M.; Plenio, M.B.; Flammia, S.T.; Gross, D.; Bartlett, S.D.; Somma, R.; Lan-don-Cardinal, O.; Poulin, D.; Liu, Y.-K. Efficient quantum state tomography. Nat. Commun. 2010, 1, 149. [Google Scholar] [CrossRef] [Green Version]
Lanyon, B.P.; Maier, C.; Holzäpfel, M.; Baumgratz, T.; Hempel, C.; Jurcevic, P.; Dhand, I.; Buyskikh, A.S.; Daley, A.J.; Cramer, M.; et al. Efficient tomography of a quantum many-body system. Nat. Phys. 2017, 13, 1158. [Google Scholar] [CrossRef]
Carrasquilla, J.; Torlai, G.; Melko, R.G.; Aolita, L. Reconstructing quantum states with generative models. Nat. Mach. Intell. 2019, 1, 155. [Google Scholar] [CrossRef]
Tiunov, E.S.; Tiunova, V.V.; Ulanov, A.E.; Lvovsky, A.I.; Fedorov, A.K. Experimental quantum homodyne tomography via machine learning. Optica 2020, 7, 448. [Google Scholar] [CrossRef] [Green Version]
Biamonte, J.; Wittek, P.; Pancotti, N.; Rebentrost, P.; Wiebe, N.; Lloyd, S. Quantum ma-chine learning. Nature 2017, 549, 195. [Google Scholar] [CrossRef]
Lohani, S.; Kirby, B.T.; Brodsky, M.; Danaci, O.; Glasser, R.T. Machine learning assisted quantum state estimation. Mach. Learn. Sci. Technol. 2020, 1, 035007. [Google Scholar] [CrossRef]
Hsieh, H.-Y.; Chen, Y.-R.; Wu, H.-C.; Chen, H.L.; Ning, J.; Huang, Y.-C.; Wu, C.-M.; Lee, R.-K. Extract the Degradation Information in Squeezed States with Machine Learning. Phys. Rev. Lett. 2022, 128, 073604. [Google Scholar] [CrossRef]
Hyllus, P.; Eisert, J. Optimal entanglement witnesses for continuous-variable systems. New J. Phys. 2006, 8, 51. [Google Scholar] [CrossRef]
Pfister, O. Continuous-variable quantum computing in the quantum optical frequency comb. J. Phys. B At. Mol. Opt. Phys. 2020, 53, 012001. [Google Scholar] [CrossRef]
Fabre, C.; Treps, N. Modes and states in quantum optics. Rev. Mod. Phys. 2020, 92, 035005. [Google Scholar] [CrossRef]
Ogawa, H.; Ohdan, H.; Miyata, K.; Taguchi, M.; Makino, K.; Yonezawa, H.; Yoshikawa, J.I.; Furusawa, A. Real-Time Quadrature Measurement of a Single-Photon Wave Packet with Continuous Tem-poral-Mode Matching. Phys. Rev. Lett. 2016, 116, 233602. [Google Scholar] [CrossRef] [Green Version]
Silva, J.L.E.; Glancy, S.; Vasconcelos, H.M. Quadrature histograms in maximum-likelihood quantum state tomography. Phys. Rev. A 2018, 98, 022325. [Google Scholar] [CrossRef]
Abdoli, S.; Cardinal, P.; Koerich, A.L. End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network. Expert Syst. Appl. 2019, 136, 252. [Google Scholar] [CrossRef] [Green Version]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. In Proceedings of the Advances in Neural Information Processing Systems (NIPS), Lake Tahoe, NV, USA, 3–8 December 2012. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Agarwal, G.S. Wigner-function Description of Quantum Noise in Interferometers. J. Mod. Opt. 1987, 34, 909. [Google Scholar] [CrossRef]
Chaturvedi, S.; Srinivasan, V. Photon-number distributions for fields with Gaussian Wigner functions. Phys. Rev. A 1989, 40, 6095. [Google Scholar] [CrossRef]
Lütkenhaus, N.; Barnett, S.M. Nonclassical effects in phase space. Phys. Rev. A 1995, 51, 3340. [Google Scholar] [CrossRef]
Seifoory, H.; Doutre, S.; Dignam, M.M.; Sipe, J.E. Squeezed thermal states: The result of para-metric down conversion in lossy cavities. J. Opt. Soc. Am. B 2017, 34, 1587–1596. [Google Scholar] [CrossRef]
Huang, G.; Liu, Z.; van der Maaten, L.; Weinberger, K.Q. Densely Connected Convolutional Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 4700–4708. [Google Scholar]
Chen, Y.-R.; Hsieh, H.-Y.; Ning, J.; Wu, H.-C.; Chen, H.L.; Chuang, Y.-L.; Yang, P.; Steuernagel, O.; Wu, C.-M.; Lee, R.-K. Experimental reconstruction of Wigner distribution currents in quantum phase space. arXiv 2021, arXiv:2111.08285. [Google Scholar]
Youssry, A.; Ferrie, C.; Tomamichel, M. Efficient online quantum state estimation using a ma-trix-exponentiated gradient method. New. J. Phys. 2019, 21, 033006. [Google Scholar] [CrossRef]

Figure 1. Demonstration of direct parameter estimations with machine learning. Here, in a single-scan measurement of the LO phase from 0 to

2 π

, the noisy data of quadrature sequence obtained by quantum homodyne tomography are fed to the convolutional layers, denoted as a 1D-CNN kernel. Then, after flattening, either the density matrix is reconstructed through extra fully connected layers (the reconstructed model), or the physical parameters are predicted directly (the characteristic model, marked with the shadowed background). The function of neurons marked as

(1, 2, 3)

denotes inferring the value of

(r, θ, n_{t h})

directly.

Figure 1. Demonstration of direct parameter estimations with machine learning. Here, in a single-scan measurement of the LO phase from 0 to

2 π

, the noisy data of quadrature sequence obtained by quantum homodyne tomography are fed to the convolutional layers, denoted as a 1D-CNN kernel. Then, after flattening, either the density matrix is reconstructed through extra fully connected layers (the reconstructed model), or the physical parameters are predicted directly (the characteristic model, marked with the shadowed background). The function of neurons marked as

(1, 2, 3)

denotes inferring the value of

(r, θ, n_{t h})

directly.

Figure 2. The comparisons on the predicted average photon number, as a function of pump power, between (a) the characteristic model and (b) the reconstruction model. In the characteristic model, we directly generate the parameter estimations (para est) for the average photon number in the measured data

{〈 n 〉}_{total}

in red, the pure squeezed state

{〈 n 〉}_{sq}

in green, and non-pure components

{〈 n 〉}_{other}

in blue. However, in the reconstruction model, the singular value decomposition is applied first to the predicted density matrix (dmtx), revealing the coefficient

σ

for the pure squeezed state and

(1 - σ)

for non-pure components.

Figure 2. The comparisons on the predicted average photon number, as a function of pump power, between (a) the characteristic model and (b) the reconstruction model. In the characteristic model, we directly generate the parameter estimations (para est) for the average photon number in the measured data

{〈 n 〉}_{total}

in red, the pure squeezed state

{〈 n 〉}_{sq}

in green, and non-pure components

{〈 n 〉}_{other}

in blue. However, in the reconstruction model, the singular value decomposition is applied first to the predicted density matrix (dmtx), revealing the coefficient

σ

for the pure squeezed state and

(1 - σ)

for non-pure components.

Figure 3. Degradation in squeezed states, i.e., squeezing level versus anti-squeezing level. For the ideal case, the squeezing and anti-squeezing levels should locate along the black line (ideal). As shown with the typical experimental data, marked with green dots, there exists a discrepancy between the measured squeezing and anti-squeezing levels. Based on Equations (3) and (4), by taking the loss and phase noise into account, the optimal fitting curve is depicted in green (exp), with the corresponding standard deviation depicted by the shadowed region. Moreover, our ML-QST based on the reconstruction model (dmtx) and characteristic model (para est) both give agreement to experimental data, depicted in blue-dashed and red-dotted curves, respectively.

Table 1. Hyper-parameters (filter sizes of each layer) used in our 1D CNN.

Layer Name	Parameters
Conv_1d_layer1	[4, 96]
Conv_1d_block_a	[4, 96]
	[4, 96]
Transition Layer 1: Conv_1d	[1, 48] (stride = 4)
Conv_1d_block_b1	[4, 64]
	[4, 64]
Conv_1d_block_b2	[4, 64]
	[4, 64]
Transition Layer 2: Conv_1d	[1, 64] (stride = 4)
Conv_1d_block_c1	[4, 128]
	[4, 128]
Conv_1d_block_c2	[4, 128]
	[4, 128]
Transition Layer 3: Conv_1d	[1, 96] (stride = 4)
Conv_1d_layer4	[4, 96] (stride = 2)
Conv_1d_layer5	[2, 128] (stride = 2)
Conv_1d_layer6	[2, 48] (stride = 2)

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hsieh, H.-Y.; Ning, J.; Chen, Y.-R.; Wu, H.-C.; Chen, H.L.; Wu, C.-M.; Lee, R.-K. Direct Parameter Estimations from Machine Learning-Enhanced Quantum State Tomography. Symmetry 2022, 14, 874. https://doi.org/10.3390/sym14050874

AMA Style

Hsieh H-Y, Ning J, Chen Y-R, Wu H-C, Chen HL, Wu C-M, Lee R-K. Direct Parameter Estimations from Machine Learning-Enhanced Quantum State Tomography. Symmetry. 2022; 14(5):874. https://doi.org/10.3390/sym14050874

Chicago/Turabian Style

Hsieh, Hsien-Yi, Jingyu Ning, Yi-Ru Chen, Hsun-Chung Wu, Hua Li Chen, Chien-Ming Wu, and Ray-Kuang Lee. 2022. "Direct Parameter Estimations from Machine Learning-Enhanced Quantum State Tomography" Symmetry 14, no. 5: 874. https://doi.org/10.3390/sym14050874

APA Style

Hsieh, H. -Y., Ning, J., Chen, Y. -R., Wu, H. -C., Chen, H. L., Wu, C. -M., & Lee, R. -K. (2022). Direct Parameter Estimations from Machine Learning-Enhanced Quantum State Tomography. Symmetry, 14(5), 874. https://doi.org/10.3390/sym14050874

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Direct Parameter Estimations from Machine Learning-Enhanced Quantum State Tomography

Abstract

1. Introduction

2. Supervised Machine Learning-Enhanced Quantum State Tomography

2.1. Reconstruction Model

2.2. Characteristic Model

3. Comparison between the Reconstruction and Characteristic Models

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI