1. Introduction
In recent years, in accordance with Moore’s law [
1], the computing ability of electronic computers has exponentially increased. However, the growth in power of CPUs has plateaued in over the last few years due to various constraints, prompting the search for alternative methods to boost computational performance. In 1982, Richard Feynman, an American theoretical physicist, introduced the concept of quantum computing. This innovative model leverages quantum mechanics principles like superposition and entanglement to enhance data storage, processing, and transmission capabilities far beyond those of traditional computers [
2]. The potential of quantum computing was further underscored by Peter Shor’s introduction of a quantum algorithm for prime number factorization in 1994 [
3], and by Lov Grover’s quantum search algorithm in 1996 [
4].
As the field of digital image processing evolves, it faces the challenge of handling an ever-growing volume and complexity of images, propelled by advances in pattern recognition, image understanding, and the development of sophisticated image sensors. Traditional image processing algorithms, foundational to numerous applications within information science, are inherently parallel in nature, demanding extensive computational resources for execution. The surge in image quantity and resolution has rendered these classical algorithms increasingly time-consuming and hardware-intensive. In response to these challenges, the integration of quantum computing into image processing emerges as a promising solution. Quantum computing utilizes qubits for data storage and leverages the properties of quantum physics, such as superposition and entanglement, to offer unparalleled parallel processing capabilities. This shift in paradigm provides a significant improvement in efficiency for tasks related to image processing.
The quantum approach to image processing significantly reduces the computational complexity associated with storing and manipulating large sets of image data. While a classical computer requires exponential resources O(n × 2
n) to store sequences of n-bit length, a quantum computer can achieve this with linear complexity O(n) [
5]. Moreover, operations that are inherently sequential and resource-intensive on classical computers, such as bitwise inversion, can be executed more efficiently on quantum computers. This is due to the quantum computer’s ability to perform operations on entangled qubits in parallel, dramatically reducing the time and resources needed for complex image processing tasks. This innovative method of leveraging quantum computing for image processing not only accelerates classical algorithms but also paves the way for the development of novel quantum image processing algorithms. These improvements have the potential to completely transform the field by greatly decreasing time it requires to analyze data and the amount of hardware needed. This will allow for the development of more advanced image processing applications that need a lot of resources. Hence, how can we use the quantum computing technique for image processing is crucial for surpassing the constraints of conventional computational techniques. This advancement presents a novel opportunity to efficiently and effectively process digital images.
To process images in quantum state, we need to follow three steps as
Figure 1, (i) prepare the image and store it into quantum state, (ii) process quantum image, (iii) processed digital image from quantum state. The quantum image compression and encryption techniques lie in the preparation of the image into quantum state. Similar to the traditional digital image compression, the quantum image compression methods have lossy and lossless compression.
The early 21st century witnessed several pivotal advancements aimed at enhancing the efficiency and quality of image compression techniques. In 2002, Lewis et al. introduced a method that utilized a two-dimensional orthogonal wavelet transform for compressing digital images. This innovative approach enabled the decomposition of images into coefficients that are localized both spatially and spectrally, offering a nuanced balance between preserving image quality and achieving substantial compression [
6]. Another noteworthy development came in the form of an advanced bit plane coding strategy specifically designed for quantizing discrete cosine transform (DCT) coefficients [
7]. This technique was lauded for its ability to deliver superior decoding quality compared to the JPEG2000 standard [
8], which was the benchmark at the time. Kouda et al. introduced a hierarchical quantum neural network-based image compression scheme, assessing the utility of large quantum neural networks in tackling complex image compression scenarios [
9]. This approach underscored the potential of quantum computing to revolutionize traditional practices by offering novel solutions that could outperform conventional algorithms in both efficiency and effectiveness. A significant challenge in image compression has always been the time-consuming nature of traditional image coding methods. To address this issue, Yang R. introduced a cutting-edge algorithm that employed a quantum BP (backpropagation) network for image compression [
10]. This method not only accelerated the encoding process but also enhanced the quality of the reconstructed images, showcasing the synergy between quantum computing and neural network methodologies in improving computational processes. In 2016, Yuen et al. unveiled an algorithm that combined discrete cosine transform (DCT) with the Secure Hashing Algorithm (SHA-1) for both compressing and encrypting images [
11]. This dual-purpose algorithm highlighted the growing need for secure and efficient image processing techniques in an increasingly digital and interconnected world. In the same year, based on hyper-chaotic system Zhou et al. proposed an image encryption-compression scheme [
12]. The same authors also published image encryption and compression scheme based on Mellin transform and compressive sensing [
13]. In 2018, an image compression–encryption algorithms by combining hyper-chaotic system with discrete fractional random transform was introduced by Gong et al. [
14].
While the above work is on the traditional images, as the field of quantum computing is advancing rapidly many of these classical techniques have been expanded to encompass the quantum realm. In this paper, we will focus on the quantum image processing and will discuss about the recent advancements in the field of quantum image compression. We start in
Section 2 with a brief introduction to quantum computing. Readers already familiar with these fundamental concepts can skip
Section 2 and proceed directly to
Section 3.
2. Brief Introduction to Quantum Computing
2.1. Vector
Quantum states are mathematically expressed as vectors in a complex vector space called Hilbert space. Hilbert space is a fundamental framework in quantum mechanics because it can effectively capture the probabilistic and superpositional characteristics of quantum systems. The nomenclature used to represent vectors in Hilbert space is a distinctive and sophisticated formalism, generally known as Dirac notation, or more informally, the “bra-ket” notation [
15].
In Dirac notation, a vector (representing a quantum state) in Hilbert space is symbolized by a ket, denoted as ∣ψ⟩, where ∣⋅⟩ signifies the ket and ψ is a label that identifies the specific quantum state. The ket is a column vector that encompasses all the essential information required to completely explain the quantum state within the mathematical framework of quantum mechanics. Complementary to the ket is the bra, denoted as ⟨ϕ∣, where ⟨⋅∣ represents the bra, and ϕ is a label for the vector. The bra is essentially the conjugate transpose of the ket. In more concrete terms, if the ket represents a column vector, then the bra represents a row vector, with its complex elements conjugated. The usage of this bra vector is essential in the construction of quantum mechanical algorithms, particularly in the computation of probabilities and expectation values, which are key aspects of quantum mechanics.
The implementation of bra-ket notation brought about a significant transformation in the mathematical handling of quantum mechanics, providing a potent and intuitive mechanism for managing the abstract concepts essential to the theory. It simplifies the depiction of quantum processes, such as measurements and transformations, and offers a standardized framework for discussing and evaluating quantum states. It also enables the succinct definition of quantum mechanical processes, such as unitary transformations and observables. In summary, the bra-ket notation encapsulates the abstract and counterintuitive nature of quantum mechanics in a mathematically rigorous yet accessible language, enabling the exploration and exploitation of quantum phenomena for computational purposes.
2.2. Tensor Products
The tensor product is a mathematical operation that combines vector spaces to create a bigger vector space.
Assume, we have a scaler α. |v⟩ is an element in V space and |w⟩ is an element in W space. Then we can write:
Now if we have two elements, |v
1⟩, |v
2⟩ in V space and an element |w⟩ in W space
Similarly, |v⟩ in V space and |w
1⟩ and |w
2⟩ in W space
We can find the tensor product of two matrices
X (dimension
m ×
n) and
Y (dimension i × j) as
2.3. Quantum Bit
In a classical computer bit is the core component, which functions inside a binary system, alternating between two distinct states: 0 and 1. The binary system serves as the foundation for classical computing architectures, allowing for the representation, manipulation, and retention of data. Quantum computing, in contrast, presents a sophisticated and intricate alternative to the classical bit, known as the quantum bit or qubit. Qubits are fundamental units that encapsulate the laws of quantum physics, forming the essential foundation for both the theoretical and practical aspects of quantum computing. Qubits, unlike traditional bits, exist inside a mathematical domain that is more flexible and abstract, rather than being limited to the binary certainties of 0 and 1. This abstraction enables the conceptualization and advancement of quantum computing theory without being limited by the physical implementation in specific hardware platforms. Qubits, being very versatile mathematical entities, allow for extensive study of the potential of quantum computing, without being restricted by the limits of physical systems.
A qubit is characterized by its capacity to exist in states that extend beyond the binary values of 0 and 1. The ability to simultaneously exist in several states is demonstrated by the phenomenon called quantum superposition, in which a qubit occupies a state that is a combination of |0⟩ and |1⟩. Mathematically, the superposed state can be represented as:
where α and β denote the probability amplitudes. The amplitudes represent the probability of the qubit collapsing into either the |0⟩ or |1⟩ state when measured. A qubit’s state is represented as a vector in a two-dimensional complex vector space, with |0⟩ and |1⟩ being the basis states used for computation. The basis states constitute an orthonormal basis set, serving as a structured framework for the definition and manipulation of qubits.
The superposition principle grants qubits the ability to exist in several states simultaneously, which is in striking contrast to the binary restriction of conventional bits. Quantum computers have the ability to process and interpret data in ways that are fundamentally distinct from traditional computing methods due to their multi-state nature. When a measurement is performed on a superposed qubit state ∣ψ⟩, it collapses into one of its component states, either |0⟩ or |1⟩. The probability of each event is defined by the square of the associated probability amplitude (|α|
2 for |0⟩ and |β|
2 for |1⟩). The stochastic character of qubit measurement forms the basis for the quantum mechanical phenomena that quantum algorithms utilize for purposes such as encryption, search optimization, and simulation of quantum systems. Also,
In quantum computing, |0⟩ is expressed as:
In quantum computing, |1⟩ is expressed as:
Let us consider a pair of qubits. From a classical perspective, a pair of bits has the capability to represent four unique values (00, 01, 10, 11) simultaneously. However, within a quantum system, two qubits have the ability to simultaneously exist in a superposition of all four of these states. Consequently, the two-qubit system can be characterized by a state vector that encompasses this superposition, denoting a quantum state that is a linear combination of its four fundamental states: |00⟩, |01⟩, |10⟩, and |11⟩. So, the quantum state for two qubit system can be written as:
where
These four states can be represented as:
2.3.1. Qubit Measurements and Unit Circle Theory
The basic states of a qubit in quantum computing are represented by the states |0⟩ and |1⟩ in quantum physics. In a two-dimensional coordinate system, where the state |0⟩ aligns with the X-axis and the state |1⟩ aligns with the Y-axis, these states can be visually depicted. Every state has a basis vector: the vector for |0⟩ is [1 0]T, indicating that it is a unit vector along the X-axis; the vector for |1⟩ is [0 1]T, indicating that it is a unit vector along the Y-axis.
We can take into consideration additional vectors that create different angles with the X-axis in order to investigate the idea of superposition. For example, the vector [1/√2 1/√2]T can be used to represent a vector that forms a 45-degree angle with the X-axis. According to this vector, there is an equal chance that this qubit will be measured and found in the states of |0⟩ or |1⟩. This indicates that the quantum state is an equal superposition of |0⟩ and |1⟩. In addition, another vector that forms a 60-degree angle with the X-axis can be represented by the column vector [1/2 √(3/2)]. This vector represents a quantum state that is not an equal superposition of |0⟩ and |1⟩. Instead, it has distinct probabilities for being observed in each state, with a greater likelihood for the state |1⟩ due to the larger coefficient in the vector representation.
Thus, a qubit can be mathematically described as a unit vector within a two-dimensional complex vector space (
Figure 2). When we apply this principle to the geometric model known as the “Bloch sphere”, the state |0⟩ correlates to the X-axis, whereas the state |1⟩ aligns with the Y-axis on this sphere. It is crucial to emphasize that any point on the surface of the Bloch sphere represents a qubit in a state of superposition, which is a weighted combination of the states |0⟩ and |1⟩. In the practice of quantum measurement, two primary approaches are utilized. The first is the measurement in the standard basis, also known as the computational basis, which corresponds precisely to the previously mentioned states |0⟩ and |1⟩. The second methodology incorporates measurements taken on an arbitrary basis, enabling the evaluation of the qubit’s state across various Bloch sphere orientations. The selection of these arbitrary bases is not obligatory and can be chosen to accommodate particular quantum computing tasks or algorithms, thereby offering a versatile structure for the assessment and application of quantum states.
2.3.2. Measuring on Standard Basis
Let us assume, state |S⟩ has an angle
θ with |0⟩ state in X axis. The figure is drawn in a 2D real space (
Figure 3), and all of its amplitudes are real.
From the figure, the state |0⟩ has the probability cos2 θ and state |1⟩ has the probability sin2 θ, which can be written as cos2 (π/2 − θ). Thus, depending on the stated above probabilities, the state S is projected onto either the |0⟩ state or |1⟩ state.
2.3.3. Measuring on Arbitrary Basis
In this case, measurement is completed on any orthogonal basis rather than onto |0⟩ and |1⟩ basis. From
Figure 4, state |S⟩ is measure using |
u⟩ and |
u′⟩ basis. Here |
u⟩ has the probability cos
2 θ and |
u′⟩ has the probability sin
2 θ. The amplitude of |
u⟩ and |
u′⟩ given by:
These concepts will come in handy when we will talk about the FRQI (flexible representation of quantum images) representation of quantum images where rotation operation will be required.
2.4. Circuit and Gates
Logic gates are essential components in traditional digital circuits, responsible for manipulating and transforming information. They serve as the fundamental building blocks for complicated computing functions. Similarly, quantum circuits utilize a distinct set of logic gates that are specifically intended to function based on the principles of quantum mechanics. Quantum logic gates enable the manipulation of quantum information by applying unitary transformations to quantum states, allowing for the execution of logical operations. The mathematical description of these changes is commonly conveyed by matrices, which accurately capture the specific operation being applied to the quantum state.
One significant category of quantum logic gates is the single quantum gate (
Figure 5). As the name implies, it only requires the participation of one qubit to perform its action. In contrast, multi-qubit gates are capable of performing quantum operations and interactions that are more intricate, as they involve two or more qubits. Quantum circuits employ horizontal lines to represent qubits in their graphical depiction. The lines depicted in the schematic of a quantum circuit, commonly known as wires, represent the pathway through which quantum information travels. The symbol “U” is used to represent a single quantum gate on these wires. This symbol indicates the unitary operation that the gate performs on the qubit it interacts with. When |ψ⟩ state is used as an input to this gate it gives U|ψ⟩ as an output. Unitary transformations play a crucial role in the functioning of quantum gates. These transformations are invertible and maintain the norm of the quantum state, a necessity for quantum operations based on the principles of quantum physics. The utilization of a matrix representation for a quantum gate offers a potent means of comprehending and formulating quantum algorithms, as it enables the accurate computation of the gate’s impact on a certain quantum state.
The single quantum gate can be expressed in a matrix form:
It is impossible to overstate the significance of single quantum gates in quantum computation. Although they are the most basic form of quantum gates, single quantum gate executes critical operations that are indispensable for quantum computation, including the initialization, manipulation, and preparation of measurements of qubits. The Pauli gates (X, Y, Z), which alter the state of a qubit in multiple dimensions, and the Hadamard gate, which generates superposition states, are both instances of single quantum gates. These gates function as the quantum equivalents of classical logic gates such as NOT and XOR, albeit within a domain where quantum states can be superimposed and entangled.
Applying the Hadamard gate to |0⟩ state or |1⟩ state:
So, by using the Hadamard H gate, the state |0⟩ and |1⟩ can be convert into a superposition state. The new state is known as |+⟩ state and |−⟩ state, respectively.
The Pauli-X gate has the ability to change the state of a single qubit. That is why this gate is also called bit-flip or Not gate.
Table 1 shows some common single quantum gates.
Within the sophisticated realm of quantum computing, in addition to the simplicity and grace of individual quantum gates, there exists a more intricate category of quantum gates that require the participation of several qubits for their functioning. Multi-qubit quantum gates enhance the complexity and functionality of quantum circuits, allowing for a wider range of computational operations that exploit the distinct characteristics of quantum mechanics, such as entanglement and superposition, to perform tasks that are beyond the capabilities of single-qubit gates.
One prominent example of multi-qubit gates is the Controlled-NOT (CNOT) gate, which represents the idea of quantum control dynamics. The CNOT gate functions by manipulating two qubits, with one qubit acting as the control and the other as the target.
The CNOT gate operates by flipping the state of the target qubit, changing it from |0⟩ to |1⟩ or vice versa, only when the control qubit is in the state |1⟩ as shown in the Equations (27)–(30). The conditional operation described here is the quantum equivalent of the conventional XOR gate. It showcases how quantum computing may imitate and expand upon traditional logic operations within a quantum context.
Expanding on the concept of conditional operations, the quantum computing also has the 0-Controlled-NOT (0CNOT) gate. This gate inverts the target qubit only when the control qubit is in the state |0⟩. This gate expands the spectrum of quantum logic operations.
The Toffoli gate, also referred to as the CCNOT gate, is a significant expansion of the CNOT gate. It involves the use of two control qubits and one target qubit. The Toffoli gate performs a bit-flip operation on the target qubit exclusively when both control qubits are in the state |1⟩. The significance of this gate in quantum computing lies in its ability to facilitate reversible computation, which is essential for the development of universal quantum computers. Also, as we explore farther into the domain of multi-qubit operations, the idea of scalability becomes apparent with the introduction of the n-CNOT gate. The gate expands the concept of conditional operation to n control qubits, providing a flexible method for coordinating intricate quantum processes that may be customized to meet the specific needs of advanced quantum algorithms.
Another important example of multi-qubit gate is the Swap gate. It facilitates the exchange of states between two qubits. The function of this gate is crucial in quantum algorithms as it allows for the reorganization of qubit states without impacting the overall quantum state of the system. This facilitates operations that necessitate particular qubit configurations.
Table 2 shows some common multiple quantum gates.
Having a basic understanding of these gates and how these work is very important because these gates are useful to build a quantum circuit. For instance, if we are working with the FRQI representation of quantum images, we need to understand the use of the Hadamard gate, Identity gate, Pauli-X gate, CNOT gate [
16], etc. Another popular quantum image representation technique is NEQR (short for Novel Enhanced Quantum Representation) in which an image can be defined as:
To implement this, we need to use Hadamard gate, Toffoli gate, Swap gate [
17], etc. We discuss this technique in detail in the next section. So, before jumping into the image processing part, knowing the basic concept of these gates are important because these will be required to represent the images in quantum state.
3. Quantum Image Representations
Before we delve into quantum image compression, we need to understand how an image can be represented in quantum state, since a number of quantum image representations have been proposed. The first attempt to represent an image in quantum system was proposed after introducing Qubit Lattice in 2003 [
18]. Then in 2005 quantum superposition was introduced in Real Ket [
19] to represent image. In 2010, Venegas et al. proposed entangled image which used quantum entanglement [
20]. Le et al. also published their work FRQI or flexible representation of quantum images [
16] in the same year. Here, they utilized an n-qubit sequence to represent the coordinate information. To store the color information of the image they used angle. An image in FRQI model can be represented as follows [
16]:
Here, |i⟩ (=0, 1, …, 22n − 1) are 22n computational basis quantum states and θ = (θ0, θ1, …, θ22n − 1) is the vector of angles encoding colors. Here the coordinate information is represented with |i⟩ and the grey scale information is represented using cosθi |0⟩ + sinθi |1⟩. This model can represent the greyscale information as well as the coordinate system of an image in quantum state properly.
This FRQI model was extended further in the following year and a new model called Multi-Channel Representation for Quantum Image (MCRQI) [
21] was proposed, which also consider the RGB space. This model represents images as follows:
As we can see, in MCRQI to store RGB channels and opacity, three qubits are required. Here, θRi, θGi, θBi vectors represent the RGB colors and θαi represents the channels.
In FRQI scheme while encoding the image pixels, normalized superposition state is utilized, allowing simultaneous operations on all pixels, thereby addressing the need for real-time processing in image applications. A number of algorithms have been introduced based on this principle. However, FRQI’s restriction to one qubit per pixel for grayscale information makes certain complex color operations challenging.
The NEQR model for digital images representation, was introduced in 2013 by Zhang et al. [
17], uses entangled qubit sequences to encode an image’s grayscale and spatial information in a quantum superposition. This method converts grayscale values into binary using q qubits, improving simplicity and accessibility. The binary-encoded grayscale data are stored in a q-qubit sequence, whereas the coordinates for a 2
n-by-2
n pixel image are retained in a 2n-qubit sequence. To better illustrate the algorithm, let us consider a 2-by-2 image as shown in
Figure 6. In order to apply NEQR to an image, first q + 2n qubits needs to be initialized to the |0⟩ state. This is followed by applying identity (I) gates and Hadamard (H) gates to this initial state. Subsequently, the grayscale values for all pixels are established using 2n-CNOT gates. So, in the figure, to store the coordinate information H-gate needs to be applied to two of the ten |0⟩ qubits. Then, 2-CNOT gates need to be used to store the grayscale information. The quantum circuit for the NEQR preparation is shown in
Figure 7.
The NEQR model is a substantial improvement over the FRQI model. NEQR requires more qubits than FRQI but solves the problem of reliably measuring grayscale information with a limited number of measurements. NEQR simplifies color operations, giving it a more practical and often useful framework in quantum image processing. However, both NEQR and FRQI have a constraint in that they are designed to store images that are strictly square because their horizontal and vertical coordinate lengths are equal. This limitation presents an issue for depicting images that are not square or rectangular, which are prevalent. To address this issue, the improved novel enhanced quantum representation (INEQR) [
22] was introduced in 2015 by Jiang et al. INEQR allows for storing and processing rectangular images by supporting uneven horizontal and vertical coordinates. This improvement broadens the applicability of quantum image representations, making them more suitable for real-world scenarios where images may not have square dimensions.
GQIR or Generalized Quantum Image Representation was presented to represent non-square, rectangular images of arbitrary dimension by utilizing logarithmic coordinates [
23]. Despite its versatility, GQIR adds redundancy to the representation process. The Novel Quantum Representation for Log-Polar Images (QUALPI) offers a framework for image representation in polar coordinates, diverging from conventional Cartesian-based methods [
24]. In 2014, Li et al. introduced a new encoding approach for multi-dimensional color images called the n-qubit normal arbitrary superposition state (NASS). This method creatively encodes grayscale values using quantum states’ angles and assigns certain states to represent different dimensions, enabling the compression of multi-dimensional color images on a quantum computer [
25]. In 2016, the FRQI method was improved, and a new model called the FRQCI or Flexible Representation for Quantum Color Image, which improved the management of color in quantum image representations [
26]. Sang et al. developed the Novel Quantum Representation of Color Digital Images (NCQI) by integrating enhancements from MCRQI and NEQR within a similar period [
27]. The new model modifies the qubits in NEQR from q to 3q to symbolize the RGB color channels, making color operations, such as intricate color transformations, much simpler to perform. Yet, a downside of the NCQI paradigm is the heightened need for qubits.
In 2018, Liu et al. introduced an Optimized Quantum Representation for Color Digital Images (OCQR) to tackle this problem [
28]. OCQR requires fewer qubits, about one-third of what NCQI uses, to hold pixel values, while having a similar time complexity for preparing quantum pictures. OCQR optimizes computational resources by minimizing qubit usage and improves the efficiency of specific color changes. In 2017, the NEQR model was expanded to include Red–Green–Blue (RGB) color schemes by creating the Quantum Multi-Channel RGB Representation (QMCR) [
29]. Although this new method demands more qubits than the MCRQI model, it streamlines the picture preparation process and allows for accurate image retrieval. In the same year, Jiang et al. introduced a new framework for three-dimensional imaging in the quantum realm, called the quantum point cloud [
30]. This novel approach expands quantum image processing to 3D visual data, providing new opportunities for manipulating and analyzing digital images in quantum computing.
In the following year, BRQI (Bitplane Representation of Quantum Images) was published by Li et al. [
31], which allows for altering color complements, reversing, and translating bitplanes within the BRQI framework. This BRQI method divides the grayscale values into eight different binary bits, converting a grayscale image into eight distinct bit planes. It requires three qubits to express the bitplane index, while n qubits are assigned to encode the spatial coordinates of the image. Moreover, Wang et al. in 2019, published a model where a bitplane is used to represent color digital images. They named this model QRCI [
32]. An improved FRQI model called FRQCI was proposed by Li et al. [
33]. Interestingly, in this model they talked about some image processing operators for pixel coordinate information and color representation. Khan explored FRQI and NEQR model further and came up with an improved flexible representation of quantum images (IFRQI) [
34]. In this model, every pair of bits was represented using angle, enabling single qubit to store information equivalent to 2-bit grayscale values. This method significantly enhances the precision in retrieving the original image data. In the following year Grigoryan et al. published a new algorithm to store the images in quantum state by using Fourier transform representation [
35]. In the same year, Wang et al. came up with a method called DQRCI (double quantum color images encryption scheme) in which two color images are stored into quantum superposition state simultaneously [
36].