The actual information capacity of holograms is so high that, in reaching for the goals set for creating holographic television, a number of stumbling blocks have been encountered, such as transmitting holographic information via available communication channels. Direct assessment of the amount of information when transmitting 3D signals frame by frame shows that one such holographic channel covers the whole radio frequency region, including all radio and TV channels. Only fiber optic channels of telecommunication and direct transmission systems with laser beams may theoretically be capable of transmitting such large volumes of information that dynamically vary from frame to frame. However, first, such broadband waveguide channels are limited by both their hardwired network or direct view distance, and second, by the fact that they are still in the pre-design stage.
This paper shows the possibility of transmitting full-fledged 3D holographic information via conventional communication channels by transmitting two two-dimensional signals, representing the 2D texture of the surface of a 3D object and a 2D map of its surface (mask). The introduction (
Section 1.1) discusses the need to find ways to compress highly informative holographic information and presents an overview of modern works close to this topic (
Section 1.2) and a methodology for describing holographic processes in the language of spatial frequencies (
Section 1.3). The results of numerical analysis of spectral texture and mask properties are demonstrated (
Section 2.1) using a case of a real 3D human portrait image. A hologram-creating technique is also described (
Section 2.2). A hologram case is used to explain the reason for inefficiency of direct mathematical coding of hologram structure and the problem of observing parallax under the frame of the applied mathematical model using the spatial frequencies language (
Section 2.3). The advantages of transmitting holographic information about a 3D object with two 2D signals are also highlighted (
Section 2.4).
Section 2.5 presents the results of direct transmission of such signals over a conventional communication channel with a frame frequency of TV, which ensures the creation of a hologram at the receiving end of the channel and the restoration of a 3D image from it. In the last section (
Section 2.6), some applications are considered simply, without unnecessary mathematical transformations, arising from the technology under consideration, including in the field of hyperspectral images and multiplexing of 3D images. Finally, in
Section 3, the main results of the work are formulated.
1.1. Density of Recorded Holographic Information
Practically right after the first manuscripts on holography were published [
1], researchers started focusing on the potential such holograms have for storing information. Research studies [
2,
3] described the first optical circuits in dedicated holographic storage devices based on the fact that a hologram writes down practically all of the data on the monochromatic wavefront that is used for recording information. An image restored with a hologram has transverse resolution
h, as in any traditional system of image transfer, which is defined with the so-called Rayleigh criterion [
4], equal to the Airy disk diameter Equation (1):
where λ is the emission wavelength,
D is the transverse size of the hologram, and
f is the distance to a point of the restored image. Owing to the high resolution, the order of magnitude of the amount of data
N that can be written to such a hologram [
2,
3] can be assessed with Equation (2). Further practical studies [
5,
6,
7,
8,
9,
10,
11,
12,
13] made significant allowances which were related, on the one hand, to the constraints on data density in a holographic storage device and, on the other hand, to finding ways to increase recording density. Potential data recording capacity
n in a hologram can be estimated approximately according to Equation (2), based on [
5,
6,
7,
8,
9,
10], from which it can be defined as the number of the points the size of which is defined by the Airy disk diameter Equation (2):
where Ω =
D/
f is relative aperture of the imaging system, in this case a hologram, and
S is its area. It follows that when Ω = 1, which is typical of a good optical imaging system, the data recording density in a hologram corresponds to approximately one bit over the area equal to λ
2, which in the visible range corresponds to a spatial frequency of two to three lines per µm.
For pictorial holography of A4 sized (210 × 297 mm) portraits of real people, which can be compared to a 14” TV screen, when λ = 0.63 µm, the number of points in a hologram reaches n = 1.6 × 1011, which in binary recording is equal to the same amount of data measured in bits. That is approximately n = 2 × 1010 bytes ≈ 20 GB and is equivalent to the amount of information contained in a 60 min Full HD movie. Transmission of such arrays with a refresh rate of 25 Hz needs channels with throughout capacity C = 5 × 1011 B/s.
The bandwidth in this case is more than 100 GHz, which would more than cover the radio band available now. Actually, some of the first holographic TV developers [
14] wrote: “Due to high specific density of holographic information, it is necessary to register and transmit a great number of discrete elements of holograms via communication channels, and their resolution and processing rate exceed the operational capabilities of existing TV devices and standard telecommunication channels.” In this regard, Yu. N. Denisyuk, one of the founders of holography [
15], pointed out that we do not yet know the fundamental principles of holography required for creating new types of 3D films and artificial intelligence.
Large volumes of holographic information, which are undoubtedly a great advantage in other spheres, in the domain of holographic TV and augmented reality play a dirty trick on developers, and this has come to be one of the critical deterrents to transmitting holographic information via telecommunication channels. The solution to this challenge would be to either reduce the size of the holographic image significantly (which is hardly compatible with the objectives of augmented reality) or compress the holographic data in a way that the volume would not exceed reasonable limits (i.e., the capability of state-of-the-art 3G and 4G communication channels). The paper focuses on solving this problem.
1.2. Overview of Modern Works
The complexity of the task postponed the implementation of a full-fledged 3D holographic TV and a number of 3D augmented reality tasks. Stereo image broadcasting systems that do not require a large volume of transmitted information occupy only two times the frequency band of the communication channel. However, they were rejected. Relatively recently (in 2016), the TV companies Samsung and LG [
16] phased out stereo TVs. One reason is the inconvenience of viewers having to use glasses. Sometimes, helmet-mounted TV stereo attachments are mistakenly called holographic monitors [
17]; they are also inconvenient to use.
Another relatively economical method of transmitting volume to TV is Pleno technology, well developed by the Joint Photographic Experts Group (JPEG) [
17]. In a traditional form, such as with the use of a microlens plate, it is poorly suited for presenting a classic 3D TV signal, since it does not provide reproduction of a large image volume. It has proven itself only as a method of adjusting the focus for non-sharp photo and video shooting.
The transmission of information of 3D objects using their representation in holographic form, when a dynamic 3D image (at least 25 frames per second) is formed directly in the space in front of the monitor, where the observer is located, could be the start of a new era of television and augmented reality.
The decision to present holographic content by the Joint Photographic Experts Group has not yet been made, although many options are being considered, and open databases of holographic content have been created to test methods proposed by various researchers for testing 3D images, compressing the information stored in them and reproducing them, preferably without loss, or at least with low loss, as in classic JPEG encoding. All of these examples of open holographic content have low spatial resolution of the reconstructed image and are difficult during digital processing with the 25 frames per second frame rate required for TV and 3D augmented reality. However, these examples do not provide a method for obtaining them. Additionally, it can be reasonably assumed that none were transmitted over a conventional radio channel with a frequency of at least 25 frames per second.
The same group of respected experts has planned work toward the creation of JPEG Pleno, developing a standard that will facilitate the capture and representation of light fields in model of point cloud, similarly to the research in this direction [
18] and the works in holographic image modes [
17]. Work is actively underway toward saving computer resources when compressing holographic images in JPEG format using deep learning [
19] and other methods of saving computer resources when calculating optical transformations [
20].
However, the problem of transmitting a holographic signal over a conventional communication channel was not directly discussed. Basically, all researchers agree on one thing: for the current level of computing technology, its performance is still insufficient for processing, and, as can be understood, for transmitting large amounts of holographic data [
21,
22,
23].
The technology for 3D object hologram synthesis closest to that described in this paper is in [
24], in which a hologram is also formed by a pair of frames with mask + texture, although in a different way, by convolution with the impulse response of the system. This method consists of two stages: obtaining an impulse response and then convolving with it. We will use another method based on the Fresnel transform, which is carried out in one stage of calculations. In addition, questions about the transmission of holographic information through communication channels, the properties of multiplexing, and hyperspectral capabilities are not addressed in [
24].
It is noteworthy that, today, the main efforts of developers are focused on mathematical methods of processing received holographic signals, often divorced from physical processes. This consists of polygonal and voxel graphics, born from the idea of a point cloud, only in the first case an elementary object from a point is modified into a triangle, a network of which covers the indivisible surface of a 3D object [
25,
26,
27], and in the second into cubes, an array of which creates a 3D picture of a three-dimensional, dynamically changing object, with the possibility of expanding it, such as the creation of splashes from a moving and developing wave [
28,
29,
30].
Such technologies require significant computing resources. For example, the complexity of N-point cloud technology increases proportionally to N3, polygonal to N9, and voxel even more, which is a serious obstacle to their practical implementation. They are mathematical models of real space, far from the physical essence of transmitted signals.
In our opinion, relatively little attention has been paid to the physical process of registering 3D holographic information more precisely to the object-oriented computational process of creating holograms. The rapidly increasing speed of processing any information has created the illusion that soon polygonal arrays and even voxel arrays will be processed as easily as point clouds (although the latter is not yet suitable for holographic TV in terms of processing speed). However, the actual transmission of such arrays over a distance requires new communication lines, such as an optical cable, over which the spectrum of the transmitted signal exceeds the radio range by orders of magnitude. Direct transmission of a full-fledged 3D holographic signal, a point cloud, or a polygonal and voxel signal via conventional radio communication channels is impossible in principle, since one such transmission occupies the entire radio range available to mankind.
In this paper, a more physical approach is considered, which is a method of photographically registering the surface texture of a 3D object and a map of its surface depths, which comprise the initial information for generating a full-fledged hologram at the receiving end of the communication channel. This method is somewhat similar to the point cloud method, but it is simple and computationally much less laborious. It is suitable for dynamic reproduction of holograms reconstructed at the receiving end of the communication channel, and the field of view is quite large.
Most importantly, it allows us to transmit the necessary 3D holographic information over a conventional radio channel and create a 3D hologram after receiving the signal at the receiving end of the communication channel. Informationally, it is similar to the well-known SSB (single-sideband modulation) method, but it has some advantages (for example, in resolution). An analysis of the comparative spatial-frequency characteristics of the 3D image transmission methods of texture + mask and SSB speaks in favor of the first method.
Using holographic images when creating virtual 3D reality is the most promising, since they are basically adequate for the physical structure of the light field of the holographic object. Close to this is the method of transmitting 3D information considered in this paper using two two-dimensional pictures: the surface texture of the 3D object of holography and a depth map of the surface (mask) of the transmitted 3D image. In this method, a hologram is created already at the receiving end of the communication channel, which significantly reduces the amount of information transmitted. Such a hologram is a full-fledged analogue of the classical one and can restore not only imaginary 3D images observed behind the monitor screen but also real ones observed in front of the screen, directly in the volume of space where the observer is located. It is this advantage that allows us to create 3D virtual reality. A related effect is the ability to record and economically transmit hyperspectral images over the communication channel, expanding the range of the visible spectrum.
This paper is devoted to the study of the features of one method of representing 3D signals and transmitting them over a conventional, non-broadband communication channel, but allowing the synthesis of a full-fledged hologram at the receiving end, restoring a 3D image of a holographic object with the high resolution of a Full HD or even 4K frame.
1.3. Amplitude and Phase Modulation in Holography
To discuss the options for transmitting 3D holographic information, it is convenient to use the terminology of the spatial frequency spectra of holograms. Since the material realization of synthesized digital volumetric reflective holograms according to the scheme of Yu. N. Denisyuk [
1] with the current state of technology is not possible, we will consider, without limiting the generality, the classical scheme for recording transmitting analogue holograms according the scheme of E. N. Leith and J. Upatnieks [
31], presented in
Figure 1. In terms of the linear response of holographic material, the structure of interference fringes is formed with firing object and reference beams Equation (3):
where
U1(
x1,
y1) and
R1(
x1,
y1) are complex amplitudes of the electromagnetic light field in the hologram recording plane (
x1,
y1), which in scalar approximation of diffraction theory represent object and reference beams, respectively. When recording such a fringe pattern onto high-resolution photosensitive material under the effect of
I(
x1,
y1), photoresponse ΔΨ(
x1,
y1) takes place, which is variable in the hologram field. In terms of the amplitude response of the holographic material, it is a material transmission variance τ(
x1,
y1), while for a phase response it is an alteration in eikonal Δφ(
x1,
y1), which occurs due to either a change of local thickness in the hologram field Δ
l(
x1,
y1) or the refraction index varying over the hologram field Δ
n(
x1,
y1), or combination of all three mechanisms in the ratio defined by the physical and chemical properties of the holographic material itself [
32].
Further, for clarity and to avoid limiting the general nature of the basic conclusions in this paper, we take I(x1, y1) << I0 over the whole hologram field, with a photoresponse ΔΨ(x1, y1) as an amplitude coefficient of transmission τ(x1, y1).
The structure of such a hologram is written as Equation (4), where the multiplier prior to cosine constitutes the visibility of interference fringe
V;
Ia and
Ir represent the intensity of the object and reference beams, respectively; and under the cosine there is local phase difference of φ
a (object) and φ
r (reference beams).
Actually, the two phases change in space. In the reference wave, this often occurs just in linear fashion, when it is flat and falls at an angle to a normal line. In the object wave, it is formed with a Fresnel image from the signal formed by an object. The spatial spectrum of such a hologram is formed on the basis of the fundamental wave, which has the meaning of carrier spatial frequency ω =
k∙sin(θ
r) equal to the average rate of change of phase difference in the space Equation (4), where θ
r is the reference beam angle to the optical axis. Deviation caused by the shape of a holographic object also has an impact on the spatial frequency of a holographic grating. The modulation depth of the spatial spectrum is defined by the visibility of fringe pattern
V. The carrier frequency Equation (5) defines the rotation angle of a reconstruction beam tilt when it comes to a hologram:
while its deviation Ω is the diffraction spectrum of waves which form the image of the object recorded in a hologram Equation (6):
The hologram forms spatial harmonics of carrier frequency ω with wave-number vector
k and lateral harmonics ω ± Ω with wave-number vectors
k ± Δ
k, similar to the phase-modulated signal known in radio engineering [
33].
They generate diffraction of the reconstructed wave into angles ± θ
x1, ± θ
y1 with respective periods
dx1 and
dy1. Likewise, deviation of spatial frequencies of the object wave generates gratings with periods from
dx1min to
dx1max and from
dy1min to
dy1max Equation (7):
Therefore, when structure τ(
x1,
y1) is illuminated with a reconstruction beam, the radiation restored with a hologram has the key spatial harmonic with period
dx1, which defines the rotation angle of the beam, as well as the two lateral harmonics with periods from
dx1min to
dx1max, the spectrum of which forms the object image in the plus first and minus first orders. A tilt angle of reference wave θ
x1 is chosen considering the need to separate beams of zero and minus first order in space to have them at distance
z from the hologram (angle θ
y1 is often chosen to be zero for convenience), while the angles limited by Δθ
x1 and Δθ
y1 are defined by the spatial spectrum of the object wave. In a somewhat simplified one-dimensional case for flat uniformity in terms of the sectional area wave, the holographic signal intensity range Equation (4) is similar to the one-tone signal spectrum Equation (8):
This represents the zeroth harmonic, which restores beam RV, on which it generates the zeroth order of refraction with ω = kxsin(θV), where k = 2π/λ, θV is the restoring beam tilt angle to an optic axis, and two more beams, plus and minus first orders, responsible for restoring the actual image on the spatial frequency one ω = kx[sin(θV) − sin(θx1)] and a inverted image from the frequency one ω = kx[sin(θV) + sin(θx1)].
It is obvious that the spectrum Equation (8) is similar to the range of frequency-modulated radio signals, where the desired signal with spatial frequency Ω widens the range of carrier frequency ω to the right and left along the spatial frequency axis. In addition, Equation (4) also shows some amplitude modulation. It becomes apparent in modulating the interference fringes with visibility coefficient V, which spatially widens the area of the zeroth and plus/minus first order harmonics and depends on distribution of object and reference signal intensities over the hologram field. Therefore, although there are some differences, in general Equation (4) shows that there are amplitude and phase modulations in the spectrum of the recorded hologram similar to amplitude and phase modulation in radio engineering.
Actually, the carrier spatial frequency ω is high. It can be compared to a reciprocal wavelength of the record, and its period is equal to only a few micrometers, while the object spectrum ±Ω is defined by its degree of complexity. Even in high resolution (e.g., Full HD standard), the greatest possible spatial frequency is about 2000 lines over a screen area, and with a 36” screen (~0.9 m), it would be about three lines per mm. The difference of almost three orders of magnitude compared with the estimations in Equation (2) enable us to rely on high compression of holographic information when encoding it in one sideband (SSB method) [
34]. The conventional method of horizontal and vertical deflection for transferring frame data point by point includes transferring all spatial frequencies of a hologram. With regard to redundancy during direct encoding of a holographic image, when transferring spatial frequencies where all the frequencies are encoded, including those that do not have information about the object for forming a hologram located between ω = Ω and ω =
kxsin(θ
x1) + Ω is not justified, though conventionally this method can be used to transfer holographic information [
14,
17,
35]. The greater the difference between the carrier frequency ω =
kxsin(θ
x1) and its deviation Ω =
kxsin(Δθ
x1) caused by the diffraction-limited beam divergence of the object wave, the more effective the information compression will be.
Consequently, similar to [
34], it is possible to transmit not the whole phase-modulated signal with the spectrum Equation (8) representing the whole hologram, but only the component that corresponds to the single sideband (SSB) with the spatial frequencies Ω =
kxsin(Δθ
x1), bearing some information about the texture of the object for forming a hologram and its surface map, which is referred to as a mask in this paper.
For this, it is enough to have a photograph of the texture and the surface map taken with the help of any method (mask). For simplicity, we did not use the cutting-edge ToF method. Although it may have bright prospects, we chose a simpler method of structured light [
36]. Significantly reducing the volume of transferred information and keeping things comparatively simple, the structured light method allows the opportunity to record information about a living 3D object in real time, to transfer pairs of frames via a conventional radio channel, and restore a 3D image at the receiving end of the telecommunication channel by synthesizing a hologram according to the received information and digital input of a spatial frequency into it.
In order to restore an image at the receiving end of the transmitting channel, it is required to re-input the carrier frequency. Since there is no simple method for adding a carrier frequency to a two-dimensional holographic signal yet, it is suggested in the patent [
37] to synthesize a hologram by a numerical method at the receiving end of the communication channel using the accepted mask and texture.
Although it is shown above that transferring 3D information about a holographic object in the form of texture + mask is similar to transferring a spatial spectrum, these two methods have significant differences, such as in the resolution of the reconstructed image, which is shown below in the computational simulation. The spatial-spectral representation is simply convenient for understanding the processes occurring during the recording of holographic information, its transmission over the communication channel, and the restoration of a 3D image at the receiving end of the communication channel.