4.1. Experimental Setup
In order to verify the efficiency and accuracy of the proposed method, its performance is evaluated against a number of UIE methods. We conducted several experiments on many real degraded underwater images, and results are analyzed both qualitatively and quantitatively. The proposed method is compared with different underwater image enhancement methods, including DCP [
26], UDCP [
19], IBLA [
27], CEIV [
40], RCP [
21], HLP [
30], and UHLP [
3]. As in the proposed method, the transmission enhancement module is added, so it is worthy to compare it with various transmission refinement methods. Therefore, comparison of transmission map refinement is carried out with methods including guided filter [
31], mutual structure filtering [
32], soft matting [
26], and SD filtering [
25]. The comparative analysis is carried out by using the benchmark datasets named Stereo Quantitative Underwater Image Dataset (SQUID) [
51]. SQUID consists of four underwater image datasets: (i) Katzaa, (ii) Michmoret, (iii) Nachsholis, and (iv) Satil. These datasets contain overall 57 stereo pairs from two different water bodies, namely the Red Sea and the Mediterranean Sea. The ground truth validation relies not only on color but also on the distances of objects from the camera to quantitatively evaluate the transmission maps from a single image.
To compare the results quantitatively, there are a few widely used no-reference quality metrics, namely, Perception-based Image Quality Evaluator (PIQE) [
52], Naturalness Image Quality Evaluator(NIQE) [
53], and the Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) [
54]; however, these measures have limitations when it comes to underwater images [
55]. Therefore, we used more realistic evaluation measures proposed by [
3] for both transmission map and color restoration from a new quantitative dataset. From (Equation (
2)), the transmission map is a function of both the distance
and the unknown attenuation coefficient
. By considering
, the distances or depth map can be estimated from the estimated transmission map
by taking the log on both sides of Equation (
2). Thus the estimated depth map
can be expressed as
As a result, the improvements in the transmission maps can be compared quantitatively. As suggested in [
3], the Pearson correlation coefficient
between the true depth map
and estimated depth map
through the refined transmission map
can be computed by using the following expression:
where
is the covariance operator, and
are the standard deviations from the true depth map
and the estimated depth map
, respectively. Similarly, for the evaluation of restored color image
, the average angular error
[
56] measure is utilized. The
between gray-scale patches and a pure gray color in RGB space, for each chart, is defined as:
where
is the
patch from the restored image
and
marks the coordinates of the gray-scale patches. The angle is calculated by taking the cosine inverse of the dot product between the intensity values of each pixel in the gray-scale patch of the output image and the pure gray color vector
, divided by the product of their magnitudes and
.
4.2. Analysis
The proposed regularization method involves a number of parameters to recover (Equation (
19)) the degraded input image. These parameters with the values are
,
,
, and
are used. These values were determined empirically. We used the same parameters throughout our experiments to test the underwater images. Now, we will analyze the results of a few images selected randomly from the four datasets [
3] both qualitatively and quantitatively.
First, we selected 8 stereo pairs from the four benchmark datasets. Then two stereo pairs are chosen from each dataset, namely Katzaa, Michmoret, Nachsholis, and Satil, respectively, which can be seen in the first row of
Figure 4. Afterward, the initial transmission maps are computed for each image (second row), where black and white colors represent the variation in transmission values. It is apparent that the edges of the objects are fused with the background, and the variation in the closed-depth regions is not smooth. If the images are recovered based on the wrong initial transmission maps, i.e., by replacing the
in the (Equation (
19)), the resulting images will be of poor quality (third row). Therefore, we come up with a robust non-convex regularizer in order to improve the initial transmission maps (fourth row). The output of the robust regulator (RR) is much better than the initial transmission maps, where the objects are not blended with the background. Also, the edges of the objects are much smoother, which shows a consistent result when it comes to the structure in the spatial domain. Finally, the input images are recovered using the (Equation (
19)) and the refined transmission map
. The resultant restored images (last row) are more natural in terms of visibility compared to the output of initial transmission maps. The details of the images are well preserved with no blocking artifacts, which shows that the proposed method is helpful in improving the initial transmission maps.
Table 1 shows the quantitative analysis of the transmission maps and their outputs using the images from
Figure 4. In this case, we used one measure, i.e., the Pearson correlation coefficient
, for the evaluation of transmission maps (Equation (
21)), and one measure, i.e., average angular psi
, for the evaluation of restored colors (Equation (
22)). For the first measure, the range of desired values is between −1 and 1, where values closer to 1 indicate a better estimation of the transmission maps, while values closer to 0 indicate a poorer estimation. For the second measure, the lower angle values represent a perfect color restoration. It is important to note that the
values are computed on the neutral patches without using any global contrast adjustment. It is apparent that the computed values for
on the refined transmission maps consistently increased (third col.) for each input image. Similarly, the results for the average angular error
on the output images restored using the refined transmission maps give a lower error (fifth col.) compared to that of the initial
values (fourth col.). This clearly indicates that refining the initial transmission maps using the proposed robust regularizer results in better restoration of quality images.
Now, we will look into the performance of the proposed regularization method presented in
Figure 5. In this case, we computed initial transmission maps using the underwater haze line prior (UHLP) [
3] by selecting one image from each dataset (Input). Next, initial transmission maps are refined using various state-of-the-art transmission refinement methods, namely guided filter [
31], mutual structure filtering [
32], soft matting [
26], and SD filtering [
25], respectively, along with our method RR (last column). It is clear that all methods are adequate for improving the initial transmission maps, and each of them (from GF to SDF) has issues like quick variations in the nearby depth regions, and the object edges are fused with the background.
We also computed output for each refinement method
Figure 6 to see the visibility of objects. Despite some limitations, all methods are sufficient for improving the initial transmission maps. However, the color variations in the edges from the foreground to the background are abrupt from guided filter to SD filtering. Mutual structure filtering and SDF both increased visibility, but the objects have become over-saturated. Also, the visibility of soft matting is poor, and object texture details are lost. On the other hand, it is interesting to note that GF surpassed the other methods, but our methodology, which provides natural results that are free from the above-mentioned artifacts, outperformed GF and other state-of-the-art improvement methods significantly.
Table 2 and
Table 3 present the results of our quantitative analysis on various transmission refinement methods, which includes the initial transmission values and their corresponding color evaluation using the unique measures
(Equation (
21)) and
(Equation (
22)). In
Table 2, we compare the initial transmission values against the refinement methods guided filter, mutual structure filtering, SM, and SDF. Meanwhile,
Table 3 presents the comparison of the restored images from the initial transmission map and refined map using the aforementioned methods, with the images from the Katzaa, Michmoret, Nachsholism, and Satil databases. Overall, these results suggest that our proposed method is a promising transmission refinement approach, and its effectiveness for color correction may depend on the characteristics of the input data.
In another simulation, we conducted a quantitative analysis to assess the performance of our proposed regularization method when applied to various state-of-the-art methods.
Figure 7 presents the quantitative analysis of the images from four different datasets. The
and
values were calculated for each method by replacing their refinement techniques with our robust regularization method. In this analysis, we evaluated several well-known methods for calculating initial transmission matrices: DCP, UDCP, HLP, and UHLP. The graph on the left side shows the improvement in initial transmission matrices, as measured by the Pearson correlation coefficient
. The first bar in each method represents the initial
values, and the subsequent bar of the same color indicates the improvement in
values after using our method. It can be observed that our method was effective at improving the initial transmission matrices in all of the selected methods. Similarly, the graph on the right side demonstrates the improvement in color values, represented by
. The color coding is the same as in the first graph. From this graph, it is clear that our method is able to reduce the average angular error
in all methods. In our experiments, we also observed that our method improved the transmission of UDCP for a variety of images. However, there are some cases where the method fails to meet the assumptions of the model or encounters a low signal, leading to failure.
4.3. Comparison
In this section, we evaluate the performance of the state-of-the-art (SOTA) and proposed methods through both qualitative and quantitative analyses. To compare the results of image restoration, we randomly select a subset of images from four datasets and present them in
Figure 8. The first column shows the input images, and the remaining columns depict the output of each method, including our proposed method. The results for IBLA [
27], UDCP [
19], CEIV [
40], HLP [
30], and UHLP [
3] were generated using code released by the respective authors.
A qualitative comparison reveals that IBLA [
27] effectively restores colors, even though with a slight tendency towards oversaturation across different distances. The method proposed by UDCP [
19], which relies on the dark channel prior, was not able to consistently correct colors in the images. Similarly, the method proposed by CEIV [
40] also failed to consistently correct colors and optimally preserve information in the images. The method proposed by HLP [
30] was not effective in removing haze and contrast from the images, whereas the new method UHLP [
3], proposed by the same author for handling different types of underwater images, is notably better at restoring colors. However, it should be noted that sharp structural edges are not well-preserved in this method. By examining the dehazed results in the last row, it can be observed that our proposed method not only effectively minimizes the haze but also restores the natural appearance of the scenes. Furthermore, the proposed regularizer with structural priors effectively preserves sharp structural edges and depth discontinuities.
In addition to the qualitative analysis, we also present a quantitative comparison in
Table 4 using the images shown in
Figure 8. The corresponding image number is mentioned in the first column. To facilitate the understanding of the evaluation results, we first compute the average angular error
, followed by the output of different underwater image enhancement methods. It should be noted that contrast stretching often outperforms most methods, especially on closer charts. We compared our method to others by using the same code and just changing the size of the images. However, in our case, we do not use any contrast stretching, and it is apparent that our proposed method outperforms various SOTA methods.
Figure 9 illustrates the ground-truth distance and transmission maps for the images presented in
Figure 8. The true distance is displayed in meters and is color-coded, with blue indicating missing distance values due to occlusions and unmatched pixels between the images. The selected images are from the right camera, which results in many missing distance values on the right border of the image, as these regions were not visible in the left camera.
To compare the proposed method with state-of-the-art (SOTA) methods and showcase the diversity, a variety of methods were selected from
Table 5, as well as other popular algorithms well-known for computing transmission maps, including DCP [
26], UDCP [
19], CEIV [
40], IBLA [
27], RCP [
21], UHLP [
3], and the proposed method. These transmission maps are estimated using a single image and are evaluated quantitatively based on the true distance map of each scene. The results indicate that the proposed method effectively refines the transmission by taking into account the structures of the edges and depth, resulting in a much smoother outcome. In contrast, methods such as UHLP that use the refinement method outlined in
Figure 8 do not perform well in transferring structural information from guidance to transmission maps. Methods like DCP and RCP fail to refine the transmission, and the reasons for this failure are discussed in the quantitative evaluation below. Moreover, the proposed method is also robust to missing distance values issue, as it can handle occlusions and unmatched pixels between the images.
The quantitative comparison evaluated the performance of various methods for estimating transmission maps from a single image using the Pearson correlation coefficient
as the evaluation metric. The results summarized in
Table 5 indicate that the proposed method yields more accurate transmission maps compared to other methods. Some methods, such as UDCP [
19] and CEIV [
40], do not prioritize physically valid transmission maps, which may be due to their use of high values in the veiling light region to suppress noise. Most of the other methods show far-off transmission values, with the exception of IBLA [
27], which performs slightly better on average but with the drawback of a highly color-biased scene correction. The proposed method outperforms the state-of-the-art methods without any contrast adjustments. Additionally, it is also worth noting that methods such as UDCP [
19] and CEIV [
40] may prioritize different aspects such as noise reduction, while the proposed method focuses on physically valid transmission maps.