1. Introduction
The near-infrared (NIR) is one of the regions closest in wavelength to the radiation detectable by the human eye. Unlike human eyes, sensors based on silicon (
SiO) are sensitive to NIR up to 1100 nm, limited by the cut-off value of silicon. Due to the proximity of NIR to visible radiation, NIR images share many properties with visible images. However, surface reflection in the NIR bands is material dependent. For instance, most dyes and pigments used for material colorization are somewhat transparent to NIR. This means that the difference in the NIR intensities is not only due to the particular color of the material, but also to the absorption and reflectance of dyes. Therefore, the NIR intensity provides the useful information pertinent to material classes rather than the color of that object [
1].
Recently, there have been several attempts to use NIR band information. In remote sensing applications [
2,
3], the multispectral images observed in a variety of spectrum bands have been used where both the visible and NIR bands are included. As each spectral band provided different kinds of information, the spectral bands were selectively used in the observation of the multispectral images. In surveillance cameras [
4] and night vision cameras [
5], the NIR band is used especially under low lighting conditions or invisible NIR lighting conditions. The NIR band is also used in biometric [
6], face matching [
7] and face recognition [
8] applications, which have been studied based on the intrinsic reflectivity of the skin or eyes under NIR illumination. Since the reflection in NIR is material dependent, it is also used in material classification [
1] and illuminant estimation [
9]. NIR images can be used in image enhancement applications, such as image dehazing [
10].
To develop an NIR image acquisition system, Kise
et al. designed a three-band spectral imaging system composed of multiple cameras with a beam splitter [
11]. This imaging system has been used to acquire multispectral images in user-selected spectral bands simultaneously by utilizing three interchangeable optical filters and various optical components. Similarly, Matsui
et al. implemented a multispectral imaging system, where two infrared cut-off filter (IRCF)-removed cameras were used to capture the color and NIR images independently [
12]. In this system, the IRCF-removed cameras were perpendicularly aligned, and the IRCF was used as a light splitter for the visible and NIR bands. By managing the shutter of two cameras with a single controller, each spectral band image pair was acquired, simultaneously. However, this imaging system requires a large space to attach two or more cameras and to perform the alignment process. Due to the lack of portability of these devices, multi camera-based imaging systems are not suitable for practical outdoor environments. C. Fredembach [
13] suggests another approach in which an IRCF-removed single camera with multiple optical band pass filters can achieve smaller sizes than multi-camera systems. On the other hand, this imaging system requires too much time to change the optical filters. Because of this weakness, some artifacts, like motion blur and registration problems, can occur during the image acquisition process.
As an alternative approach, an IRCF-removed color filter array (CFA) image sensor, such as a Bayer image sensor without an IRCF, can be used [
13]. By using a single digital camera without an IRCF, the spectral information of the visible bands and that of the NIR bands can be acquired at the same time.
Figure 1 shows a conventional camera system approach with an IRCF and a spectral sensitivity of a complementary metal-oxide semiconductor (CMOS) imager integrated with traditional RGB Bayer filters. By removing the IRCF, the NIR contribution to the RGB channel can reach the CMOS imager. This additional NIR information can be used to allow for invisible monitoring in surveillance applications.
On the other hand, mixing color and NIR signals at the pixel level can result in extreme color desaturation if the illumination contains sufficient amounts of NIR. Although it may be possible to overcome the unwanted NIR contribution to the RGB color channel through the signal processing technique, it is hard to estimate the NIR spectral energy in each RGB color channel, because there is no way to detect the NIR band spectral characteristics.
As an improved system based on a single image sensor, an imaging system based on the multispectral filter array (MSFA), which simultaneously obtains visible and NIR band images, can be considered [
14]. A pixel configuration of the RGB filters and another NIR pass filter, which transmits NIR light only, is shown in
Figure 2. In the following descriptions, we refer to the four channels as RGBN channels, where RGB represents the red, green and blue channels and N represents the additional channel for the NIR band.
Because these sensors based on the RGBN filter array need to acquire invisible range information, removing IRCF is necessary. Without IRCF, RGB and NIR signals can be obtained simultaneously. Because of this advantage, imaging systems based on MSFA sensors can be applied to a wide variety of applications. Under certain circumstances, especially low lighting conditions, this system can obtain wide spectral information simultaneously. Furthermore, by applying fusion technology that uses NIR band information, gaining additional sensitivity to colors that do not deviate considerably from the human visual system is possible [
15].
However, without IRCF, the additional NIR component penetrates through the color filter to each R, G, B pixel. The unwanted NIR interference distorts the color information of each R, G, B color channel.
Figure 3 is an example of an imaging system based on the MSFA image sensor. Many researchers studied the interpolation method, such as [
15,
16,
17], to make a full resolution image in each RGBN channel. Since the input RGB signals contain NIR, natural RGB color information needs to be calculated by subtracting an NIR band component from the input RGB signals that have been deteriorated with NIR interference. During the process, the NIR channel information in the N pixel can be used to remove the unnecessary NIR contribution to the RGB channel. After restoring the color information of the RGB channel from the input signal received through the MSFA image sensors without an IRCF, a fusion method can be applied to generate the new blended images, which have not only natural color information, but also additional NIR spectral information. To take advantage of this benefit, it is necessary to restore natural color. As a result, the IRCF can be removed day and night with color restoration process.
In recent studies, researchers have proposed CFA for one-shot RGB and NIR capture in NIR imaging. However, the studies do not consider color restoration [
16,
17]. Although [
18] addresses both crosstalk and demosaicing, it assumes crosstalk between the green and NIR channels only. Chen
et al. proposed a color correction pipeline [
15], which is able to apply only specific NIR illumination. The color correction method in [
15] does not guarantee successful color correction results if the illumination spectrum is widely distributed in an NIR range. Furthermore, in [
19], NIR restoration was proposed; however, the method does not consider crosstalk in a visible range in an NIR channel. The IR removal method proposed by Martinello
et al. considers the crosstalk happening in the near IR range of 650 nm to 810 nm. This method assumes that the contribution from the wavelengths in the visible range (
λ < 650 nm) to the IR channel can be ignored. On the contrary, the proposed color restoration method divides the visible and NIR bands to estimate the color correction matrix. In the visible band, the crosstalk in the N channel is estimated by using the linear regression of RGB channels by using the N channel decomposition matrix. By removing the estimated crosstalk in the N channel, the N channel information in the NIR band is obtained. The N channel information in the NIR band is used to estimate the NIR contribution to the RGB channels by using the RGB channel decomposition matrix. In this way, the proposed method copes with the different spectral responses of the visible and invisible bands, respectively. Furthermore, the proposed method considers the crosstalk happening in the near IR range from 650 nm to 1100 nm.
We proposed a brief idea to restore color information with an RGBN sensor [
20]. However, since we focused only on color restoration under generally bright illumination environments, our previous work did not have good performance in low light conditions. In this paper, we proposes a color restoration method that removes the NIR component in each RGB color channel with an imaging system based on the IRCF-removed MSFA image sensor. To investigate the color restoration method for various illumination environments, we analyze the change of the chromaticity feature obtained by the additional NIR. In addition, the color restoration method for the low lighting condition based on the spectral energy distribution analysis is proposed. Since color degradation caused by IRCF removal is a huge limitation, the NIR contribution to each RGB color channel needs to be eliminated. To remove unwanted NIR components in each RGB channel, the color restoration model was subdivided into two parts of the spectral estimation and the spectral decomposition process.
The remainder of this paper is organized as follows: In
Section 2, we discuss the problem that arises when a color image is acquired with the IRCF-removed MSFA image sensor. In
Section 3, we analyze the color model of an IRCF-removed MSFA image sensor. In
Section 4, we outline our proposed color restoration method with spectral estimation and spectral decomposition. In
Section 5, we present our results and compare our solution to another state-of-the-art method. In
Section 6, we provide a conclusion.
2. Color Degradation
To analyze the change of the chromaticity feature by the additional NIR, the RGB color space was converted to the HSI color space, as in [
21]:
where
represents the minimum value among three values.
H,
S and
I represent the hue, saturation and intensity, respectively.
In
Figure 4, the NIR band is divided into two sub-bands: we define these sub-bands as a chromatic NIR band (700 nm∼800 nm) and an achromatic NIR band (800 nm∼1100 nm), respectively.
Figure 5 shows that the responses of the achromatic NIR bands are identical. To obtain achromatic NIR band information, we used an NIR band pass filter that passes a specific wavelength (800 nm∼1100 nm). The distribution of 96 color patch values in the Gretag color checker SG shows a linear response in the achromatic NIR band with respect to the NIR channel. Based on this, we define these responses as a constant at each pixel, such as
. The
,
and
represent the achromatic colors of the image sensor beyond an 800-nm wavelength in each channel. As a result, the RGB intensities at a pixel position are represented as:
where
represent the chromatic colors of the image sensor under an 800-nm wavelength.
With the RGB color values with offset
δ, the intensity of the observed color is defined as follows:
where
represents the intensity of the chromatic spectral band of the image sensor. The intensity of the IRCF-removed MSFA image sensor is changed by the amount of the offset value. The hue value in Equation (
1) is redefined as:
Because the achromatic offset value
δ is removed during subtraction, an identical offset on the RGB channels could not change the hue value. Finally, the saturation value is described as:
where
represents the saturation of the chromatic spectral band of the image sensor and
. Since the range of
is
, the saturation of the image obtained by the IRCF-removed MSFA image sensor is degraded and becomes smaller than the image obtained by the chromatic spectral band of the image sensor.
Figure 6 describes how NIR affects the RGB color images. The illuminance was 200 lx, and the exposure time was 0.03 s. When objects are illuminated by an incandescent lamp, an image sensor with an IRCF obtains a yellowish hue due to the low color temperature of the illuminance. After performing a white balance technique from the grey color patch, a white-balanced color image was obtained as shown in
Figure 6b. On the other hand, due to the additive NIR intensities included in the RGB channels,
Figure 6c appears brighter than
Figure 6a, and low color saturation was observed in
Figure 6d.
To correct desaturated color from the input image acquired by the MSFA image sensor, several conventional methods can be considered, as described in [
22,
23]. A straightforward method is to train the matrix to reproduce a set of known reference colors. Given the observed color vector
and the visible band color vector with canonical illuminance
, the color correction method is represented in a matrix form:
where Φ is a matrix whose component corresponds to the ratio between the canonical and the current illuminance value of each channel. The illuminant color estimation was performed under unknown lighting conditions where pre-knowledge based approaches, such as gamut mapping [
24] or the color correlation framework [
25], were used.
However, color degradation caused by IRCF removal is not considered a multiplicative process, but an additive process. Applying a conventional color correction approach to the RGBN images yielded poor results, because it did not sufficiently remove the NIR contributions to the RGB channels. The higher the energy in the NIR band relative to that in the visible band, the higher the color errors caused by NIR contributions to the RGB signals. As a result, the conventional color correction method restored visible band color in a limited way. Although each color was obtained under the same illuminant conditions with and without an IRCF, respectively, the mixture of the exclusive NIR band intensity to the visible band intensity resulted in severe color distortion.
Figure 7 shows the result of the conventional color correction method for an MSFA image. In
Figure 7c, the color correction matrix worked well for colors in the color chart with low reflectance in the NIR band. However, despite the fact that the colors of the black paper and velvet paper were the same in the visible band, the conventional color correction method could not restore the black color with high reflectance in the NIR band (such as fabric substance).
3. Color Model of an IRCF-Removed MSFA Image Sensor
A color image observed by a CMOS image sensor can be modeled as a spectral combination of three major components: illuminant spectra E(
λ), sensor function
and the surface spectra S(
λ). The color image formation model in the visible band for channel
k was defined as [
26]:
where
represents the spectral range of the visible band between 400 nm and 700 nm. Since an IRCF-removed MSFA image sensor can acquire the additional NIR band spectral energy beyond a 700-nm wavelength, the range of these three major components in Equation (
7) had to be expanded to the NIR band. The observed camera response for channel
k when using the IRCF-removed MSFA image sensor is represented by the color image formation model
[
19] from Equation (
7):
where
represents the NIR band beyond 700 nm.
and
represent the camera response for channel
k by using the IRCF-removed MSFA image sensor in the visible band and the NIR band, respectively. For an image sensor with RGBN filters, the intensities at each pixel position are represented as,
In Equation (
9), each pixel contained additional NIR band information. Since this additional information can be helpful to increase the sensitivity of the sensor, this feature can be useful under low light condition. However, mixing color and NIR intensities can result in color degradation if the illumination contains high amounts of NIR. To restore the RGB channels corrupted by NIR band spectral energy, the additional NIR band components (
,
,
) in the RGB channels have to be removed:
Since the spectral response function of the RGBN filter is not defined only in the NIR band, we used a signal processing approach to estimate the NIR band response. To decompose the spectral information of the RGBN channel, the unknown value
or
must be estimated. To cope with the different characteristics of the correlation in the visible band, as well as the NIR band, we set the correlation model in each sub-band, separately. In the visible band, the RGB channel filters show different peak spectral responses, while the N channel filter covered all spectral ranges without outstanding peaks. As a result, the N channel filter response function is modeled as a linear combination of the others:
where
,
and
represent the coefficients that show cross-correlation in the visible band. Since the spectral response of the N channel in the visible band covers a wide spectral range without an outstanding peak, those coefficients are constrained to be constant in terms of the wavelength [
27]. Using the constrained weights, the intensities of the N channel in the visible band are approximated as follows:
where
and
represent the visible band cross-correlation coefficients obtained by the linear transformation model:
where
is a one by three matrix describing the mapping between the RGB to N channel values. The transformation
is obtained by solving the following minimization function:
where
and
are matrices whose components are the NIR and the RGB components. Each cross-correlation coefficient could have been of any arbitrary form determined by the illuminance change and the spectral response of the sensor. As a result, the function
ω depends not on the spectrum
λ itself, but on the spectral response of the illuminance and the sensor.
Figure 8 represents the comparison between the optical filtered N channel image in visible band and estimated N channel image in the visible band by using Equation (
12).
In the NIR band, the cross-correlation is derived more intuitively, since the RGBN filters are all pass filters where the filter responses are highly correlated in the NIR spectral range. Since there is an energy difference between the two spectral ranges in the N filter response, the cross-correlation coefficients in Equation (
12) have to be modified. To cope with the different energy ratios in the visible and the NIR bands, the response of the N channel in the NIR band is:
where
is the inter-spectral correlation coefficient that considers the visible band to the NIR band energy balance.
Figure 9 represents the comparison between the optical filtered N channel image in the NIR band and the estimated N channel image in the NIR band by using Equation (
15).
4. Proposed Methods
The purpose of the proposed method is to restore the original color in the visible bands from the mixed wide band signal. However, the color restoration in the spectral domain is an underdetermined problem, as described in Equation (
9). Since MSFA image sensors have additional pixels whose intensity was represented in Equation (
9), we redefined this underdetermined problem with eight unknown spectral values.
From Equation (8), the observed intensity vectors of the multispectral images are represented as . To focus on the color restoration at each pixel position, we assumed that the spatially-subsampled MSFA image was already interpolated. As a result, there are four different intensities at each RGBN pixel position.
In
Figure 4, the spectral response of each channel is described with the corresponding RGB and N values. The energy of the NIR band is obtained by the RGB color filters, as well as the N filter. Similarly, a large amount of the energy in the visible band is obtained by the N channel. By considering the observed multispectral intensity vector
, the spectral correlation between the channels in the visible band and the NIR band resulted in a mixture of exclusive responses in each channel, as represented in Equation (
9).
From the sub-spectral band intensity mixture model, the color restoration problem is defined to find the unknown visible band intensity values from the observed intensity values R, G, B and N, which contained the unknown NIR band intensity values and the unknown visible intensity values.
4.1. Color Restoration Based on Spectral Decomposition
When we spectrally decompose the N channel to the visible and NIR bands, the given N channel is represented by the RGB channel intensities in the visible and NIR bands from Equation (
12) and Equation (
15):
In Equation (
16), the observed N channel is described with unknown RGB values in the visible bands and the NIR bands. Therefore, the decomposed N channel is obtained indirectly from Equation (
16). Corresponding to the spectral response of the N channel, we define the artificial N channel
made by using the observed RGB channels and the visible band cross-correlation coefficients in Equation (
12):
Since the visible band cross-correlation coefficients are designed to fit the N channel in the visible band, the estimated
value resembles the N channel filter responses in the visible band, but not in the NIR band. By using the energy difference between
N and
in the NIR band, the observed N channel is decomposed into the two bands by subtracting the original
N channel in Equation (
16) and the artificial N channel
in Equation (
17):
where
is a scaling factor and
represents the artificial N channel in the NIR band from Equation (
15). Based on Equation (
18), we decompose the spectral response of the N channel into two different channels, the visible band and the NIR band. The N channel information in the NIR band is recovered from the N channel that contained the energy of the entire spectrum of the MSFA image sensor. As a result, the decomposed N channel intensities in the NIR band and the RGB channel intensities in the NIR band are estimated from the result of Equation (
18).
Figure 10 shows the relationship of the RGB channel intensities and the N channel intensity of 96 color patches of the Gretag color checker SG in the NIR band. As described in
Figure 10, they are asymptotically linear in the NIR band. From this linear correlation, the decomposed RGB channel in the NIR band is defined as follows:
where
,
and
represent the coefficients of the linear correlations between the RGB channels and the N channel in the NIR band. From the equation, the intensities of the RGB channel in the NIR band are estimated, and this color restoration model was processed with a single matrix transformation of:
where
is:
where
is the N channel decomposition matrix,
is the RGB channel decomposition matrix and
is a 3 × 4 matrix of zeros with ones along the leading diagonal. The N channel decomposition matrix
is defined as:
and the RGB channel decomposition matrix is defined as:
Based on Equation (
21), the unified matrix
M is:
where
is a scaling factor in Equation (
18),
,
,
are the coefficients for the linear combination in Equation (
11) and
,
and
are the coefficients that represent the linear correlation between the RGB channels and the N channel in the NIR band in Equation (
19). Because Equation (
24) is a combination of cascaded linear decomposition matrices
and
, the proposed color correction matrix is more flexible than the simple 3 × 4 linear color correction model. Further, because the sensor response function over the entire band is nonlinear, color correction error is inevitable when the linear color correction method is employed. Moreover, there is an energy difference between the visible and NIR bands. The spectral response of the local spectral band can be approximated to a linear model. On the basis of linear model approximation of each local spectral band, the proposed method separates the visible and NIR bands to estimate the color correction matrix and, thereby, obtain a more accurate estimation of the NIR interference in each RGB channel. Using
, the proposed method decomposes the N channel to the visible and NIR bands and uses the NIR band information obtained from
to estimate the NIR contribution in the RGB channels. The correlation between the RGB and N channels in the NIR band is estimated using
. Because the proposed method separates the visible and NIR bands to estimate the color correction matrix (CCM), it is possible to estimate the correlation between RGB and NIR in various illumination environments.
Figure 11 shows the experimental results obtained under an incandescent lamp with 300 lx illumination. Because the incandescent lamp emits an amount of spectral energy in the NIR band, we selected this lamp to show the advantage of the proposed method. By comparing
Figure 11b and
Figure 11c, the level of restoration of the overall colors of each color patch can be ascertained. In
Figure 11a, which is the target optical filtered image, it can be seen that some color patches are slightly different. To investigate the color restoration accuracy, we calculated angular error.
Table 1 shows the average angular error. From
Table 1, it is clear that the proposed method restores color better than the linear 3 × 4 color correction method.
4.2. Low Light Conditions
Because of the additional NIR band information, an IRCF-removed MSFA image sensor has advantages in low visible light conditions. From the perspective of color restoration, however, there is no advantage, since the unnecessary NIR interference to the RGB color channel does not have any visible band color information.
Figure 12 represents the spectral energy distribution of an incandescent lamp with a variety of illuminance values. The correlated color temperature of the lamp is 3000 K. As illuminance decreased, the overall intensities of spectral energy decreased, too. In addition, the energy ratio between the visible band and the NIR band varied as the illuminance decreased.
Table 2 shows that decreasing illuminance increases the portion of the NIR band spectral energy under incandescent light. The numbers in Columns 2 and 3 represent the summation of the spectrum values in
Figure 12. This implies that 60% of the unwanted NIR contributions in each RGB channel must be removed to obtain a natural color image under an incandescent lamp with 10 lx. Because the NIR contribution is greater than the color information in each RGB channel, it is important to estimate the NIR band spectral information precisely to prevent false color generation.
4.3. Two-Step Color Restoration
In general lighting situations, the proposed color restoration method based on Equation (
24) can decompose the NIR contribution in each RGB channel. However, as mentioned in
Section 4.2, the spectral energy distribution changed under low lighting conditions. Furthermore, the ratio between the visible band and the NIR band changed. Therefore, the estimation of the N channel in the NIR band is more important under low lighting conditions. The color restoration model in Equation (
24) is based on the assumption that the spectral response of the MSFA sensor in the NIR band correlated with the spectral linearity between the RGB and N channels. However, in the 700 nm to 800 nm spectral range, there was a lack of linear correlation between the channels, except for between the R and N channels. If the spectral energy distribution of the light source shows strong energy between this nonlinear range, the spectral decomposition error of the result will increase. Because the visible band information is smaller than the NIR band under low lighting conditions, the spectral decomposition error can produce a false color result.
To overcome this spectral nonlinearity problem, we used a two-step color restoration method that divides the spectral range into two parts and removes the NIR band information sequentially.
Figure 4 represents the two-step color restoration process. In the first step, the intensities of the RGB channel in the NIR band with a spectral wavelength range greater than 800 nm were decomposed using the B channel. In
Figure 13, the ratio between the B channel and the N channel of 96 color patches of the Gretag color checker SG is represented. Since the visible band information of the B channel is quite small under low lighting conditions, there is a strong correlation between the B channel and the N channel whose wavelength is above 800 nm, as described in
Figure 13.
The N channel whose wavelength is beyond 800 nm was approximated from the B channel as follows:
where
γ is the correlation coefficient between the B channel and the N channel above 800 nm.
Figure 14 represents the result of Equation (
25).
Figure 14a is the image obtained with the optical filter, and
Figure 14b is the result of the proposed method after the first step of color restoration. By comparing (a) to (b), the overall colors of the entire image were similar.
After the first step, the remaining NIR intensities in the RGB channel were removed through the spectral decomposition method as proposed in Equation (
24). Based on Equation (
20), the two-step color restoration model can be processed with a matrix equation as follows:
where
P is defined as:
The , and values represent the correlation coefficient between the B channel and the N channel whose wavelength was above 800 nm. The proposed two-step color restoration method was applied to estimate the NIR component of the image obtained under particular illumination situations, such as low light conditions, especially the illuminance of an incandescent lamp under 5 lx. In this paper, we use the proposed method with a two-step color restoration with Equation (26) when the illuminance of the light source is under 5 lx.
From
Section 2, the achromatic NIR component
δ did not affect the hue and saturation value of the images. The achromatic NIR component is not an important part of restoring the color component. Therefore, we estimated the spectral information of the chromatic NIR band precisely after removing the achromatic NIR component
δ.
Figure 15 represents the result of the proposed method under an incandescent lamp with 5 lx.
Figure 15a is the input image, the color of which is desaturated by additional NIR, and
Figure 15b is the optical filtered visible band image.
Figure 15c is the result obtained using the proposed method in Equation (
24) as given in
Section 4.1, and
Figure 15d is the result that was obtained using the two-step color restoration described in
Section 4.3. By comparing
Figure 15c to
Figure 15d, the overall color of
Figure 15c is yellow-shifted, especially in red color patches. Since the spectral energy distribution changed under low lighting conditions, the unified color restoration model
in Equation (
24) was limited in explaining the complicated nonlinear transformation. After removing the achromatic NIR band information, the only concern was the chromatic NIR band used to restore the color information. Since the unified color restoration model
handled the chromatic NIR band information, the color was successfully restored as represented in
Figure 15d.
5. Experimental Results
The proposed color restoration method was tested with images captured under different standard illuminations: sunlight, incandescent lamp, sodium lamp and fluorescent lamp. Since the spectrum of these light sources was spread over a wide range, we used these lights as the target illuminance values as represented by
Figure 16.
As the training set for the correlation coefficients, we used 96 standard colors of the Gretag color checker SG. Because the color samples were distributed widely, these colors were used for the training set. The input multispectral image was obtained by a camera system with an RGBN image sensor without IRCF, and we used a target visible band image with an IRCF as a reference image. The 96 patches were manually segmented, and we used the average RGB of each patch. The resulting average RGB values in the input image and the reference image were used to derive a set of color restoration models in Equation (
24). We also measured the XYZ of each of the 96 patches using a spectrophotometer. If an illuminance value was less than 5 lx, we used an additional optical filter that passes wavelengths beyond 800 nm to derive a set of color restoration models in Equation (27). After setting a color restoration model, the proposed method was applied to an input multispectral image without IRCF. As mentioned in
Section 4.3, we used two-step color restoration when the illuminance of the light source was darker than 5 lx. In our experiment, we measured the illumination level using an illuminometer. In practical situations, the light sensor commonly used to turn on the flash light or changing to night shot mode must be installed to measure the luminance level of the illumination. The light sensor performs the simple role of determining whether the luminance level corresponds to dark or bright. When the illuminance of the light source is brighter than 5 lx, we used the color restoration model in Equation (
24).
As an error criterion, the angular error was calculated. Considering the
Z color sample entities in the training set, the angular error for the
z-th color was defined as:
where
is the angular error between the target color vector
and the color restoration result
. ‘·’ represents the inner product of two vectors, and
represents the magnitude of the vector
. In addition, we measured the color difference ΔE of each color sample in the CIELAB color space defined by:
We regarded the average of ΔE as the color correction error. To convert RGB to the CIELAB color space, the RGB signals were transformed to CIE tristimulus values by using a spectrophotometer with a standard illuminant, after which the CIELAB equation was applied [
28]. The tristimulus values of the illuminant were A, F and D65 with respect to the incandescent lamp, fluorescent lamp and sunlight, respectively. We used a visible band image with IRCF as a reference image that was used to compare to the input image and the result image. As comparative methods for the proposed color restoration algorithm, we implemented the least squares-based color correction method [
29] and the N-to-sRGB mapping color correction method based on root-polynomial mapping [
30].
Figure 17 depicts the experimental results under a fluorescent lamp with 350-lx illumination. Since the fluorescent lamp did not emit NIR, the input image in
Figure 17a and the optical filtered image in
Figure 17b were almost similar. Our proposed method preserved the color of the input image (
Figure 17f) and the other color correction methods (
Figure 17c to
Figure 17e) because of the absence of NIR color distortion in the input image.
Figure 18 shows the experimental result under sunlight, which has a wide range of spectral distribution and abundant visible band information. In this case, it was sufficient to restore color using the proposed method in Equation (
24). Comparing
Figure 18b and
Figure 18c to
Figure 18f, the resulting image of the proposed method restored the distorted color well, especially the materials with high reflectance in the NIR band. The root-polynomial mapping method in
Figure 18e restored the overall colors of each color patch and black materials well. The comparison of
Figure 18b,e shows that the saturation is slightly high. Since sunlight has plenty of spectral energy in visible bands, the root-polynomial mapping restores color information as well as the proposed method. To investigate color restoration accuracy, each method was compared in
Table 3 and
Table 4.
Another set was tested under an incandescent lamp, which emits much spectral energy in the NIR band.
Figure 19a represents the multispectral image obtained under the incandescent lamp. The color channels were white balanced without considering the color degradation caused by the additional NIR; therefore, the overall colors of the image show low saturation and blue hue over much of the NIR band.
Figure 19c shows the result of the conventional color correction method. When comparing
Figure 19c to
Figure 19b, the overall colors of each color patch and object were close to the target image. The comparison of
Figure 19d to
Figure 19f shows that the overall colors of each color patch and object were close to the optical-filtered visible band image (
Figure 19b). However, the color of the objects with high reflectance in the NIR band, such as fabric, leaf, and so on, was slightly different. This means that the accuracy of the NIR estimation was different.
Figure 19f is much closer to the visible color in
Figure 19b because the proposed method separates the visible and NIR bands to estimate the color correction matrix and, thereby, obtains a more accurate estimation of the NIR interference in each RGB channel. The black colors of the fabric patch in the upper side of the image, as well as the doll’s cap and clothes were restored to their original colors successfully.
As discussed in
Section 4.3, the proposed two-step color restoration method is useful under particular illumination.
Figure 20 represents a comparison with and without two-step color restoration under an incandescent lamp at 1 lx. Since the visible band information was less than that of the NIR band in low lighting conditions, the spectral estimation error increased. As a result,
Figure 20c shows a yellow image compared to
Figure 20b. With the proposed two-step color restoration method, the color of the image was successfully restored, as shown in
Figure 20d. Based on this result, we tested the proposed method under low lighting situations.
Figure 21 represents the experimental results under an incandescent lamp at 1 lx. This illumination emits plenty of spectral energy in the NIR band. In
Figure 16, the spectrum distribution of the incandescent lamp is spread evenly over a wide range. In low lighting conditions, the lack of visible band information makes the overall saturation of the images low.
Figure 21c shows that the 3 × 3 CCM-based method could not restore the overall color of the input image (
Figure 21a). By comparing
Figure 21d to
Figure 21f, the overall colors of each color patch and object were close to the optical-filtered visible band image (b). However, the colors of black materials were not restored correctly in
Figure 21d,e. Since the spectral energy of the incandescent lamp under 550 nm and the MSFA sensor response in the blue channel were low, blue information is lacking in the black area. As a result, the blue intensity was boosted during the process of color constancy. Both root-polynomial mapping and our proposed color restoration method are based on least-square linear mapping; therefore, a large amount of NIR spectral energy in low-lighting condition (see
Section 4.2) must be considered. Compared to
Figure 21d,e,
Figure 21f shows that the proposed method restored colors satisfactorily for both the patches and for materials with high NIR component.
Figure 22 represents the experimental results under a sodium lamp at 1 lx.
Figure 22c shows that the 3×3 CCM-based method could not restore the overall color of the input image (
Figure 22a). The spectrum distribution of the sodium lamp is concentrated at a particular wavelength at 830 nm, as shown in
Figure 16. In this case, the sensor spectral response of the local spectral band can be approximated to a linear model. For this reason, the experimental results in
Figure 22d to
Figure 22f show high restoration performance visually. To investigate the color restoration accuracy, each method was compared in
Table 3 and
Table 4.
Table 3 and
Table 4 show the average angular error and the color difference with a variety of light sources. The performance of the proposed method was confirmed visually for materials with high reflectance in the NIR band. However, the performance of the proposed method for various colors in the color chart and substances had to be measured.
Table 3 shows the amount of angular error, where our proposed method outperformed other methods. Since the color of the input image was severely distorted, the angular error between the input image and the optical filtered image was significantly high. After the application of color correction methods, the average angular errors were reduced, and the performance of the proposed method was better than that of the conventional methods. Similarly, the color difference in
Table 4 shows that the color correction results obtained with the proposed method were better compared to the another methods.
In addition, to calculate the gain advantage provided with NIR information, we measured the intensities of the image obtained in various illuminations with or without IRCF.
Figure 23 represents the sensitivity boosting provided by the NIR information. To measure the additional intensities, the image is divided into 16 sections. After that, the intensities are averaged in each section. As shown in
Table 5 and
Figure 23, the sensitivity was boosted by 10 dB without IRCF under an incandescent lamp. On the contrary, because the fluorescent lamp does not emit an NIR component, there is no gain advantage.