1. Introduction
Against the background of extensive urbanization, cities and conurbations experience rapid expansions, especially in developing countries. Vast areas of rural land are changed into impervious surfaces such as buildings and roads, which generally absorb and re-radiate solar radiations effectively [
1,
2,
3,
4,
5]. These easily-stored solar radiations and artificial heat generated from productions, transportations, and civilian life lead to severe urban heat islands (UHI) [
6,
7,
8,
9]. This phenomenon, as a chief corollary of urbanization, critically affects not only the urban environment and ecology but also human health [
10,
11,
12]. Consequently, how to mitigate these impact of the UHI effect has become one of the hot research issues since the inevitability of urbanization [
13].
Research has been proliferating on the analysis of the patterns of UHIs, their influences, and corresponding adaptive strategies [
14,
15,
16,
17]. These studies are usually based on the relationship between land surface temperature (LST) and land cover (LC) [
18,
19,
20], which play crucial roles in describing the urban thermal environment. The outcomes generally confirm that LST is negatively correlated with the abundance of vegetation while it is positively correlated with the percentage of impervious surfaces area [
21,
22]. Some studies also pay attention to the difference of regions, seasons, and even moments during a day. Tarawally et al. [
23] compared the responses of LST to LC changes in the long term between the coastal and inland cities. According to Roth et al., for some American cities, the difference between urban and rural LSTs in the daytime was larger than that at night [
24]. Furthermore, to reduce interference from other factors and assess SUHI quantitively, some researchers provided and examined evaluative indexes [
25,
26]. Pongracz et la. [
27] found that the surface urban heat island intensity (SUHII) in nine central Europe cities demonstrated appreciable monthly variations.
Most of the researches on SUHI have been carried out by using thermal infrared (TIR) satellite remote sensing techniques. It is evident that the copious data accumulated from satellites over the decades suffice to enable the accurate determination of temporal and spatial change patterns of LC, LST and SUHI on both the macro- and meso-scales [
28,
29,
30]. However, the scale and spatial resolution of imageries may occasionally determine the conclusions for studies. Luan et al. [
31] analyzed the urban green land in Beijing with Landsat TM5 data with 120 m spatial resolution and showed that the patches’ perimeter, area, shape index, and fractional vegetation cover had no significant correlation with their cooling range on their surroundings; conversely, Gao et al. [
32] obtained completely different conclusions by aerial photos with 0.25 m spatial resolution. Thanh Hoan et al. [
25] regressed the multivariate relationship between land-use composition and LST, which had reasonable results at a specific window size only in the summer season. Thus, there is also a need to observe urban thermal environments with multi-scale remote sensing technologies [
33,
34].
Low-altitude airborne platforms as a complement to satellite-based remote sensing technologies are more suited for local or micro-scale studies [
35,
36,
37]. The conventional method to collect TIR aerial data involves the use of manned airborne platforms, which can supply exceedingly high-quality imageries through the big thermal infrared multispectral scanner (with the requirement of a heavy load) [
38,
39,
40,
41,
42]. However, logistical hindrances such as the costs of commissioned flights and limited technical resources circumscribe its wide application. Accordingly, a variety of unmanned airborne systems have in recent years been considered as an alternative due to their low costs and flexibility. Unmanned airborne systems with visible and TIR camera have hitherto been employed for various civil, industrial, and agricultural research and management [
43,
44,
45], but rarely for research on local SUHI and its change patterns. This situation is partially due to technical restrictions such as the undesirably small observation areas of unmanned aerial vehicles (UAV), short flight duration (generally about 15–20 min) and overly low cruising altitudes (generally under 200 m) [
46,
47]; and also partially due to the lack of effective methods for common and inexpensive devices. Therefore, a reliable and affordable method is necessary to assess the SUHI on a local scale.
In this study, we first present a workflow to assess the LC’s impact on SUHI, based on the visible and TIR images with high spatial resolution captured by an unmanned airship system (UAS) [
35] in the Sino-Singapore Guangzhou Knowledge City in 2012 and 2015. Moreover, we assess the accuracy of the LC classification and land surface temperature (LST) retrieval, and the applicability of some well-known indexes from satellite remote sensing field on a local scale, and finally analyze the spatial and temporal changes in the SUHI patterns.
3. Methodology
This study aims to determine the effective method to assess SUHI patterns based on LCTs and their changes on a local scale. The workflow is as follows (
Figure 3). First, the visible and TIR images obtained by observations are mosaiced and orthorectified to visible and TIR orthophotos. Subsequently, the orthophotos are spatially co-registered to a high-resolution non-offset optical Google map. This is followed by the classification of the LCT from the visible map. Then, an LST map is retrieved from the TIR map based on the algorithm of TH9260. Finally, statistical analyses on LC, LST, and SUHI are performed. The following sections detail these phases and present the relevant data.
3.1. Pre-Processing
The hundreds of visible and TIR images need to be pre-processed, with regard to aspects including the mosaicking, geometric correction, and spatial co-registration. The mosaicking for both kinds of images was performed by the Agisoft PhotoScan software, which has been widely used in UAV image mosaicking [
48]. For the visible orthomosaic, the processing with PhotoScan was twofold. Firstly, the software was deployed to identify and match the feature points between images with the Scale Invariant Feature Transform (SIFT) algorithm, and to reconstruct the scene geometry with Structure from Motion (SfM) techniques [
49]. Then, it applied a dense multi-view reconstruction to the aligned images to establish the scene geometry, and finally exported an orthophoto [
50]. The whole processing was automatic and involved no human interactions. The TIR images obtained from the TH9260 camera can be identified only by its companion software "Image Processor Pro II"; accordingly, the ASCII format file with the temperature information of each pixel had to be exported and converted into the Tiff format image through the IDL program in the ENVI software. For the TIR images, the subsequent mosaicking was performed in a manner similar that for the visible images [
51].
The other pre-processing step is to perform spatial co-registration for the visible and TIR orthophotos, which is necessary for subsequent analyses on LCT and LST. We chose the non-offset optical Google map with geographic coordinates in the study area as the reference. The visible and TIR orthophotos in 2012 and 2015 were loaded into ArcMap and co-registered to the Google map with twenty as possible as evenly-distributed, clear, and discernible feature points e.g. intersections of roads and corners of houses in all maps. The second-order polynomial transformation was selected for this process in ArcMap.
3.2. Classification and Accuracy Assessment
Given the central role of the algorithm in ensuring successful classification, five supervised classification algorithms were deployed in the ENVI software [
52]: Maximum likelihood, parallelepiped, minimum distance, Mahal distance, and neural network classification algorithms [
53]. Based on land-use characteristics in the study area and spectral characteristics of the visible image, LC was identified and classified into six types: Dry farmland (including crops and lawns), paddy field, woodland, bare land, built-up land, and water body. After the classification, accuracy assessments were performed by comparing the consistency of validation data (over 500 checkpoints for each LCT) and the classification results for the different algorithms. Overall accuracy and kappa coefficient, as well as user’s accuracy and producer’s accuracy, were used as quantitively assessing indexes [
54]. Finally, each LCT was extracted from the classified maps for discerning the changes in the areas during the period of interest in ArcMap.
3.3. LST Retrieval and Accuracy Assessment
Due to functional limitations of the TH9260, traditional satellite remote sensing methods cannot be directly applied. The LST retrieval has to be performed based on the corresponding function in the "Image Processor Pro II" software, which provides a retrieval algorithm based on the parametric correction of atmospheric transmittance, mean atmospheric temperature, and emissivity. This process is fourfold. First, we retrieved LST from the brightness temperature (BT) in the images (including a representative LCT) by correcting these aforesaid parameters. Next, we extracted the BT and LST of some corresponding sample pixels to regress their linear relationships. Then, we used the mask to extract the TIR maps for each LC according to the previous LC classification. Finally, we engaged spectral calculation to obtain the LST maps of each LCT based on the linear relationship between BT and LST, and then mosaicked them to the whole LST maps for 2012 and 2015.
The atmospheric correction was performed with the MODTRAN 4 model. In this paper, the settings of input parameters were in place [
55]: the model atmosphere was selected as tropical; the atmospheric path was the slant path between two altitudes; the aerosol model was set to Rural Extinction; the default visibility was 5 km; the seasonal aerosol profile was set to Spring–Summer; the weather was fine, and no cloud or rain. The range of wavelengths was from 8 μm to 13 μm, and the wavelength space was set to 0.1 μm. The atmospheric temperature at the first boundary took the observed mean value during the airship’s campaign at flight altitudes (29 °C in 2012 and 28.5 °C in 2015). Assuming the spectral response functions on one broad spectral band of the TH9260 according to a normal distribution, the mean atmospheric transmittances were 0.64 in 2012 and 0.68 in 2015.
Given the lack of in situ measurement of the emissivity of LCTs and non-mixed pixels in our study, reference values released by the MODIS UCSB Emissivity Library were used to retrieve LST [
56,
57]. The mean values of emissivity (
Table 2) are as follows: dry farmland (0.95), paddy field (0.97), woodland (0.96), barren land (0.96), built-up land (0.95), and water body (0.99). After LST retrieval, the surface temperatures recorded synchronously with the in-situ measurements were used to evaluate the accuracy of the retrieval results.
3.4. Thermal Indexes Based on LC
In this section, the thermal effect contribution index (Hi), thermal pixels proportion index (D1), and regional thermal pixels proportion index (D2) were used to discuss the impact of various LCTs on the urban thermal environment.
Hi indicates the impact on the regional mean LST caused by LCTs. For calculating
Hi, we first aggregated the differences between the LSTs of the pixels on each LC higher than the regional mean LST as the initial thermal effect contribution index (
Hi’), and then normalized
Hi’ to compare them among various LCTs.
Hi’ and
Hi are respectively calculated by Equation (1) and Equation (2),
where
Tij represents the LST of pixel
j on the land cover
i more than the regional mean LST,
Ta0 the regional mean LST of the study area,
ni the pixel numbers on the LCT
i more than the regional mean LST. A greater
Hi implies a more significant thermal effect of the LCT.
On the other hand,
D1 and
D2 are respectively represented by:
where
D1 represents the ratio of the area with LST above the mean in an LCT,
D2 the ratio of the area with LST above the mean in the total study area, and
Ni the pixel numbers on the LCT
i, and
N the total pixel numbers in the study area.
3.5. Normalized LST Level Map
Given differences in climate background, it will be challenging to directly compare LST maps across different years [
58]. Accordingly, the normalized LST (NLST) could maintain the spatial distribution of LST and eliminate the influence of varied climate background, providing an objective comparison [
59]. The LST maps need to be normalized by Equation (5) [
60]:
where
LSTi’ is the normalized LST of
i pixel,
LSTi the LST of
i pixel,
LSTmax and
LSTmin respectively the maximum and minimum LST values in a map. Such normalization uniformly rescales the distribution of LST to the 0-and-1 range, as well as reduces the climatic differences.
Then, we classified the NLST into different thermal field levels (from 3 to 6) through the equalization method and the mean value-standard deviation (STD) method (
Table A3), and chose the preferred method and numbers of corresponding thermal field levels based on the results (
Figure A1,
Figure A2,
Figure A3 and
Figure A4) [
61]. Subsequently, the area of each level and SUHI pattern were calculated and investigated during the period of interest. The primary purpose of the classification of the NLST was to understand the changes in the statistical distribution of temperature data and mitigate SUHI.
3.6. SUHI Ratio Index Computation
In order to further study SUHI variations between 2012 and 2015, the surface urban heat island ratio index (
SUHIRI) is computed, as follows:
where
m represents the level number in the previous normalized step,
n the number of the levels higher than the mean temperature, Wi the weight of the levels higher than the mean temperature (the order value of the level in
m, for example , the weight value of high and extra-high levels are 5 and 6, respectively), and
Pi the area proportion of the SUHI level of
i in the study area [
59,
61].
5. Discussion
The primary purpose of this study is to explore the LCT’s impact on SUHI on a local scale by a visible camera and a TIR camera. We first used the visible imagery (RGB channel) to classify six LCTs and obtain the sufficient classification accuracy in 2012 and 2015, according to the recommended 85% to 90% overall accuracy [
53]. Limited by the property of the RGB channel, it is hard to classify more LCTs without other wavebands and geographic data [
64]. For instance, woodland (with its relatively dark color, texture, and shadow) and dry farmland (with its light and uniform color) can be easily distinguished. Conversely, dry cropland and lawns are hardly distinguishable due to their similar texture and the same spectral feature on the RGB channel. They were therefore considered in one LCT in our study.
From
Table 7 and
Table 8, the difference of LST
mean between heat and cooling sources is more significant than the mean of absolute deviation and RMSE. It is very reasonable that the retrieved LST is are acceptable for assessing LST patterns. Moreover, the problems of LST retrieval in our study are different from in satellite remote sensing field. Firstly, LST on the map with the higher spatial resolution usually exhibits more variabilities, especially for the built-up land [
35,
39]. Therefore, the error control of LST retrieval in our study tends to be relatively challenging. Secondly, more LCTs with accurate emissivity are able to obtain more accurate LSTs in a high-spatial-resolution map. However, it is inconvenient to measure the real value of emissivity for every surface in practice and hard to classify more LCTs only using the RGB channel. Fortunately, our results are acceptable based on the data from the MODIS UCSB Emissivity Library [
57]. Finally, the atmospheric transmittance calculated from MODTRAN is easily influenced by atmospheric parameters such as the absorption coefficients of atmospheric molecules and of atmospheric aerosols in satellite-based LST retrieval [
65,
66], while the impact of atmospheric transmittance is much smaller at the shooting height of around 600 m.
We introduced several indexes widely used in satellite remote sensing to assess the thermal environment and SUHI, and their application on a local scale. A decrease in the LST was registered by built-up land, dry farmland or bare land, paddy field, and water body or woodland (the descending order was in agreement with most results from satellites [
67,
68,
69]). While the LSTs between different LCTs exhibited more significant differences than the result from the satellites due to the presence of mixed pixels [
70,
71]. Similarly, the higher spatial resolution shows more details in surface materials and their properties, revealing why the RMSE of LST in each LCT derived from satellites are generally smaller in summer daylight [
39,
72]. For an LST map with high spatial resolution, the LST is determined mainly by the thermal properties of materials such as thermal conductivity and heat capacity. The range of its fluctuations is closely related to the heterogeneity of the materials. The built-up land usually consists of numerous surface materials (such as concrete, asphalt, and metal) and has greater thermal capacity and conductivity. Therefore, the highest mean LSTs and the larger fluctuations of LSTs (most in 2012 and second most in 2015) are found on built-up land. The wider gap of LSTs between different LCTs shows the potential that the local thermal environment can be improved by utilizing the heat transfer between the different LCTs [
73]. It is not clearly illustrated by satellites. Accordingly, compared with the application of satellite remote sensing on an urban or city scale, low-altitude remote sensing is more suitable for assessing the impacts of landscape composition, spatial configuration, and even materials on a local or micro scale. For example, paying attention to the importance of small cold sources can improve the micro thermal environments [
74,
75]. More specifically, the proper compositions between built-up land and water body or woodland need to be ensured and the cool pavements or roofs need to be used [
76,
77,
78]. In the meantime, the difference of LSTs and their fluctuated ranges caused by different resolutions demonstrates that it is improper to directly compare their SUHI intensities, even with the same time and place. This has been presented by Sobrino (STD of LST is from 4.4 °C to 0.8 °C with spatial resolution from 4 m to 1000 m in the same district) [
34].
The LST and LC classification maps depict the patterns of LST and LC and their relationship but do not accurately appear the thermal contribution of the various LCTs. Then, LCT-based indexes i.e. Hi, D1, and D2 are demonstrated to be applicable to our local map. Hi highlights the LCT’s impact on the SUHI: the built-up areas of 9.36% and 18.19% contributed to SUHIs of 48.11% and 60.35%, respectively. To compare the impacts of LC changes on SUHI in the situation with high spatial resolution, we attempted to classify the LSTs with the equalization method and the mean value-standard deviation method, as well as different STD multiple (x = 1,1.5, and 2). We finally determined the most suitable combination (the mean value-standard method with x = 1.5 and LN 6) in our study area. Similar to the LST map, the NLST level map is closely related to the LCs and their shapes, which indicates the importance of LCs together with their compositions and materials. In addition, the role of the woodland and water body as the high-quality cold source is especially highlighted. Woodland and water body were both located on the non-SUHI areas, whereas some of the dry farmland as a kind of vegetation was situated on the high temperature area, even the lawns beside the Phoenix Lake in the extra-high temperature area in 2015. It indicates the differences between woodland and lawns in mitigating SUHI. It also allows us to elucidate the principle for regulating the microclimate on the local scale: more trees with intense evaporation such as arbors, rather than large areas of lawns. The change of landscape composition, spatial configuration, and materials tends to be crucial for the improvement of the civilians’ thermal comfort and reduce their heat stress.
6. Conclusions
We studied the surface thermal characteristics and its spatial and temporal changes in the Sino-Singapore Guangzhou Knowledge City in 2012 and 2015. Meanwhile, we examined the workflow and assessed the accuracy of LC classification, LST retrieval, and applicability of some indexes for evaluating SUHI on a local scale.
Our results reveal that the common digital camera and TIR camera mounted on an unmanned airship can conveniently and affordably capture fine and high-spatial-resolution images, thereby enabling detailed characterization of visible and TIR images for complex landscape composition and spatial configuration.
Several detailed conclusions are drawn:
- (1)
The supervised maximum likelihood method yielded satisfactory results in classifying the high-spatial-resolution imagery; the overall accuracy was 92.88% and 90.32% in 2012 and 2015, as well as the Kappa coefficient was 0.91 and 0.88. Whereas, cropland and lawns were indistinguishable in the RGB channel.
- (2)
The deviation of the retrieved LST and RMSE indicate that the retrieval results are acceptable for assessing LST distributions and further SUHI patterns.
- (3)
The LSTs demonstrated greater consistency with LCTs and more fluctuation within an LCT on a local scale than on an urban scale. Built-up land contributed to the highest SUHI intensity, while woodland and water body had the most cooling contribution. Additionally, more significant discrepancies in LST were found between built-up area and woodland on a local scale, indicating that proper landscape composition is an excellent way to mitigate SUHI and improve thermal comfort in the local microclimate.
- (4)
Indexes from the satellite field such as the Hi and SUHIRI are applicable for a local scale, but the application of the NLST level map necessitates the judicious selection of parameters.
- (5)
Low-altitude remote sensing data can provide a reference for local regulator planning, in comparison to satellite remote sensing data, which provides a reference for urban planning. More specifically, for the local scale, landscape composition, spatial configuration, and even materials are of greater concern.
Furthermore, the following works are envisioned: 1) To continue observing this study area to check the SUHI change whether the aim of urban planning is achieved upon completion of the development; 2) and to further investigate the relationship between the parameters (e.g., landscape composition, spatial configuration, and materials) and SUHI.