Landsat-Derived Annual Maps of Agricultural Greenhouse in Shandong Province, China from 1989 to 2018

Ou, Cong; Yang, Jianyu; Du, Zhenrong; Zhang, Tingting; Niu, Bowen; Feng, Quanlong; Liu, Yiming; Zhu, Dehai

doi:10.3390/rs13234830

Open AccessArticle

Landsat-Derived Annual Maps of Agricultural Greenhouse in Shandong Province, China from 1989 to 2018

by

Cong Ou

^1,2,3

,

Jianyu Yang

^1,3,*,

Zhenrong Du

^1,3,4

,

Tingting Zhang

^1,3,

Bowen Niu

^1,3,

Quanlong Feng

^1,3,

Yiming Liu

^1,3,5 and

Dehai Zhu

^1,3

¹

College of Land Science and Technology, China Agricultural University, Beijing 100083, China

²

Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China

³

Key Laboratory for Agricultural Land Quality Monitoring and Control, Ministry of Natural Resources, Beijing 100083, China

⁴

Ministry of Education Key Laboratory for Earth System Modeling, Department of Earth System Science, Institute for Global Change Studies, Tsinghua University, Beijing 100084, China

⁵

Center of Product Research and Development, China Mobile Communication Group Guangdong Co., Ltd., Guangzhou 510623, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2021, 13(23), 4830; https://doi.org/10.3390/rs13234830

Submission received: 16 November 2021 / Revised: 24 November 2021 / Accepted: 26 November 2021 / Published: 28 November 2021

(This article belongs to the Special Issue Remote Sensing for Future Food Security and Sustainable Agriculture)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Agricultural greenhouse (AG), one of the fastest-growing technology-based approaches worldwide in terms of controlling the environmental conditions of crops, plays an essential role in food production, resource conservation and the rural economy, but has also caused environmental and socio-economic problems due to policy promotion and market demand. Therefore, long-term monitoring of AG is of utmost importance for the sustainable management of protected agriculture, and previous efforts have verified the effectiveness of remote sensing-based techniques for mono-temporal AG mapping in a relatively small area. However, currently, a continuous annual AG remote sensing-based dataset at large-scale is generally unavailable. In this study, an annual AG mapping method oriented to the provincial area and long-term period was developed to produce the first Landsat-derived annual AG dataset in Shandong province, China from 1989 to 2018 on the Google Earth Engine (GEE) platform. The mapping window for each year was selected based on the vegetation growth and the phenological information, which was critical in distinguishing AG from other misclassified categories. Classification for each year was carried out initially based on the random forest classifier after the feature optimization. A temporal consistency correction algorithm based on classification probability was then proposed to the classified AG maps for further improvement. Finally, the average User’s Accuracy, Producer’s Accuracy and F1-score of AG based on visually-interpreted samples over 30 years reached 96.56%, 86.64% and 0.911, respectively. Furthermore, we also found that the ranked features via calculating the importance of each tested feature resulted in the highest accuracy and the strongest stability in the initial classification stage, and the proposed temporal consistency correction algorithm improved the final products by approximately five percent on average. In general, the resultant AG sequence dataset from our study has revealed the expansion of this typical object of “Human–Nature” interaction in agriculture and has a potential application in use of greenhouse-related technology and the scientific planning of protected agriculture.

Keywords:

agricultural greenhouse; annual mapping; Landsat; Google Earth Engine

Graphical Abstract

1. Introduction

With a growing global population, further urbanization and increasing demand for a balanced food supply, protected agriculture has been widely developed all over the world, especially in China [1]. Agricultural greenhouses (AG), as the most typical object of protected agriculture, has been steadily increased throughout the world and reached about at a total area of

3.02 \times 10^{6}

ha in 2016 [2]. Since it has played a great role in the balanced annual supply of food, the utilization of agricultural resources, the increase of farmers’ income and the employment of rural labor, the Chinese government has been vigorously promoting the construction of AG since the late 1980s. China also has the largest area of AG in the world, with a total area of about

1.32 \times 10^{6}

ha as of 2016 [3]. In the meantime, the widespread use of greenhouse-related technology also poses certain threats to ecological and environmental safety, the efficient use of cultivated land and national food security. On the one hand, driven by market demand, with the increase in the number of years of continuous cultivation, especially in terms of irrational fertilizer and water management [4] and plastic waste [5] has resulted in continuous crops [6] and soil biodiversity degradation [7]. On the other, the lack of scientific and unified planning for AG construction [8], especially in its early stage of development, largely caused by farmers’ autonomous behavior [9], has resulted in the scientific design of the structures not being in accordance with the local geographic location and environment. In addition, as one of the typical ways of non-grain cultivation of cultivated land, greenhouse planting has the potential of disorderly expansion under the will of rational economic man, which reduces the possibility of restoring grain planting [10] and threatens China’s most stringent cropland protection policy. In order to prevent and control the surface source pollution caused by the disorderly development of AG, optimize the layout of land for protected agriculture, balance the relationship between non-grain cultivation, ensuring food security, and promoting the sustainable development of agricultural resource utilization, it is urgent to track the long-term dynamics of the AG in the typically protected agriculture regions.

Due to the better availability of satellite images in recent years, remote sensing technology that provides a unique vision for unveiling its growth has been verified to be the most feasible approach to obtain the spatiotemporal information of AG [11]. Several prior studies have proposed various approaches for AG mapping; from the perspective of image classification, it is mainly divided into unsupervised classification and supervised classification. For unsupervised approaches, various index-based approaches have been proposed for AG extraction, such as the Greenhouse Detection Index (GDI) [12], Plastic Greenhouse Index (PGI) or Retrogressive Plastic Greenhouse Index (RPGI) [13], Plastic-Mulched Landcover Index (PMLI) [14] and Index Greenhouse Vegetable Land Extraction (VI) [15]. For supervised classification, the Pixel-based (PB) approach is mainly applied to extract the distribution of AG [16], while the Object-based (OBIA) approach is aimed at AG detection [17,18,19], delineation [20] or even further for greenhouse crop identification [21,22] in a specific protected agricultural area. In addition to the mono-temporal AG mapping studies mentioned, a few studies have focused on the long-term AG dynamics at regional scales. For instance, Picuno et al. analyzed the plasticulture landscapes changes in southern Italy with two-year intervals over 10 years using multitemporal Landsat TM images [23]. Arcidiacono et al. proposed a model to manage crop-shelter spatial development and used it in a highly representative study of the Italian protected cultivation during 1994–1999 [24]. Ou et al. produced seven greenhouses maps in the 1990–2018 period in a typical protected agricultural region of China [25].

However, none of these studies have achieved a continuous AG mapping over a long time at the provincial scale, which is essential for policymakers to have a complete picture of the development of the protected agriculture and to assess its negative impacts from a macro perspective. With the availability of the Landsat archive, a series of annual maps of urban area [26,27,28], cropland [29,30], forest [31,32], lake [33] and land cover [34,35,36] has been carried out. The main challenging part of annual long-term AG mapping is as follows: firstly, the AG is easily mixed with bare cropland and plastic-mulched land at provincial scale; secondly, the construction materials and the planting structure of AG is diverse in different regions and periods, which leads to a strong spatiotemporal heterogeneity of its remote sensing characteristics; and thirdly, the image of individual years is affected by the cloud cover and strip problems, resulting in the inconsistency of annual maps of AG over the long-term.

To address the above issues, an annual remote sensing mapping method of AG oriented to the provincial area and long-term period was proposed in this study. This method integrated the remote sensing characteristics analysis of provincial AG, mapping window selection, classification feature optimization and temporal consistency correction for annual classification results. Based on this method, we developed the provincial AG annual maps in Google Earth Engine (GEE) at 30 m resolution from 1989 to 2018 over Shandong province, China for the first time. The dataset not only provided the first complete description of the annual dynamics of AG over 30 years at the provincial scale, but also has the potential to serve the use of greenhouse-related technology and the scientific planning of protected agriculture.

2. Study Area and Data

2.1. Study Area

The study area is Shandong province, which is part of the eastern coastal region of China, between

34^{°} 22^{'}

∼

38^{°} 24^{'}

N and

114^{°} 47^{'}

∼

122^{°} 42^{'}

E (Figure 1), with a total area of about 157,900 km². This area is dominated by three landforms: mountain, plain and hill, and belongs to a warm temperate monsoon climate zone, with a short spring and autumn and long winter and summer, and the annual average temperature is approximately 11∼14 °C. The drainage area of Shandong province is about

4.8 \times 10^{4}

km², and the average river network density is about 0.24 km/km². Such climate and hydrological conditions favor agricultural production. Therefore, Shandong is not only a major grain-producing province, but also an important economic crop producing province in China, especially in vegetable production. By the end of 2019, Shandong province has ranked first among the vegetable supply provinces in China with an output of 81.92 million tons, accounting for 11.65% of the total vegetable supply in China. In particular, the large-scale specialized greenhouse-based vegetable planting in this area began in the early 1990s and ranks first among the 34 regions for the number of AG, accounting for 15.78% of the AG in China [37], and is the earliest and largest province to develop protected agriculture in China. Therefore, we selected Shandong province as the representative province in China to trace the full change trajectory of AG in this study.

2.2. Landsat Archive

In this study, we used archived Landsat data in its SR (atmospherically corrected surface reflectance) form in GEE from 1989 to 2018 as our remote sensing data, including the Landsat 5 Thematic Mapper (TM), Landsat 7 Enhanced Thematic Mapper Plus (ETM+) and the Landsat 8 Operational Land Imager (OLI) [38], and the extent of the study area was covered by 16 scenes of Landsat imagery (Figure 2a) for each year. After counting the number of available scenes for each year (Figure 2b), and considering the stripe problem of Landsat 7 ETM + due to the SLC-off issue (when the Scan Line Corrector failed and these products have data gaps) after 2003 [39], we finally selected Landsat 5 images for the period of 1989–2001, 2003–2006 and 2008–2010, Landsat 7 images for period 2002, 2007 and 2011–2012, and Landsat 8 images for the period 2013–2018. In total, around 8450 Landsat scenes were collected for the study area over the past three decades. Since the L1T-level Landsat SR products retrieved in GEE have been corrected for the radiometric, topographic, and atmospheric effects [40], we applied the “CFmask” algorithm to create cloud- and cloud-shadow- free Landsat images covering the study area for each year. In addition, in order to reduce the interference of other similar objects in mountainous and hilly areas, the global DSM data (ALOS World 3D) produced by the ALOS satellite launched by the Japan Aerospace Exploration Agency (JAXA) in 2006 with a spatial resolution of 30 m [41] was used to calculate the elevation and slope of the study area.

2.3. Reference Dataset for Supervised Training

The annual maps of AG from 1989 to 2018 were derived separately using a supervised machine learning approach in GEE. The accuracy of the reference dataset has a great impact on the classification accuracy under the supervised learning-based strategy, including the accuracy of samples labeling and the spatial distribution of samples [42]. This study is oriented to the provincial area, which covers an area of more than 150,000 km², while AG in this area are concentrated in several developed regions of protected agriculture. According to the conclusion of the previous study [25], the labeled samples are limited to the above-mentioned regions only, which will lead to large misclassification in other regions. In order to avoid this phenomenon, a 10 km grid sampling structure covering the study area was constructed in this study, which ensured that labeled samples are available in each grid (Figure 3).

Finally, two types of reference datasets, including AG and Non-AG, were labeled via visual inspection of high-resolution imagery such as QuickBird and IKONOS available in Google Earth or Landsat imagery based on the proposed sampling structure. Meanwhile, 70% of this dataset was randomly selected as the training samples and the remaining 30% was randomly selected as the test samples to evaluate the accuracy of classification results for each year. A total of 271,729 samples were finally obtained for the period of 1989–2018, including 44,969 samples for AG and 226,760 samples for Non-AG (Table 1).

3. Methods

This study was aimed at developing a set of continuous annual AG maps at 30 m resolution for Shandong province, China for the period of 1989–2018, and the research workflow mainly included four parts: collecting the Landsat archive and reference dataset, remote sensing characteristics analysis and mapping window selection, input feature optimization and annual classification, as well as temporal consistency correction and accuracy evaluation (Figure 4). The whole procedure was implemented on the GEE platform, which has shown great potential in dealing with massive image processing for multi-temporal classification in previous studies [43,44,45]. Based on the obtained and analyzed Landsat archive and reference dataset (Section 2.2 and Section 2.3), we (1) analyzed the spectral curves of different landcover categories and its NDVI time series, combining the phenological information of cropland and plastic-mulched land, selected the best mapping window within a year for annual AG mapping, (2) calculated the tested features and obtained the best feature combination via evaluated average importance over 30 years for each feature, and derived the initial classifications, and (3) applied a temporal consistency correction algorithm to generate the final annual AG classifications and performed accuracy evaluation. A detailed description of each individual part of the whole procedure is presented in the following sections.

3.1. Remote Sensing Characteristics Analysis

As a kind of agricultural facility to ensure the balanced annual output of crops, the spectral curves of AG will be affected by the change of crop phenology, but theoretically, its seasonal variation is more stable than that of cropland or forest. Therefore, the quantitative analysis of the spectral curves of AG is an important basis for mapping AG. In this study, 600 samples (divided into AG, cropland, impervious, forest, water and plastic-mulched land, 100 samples in each category) were individually labeled on the Landsat 8 OLI imagery in 2018, and the pixel-based surface reflectance of those types was used to analyze its spectral curves in different seasons. In order to reduce the interference of outliers among them, the median value of each band corresponding to each type was used as the SR reflectance of each band.

As shown in Figure 5, firstly, the spectral curves of AG are more stable than cropland, forest and plastic-mulched land and more variable than impervious and water in different seasons, which indicates that vegetation cover is a determinant of seasonal variations in the spectral curves of all land cover types, and differences in vegetation growth are an important point for differentiating AG from other types. Secondly, the spectral curves of AG are similar to those of cropland and plastic-mulched land that leads to a poor separability in autumn and winter, while the spectral curves of AG are obviously different to those of impervious, water and forest that leads to a strong separability in all seasons, which indicates that those non-crop growing land cover types interfere less with the remote sensing identification of AG. Thirdly, in terms of specific bands, the visible band (2, 3, 4) of AG is close to impervious in spring and summer, and close to cropland and plastic-mulched land in autumn and winter; the infrared band (5) of AG is close to the forest in summer, and close to cropland, plastic-mulched land and impervious in autumn, but differs from all other land cover types in spring and winter; the short-wave infrared band (6 and 7) of AG differs from all other land cover types in spring and summer, which indicates that all the above-mentioned bands can play a role for AG mapping, and the key lies in the selection of the mapping window.

3.2. Mapping Window Selection

According to the results of the spectral curves analysis, cropland and plastic-mulched land are most likely to cause misclassification in AG mapping and their differences in vegetation growth during a year are critical to distinguishing them. Therefore, the selection of the mapping window should consider the vegetation growth that best differentiates AG from cropland and plastic-mulched land. Based on this assumption, we constructed the time-series NDVI data of AG, cropland and plastic-mulched land at a 16-day interval from the Landsat 8 OLI imagery in 2018. Specifically, we calculated the median NDVI value of all the samples and used it to represent the NDVI value for each category, and we further smoothed the NDVI time series by the Savitzky-Golay filter to eliminate small fluctuations using a moving window of size 5 and a filter order of 2 [46]. As shown in Figure 6, there are two periods of flourishing vegetation growth within a year, in which the difference of the NDVI value between AG, cropland and plastic-mulched land is pronounced. The first period is day of year (DOY) 73-136 (from 15 March to 15 May), in which the maximum NDVI difference between AG and cropland is about 0.4, and that between AG and plastic-mulched land is about 0.3; The second period is DOY 165-248 (from 15 June to 5 September), in which the maximum difference of NDVI between AG and cropland is about 0.5, and that between AG and plastic-mulched land is about 0.3.

Due to the complex crop structure in AG, we collected the phenological information of cropland and plastic-mulched land in Shandong province (Figure 7). Winter wheat (WW), summer corn (SUC) and spring corn (SPC) are the dominant crop type of cropland in Shandong province, DOY 73-136 are mainly in the reviving, jointing and heading stage of WW and the planting and seeding stage of SPC, and DOY 165-248 are mainly in the ripening and harvest stage of WW, and the seeding, jointing, tesseling and repending-harvest stage of SUC or SPC. Meanwhile, garlic (GA) and cotton (CO) are the dominant crop type of plastic-mulched land in Shandong province, DOY 73-136 are mainly in the differentiation stage of GA and the planting and seeding stage of CO, and DOY 165-248 are mainly in the repending-harvest stage of GA and the square-boll and picking stage of CO. Referring to the crop calendars for the dominant crops in Shandong province, both mapping windows are for the period that the vegetation growth is flourishing. However, as the major gain production crop in Shandong province, WW is usually harvested in late DOY 165-248, while all kinds of crops are usually grown in DOY 73-136, which shows a better differentiation AG from cropland and plastic-mulched land than DOY 165-248. Therefore, combining the smoothed NDVI times-series and the phenological information, we finally selected DOY 73-136 (from 15 March to 15 May) as the best mapping window within a year for annual AG mapping.

3.3. Tested Features

The AG exhibits wider feature variation with respect to both space and time due to the difference in construction type, crop structure and satellite sensors. In order to reduce the mapping error in classification accuracy due to the different contributions of features to AG classification in different years and different regions, the tested features were calculated in terms of spectral-based, index-based, and texture-based features (Table 2). Firstly, based on all available Landsat SR within the selected mapping window, we calculated the median value of the image composite for each spectral band [47], including blue (B), green (G), red (R), near-infrared (NIR), shortwave infrared 1 (SWIR1) and shortwave infrared 2 (SWIR2).

Given that the index-based features can effectively enhance the information of spectral-based features in specific directions, we selected 15 index-based features based on the analysis of remote sensing characteristics of greenhouses and previous research on greenhouse mapping. Enhanced Vegetation Index (EVI), The Normalized Difference Vegetation Index (NDVI), Green NDVI (GNDVI) and Green Red Vegetation indices (GRVI) were used to indicate the vegetation greenness, and Normalized Difference Built-up Index (NDBI) was one of the most popular indices for mapping impervious areas. The Modified Normalized Difference Water Index (MNDWI) and Land Surface Water Index (LSWI) were closed to open surface water bodies and land surface moisture, respectively. Normalized Burn Ratio (NBR) was a good indicator of surface temperature, Bare Soil Index (BSI) and Soil Adjusted Vegetation Index (SAVI) were sensitive to the background information from vegetation and soils, and Normalized Difference Tillage Index (NDTI) was used to distinguish the differences of residue between AG, cropland and plastic-film land. Plastic Greenhouse Index (PGI), Index Greenhouse Vegetable Land Extraction (VI), Retrogressive Plastic Greenhouse Index (RPGI) and Plastic-Mulched Landcover Index (PMLI) were both developed to delineate greenhouse or plastic-film by previous studies. It should be noted that NBR, LSWI and NDTI were first used to identify AG.

Since the construction of a greenhouse can significantly change the texture of the land surface, we further calculated texture-based features based on the gray-level co-occurrence matrix (GLCM) [48]. Previous studies have shown that the blue band is the most sensitive to the greenhouse fraction in each Landsat band [13], therefore we selected the blue band for texture-based features calculation. Referring to the results of previous studies [21,22] and the texture characteristics of the greenhouse in the study area (e.g., contrast, correlation, and entropy, etc.), the eight most common texture-based features were selected and computed via the “glcmTexture” function in GEE, including the angular second moment (ASM), contrast (CON), correlation (CORR), variance (VAR), inverse difference moment (IDM), sum average (SAVG), entropy (ENT) and dissimilarity (DISS).

Table 2. Landsat time-series features. GLCM, gray-level co-occurrence matrix; NIR, near-infrared; SWIR1, shortwave infrared 1; SWIR2, shortwave infrared 2; R, red band; G, green band; B, blue band.

	Tested Features	Description	Reference
Spectral-based features	Median value	Median value of B, G, R, NIR, SWIR1, and SWIR2 bands	[47]
Index-based features	EVI	2.5(NIR-R)/(NIR+6R-7.5*B+1)	[49]
	NDVI	(NIR-R)/(NIR+R)	[50]
	GNDVI	(NIR-G)/(NIR+G)	[51]
	GRVI	(G-R)/(G+R)	[52]
	NDBI	(SWIR1-NIR)/(SWIR1+NIR)	[53]
	MNDWI	(G-SWIR1)/(G+SWIR1)	[54]
	NBR	(NIR-SWIR2)/(NIR+SWIR2)	[55]
	BSI	((SWIR1+R)-(NIR+B))/((SWIR1+R)+(NIR+B))	[56]
	NDTI	(SWIR1-SWIR2)/(SWIR1+SWIR2)	[57]
	LSWI	(NIR-SWIR1)/(NIR+SWIR1)	[58]
	SAVI	(1.5*(NIR-R))/(NIR+R+0.5)	[59]
	PGI	(100R(NIR-R))/(1-(NIR+B+G)/3)	[60]
	VI	((SWIR1-NIR)/(SWIR1+NIR))*((NIR-R)/(NIR+R))	[11]
	RPGI	(100*B)/(1-(NIR+B+G)/3)	[13]
	PMLI	(SWIR1-R)/(SWIR1+R)	[61]
Texture-based features	ASM	Angular Second Moment of GLCM from B	[48]
	CON	Contrast of GLCM from B	[48]
	CORR	Correlation of GLCM from B	[48]
	VAR	Variance of GLCM from B	[48]
	IDM	Inverse Difference Moment of GLCM from B	[48]
	SAVG	Sum Average of GLCM from B	[48]
	ENT	Entropy of GLCM from B	[48]
	DISS	Dissimilarity of GLCM from B	[48]

3.4. Classification and Feature Optimization

As the research flowchart described, we adopted the supervised learning method to produce AG classification for each year. Random forest (RF) classifier, which is a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest [62], has a number of advantages, such as the additional description of data, less manual intervention, the ability to handle high-dimensional data, and the robustness to missing data [63,64,65]. It has also been widely used for large-scale mapping applications [66]. Therefore, the RF classifier was used to generate the annual AG maps in this study. The number of trees (N) was set to 100 according to the total number of features and a large number of experiments [28], and the decision tree divides nodes (M) was set to six based on the arithmetic square root of the total number of features [67].

In addition, for the classifier based on machine learning algorithm, it is not that the more feature dimensions, the higher the final classification accuracy. The fact that certain redundancy between features and exorbitant feature dimension often causes “dimension disaster”, results in the decline of classifier learning ability [68]. During the construction of RF, about 63% of training samples are bootstrapped for the training of the classifier. Meanwhile, about 37% of training samples are not extracted, which is called “Out-of-Bag (OOB)” data. The OOB data not only be used to determine whether the training stage is completed, but also can be used to calculate the importance of each feature. Therefore, we promoted feature optimization by calculating the importance of each tested feature in the process of RF construction.

3.5. Temporal Consistency Correction

The selected mapping window within a year for annual AG mapping is Spring, in which the weather of Shandong province is changeable, which leads to different cloud cover phenomenon every year. In addition, the SLC-off issue of Landsat 7 after 31 May 2003 led to the stripe problem of classification results in 2007, 2011 and 2012. Therefore, it is necessary to modify the long-term mapping results to reduce the mapping errors caused by the inconsistent image quality each year. The long-term mapping error could be reduced by checking whether specific land cover change is reasonable under the temporal context [69], and the basic idea of previous research was to eliminate it by using a temporal filtering algorithm based on the temporal context (previous and after classification results in specific sliding window) [36,70].

In this study, a new algorithm based on classification probability (Probability-based Temporal Consistency Correction, PTCC) was proposed to obtain more consistent long-term mapping results than the individual results. Detailed procedures of the PTCC algorithm are shown in Figure 8, the individual classification results and its classification probability of each pixel were obtained, and then the individual classification results and its adjacent classification probability in the specific sliding window of each pixel were combined. For those pixels that were recognized as AG, if its adjacent classification probability were both greater or equal to threshold a, it would be corrected to Non-AG. For those pixels that were recognized as Non-AG, if its adjacent classification probability were both greater or equal to threshold b, it would be corrected to AG. In light of the demolition and construction characteristics of AG, the sliding window of this study was set to 3 (only the classification probability of the previous year and the next year is considered). Therefore, it can be seen that the key point to implement the PTCC algorithm is to set threshold a and b when the sliding window is determined. Compared to the related research, the revisability of long-term mapping results with different classification probabilities was discussed in this study.

3.6. Accuracy Assessment

In order to evaluate the accuracy of the annual classification results under different scenarios for feature optimization and temporal consistency correction, we calculated the annual classification accuracy based on the confusion matrix. Since the classification of AG in this study is a two-class task, the confusion matrix is a 2-by-2 matrix, including true AG (TAG), false AG (FAG), true non-AG (TNAG) and false non-AG grapes (FNAG). The Producer’s Accuracy (PA) and User’s Accuracy (UA) for each category are calculated as:

U A_{i} = \frac{T P_{i}}{P_{i}} \times 100 %

(1)

P A_{i} = \frac{T P_{i}}{C_{i}} \times 100 %

(2)

where

T P_{i}

represents the total number of pixels correctly classified into a certain category (TAG or TNAG),

P_{i}

represents the total number of pixels of a certain category in the whole classification result (TAG+FAG or TNAG+FNAG), and

C_{i}

represents the total number of pixels of a certain category in the test samples (TAG+FNAG or TNAG+FAG). Finally, The F1-score conveys the balance between PA and UA, which is suitable for the unbalanced samples of the two-class task, is computed as:

F 1_{i} = 2 \times \frac{U A_{i} \times P A_{i}}{U A_{i} + P A_{i}}

(3)

4. Results and Discussion

4.1. Performance of Feature Optimization

In the process of features optimization, we first performed the unbiased estimates of the generalization errors for each feature, and computed the importance of all features from 1989 to 2018. In terms of the different size of OOB data in each year, the importance of each feature was normalized over 30 years, and then the contribution of each feature for AG classification in each year was obtained. As shown in Figure 9a, the importance of spectral features has the smallest variation in 30 years, the importance of index features has the largest variation in 30 years, and the importance of texture features has a slight variation in 30 years. Furtherly, we ranked the average importance of each feature over 30 years. Figure 9b showed that GREEN (0.33) and BLUE (0.28) got the highest average importance among spectral features, NDTI (0.88) and PGI (0.21) got the highest average importance among index features, and SAVG (0.41) and CON (0.35) got the highest average importance among texture features. Notably, we found that NDTI (0.88) and NBR (0.46), which were first applied to remote sensing recognition of AG, ranked first and seventh in the average importance respectively. It means that in addition to the growth of vegetation in AG, the thermal characteristics of AG can also be used as an important factor in AG classification. Finally, all features were divided into four combinations based on the ranked order, including ranked features with one average importance greater than 0.5 (NDTI, GREEN, BLUE), ranked features with two average importance greater than 0.4 (NDTI, GREEN, BLUE, PGI, RED, SWIR1, NBR, NIR, SAVG), ranked features with three average importance greater than 0.3 (NDTI, GREEN, BLUE, PGI, RED, SWIR1, NBR, NIR, SAVG, SWIR2, CON, RPGI, EVI, PMLI, GRVI, MNDWI, DISS), and ranked features with 4 average importance greater than 0.2 (NDTI, GREEN, BLUE, PGI, RED, SWIR1, NBR, NIR, SAVG, SWIR2, CON, RPGI, EVI, PMLI, GRVI, MNDWI, DISS, CORR, VAR, VI, GNDVI, ASM, ENT, IDM, NDBI, NDVI).

We also evaluated the feature-by-feature iteratively F1-score curve based on the order of the ranked features. As shown in Figure 10, the trend of F1-score curves in 1995 (Landsat 5 TM), 2007 (Landsat 7 ETM+) and 2018 (Landsat 8 OIL) were similar, which reached around 0.8 before the top five features and then fluctuated slightly with the increase of features. Specifically, the F1-score reached the highest value (0.881) when it iterated to the top 22 features in 1995, the highest value (0.935) when it iterated to the top 23 features in 2007, and the highest value (0.940) when it iterated to the top 17 features in 2018. It verified that the method based on the multi-year average importance ranking of features could efficiently select those features that have an important impact on the classification accuracy.

Therefore, a total of 10 sets of scenarios (Table 3) were designed to investigate the importance of different types of features for AG classification, and the average and standard deviation of F1-score in each scenario were obtained. From Figure 11 we can see that the average F1-score of scenarios 1 and 2 were equal and significantly higher than scenario 3, but the standard deviation of F1-score in scenario 2 was slightly lower than scenario 1, indicating that the spectral-based and index-based features were more effective in AG classification than the texture-based features, and the index-based features were more stable than the spectral-based features in multi-year AG classification. For scenarios 4 and 5 in which the index-based features and texture-based features were added to the spectral-based features respectively, their average F1-score were improved to 0.844 and 0.836 and the standard deviation of F1-score was reduced to 0.088 and 0.085, indicating that the combination of spectral-based features with index-based or texture-based features can both improve the average accuracy and stability over 30 years. Comparing the average and standard deviation of F1-score in scenarios 6, 7, 8, and 9, which were both ranked feature combinations based on the average importance of the features, we found that the highest average F1-score (0.869) and the lowest standard deviation of F1-score (0.063) were shown in scenario 8, which was also the best performer among the 10 scenarios. All of these results indicated that although each feature showed certain importance for AG mapping, not all of them can achieve the highest and most stable classification accuracy. Therefore, we choose the results of scenario 8, in which the features with an average importance of more than 0.3, as the annual AG classification results.

4.2. Performance of the Temporal Consistency Correction

In this study, we designed 25 combinations based on the PTCC algorithm to explore the conditions under which the classification probability of adjacent pixels in the sliding window of each pixel needs to be corrected. As shown in Figure 12, when the threshold a remains unchanged (i.e., the threshold for judging whether AG needs to be corrected to Non-AG), with the increase of threshold b (i.e., the threshold for judging whether Non-AG needs to be corrected to AG), of the correction results gradually decreased and the standard deviation of F1-score gradually increased, indicating that for Non-AG with a large coverage area, it can be corrected directly if the classification results of the previous and subsequent years were AG (i.e., threshold b equals 0.5). When the threshold b remains unchanged, with the increase of the threshold a, the average F1-score of the correction results gradually increased and reached the maximum value (0.911), indicating that for AG with a small coverage area, the correction should be carried out when the classification probability of Non-AG in the previous and subsequent years were more than 0.8 (i.e., threshold a equals 0.8). Considering the priority of the average F1-score, the correction threshold was finally determined as a = 0.8 and b = 0.5.

Comparing the F1-score of AG before and after correction year by year (Figure 13), it can be seen that the average F1-score of the corrected results has been effectively improved from 0.869 to 0.911. In terms of specific years, the most obvious improvement after the correction occurred in 1996 and 2003, with an improved F1-score by 0.190 and 0.101 respectively. In addition, for 2007, 2011, and 2012 using Landsat 7 ETM + strip problem images, the F1-score of the corrected AG classification was improved by 0.015, 0.028, and 0.047, respectively. In conclusion, the temporal consistency correction based on the PTCC algorithm effectively improved the quality of annual maps of AG in Shandong province, China from 1989 to 2018.

We further compared the results before and after implementing the temporal consistency correction by enlarging the details. As shown in Figure 14, the correction based on the proposed PTCC algorithm effectively eliminated the limitations caused by cloud cover or strip problems, et al. In region (a), the missed information underneath the cloud has been well restored for the cloud contaminated areas. Whereas for data missing areas that are due to the SLC-off issue of Landsat 7, it has been recovered through the temporal consistency correction in the region (b). Besides, the underestimation of AG areas has been considerably complemented in the region (c).

4.3. Annual Maps of AG in Shandong Province from 1989 to 2018

The accuracy of the annual AG mapping product was first assessed for each year via visually-interpreted test samples, respectively (Table 4). Overall, the accuracy of obtained AG sequences was stable and satisfactory, the F1-score of AG within all years are greater than 0.8, and for years 2005, 2009, 2010, and 2011, their accuracies are even better (above 0.95). From the classification accuracy of each year, the average UA of AG from 1989 to 2018 was 96.56%, the minimum value appeared in 2003 (93.67%), the maximum value appeared in 1991 (98.60%), and the standard deviation of UA in 30 years was 1.12%. From 1989 to 2018, the average PA of AG was 86.64%, the minimum value appeared in 1990 (68.31%), the maximum value appeared in 2005 (94.76%), and the standard deviation of user accuracy in 30 years was 7.60%. And the average F1-score of AG was 0.911, the minimum value appeared in 1990 (0.801), the maximum value appeared in 2010 (0.960), and the standard deviation of F1-score in 30 years was 4.33%. In summary, this product had a good agreement between mapped pixels and referenced pixels, which can be used for subsequent analyses or for expansion modeling.

Figure 15 showed the mapped AG areas in each city of Shandong province in 1989, 1999, 2009 and 2018. The map series indicated that greenhouses in 1989 were mainly scattered in the northern plain areas (Dezhou, Jinan, Dongying, Binzhou and Weifang), and distributed in the northwest of Weifang (represented by Shouguang area) in 1999. By 2009, AG in Shandong province appeared in a provincial spread pattern, with concentrated distribution areas including Weifang, Linyi, Jining and Liaocheng, while further expanding on the previous basis in 2018. Generally, it showed the “C” shape spreading trend around the central mountainous area, especially in the north, south and west plain areas. In addition, Figure 16 showed the enlarged AG details in seven typical locations in 1989, 1994, 1999, 2004, 2009, 2014 and 2018, and we can see that the concentrated distribution of AG is often built according to the terrain, diffuses outward around the farmland around rural residential areas, and is divided by rivers and roads.

5. Data Availability

The Landsat-derived annual AG dataset herein is now available in the public domain at https://doi.org/10.5281/zenodo.5633605 (accessed on 15 November 2021). This dataset was tagged in GeoTIFF file format, with a spatial resolution of 30 m under the WGS84 geographic coordinate system, named as “AG_SD_CHN_v01.zip”. A detailed description of the classification system is also provided. The uploaded imagery can be processed using GIS software such as ArcGIS or QGIS.

6. Future Works

Future works should consider several aspects: first, in order to achieve high spatial accuracy and temporal consistency of the final annual AG mapping products over a long time at the provincial scale, the sample acquisition process was still in accordance with the sampling method year by year. The idea of transfer learning should be introduced to explore the spatial and temporal generalizability of the classified samples, reduce the labor cost and improve the intelligence of classification under such scenario. Second, limited by the time span of the study, we can only use Landsat-derived images with a spatial resolution of 30 m, with the development of the Earth observation satellite system and the accumulation of high-resolution remote sensing data, images with higher spatial resolution should be further used in future studies to achieve fine classification of different greenhouse types. Third, the mapping product generated here have a wider coverage and higher spatiotemporal resolution than statistical data and other similar studies. As one of the most typical provinces for the development of protected agriculture in China, Shandong Province has received attention in the fields of industrial development, economic geography and environmental science regarding the socio-economic and ecological impacts brought about by the development of greenhouse-related technology, and it is also a direction worth exploring for future research to break through the disciplinary barriers to achieve interdisciplinary sharing and analysis of this dataset.

7. Conclusions

With the rapid urbanization and agricultural modernization in China, a sequential and fine-resolution AG product is the fundamental information to implement informed and sustainable protected agriculture management. However, previous efforts to map AG have generally used visual interpretation and supervised, and unsupervised methods, and mono-temporal or multi-temporal images at regional scales, while fine-resolution annual AG maps at large scales have rarely been investigated in the current literature. Therefore, to better understand the expansion of AG in the representative province of China, we generated the first Landsat-derived annual AG dataset from 1989–2018 in Shandong province based on the GEE platform. Combining vegetation growth with the phenological information, we found that DOY 73-136 (from 15 March to 15 May) is the best mapping window within a year, which is critical in distinguish AG, cropland and plastic-mulched land in the study area. Furthermore, by employing feature optimization via calculating the importance of each tested feature, the mapping accuracy of each year has been considerably increased. In addition, the proposed temporal consistency correction algorithm played a crucial role in the eventually obtained products, with an improvement of approximately 5% on average. It is strongly recommended that temporal consistency should be corrected based on classification probability when mapping AG over multiple times. Eventually, we got accurate and long-term AG dynamic maps in the most representative province of China, which have an average UA of 96.56%, an average PA of 86.64% and an average F1 accuracy of 0.911. Therefore, the AG maps in this study could provide important reference and scientific guidance for the efficiently planning and managing of protected agriculture. Combined with other data (e.g., environmental data or socio-economic data), it also could serve as the higher spatiotemporal resolution dataset for more comprehensive characterization of its ecological and economic impacts.

Author Contributions

Conceptualization, C.O.; Investigation, J.Y.; Methodology, C.O. and Z.D.; Software, T.Z. and B.N.; Validation, Y.L.; Writing—original draft preparation, C.O.; writing—review and editing, Q.F. and D.Z. All authors have read and agreed to the published version of the manuscript.

Funding

The research was funded by Ministry of land and resources industry public welfare projects (No: 201511010-06).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All relevant data are within the manuscript. The AG dataset produced in this study is now freely available at https://doi.org/10.5281/zenodo.5633605.

Acknowledgments

The authors would like to thank the Google Earth Engine (GEE) team and the user community for their useful feedback during this research process. And thank the journal’s editors and anonymous reviewers for their kind comments and valuable suggestions to improve the quality of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Jensen, M.H.; Malter, A.J. Protected Agriculture: A Global Review; World Bank Publications: Washington, DC, USA, 1995; Volume 253. [Google Scholar]
Briassoulis, D.; Dougka, G.; Dimakogianni, D.; Vayas, I. Analysis of the collapse of a greenhouse with vaulted roof. Biosyst. Eng. 2016, 151, 495–509. [Google Scholar] [CrossRef]
Statistics, N.B. Third National Agricultural Census. 2017; Volume 2.
Min, J.; Zhang, H.; Shi, W. Optimizing nitrogen input to reduce nitrate leaching loss in greenhouse vegetable production. Agric. Water Manag. 2012, 111, 53–59. [Google Scholar] [CrossRef]
Sica, C.; Picuno, P. Spectro-radiometrical characterization of plastic nets for protected cultivation. In Proceedings of the International Symposium on High Technology for Greenhouse System Management: Greensys2007 801, Naples, Italy, 4–6 October 2007; pp. 245–252. [Google Scholar]
He, W.S. Soil problems and countermeasure in facility agriculture in china. Soils 2004, 36, 235–242. [Google Scholar]
Zhang, Y.; Wang, P.; Wang, L.; Sun, G.; Zhao, J.; Zhang, H.; Du, N. The influence of facility agriculture production on phthalate esters distribution in black soils of northeast China. Sci. Total Environ. 2015, 506, 118–125. [Google Scholar] [CrossRef] [PubMed]
He, F.; Ma, C. Development and Strategy of Facility Agriculture in China. Chin. Agric. Sci. Bull. 2007, 23, 462–465. [Google Scholar]
Ge, D.; Wang, Z.; Tu, S.; Long, H.; Yan, H.; Sun, D.; Qiao, W. Coupling analysis of greenhouse-led farmland transition and rural transformation development in China’s traditional farming area: A case of Qingzhou City. Land Use Policy 2019, 86, 113–125. [Google Scholar]
Su, Y.; Li, C.; Wang, K.; Deng, J.; Shahtahmassebi, A.R.; Zhang, L.; Ao, W.; Guan, T.; Pan, Y.; Gan, M. Quantifying the spatiotemporal dynamics and multi-aspect performance of non-grain production during 2000–2015 at a fine scale. Ecol. Indic. 2019, 101, 410–419. [Google Scholar] [CrossRef]
Jiménez-Lao, R.; Aguilar, F.J.; Nemmaoui, A.; Aguilar, M.A. Remote Sensing of Agricultural Greenhouses and Plastic-Mulched Farmland: An Analysis of Worldwide Research. Remote Sens. 2020, 12, 2649. [Google Scholar] [CrossRef]
González-Yebra, Ó.; Aguilar, M.A.; Nemmaoui, A.; Aguilar, F.J. Methodological proposal to assess plastic greenhouses land cover change from the combination of archival aerial orthoimages and Landsat data. Biosyst. Eng. 2018, 175, 36–51. [Google Scholar] [CrossRef]
Yang, D.; Chen, J.; Zhou, Y.; Chen, X.; Chen, X.; Cao, X. Mapping plastic greenhouse with medium spatial resolution satellite data: Development of a new spectral index. ISPRS J. Photogramm. Remote Sens. 2017, 128, 47–60. [Google Scholar] [CrossRef]
Lu, L.; Di, L.; Ye, Y. A decision-tree classifier for extracting transparent plastic-mulched landcover from Landsat-5 TM images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 4548–4558. [Google Scholar] [CrossRef]
Zhao, G.; Li, J.; Li, T.; Yue, Y.; Warner, T. Utilizing landsat TM imagery to map greenhouses in Qingzhou, Shandong Province, China. Pedosphere 2004, 14, 363–369. [Google Scholar]
Perilla, G.A.; Mas, J.F. High-resolution mapping of protected agriculture in Mexico, through remote sensing data cloud geoprocessing. Eur. J. Remote Sens. 2019, 52, 532–541. [Google Scholar] [CrossRef] [Green Version]
Novelli, A.; Aguilar, M.A.; Nemmaoui, A.; Aguilar, F.J.; Tarantino, E. Performance evaluation of object based greenhouse detection from Sentinel-2 MSI and Landsat 8 OLI data: A case study from Almería (Spain). Int. J. Appl. Earth Obs. Geoinf. 2016, 52, 403–411. [Google Scholar] [CrossRef] [Green Version]
Coslu, M.; Sonmez, N.; Koc-San, D. Object-based greenhouse classification from high resolution satellite imagery: A case study antalya-turkey. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2016, 41, 183–187. [Google Scholar] [CrossRef] [Green Version]
Aguilar, M.A.; Bianconi, F.; Aguilar, F.J.; Fernández, I. Object-based greenhouse classification from GeoEye-1 and WorldView-2 stereo imagery. Remote Sens. 2014, 6, 3554–3582. [Google Scholar] [CrossRef] [Green Version]
Agüera, F.; Liu, J. Automatic greenhouse delineation from QuickBird and Ikonos satellite images. Comput. Electron. Agric. 2009, 66, 191–200. [Google Scholar] [CrossRef]
Aguilar, M.A.; Vallario, A.; Aguilar, F.J.; Lorca, A.G.; Parente, C. Object-based greenhouse horticultural crop identification from multi-temporal satellite imagery: A case study in Almeria, Spain. Remote Sens. 2015, 7, 7378–7401. [Google Scholar] [CrossRef] [Green Version]
Nemmaoui, A.; Aguilar, M.A.; Aguilar, F.J.; Novelli, A.; García Lorca, A. Greenhouse crop identification from multi-temporal multi-sensor satellite imagery using object-based approach: A case study from Almería (Spain). Remote Sens. 2018, 10, 1751. [Google Scholar] [CrossRef] [Green Version]
Picuno, P.; Tortora, A.; Capobianco, R.L. Analysis of plasticulture landscapes in Southern Italy through remote sensing and solid modelling techniques. Landsc. Urban Plan. 2011, 100, 45–56. [Google Scholar] [CrossRef]
Arcidiacono, C.; Porto, S. A model to manage crop-shelter spatial development by multi-temporal coverage analysis and spatial indicators. Biosyst. Eng. 2010, 107, 107–122. [Google Scholar] [CrossRef]
Ou, C.; Yang, J.; Du, Z.; Liu, Y.; Feng, Q.; Zhu, D. Long-Term Mapping of a Greenhouse in a Typical Protected Agricultural Region Using Landsat Imagery and the Google Earth Engine. Remote Sens. 2020, 12, 55. [Google Scholar] [CrossRef] [Green Version]
Lyu, H.; Lu, H.; Mou, L.; Li, W.; Wright, J.; Li, X.; Li, X.; Zhu, X.X.; Wang, J.; Yu, L.; et al. Long-term annual mapping of four cities on different continents by applying a deep information learning method to landsat data. Remote Sens. 2018, 10, 471. [Google Scholar] [CrossRef] [Green Version]
Li, X.; Zhou, Y.; Zhu, Z.; Liang, L.; Yu, B.; Cao, W. Mapping annual urban dynamics (1985–2015) using time series of Landsat data. Remote Sens. Environ. 2018, 216, 674–683. [Google Scholar] [CrossRef]
Liu, D.; Chen, N.; Zhang, X.; Wang, C.; Du, W. Annual large-scale urban land mapping based on Landsat time series in Google Earth Engine and OpenStreetMap data: A case study in the middle Yangtze River basin. ISPRS J. Photogramm. Remote Sens. 2020, 159, 337–351. [Google Scholar] [CrossRef]
Waldner, F.; Canto, G.S.; Defourny, P. Automated annual cropland mapping using knowledge-based temporal features. ISPRS J. Photogramm. Remote Sens. 2015, 110, 1–13. [Google Scholar] [CrossRef]
Matton, N.; Canto, G.S.; Waldner, F.; Valero, S.; Morin, D.; Inglada, J.; Arias, M.; Bontemps, S.; Koetz, B.; Defourny, P. An automated method for annual cropland mapping along the season for various globally-distributed agrosystems using high spatial and temporal resolution time series. Remote Sens. 2015, 7, 13208–13232. [Google Scholar] [CrossRef] [Green Version]
Zhang, Y.; Ling, F.; Foody, G.M.; Ge, Y.; Boyd, D.S.; Li, X.; Du, Y.; Atkinson, P.M. Mapping annual forest cover by fusing PALSAR/PALSAR-2 and MODIS NDVI during 2007–2016. Remote Sens. Environ. 2019, 224, 74–91. [Google Scholar] [CrossRef] [Green Version]
Qin, Y.; Xiao, X.; Wang, J.; Dong, J.; Ewing, K.; Hoagland, B.; Hough, D.J.; Fagin, T.D.; Zou, Z.; Geissler, G.L.; et al. Mapping annual forest cover in sub-humid and semi-arid regions through analysis of Landsat and PALSAR imagery. Remote Sens. 2016, 8, 933. [Google Scholar] [CrossRef] [Green Version]
Olthof, I.; Fraser, R.H.; Schmitt, C. Landsat-based mapping of thermokarst lake dynamics on the Tuktoyaktuk Coastal Plain, Northwest Territories, Canada since 1985. Remote Sens. Environ. 2015, 168, 194–204. [Google Scholar] [CrossRef]
Lyons, M.B.; Phinn, S.R.; Roelfsema, C.M. Long term land cover and seagrass mapping using Landsat and object-based image analysis from 1972 to 2010 in the coastal environment of South East Queensland, Australia. ISPRS J. Photogramm. Remote Sens. 2012, 71, 34–46. [Google Scholar] [CrossRef]
Franklin, S.E.; Ahmed, O.S.; Wulder, M.A.; White, J.C.; Hermosilla, T.; Coops, N.C. Large area mapping of annual land cover dynamics using multitemporal change detection and classification of Landsat time series data. Can. J. Remote Sens. 2015, 41, 293–314. [Google Scholar] [CrossRef]
Zhao, Y.; Feng, D.; Yu, L.; Cheng, Y.; Zhang, M.; Liu, X.; Xu, Y.; Fang, L.; Zhu, Z.; Gong, P. Long-term land cover dynamics (1986–2016) of northeast china derived from a multi-temporal landsat archive. Remote Sens. 2019, 11, 599. [Google Scholar] [CrossRef] [Green Version]
Ma, A.; Chen, D.; Zhong, Y.; Zheng, Z.; Zhang, L. National-scale greenhouse mapping for high spatial resolution remote sensing imagery using a dense object dual-task deep learning framework: A case study of China. ISPRS J. Photogramm. Remote Sens. 2021, 181, 279–294. [Google Scholar] [CrossRef]
Loveland, T.R.; Dwyer, J.L. Landsat: Building a strong future. Remote Sens Environ. Remote Sens. Environ. 2012, 122, 22–29. [Google Scholar] [CrossRef]
Markham, B.L.; Storey, J.C.; Williams, D.L.; Irons, J.R. Landsat sensor performance: History and current status. IEEE Trans. Geosci. Remote Sens. 2004, 42, 2691–2694. [Google Scholar] [CrossRef]
Li, X.; Zhou, Y.; Zhu, Z.; Cao, W. A national dataset of 30-m annual urban extent dynamics (1985–2015) in the conterminous United States. Earth Syst. Sci. Data 2019, 12, 357–371. [Google Scholar] [CrossRef] [Green Version]
Tadono, T.; Ishida, H.; Oda, F.; Naito, S.; Minakawa, K.; Iwamoto, H. Precise global DEM generation by ALOS PRISM. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2014, 2, 71. [Google Scholar] [CrossRef] [Green Version]
Cracknell, M.J.; Reading, A.M. Geological mapping using remote sensing data: A comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information. Comput. Geosci. 2014, 63, 22–33. [Google Scholar] [CrossRef] [Green Version]
Hansen, M.C.; Potapov, P.V.; Moore, R.; Hancher, M.; Turubanova, S.A.; Tyukavina, A.; Thau, D.; Stehman, S.V.; Goetz, S.J.; Loveland, T.R.; et al. High-resolution global maps of 21st-century forest cover change. Science 2013, 342, 850–853. [Google Scholar] [CrossRef] [Green Version]
Pekel, J.F.; Cottam, A.; Gorelick, N.; Belward, A.S. High-resolution mapping of global surface water and its long-term changes. Nature 2016, 540, 418–422. [Google Scholar] [CrossRef]
Liu, X.; Huang, Y.; Xu, X.; Li, X.; Li, X.; Ciais, P.; Lin, P.; Gong, K.; Ziegler, A.D.; Chen, A.; et al. High-spatiotemporal-resolution mapping of global urban change from 1985 to 2015. Nat. Sustain. 2020, 3, 564–570. [Google Scholar] [CrossRef]
Chen, J. A simple method for reconstructing a high-quality NDVI time-series data set based on the Savitzky–Golay filter. Remote Sens. Environ. 2004, 91, 332–344. [Google Scholar] [CrossRef]
Phan, T.N.; Kuch, V.; Lehnert, L.W. Land Cover Classification using Google Earth Engine and Random Forest Classifier—The Role of Image Composition. Remote Sens. 2020, 12, 2411. [Google Scholar] [CrossRef]
Mohanaiah, P.; Sathyanarayana, P.; GuruKumar, L. Image texture feature extraction using GLCM approach. Int. J. Sci. Res. Publ. 2013, 3, 1–5. [Google Scholar]
Jiang, Z.; Huete, A.R.; Didan, K.; Miura, T. Development of a two-band enhanced vegetation index without a blue band. Remote Sens. Environ. 2008, 112, 3833–3845. [Google Scholar] [CrossRef]
Piao, S.; Fang, J.; Zhou, L.; Guo, Q.; Henderson, M.; Ji, W.; Li, Y.; Tao, S. Interannual variations of monthly and seasonal normalized difference vegetation index (NDVI) in China from 1982 to 1999. J. Geophys. Res. Atmos. 2003, 108. [Google Scholar] [CrossRef]
Shaver, T.; Khosla, R.; Westfall, D. Utilizing green normalized difference vegetation indices (GNDVI) for production level management zone delineation in irrigated corn. In Proceedings of the 18th World Congress of Soil Science, Philadelphia, PA, USA, 9–15 July 2006. [Google Scholar]
Khadanga, G.; Jain, K. Tree Census Using Circular Hough Transform and GRVI. Procedia Comput. Sci. 2020, 171, 389–394. [Google Scholar] [CrossRef]
Guha, S.; Govil, H.; Dey, A.; Gill, N. Analytical study of land surface temperature with NDVI and NDBI using Landsat 8 OLI and TIRS data in Florence and Naples city, Italy. Eur. J. Remote Sens. 2018, 51, 667–678. [Google Scholar] [CrossRef]
Xu, H. A new index for delineating built-up land features in satellite imagery. Int. J. Remote Sens. 2008, 29, 4269–4276. [Google Scholar] [CrossRef]
Escuin, S.; Navarro, R.; Fernandez, P. Fire severity assessment by using NBR (Normalized Burn Ratio) and NDVI (Normalized Difference Vegetation Index) derived from LANDSAT TM/ETM images. Int. J. Remote Sens. 2008, 29, 1053–1073. [Google Scholar] [CrossRef]
Fernandez-Buces, N.; Siebe, C.; Cram, S.; Palacio, J. Mapping soil salinity using a combined spectral response index for bare soil and vegetation: A case study in the former lake Texcoco, Mexico. J. Arid. Environ. 2006, 65, 644–667. [Google Scholar] [CrossRef]
Zheng, B.; Campbell, J.B.; Shao, Y.; Wynne, R.H. Broad-Scale Monitoring of Tillage Practices Using Sequential Landsat Imagery. Soil Sci. Soc. Am. J. 2013, 77, 1755–1764. [Google Scholar] [CrossRef]
Chandrasekar, K.; Sesha Sai, M.; Roy, P.; Dwevedi, R. Land Surface Water Index (LSWI) response to rainfall and NDVI using the MODIS Vegetation Index product. Int. J. Remote Sens. 2010, 31, 3987–4005. [Google Scholar] [CrossRef]
Huete, A.R. A soil-adjusted vegetation index (SAVI). Remote Sens. Environ. 1988, 25, 295–309. [Google Scholar] [CrossRef]
Lin, J.; Jin, X.; Ren, J.; Liu, J.; Liang, X.; Zhou, Y. Rapid Mapping of Large-Scale Greenhouse Based on Integrated Learning Algorithm and Google Earth Engine. Remote Sens. 2021, 13, 1245. [Google Scholar] [CrossRef]
Lu, L.; Hang, D.; Di, L. Threshold model for detecting transparent plastic-mulched landcover using moderate-resolution imaging spectroradiometer time series data: A case study in southern Xinjiang, China. J. Appl. Remote Sens. 2015, 9, 097094. [Google Scholar] [CrossRef]
Liu, W.; Zhang, X.; He, F.; Xiong, Q.; Zan, X.; Liu, Z.; Sha, D.; Yang, C.; Li, S.; Zhao, Y. Open-air grape classification and its application in parcel-level risk assessment of late frost in the eastern Helan Mountains. ISPRS J. Photogramm. Remote Sens. 2021, 174, 132–150. [Google Scholar] [CrossRef]
Bauer, E.; Kohavi, R. An empirical comparison of voting classification algorithms: Bagging, boosting, and variants. Mach. Learn. 1999, 36, 105–139. [Google Scholar] [CrossRef]
Wulder, M.A.; Coops, N.C.; Roy, D.P.; White, J.C.; Hermosilla, T. Land cover 2.0. Int. J. Remote Sens. 2018, 39, 4254–4284. [Google Scholar] [CrossRef] [Green Version]
Yang, J.; Huang, X. 30 m annual land cover and its dynamics in China from 1990 to 2019. Earth Syst. Sci. Data Discuss. 2021, 13, 3907–3925. [Google Scholar] [CrossRef]
Belgiu, M.; Drăguţ, L. Random forest in remote sensing: A review of applications and future directions. ISPRS J. Photogramm. Remote Sens. 2016, 114, 24–31. [Google Scholar] [CrossRef]
Gislason, P.O.; Benediktsson, J.A.; Sveinsson, J.R. Random forests for land cover classification. Pattern Recognit. Lett. 2006, 27, 294–300. [Google Scholar] [CrossRef]
Georganos, S.; Grippa, T.; Vanhuysse, S.; Lennert, M.; Shimoni, M.; Kalogirou, S.; Wolff, E. Less is more: Optimizing classification performance through feature selection in a very-high-resolution remote sensing object-based urban application. GISci. Remote Sens. 2018, 55, 221–242. [Google Scholar] [CrossRef]
Xu, Y.; Yu, L.; Zhao, Y.; Feng, D.; Cheng, Y.; Cai, X.; Gong, P. Monitoring cropland changes along the Nile River in Egypt over past three decades (1984–2015) using remote sensing. Int. J. Remote Sens. 2017, 38, 4459–4480. [Google Scholar] [CrossRef]
Li, X.; Gong, P.; Liang, L. A 30-year (1984–2013) record of annual urban dynamics of Beijing City derived from Landsat data. Remote Sens. Environ. 2015, 166, 78–90. [Google Scholar] [CrossRef]

Figure 1. Study area.

Figure 2. Spatiotemporal distribution (DOY, day of year) of Landsat images used in this study: (a) Landsat tiles in Path 119–123 and Row 33–36, (b) the acquisition time of Landsat images.

Figure 3. (a–e) The 10 km grid sampling structure: (a) and the visual-interpreted samples (red marks for AG, green marks for Non-AG) from Landsat 5 TM (b), Landsat 7 ETM+ (c), and Landsat 8 OLI (d).

Figure 4. The research flowchart of the article.

Figure 5. Spectral curves of different land cover types (SR reflectance in median value) in different seasons.

Figure 6. (a,b) Mapping window selection based on vegetation growth.

Figure 7. The phenological information of cropland and plastic-mulched land in Shandong province includes winter wheat (WW), summer corn (SUC), spring corn (SPC), garlic (GA) and cotton (CO).

Figure 8. Detailed procedures of the PTCC algorithm.

Figure 9. The importance of all features from 1989 to 2018: (a) the importance of each feature from 1989 to 2018; (b) the average importance of each feature over 30 years after ranking.

Figure 10. The change of F1-score curves of AG within the different selected years.

Figure 11. Comparison of accuracy of different feature scenarios.

Figure 12. Comparison of 30-year average classification results after a temporal consistency correction under different classification probability combinations: (a) the average F1-score of the correction results, (b) the standard deviation of the F1-score of the correction results.

Figure 13. Comparison of F1-score of AG before and after temporal consistency correction year by year.

Figure 14. Improved classifications after implementing temporal consistency correction: (a) TM image acquired in 1996, (b) ETM image acquired in 2012; (c) OLI image acquired in 2018. Regions within ellipses are typical areas improved after implementing the proposed PTCC algorithm.

Figure 15. Spatial distribution of AG in Shandong province, China in 1989 (a), 1999 (b), 2009 (c) and 2018 (d).

Figure 16. Landsat image examples (ture color composite) for greenhouses in 1989 (a), 1994 (b), 1999 (c), 2004 (d), 2009 (e), 2014 (f) and 2018 (g).

Table 1. The number of training and testing samples for each class from 1989 to 2018.

Year	1989	1990	1991	1992	1993	1994	1995	1996	1997	1998	1999	2000	2001	2002	2003
AG	568	562	612	654	734	756	797	811	937	1164	1066	1181	1193	1309	1512
Non-AG	8252	8224	8103	7890	7731	7677	7631	7587	7539	7536	7495	7403	7367	7346	7309
Total	8820	8786	8715	8544	8465	8433	8428	8398	8476	8700	8561	8584	8560	8655	8821
Year	2004	2005	2006	2007	2008	2009	2010	2011	2012	2013	2014	2015	2016	2017	2018
AG	1776	1604	1516	1941	1958	1948	1890	2078	2049	1849	1908	2020	2099	2333	2308
Non-AG	7262	7159	7141	7141	7117	7108	7073	7066	7107	7009	6900	6852	6831	6745	6653
Total	9038	8763	8657	9082	9075	9056	8963	9144	9156	8858	8808	8872	8930	9078	8961

Table 3. Different scenarios for feature optimization.

Scenarios	Feature Combination (Count)	Scenarios	Feature Combination (Count)
1	Spectral-based features (6)	6	Ranked features 1 (3)
2	Index-based features (15)	7	Ranked features 2 (9)
3	Texture-based features (8)	8	Ranked features 3 (18)
4	Spectral and index-based features (21)	9	Ranked features 4 (28)
5	Spectral and texture-based features (14)	10	All features (31)

Table 4. Classification performance of AG in specific years.

Class	Non-AG	AG	Total	PA(%)	UA(%)	F1-Score
Confusion matrix—1989
Non-AG	2429	6	2435	98.18	99.75	0.990
AG	45	120	165	95.24	72.73	0.825
Total	2474	126	2600
Confusion matrix—1999
Non-AG	2271	10	2281	98.48	99.56	0.990
AG	35	277	312	96.52	88.78	0.925
Total	2306	287	2593
Confusion matrix—2009
Non-AG	2053	16	2069	98.23	99.23	0.987
AG	37	555	592	97.20	93.75	0.954
Total	2090	571	2661
Confusion matrix—2018
Non-AG	1991	19	2010	96.79	99.05	0.979
AG	66	631	697	97.08	90.53	0.937
Total	2057	650	2707

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ou, C.; Yang, J.; Du, Z.; Zhang, T.; Niu, B.; Feng, Q.; Liu, Y.; Zhu, D. Landsat-Derived Annual Maps of Agricultural Greenhouse in Shandong Province, China from 1989 to 2018. Remote Sens. 2021, 13, 4830. https://doi.org/10.3390/rs13234830

AMA Style

Ou C, Yang J, Du Z, Zhang T, Niu B, Feng Q, Liu Y, Zhu D. Landsat-Derived Annual Maps of Agricultural Greenhouse in Shandong Province, China from 1989 to 2018. Remote Sensing. 2021; 13(23):4830. https://doi.org/10.3390/rs13234830

Chicago/Turabian Style

Ou, Cong, Jianyu Yang, Zhenrong Du, Tingting Zhang, Bowen Niu, Quanlong Feng, Yiming Liu, and Dehai Zhu. 2021. "Landsat-Derived Annual Maps of Agricultural Greenhouse in Shandong Province, China from 1989 to 2018" Remote Sensing 13, no. 23: 4830. https://doi.org/10.3390/rs13234830

APA Style

Ou, C., Yang, J., Du, Z., Zhang, T., Niu, B., Feng, Q., Liu, Y., & Zhu, D. (2021). Landsat-Derived Annual Maps of Agricultural Greenhouse in Shandong Province, China from 1989 to 2018. Remote Sensing, 13(23), 4830. https://doi.org/10.3390/rs13234830

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Landsat-Derived Annual Maps of Agricultural Greenhouse in Shandong Province, China from 1989 to 2018

Abstract

1. Introduction

2. Study Area and Data

2.1. Study Area

2.2. Landsat Archive

2.3. Reference Dataset for Supervised Training

3. Methods

3.1. Remote Sensing Characteristics Analysis

3.2. Mapping Window Selection

3.3. Tested Features

3.4. Classification and Feature Optimization

3.5. Temporal Consistency Correction

3.6. Accuracy Assessment

4. Results and Discussion

4.1. Performance of Feature Optimization

4.2. Performance of the Temporal Consistency Correction

4.3. Annual Maps of AG in Shandong Province from 1989 to 2018

5. Data Availability

6. Future Works

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI