Automated Plantation Mapping in Southeast Asia Using MODIS Data and Imperfect Visual Annotations

Jia, Xiaowei; Khandelwal, Ankush; Carlson, Kimberly M.; Gerber, James S.; West, Paul C.; Samberg, Leah H.; Kumar, Vipin

doi:10.3390/rs12040636

Open AccessArticle

Automated Plantation Mapping in Southeast Asia Using MODIS Data and Imperfect Visual Annotations

by

Xiaowei Jia

¹

,

Ankush Khandelwal

¹

,

Kimberly M. Carlson

^2,3

,

James S. Gerber

⁴,

Paul C. West

⁴

,

Leah H. Samberg

^4,5 and

Vipin Kumar

^1,*

¹

Department of Computer Science, University of Minnesota, Minneapolis, MN 55455, USA

²

Department of Natural Resources and Environmental Management, University of Hawai’i at Mānoa, Honolulu, HI 96822, USA

³

Department of Environmental Studies, New York University, New York, NY 10003, USA

⁴

Institute on the Environment, University of Minnesota, St. Paul, MN 55108, USA

⁵

Rainforest Alliance, New York, NY 10004, USA

^*

Author to whom correspondence should be addressed.

Remote Sens. 2020, 12(4), 636; https://doi.org/10.3390/rs12040636

Submission received: 13 January 2020 / Revised: 9 February 2020 / Accepted: 12 February 2020 / Published: 14 February 2020

(This article belongs to the Special Issue Global Biospheric Monitoring with Remote Sensing)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Expansion of large-scale tree plantations for commodity crop and timber production is a leading cause of tropical deforestation. While automated detection of plantations across large spatial scales and with high temporal resolution is critical to inform policies to reduce deforestation, such mapping is technically challenging. Thus, most available plantation maps rely on visual inspection of imagery, and many of them are limited to small areas for specific years. Here, we present an automated approach, which we call Plantation Analysis by Learning from Multiple Classes (PALM), for mapping plantations on an annual basis using satellite remote sensing data. Due to the heterogeneity of land cover classes, PALM utilizes ensemble learning to simultaneously incorporate training samples from multiple land cover classes over different years. After the ensemble learning, we further improve the performance by post-processing using a Hidden Markov Model. We implement the proposed automated approach using MODIS data in Sumatra and Indonesian Borneo (Kalimantan). To validate the classification, we compare plantations detected using our approach with existing datasets developed through visual interpretation. Based on random sampling and comparison with high-resolution images, the user’s accuracy and producer’s accuracy of our generated map are around 85% and 80% in our study region.

Keywords:

remote sensing; land cover change; plantations; ensemble learning; deforestation; tropical; MODIS

Graphical Abstract

1. Introduction

In Southeast Asia, expansion of tree crops and managed forests that help meet demand for global commodities has resulted in substantial tropical deforestation [1,2,3]. Replacement of natural forest cover with tree plantations, including tree crops for food (e.g., oil palm), fiber (e.g., pulp and paper), and materials (e.g., rubber), leads to net greenhouse gas (GHG) emissions to the atmosphere [3,4] and negatively impacts local environments, including degradation of biodiversity [5] and water quality [6]. These tree crops are frequently grown at the industrial plantation scale across large contiguous patches by capitalized companies [3], but may also be produced by smallholder farmers in small patchy areas [7].

Due to concerns over these large trade-offs between the environment and human material needs [8], several initiatives across government, industry, and non-governmental organization (NGO) sectors aim to ensure that tropical commodities meet rigorous sustainability standards [9]. The United Nations Collaborative Programme on Reducing Emissions from Deforestation and Forest Degradation in Developing Countries (REDD+), launched in 2008, aims to mitigate climate change through enhanced forest management. Since the mid-2000s, many large international corporations that produce, trade, and sell tropical commodities have committed to zero-deforestation supply chains [10]. The companies have also sought to demonstrate the sustainability of products and supply chains through third-party certifications such as Roundtable on Sustainable Palm Oil (RSPO) certification [11,12], and International Sustainability and Carbon Certification [13]. Concurrently, governments have begun to address commodity-driven deforestation through initiatives such as the Indonesian moratorium on allocation of new commodity concession in forested areas [14]. However, evaluating the effectiveness of these diverse policies depends on the ability to monitor land cover change due to tree commodity expansion [15]. Hence, scalable and timely monitoring of these land uses is essential for understanding whether programs and policies are meeting their stated goals [16,17].

Recent advances in access to and processing of remote sensing data have enabled semi-automated monitoring of high resolution changes in tree canopy cover over regional to global areas [18,19,20]. Concurrent progress has been made in classifying individual crop types, such as soybean and sugarcane in Brazil, using remote sensing data [21,22]. However, differentiating natural forests from tree crops remains a major challenge. Tree crops frequently have spectral properties (e.g., greenness) similar to natural forests [23,24], and can thus be difficult to discern from natural forests based on single-date spectral properties alone. For instance, the most widely used global tree cover loss product [18] defines forests on the basis of structure (tree height and percent canopy cover) and therefore does not differentiate between forest and plantations [25]. Moreover, they often have long rotation times (e.g., oil palms are replanted every 25 to 30 years), so that clearing and replanting may not be detected in a relatively long time-series of remote sensing imagery.

For these reasons, many plantation mapping studies have relied on visual interpretation—which makes use of human expertise to recognize patterns—in the plantation detection process [26,27,28,29]. For instance, Miettinen et al. [30] manually delineated industrial palm plantations based on Landsat 7 and Landsat 8 images. Similarly, Petersen et al. [29] divided Landsat images into a grid of 20 × 20 km and visually scanned each gridded structure for multiple types of plantations with the assistance of forest gain and loss information [18]. Miettinen and Hooijer et al. [27] first conducted clustering on Moderate Resolution Imaging Spectroradiometer (MODIS) satellite images and then manually checked the properties of each cluster. Koh et al. [26] clustered Advanced Land Observation Satellite (ALOS) images, visually interpreted each cluster as basic land cover types, and then classified them into finer-grained land cover types.

While such a manual approach is useful for creating maps for tree crops within small regions or for specific dates, this approach does not scale well to regional or global areas and also has limitations for creating maps at high temporal frequency (e.g., annually). This approach may require multiple observers to delineate tree crop plantations, and observers are likely to be inconsistent with one another [31,32]. Most critically, the human resources needed for large-scale annual digitizing are substantial. Given the challenges of time expense and likelihood of inconsistency of observations, mapping methods heavily reliant on visual interpretation are not feasible for large regions with annual repetition.

As an alternative, automated machine learning-based methods have been applied successfully to map plantation agriculture in the humid tropics. For example, Gutierrez Velez et al. [33] used a combination of MODIS Enhanced Vegetation Index (EVI) time-series and Landsat to detect forest conversion to oil palm in Peru. Automated approaches applied to detect tree crops have commonly utilized thresholding-based approaches [33,34,35], and the nearest neighbor method [36]. These approaches are limited in their capabilities, however, because they do not fully exploit crop-specific changes in vegetation characteristics over time (e.g., land clearing, growth cycles, and phenology specific to each tree crop species).

While the complex high-dimensional feature space in remote sensing data poses a challenge for learning processes [37,38,39], automated approaches have great potential to provide annual tree crop maps over large scales especially if several methods are implemented in these approaches. Specifically, a high temporal approach has been shown to be effective at reducing the fuzziness of boundaries by examining the vegetation’s phenology and eliminating cloud-cover pixels [40,41,42].

In addition, there exist varying levels of similarity between different land cover types [43,44,45]. Forests are more like plantations than urban areas. If a simple binary classifier is used to distinguish between plantation and “non-plantation", the classifier is highly likely to confuse plantations with similar land covers (e.g., forests) [46,47]. One method to address this issue is aggregation of similar land cover types into subgroups (aggregated classes) and training of separate classifiers to distinguish between each pair of subgroups. Moreover, the distribution of multiple land covers is often skewed, which necessitates a careful sampling design to collect training data from each subgroup.

Our objective is to automatically generate annual large-scale tree plantation maps. To do so, we introduce an automated approach for annually mapping tree crop plantations, which we call Plantation Analysis by Learning from Multiple Classes (PALM). The method uses a combination of remote sensing data, existing visually-delineated tree plantation maps, and machine learning techniques (ensemble learning, Deep Belief Networks (DBN), Hidden Markov Model (HMM)) to distinguish plantations from other land cover types in Indonesia (Sumatra and Kalimantan).

We use two existing plantation products, the Tree Plantation (TP) and Roundtable on Sustainable Palm Oil (RSPO) datasets, to create training data as an input to PALM. We validate the proposed approach by conducting random sampling to evaluate plantation maps generated by our approach and existing plantation maps available from TP and RSPO using a variety of metrics. Our validation process is based on an independent data source, i.e., high-resolution DigitalGlobe images, which are not used in the training process.

2. Dataset and Study Region

2.1. MODIS Data

To map tree plantations, we use the 500 m resolution MODIS Surface Reflectance product MOD09A1 [48], which consists of seven reflectance bands collected by MODIS instruments onboard Terra satellites. The MODIS product is defined on global sinusoidal grids in fixed geolocated 10 × 10 degree tiles and is publicly available from the Land Processes Distributed Active Archive Center [49]. In this product, 8-day composite images are generated from daily images by selecting the per-pixel reflectance values with the least noise (i.e., clouds and missing values) from every 8-day interval. The product provides 46 composite images per year since year 2000. Because the MODIS level-3 data product has filtered multiple factors including aerosoles and cloud, each image contains pixels with no data. We interpolate these missing values using temporally adjacent reflectance values. For example, if the reflectance value for band d is missing at time t, we will look for the available time for this band at previous time steps and following steps and conduct a linear interpolation for the missing value.

2.2. Study Region

Our study region includes the land area of the three MODIS tiles h29v09, h29v08 and h28v09 in Kalimantan and Sumatra, as shown in Figure 1. Tile h29v09 (“southern Kalimantan”) covers the province of South Kalimantan, as well as parts of West, Central, and East Kalimantan. Tile h29v08 (“northern Kalimantan”) covers North Kalimantan and parts of East and Central Kalimantan. Tile h28v09 (“southern Sumatra”) covers most of the southern part of Sumatra. These regions are extensively planted with oil palm, rubber, pulp and paper, and coconut palm [50,51]. In Table 1, we summarize the study area in these three MODIS tiles.

2.3. Training Data

Our classification method requires training samples to define the locations and the spectral features of various land cover types. Here, we use two training datasets developed by visual interpretation of remote sensing data. We refer to these as Tree Plantation (“TP”) [29] and Roundtable on Sustainable Palm Oil (“RSPO”) [12] datasets. Both datasets provide complete coverage in our region of study.

2.3.1. Tree Plantation Dataset

The Tree Plantation dataset (TP) was developed through visual interpretation of moderate- and high-resolution satellite imagery, and provides the location of tree plantations in selected tropical countries circa 2013–2014 [29]. In this dataset, plantations are defined as “tree established through planting and/or deliberate seeding of native or introduced species”, and include tree crops. In this work, we consider only the major plantation species in our study region, comprised of oil palm, acacia, rubber, and coconut palm [50]. Plantations are further categorized as industrial, medium-sized mosaic, small-sized mosaic, or very young. Random stratified sampling conducted by Perterson et al. [29] found that the user’s accuracy of this dataset is 79% (i.e., 79% of the identified tree plantation locations are in fact plantations) and the producer’s accuracy is 94% (i.e., 94% of all plantations are identified). This dataset identifies around 60,804 km², 26,242 km² and 112,062 km² of plantation area in MODIS tile h29v09 (southern Kalimantan), h29v08 (northern Kalimantan), and h28v09 (southern Sumatra), respectively circa 2013–2014.

2.3.2. RSPO Dataset

This dataset was produced by the 2nd Greenhouse Gas Working Group of the Roundtable on Sustainable Palm Oil (RSPO) [12] via visual interpretation of Landsat satellite images to delineate industrial oil palm plantations across Indonesia, Malaysia, and Papua New Guinea in 1990, 2000, 2005, and 2010. In addition, the study categorized each land pixel into 19 land cover types, which were further aggregated into nine higher level classes (Table 2, columns 2 and 3). We also show the estimated area and the number of MODIS pixels for each land cover in southern Kalimantan in Table 3. Although this study did not provide an accuracy assessment, our validation in Section 4.2 reports that the RSPO approach rarely classifies a non-plantation area as plantation (high user’s accuracy) but misses many real plantation areas (low producer’s accuracy).

3. Method

3.1. PALM Framework

The PALM framework uses samples derived from manually-digitized plantation datasets available for specific years. In particular, we employ an ensemble learning method to identify tree crop plantations. Ensemble learning refers to a set of approaches that jointly utilize multiple models to classify data [52]. Similar ensemble learning techniques have been used to identify land cover types from heterogeneous data, and these studies suggest that such techniques perform better than directly trained individual classifiers [35,53,54]. We use a deep learning model, Deep Belief Networks (DBN), as the classifier in ensemble learning to discover discriminative information from multi-spectral satellite data collected from multiple dates. The success of deep learning [55,56,57,58,59,60,61] in a wide range of applications demonstrates its capacity to learn discriminatively from complex feature space and extract representative features. In ensemble learning, we simultaneously train multiple DBNs to differentiate between each pair of aggregated land cover classes. However, even after using the most suitable classification machinery, classification errors in individual annual maps have a compounding effect in the context of land cover change detection. Therefore, to further improve classification accuracy, we use a post-processing method based on transition characteristics and spatial contiguity of land cover types. In addition, we distinguish different plantation species by conducting a hierarchical classification on the obtained plantation maps. The flow of the PALM framework is shown in Figure 2. There are many possible implementations of these modules. See [62] for the evaluations of different possible implementations for some of these modules (collecting training samples, comparison between ensemble learning and other learning strategies, filtering, post-processing strategies, and different amounts of training data) using imperfect data from TP and RSPO. In the following section we describe the specific implementations that were tested in the context of this study.

3.1.1. Ensemble Learning Method

The RSPO dataset provides training data that are suitable for our ensemble learning method because the dataset has many classes, multiple time points, complete coverage, and high user’s accuracy (see Table 11). We first aggregated the RSPO land cover types into several classes to train a learning model. The number of aggregated classes needs to be carefully chosen since both detailed taxonomy (a large number of aggregated classes) and binary classification (aggregated “plantation" and “non-plantation" classes) have limitations. Although a detailed land cover taxonomy can provide the maximum information about land cover change, the existing RSPO maps do not achieve high accuracy on fine-grained land covers, as described in the RSPO report [12]. Conversely, if all the land cover types other than plantation are aggregated into a “non-plantation” class, the high heterogeneity within the “non-plantation” class will reduce learning performance and increase misclassification risk [47]. While distinguishing between plantation and forest is challenging due to similar per-pixel spectral properties, other land cover types such as urban areas and annual crops are typically dissimilar from forests and tree plantations, and are therefore easier to distinguish from plantation. Based on these considerations, we aggregate all land cover types into three classes: “forest”, “plantation”, and “other” (Table 2, column 1).

The aggregation facilitates the classification process in that the classifier can capture specific discriminative patterns to distinguish tree plantations with similar classes (“forest” class), and also with dissimilar classes (“other” class). To determine which RSPO land cover types are relatively similar to plantations and belong to the aggregated “forest” class, we conduct pairwise binary classification between plantation and every land cover type within “disturbed forest” and “undisturbed forest”. Then, we aggregate the land cover types which result in a relatively low classification accuracy into the “forest” class.

Given the aggregated classes, we train three binary classifiers using a one vs. one strategy: plantation versus forest (P-F), forest versus other (F-O), and other versus plantation (O-P). In this way, each classifier focuses on exploiting the discriminative knowledge between a specific pair of classes while ignoring the other class. This learning strategy [63] is thought to reduce data heterogeneity from multiple classes and improve the learning performance. After training the three classifiers separately, we then aggregate the predicted results from each classifier to make a final classification decision based on majority voting. Since each binary classifier focuses on differentiating between a specific pair of classes, eight possible combinations represent potential outcomes from the three classifiers. Based on the separate predictions from each classifier, we assign the aggregated prediction result as the majority class label. For instance, if both P-F and O-P classifiers predict a test location as “plantation”, then we will label this test location as “plantation” regardless of the prediction of F-O classifier. When the three binary classifiers generate mutually different labels, the test sample will be assigned to the “Unknown” (U) class. This "Unknown” class is handled through our post-processing step as discussed in Section 3.1.5.

3.1.2. Collecting Training Samples

Aggregated land cover classes derived from the RSPO dataset contain multiple land cover types, and relative area under each land cover type varies widely. For instance, in the RSPO dataset, settlement area is much smaller than grassland area. Classification algorithms may over-emphasize the classes with larger population of samples [64]. If we uniformly sampled from each aggregated non-plantation land cover class for supervised classification, the resulting map would likely be dominated by the land cover types with the largest relative area. As a result, minority classes might be misclassified. To address this issue, we randomly sample equal numbers of pixels from each sub-class within the aggregated “forest” and “other” classes.

The spectral features of each land cover can be different in different years because of changing climate conditions (e.g., precipitation, sunlight). Due to such temporal heterogeneity, a model developed for a specific year can result in a poor prediction when applied to another year. To overcome this limitation, we sample each pixel for every year for which we can be confident about its label. This enables the training of a global model that can be applied over an entire observation period. Specifically, according to the availability of RSPO data, we first divide the 2001 to 2014 period into three intervals. For the first two intervals (i.e., 2001–2005 and 2006–2010), we are confident only in locations that are labeled with the same land cover type at both end years. For the 2011–2014 interval, we are confident only in locations that show little change in Enhanced Vegetation Index (EVI) from 2010 to 2014.

Since the quality of labeled data can impact the performance of the machine learning algorithm, we further improve the confidence of selected samples used for training. We utilize the TP dataset to prune less confident samples from this set. If the RSPO dataset identifies a plantation pixel that is not part of the TP dataset, this indicates a potential false positive in the RSPO dataset since the TP dataset includes the clear majority of true plantations. Hence, we keep only those plantation samples that are also labeled as “plantation” in the TP dataset. We also use the TP dataset to prune the confident samples for the “forest” class and “other” class, keeping only those confident samples that are not identified as plantations in the TP dataset.

Since the TP dataset and RSPO dataset were created using satellite imagery at a higher spatial resolution than MODIS pixel, there exist mixed MODIS pixels along the boundaries of each land cover. To ensure that the training process only involves high-quality training data, we exclude these mixed pixels from the classification model.

3.1.3. Learning Model

While ensemble learning assists with distinguishing tree plantations with multiple other land cover types, we still need to carefully choose the learning model that distinguishes between aggregated classes to ensure high classification performance. After gathering training samples, we train a Deep Belief Network (DBN) [60] to learn each of these three binary classifiers. DBNs are effective in exploiting the latent relationship among input features and extracting high-level representative features via the unsupervised pre-training phase and the supervised fine-tuning process, and thus are an excellent tool for such land cover classification. As mentioned in Section 3.1.1, the samples of each class can be associated with multiple land cover types, which requires that the model effectively transforms the spectral features into a subspace where the land cover types within each aggregated class are close and those from different classes are distinct from one another. We take advantage of the spectral and temporal richness of our data by exploiting reflectance values from all seven MODIS bands, and multiple available images each year [58,59], to determine land cover.

The main components of a DBN are Restricted Boltzmann Machines (RBMs). An RBM is an undirected graphical model structured as a fully connected bipartite graph between two layers of binary variables, visible variables

V \in R^{N}

, and hidden variables

H \in R^{M}

(Figure 3). In our problem, V represents concatenation of reflectance values over multiple composite images in a given year, which has the length of

N = N_{b} \times N_{d}

.

N_{b}

and

N_{d}

denote the number of bands and the number of composite images in a year, respectively. The joint distribution of V and H is defined by an energy-based distribution [65].

While RBMs are capable of extracting latent features, a single RBM may fail to reveal truly discriminative features due to the limited ways of combining input features [66]. Therefore, multiple RBM layers are usually stacked to form a DBN. The DBN enables the learning of more representative features since the number of combinations increases exponentially with the number of layers. The DBN model can be trained using Gibbs sampling (alternating sampling of visible and latent variables) in a greedy (layer-wise) fashion [60]. The objective of DBN training is to automatically extract representative features from a large volume of high-dimensional data. After training, the learned DBN model can be folded into standard deep neural networks for fine-tuning processes to fit the training data [60].

Here, we train three DBN models separately to discriminate between each pair of aggregate classes (i.e., plantation-forest, forest-other, other-plantation). We feed each DBN model with the concatenation of seven-band spectral features collected for 46 dates of a year, and it outputs a class label for every pixel every year. For each DBN model, we adopt a stacked four-layer structure with 158, 64, and 20 hidden variables, respectively (The last layer outputs the class label).

3.1.4. Filtering

During the analysis of our ensemble classification machinery, we found a set of locations that were labeled as bare soil by RSPO but that were mistaken for plantation by our classification approach. This bare soil class refers to bare rock, gravel, sand, silt, clay, or other exposed soil, and often includes recently cleared (deforested) areas, landscapes impacted by fire and portions of estates undergoing replanting procedures [12]. Some of these locations may potentially be converted into plantations in future. Therefore, they can be confused with true new plantations as they share similar characteristics for early stage of plantations, e.g., road networks in plantations.

While most bare soil locations and plantations in Southeast Asia can be distinguished based on phenology, another important reason for the classification error is the lack of such training data (e.g., deforested area with road networks) used for three-class classification (see Section 3.1.1).

Even though we collect equal numbers of samples from each land cover type to obtain a rich training set, samples within a given land cover type are selected randomly. Although these locations allow better class separability, they are rarely present in the bare soil class. When these limited samples are mixed with other land cover types in the “other” class, the training process can be dominated by the other samples that are more distinguishable from plantations.

To solve this problem, we train an additional classifier that differentiates between bare soil and plantation (B-P). The separate training of this B-P classifier ensures that we have sufficient samples for both classes, which allows deep learning models to automatically extract proper spectral metrics that reflect temporal differences in reflectance. Then, we utilize this classifier to filter the detected plantation locations obtained from the three-class ensemble classification step. Specifically, we apply the B-P classifier only to locations that are classified as plantation by the three-class ensemble classification model. In this way, we remove bare soil locations that look similar to plantations.

3.1.5. Post-Processing

Due to the natural disturbance and the temporal variation in remote sensing data, conducting separate annual predictions results in classification errors and inconsistencies among years. For instance, a pixel may be classified as {plantation, plantation, plantation, forest, plantation, plantation} for six consecutive years. Here, the “forest” label is highly likely to be a classification error because plantations in Southeast Asia are rarely converted back to forest. Indeed, every location labeled as plantation in 2001 and 2005 in the RSPO dataset remains as a plantation in following plantation maps.

In general, different land cover types have different transition or conversion characteristics across the years. To capture this transition relationship and reduce the interannual classification errors, we utilize a Hidden Markov Model (HMM), which models the transition probability among latent states by a transition matrix T and the mapping between the latent state and the observed class by an emission matrix E. In particular,

T_{i j}

represents the transition probability from state i to state j, and

E_{i k}

denotes the emission probability from state i to the observed class k.

In our post-processing method, each latent state in the HMM represents a real land cover type from {“plantation”, “forest”, “other”}, and each observed class k belongs to plantation, forest, other, or unknown class. We initialize the transition matrix and emission matrix using the visually delineated RSPO land cover dataset. We first interpolate maps in each year from the RSPO dataset which is available in 2001, 2005, and 2010. For example, if M pixels are converted from forest to plantations from 2001 to 2005, then we randomly add

M / 4

of these locations as plantations in each year from 2002 to 2005 under the assumption of constant conversion rate.

T_{i j}

is initialized as the proportion of locations in land cover i at any year to be converted to land cover j at the next year.

E_{i k}

is initialized as the proportion of pixels in land cover type i that are classified as class

k \in

{“plantation”, “forest”, “other”, “unknown”}. In this way, the emission matrix E can capture the confusion between land cover classes. Since our assumption of constant conversion rate is not accurate, this initialization can only serve as an approximation to the transition probability between consecutive years and the emission probability. However, the transition matrix T and emission matrix E are further tuned in the training process of HMM [67] so that they can precisely capture the yearly transition relationships. With the obtained transition matrix and emission matrix, we improve the yearly prediction for each location via the Viterbi algorithm [68].

We also conduct spatial post-processing. New plantations are frequently developed near existing plantations due to improved access and other factors provided by previous developments. Therefore, we implement a spatial filtering process using the following steps. First, we conduct spatial clustering by finding all the connected components of plantation locations. For each cluster, if the cluster size is less than a threshold δ, we will consider this cluster as false positive and remove it. In our implementation, we set δ = 30, which is equivalent to a minimum area of 7.5 km². To determine this threshold, we select a set of confident locations in 2014 which are labeled as plantations by both TP and RSPO. Since these locations are selected conservatively, the failure to detect any of these locations can lead to omission errors. At the same time, since the TP dataset has high producer’s accuracy [29], any locations that are labeled as plantations by PALM but not by the TP dataset can be commission errors. As we increase the threshold δ, we keep track of these estimated omission errors and commission errors. The selected threshold of δ = 30 significantly reduces commission errors with a limited increase of omission errors. Because this procedure removes small (<750 ha) patches, while mean smallholder farm size in Indonesia is <1 ha [69], this post processing likely removes most independent smallholder farms that are not contiguous with industrial scale plantations.

3.1.6. Hierarchical Classification

Having obtained annual plantation maps using the method described above, we further conduct hierarchical classification using another DBN model to distinguish major plantation species in our study region (i.e., oil palm, acacia, rubber, and coconut palm). We take the manually annotated locations for these four species from TP dataset. The hierarchical classification is only conducted for the last year (i.e., 2014) of our study period. We utilize the yearly plantation maps to infer the locations of different species in previous years. For example, if a location converts to plantation in 2006 according to our generated annual plantation maps, and it is classified as oil palm in 2014, then it is identified as oil palm for every year from 2006–2014. Our method does not account for the rare situations where producers switch plantation species across years.

3.2. Validation of Plantation Maps

To evaluate the quality of our maps, we visually inspect a set of randomly selected samples using DigitalGlobe high-resolution imagery. We propose a random sampling-based validation approach which is beyond the sampling approaches typically used to measure user’s and producer’s accuracy.

Producer’s accuracy validation: Existing studies have commonly sampled pixels outside of classified plantations to measure producer’s accuracy. However, this validation approach is inefficient in that only an extremely small portion of selected samples can be true plantations assuming that the proposed method performs reasonably well in plantation mapping. For this reason, we measure producer’s accuracy following the method proposed in [70], which biases the sampling process towards locations that are more likely to be true plantations. This is helpful in obtaining a sufficient number of true plantation samples to robustly estimate omission errors in generated plantation maps.

We randomly sample a set of locations around plantation mill locations (http://data.globalforestwatch.org/datasets/ed8d5951b2a4482a9e62c4fe0bc23b5f$_$27) in each MODIS tile, and then we check the real plantation locations from the sampled locations using DigitalGlobe in 2014 via visual interpretation [1]. Within selected real plantations, we then measure the fraction of locations detected by PALM, TP, and RSPO. This analysis provides insight into the producer’s accuracy of these products.

User’s accuracy validation: To better understand the user’s accuracy of our product relative to TP and RSPO, we individually sample each major difference region, as described below (Figure 4):

R0—locations labeled as plantations by PALM, TP, and RSPO.
R1—locations labeled as plantations by PALM and TP, but not by RSPO.
R2—locations labeled as plantations by TP, but not by PALM and RSPO.
R3—locations labeled as plantations by PALM, but not by TP. This includes both the locations detected by RSPO and the locations not detected by RSPO. Only a few locations in R3 are detected by RSPO.
R4—locations labeled as plantations by RSPO, but not by PALM. This includes both the locations detected by TP and the locations not detected by TP.

This sampling strategy assists in better investigating the difference between our detected plantation locations (from 2001 to 2014) and these two available datasets. Here, we use on the RSPO map on 2010, which is almost a subset of TP (91% of the RSPO map is included in TP in our entire study region).

We quantify the accuracy of our analysis by comparing randomly selected samples from each difference region to high-resolution images DigitalGlobe (https://www.digitalglobe.com/) in 2014. We take 200–300 samples from each difference region, and visually interpret high-resolution DigitalGlobe images [1]. We then compare the results of our visual interpretation with plantation maps to obtain the fraction of samples that are real plantations, which is also referred to as the confidence.

Based on the quantified results of {R0, R1, R2, R3, R4}, we then estimate the user’s accuracy for the PALM, TP, and RSPO. According to the obtained user’s accuracy and producer’s accuracy, as well as the area of detected plantations, we estimate the overall accuracy and Cohen’s κ coefficient for PALM, TP, and RSPO. Finally, to illustrate differences between PALM, TP, and RSPO plantation maps, we conduct case studies by overlaying regions R0–R4 on high spatial resolution DigitalGlobe imagery.

4. Results

4.1. Plantation Map and Basic Statistics

Our classification suggests that, in 2001, southern and eastern Kalimantan and southern Sumatra contained around 81 × 10³ km² of tree plantations, which include oil palm, pulp and paper, rubber, and coconut palm plantations. By 2014, plantations extent had almost doubled to 152 × 10³ km² (Figure 5a). We also show the spatial distribution of PALM, TP, and RSPO in Figure 5b. In Figure 6, we show the growing area of plantations in different MODIS tiles. Kalimantan had substantially greater plantation expansion rates (9.57% and 7.29% per year in southern Kalimantan and northern Kalimantan, respectively) than southern Sumatra (2.27% per year, Figure 6).

Across the study region, in 2014, our maps suggest that around 59% of all plantation area was oil palm, 18% rubber, 16% timber (acacia), and 7% coconut palm. Here, we show the major species in each tile in Table 4, Table 5 and Table 6. According to our results, almost all the detected plantations in northern Kalimantan are oil palm. Oil palm and acacia expanded most rapidly across all regions, increasing by 6% and 5% per year, respectively from 2001 to 2014.

4.2. Validation Using High-Resolution Images

4.2.1. Validation for Producer’s Accuracy

Our producer’s accuracy assessment, generated from a random sample of around 1000 pixels around oil palm plantation mills in each MODIS tile, suggests that both PALM and TP have reasonably good coverage of true plantations, whereas RSPO has much lower plantation detection for all three MODIS tiles (Table 7). RSPO accounts for only around 25% of sampled true plantations for MODIS tile h28v09 (southern Sumatra), which has the largest plantation area.

4.2.2. Validation for User’s Accuracy

To estimate user’s accuracy for PALM and existing products TP and RSPO, we analyze the five difference regions {R0, R1, R2, R3, R4} defined in Section 3.2. Our user’s accuracy assessment (Table 8, Table 9 and Table 10) suggests that our approach outperforms the TP dataset, but has slightly lower performance than the RSPO dataset (Table 11). All sampled locations from R0 are identified as real plantations from high-resolution imagery. About 81%–84% of locations labeled as plantations by PALM and TP, but not by RSPO (R1) are real plantations as identified from high-resolution optical imagery. In contrast, just 20%–40% of pixels labeled as plantations by TP, but not by PALM and RSPO (R2) are real locations. Moreover, although RSPO has high user’s accuracy and TP has high producer’s accuracy, 40%–58% of pixels labeled as plantations by PALM, but not by TP (R3) are real plantations while 52%–63% of pixels labeled as plantations by RSPO, but not by PALM (R4) are false positives.

4.2.3. Overall Accuracy

Given the estimated user’s accuracy and producer’s accuracy, we can estimate the overall accuracy based on the area of detected plantations and the area of the entire study region. Across all study regions, PALM had a 94% overall accuracy, compared to 89% for TP and 84% for RSPO (Table 12). PALM also had higher

κ

coefficient than TP and RSPO (Table 13). These differences were especially pronounced in southern Sumatra.

There commonly exists a trade-off between user’s accuracy and producer’s accuracy. For instance, the RSPO’s conservative classification strategy resulted in higher user’s accuracy but missed many real plantations, and therefore had lower producer’s accuracy. In contrast, the TP classification covered a much larger plantation area and had higher producer’s accuracy, but mislabeled many non-plantation area as plantations, and yielded lower user’s accuracy. Compared with the TP and RSPO datasets, PALM leads to a better balance between the user’s accuracy and producer’s accuracy

4.2.4. Case Studies of Model Performance

Our case studies provide visual illustration of differences between datasets presented here (Figure 7), and cases where PALM outperforms other approaches and datasets.

In Figure 7a, the red region (R1) detected by PALM and TP is a real plantation as confirmed by the DigitalGlobe image, but is missing from the RSPO dataset. In Figure 7b, the blue region (R2) included as plantation by the TP dataset was not a real plantation, and was not picked up by PALM. From the high-resolution image in Figure 7c, we observe that PALM detects the boundary between real plantation and non-plantation area (red), while the TP dataset mislabels nearby non-plantation area as plantations (blue).

Figure 7d provides an example of plantations that are detected by our approach but are missed by the TP dataset (yellow, R3). Finally, Figure 7e shows a case where RSPO incorrectly identifies a plantation but PALM correctly indicates no plantation (green, R4), adjacent to plantations correctly identified by all three datasets (magenta, R0).

5. Discussion

Our study evaluated the accuracy of an automated plantation mapping approach (PALM) using MODIS satellite data in Southeast Asia. Compared to manually-digitized plantation datasets, PALM produced plantation maps with greater overall accuracy (5% higher overall accuracy than the TP dataset and 10% than the RSPO dataset) and higher temporal resolution (annual). Our maps suggest that the extent of industrial-scale tree plantations in the study region is 58% greater than the RSPO dataset in 2010 and 36% smaller than the TP dataset in 2014. The proposed method focuses on capturing contiguous or large-scale plantations, which account for over half of oil palm plantations and almost all of pulp and paper according to [29]. These generated maps are useful for several purposes, including annual assessments of tree plantation expansion.

5.1. Smallholder Tree Plantations

The RSPO dataset that we used to train our model focused on mapping large-scale oil palm plantations and excluded most smallholder oil palm farms [12]. Indeed, validation suggests that the RSPO dataset omitted the most plantation area in southern Sumatra, where smallholder oil palm is more prevalent than in Kalimantan. In addition, we chose to threshold our plantation maps such that the minimum patch size was 7.5 km² (750 ha), substantially larger than any definition of a “smallholder” farm in Indonesia. For instance, the Indonesian Ministry of Agriculture sets this limit at 25 hectares for farmers growing plantation crops [71]. Thus, our classification methodology likely excluded a portion of the independent smallholder tree plantation area that was visually dissimilar to the RSPO-identified plantation classes, or that occurred in patches <750 ha. According to the Indonesian Central Bureau of Statistics (BPS), in 2015 around 40% of oil palm and 85% of rubber plantation area is under smallholders, such that around 50% of the total area of these crops is held by smallholders [72]. Figures are not available for coconut palm or acacia, but most coconut palm is likely smallholder-run, while most pulp and paper is expected to be grown on an industrial scale. In contrast, the methods used to detect plantations in the TP dataset included identification of “small” plantations, with patches less than 10 hectares [29]. While the 30-meter resolution Landsat data used for their plantation identification are not always able to distinguish small patchy tree plantations from other land uses, the relatively greater producers’ accuracy of the TP dataset may be partially explained by the greater focus on smallholder tree plantations. Given increased focus on smallholder land use practices by the Indonesian government (e.g., ISPO) as well as corporations trading tree crop commodities (e.g., zero-deforestation pledges that reference smallholders), adapting PALM to better capture smallholder plantation expansion would be a critical next step to evaluate smallholder-linked land use dynamics.

5.2. Tree Plantation Species-Specific Mapping

To our knowledge, this is the first automated species-specific mapping effort for tree plantations in Indonesia across large spatial and temporal scales. Species-specific mapping is important because it allows identification of the commodity driving land cover change, and enables additional research into questions of legality, forest conversion, yields, land tenure, and effects of changing climate on tree plantations. However, our species-specific mapping was heavily limited by our inability to accurately validate our species-level maps. Distinguishing between different plantation species can be challenging even with high spatial resolution DigitialGlobe data when trees are recently planted, and requires considerable expertise in regional land use or an extensive field dataset on the locations of the tree plantations in question. For this research, we therefore chose not to validate our species mapping, but this is a clear next step for this research.

Our sampling process for producer’s accuracy validation is biased towards oil palm plantations because the mills dataset includes only palm oil facilities. Given that rubber, acacia, and coconut palm represent major land covers in Indonesia (BPS Indonesia [50]), a sampling process that makes use of other datasets such as concession boundaries for oil palm, timber, and rubber plantations, or government census or survey data on the cultivated land area for each of these species, would be needed to address this limitation. Our approach was also limited in its ability to map changes in species over time. In the hierarchical classification process, we assume that plantation species are constant at the same location over years. Since 2012, the price of rubber has declined in comparison to that of oil palm [73], suggesting that conversion from rubber to oil palm may be occurring, a dynamic that we do not track in the current PALM iteration.

Looking forward, a potential solution is to utilize tree structural information to distinguish different plantation species (e.g., rubber as a broad-leaved species may be structurally different with other species, while coconut palm trees usually grow taller than oil palm trees). Indeed, previous research has had success in mapping mature oil palm using structural information from radar data [74]. In addition, the spectral information available at higher resolution, e.g., Landsat 8 or Sentinel-2 satellites, could also help to better distinguish among species. These datasets could be incorporated into PALM by merging the extracted features from deep learning models at intermediate layers, as discussed in previous work [75].

5.3. Classification Model

Compared with other classification models, our selected deep learning approach has two main advantages. First, deep learning methods have shown superior performance in learning from high-dimensional features given large volume of data [76]. The combination of MODIS data with TP and RSPO datasets generates large volume of MODIS multi-spectral data collected on multiple dates, and therefore has high dimensionality. Second, training the DBN enables learning representative features (e.g., spectral properties) from unlabeled data (i.e., years for which training data are not available). Since the TP and RSPO datasets are only available in specific years, we are not certain about the land use classes for many locations in other years even after using the sampling method mentioned in Section 3.1.2. However, the training of the DBN can still make use of these data to extract representative features while the fine-tuning process only focuses on the selected labeled samples.

We do not adopt popular ensemble methods including bagging-based algorithms such as Random Forests [77] or boosting-based algorithms such as Adaboost [78] because these methods cannot use unlabeled data to learning representative features. Deep learning methods also require less feature engineering [79] than these approaches. Moreover, the boosting methods are sensitive to mislabeled samples in training data [80], which can be caused by observers’ mistakes in the manual labeling process.

Another promising approach to land cover classification problems like the one presented here is Fully Convolutional Networks (FCN), which have been used for pixel-wise labeling in computer vision applications [81,82]. FCN is directly trained on the entire image and is thus capable of capturing the spatial correlations in the classification process. In contrast, the PALM approach utilizes a spatial post-processing step to filter potential commission errors, but may misclassify individual pixels with noisy data. However, it is extremely inefficient to train an FCN model directly across a large spatial extent with high dimensional features. One potential solution is to extract spatial context information as a separate step, as discussed in [58,59]. The extracted features could then be combined with PALM for the pixel-wise training.

It is noteworthy that the model training process focuses on the difference region R0 and R4, since we use the RSPO and TP datasets to select samples. However, validation comprised a much larger area including R0–R4. Moreover, the labeling information used in training process is based on the RSPO and TP datasets, while the classification is validated through visual inspection using high-resolution images. Therefore, the overall classification performance provided in Table 12 and Table 13 is not directly impacted by the potential overlap between training and testing data.

5.4. Limitations

While this method performs better than manual plantation delineation, it remains limited in terms of validation, classification, and imagery inputs, which need to be addressed in future work. The first limitation lies in the visual validation process using DigitalGlobe. In this study, we determine a pixel to be plantation if it shows obvious plantation characteristics (e.g., the context of surrounding pixels, alignment structure, crossing road networks). However, some plantations may not be easily identified visually (e.g., due to their advanced age and associated high tree cover) or due to limitations of our expertise. Another assumption that underlies our evaluation relative to RSPO (as shown in Figure 7e) is that a location that is a plantation in 2010 is not converted back to other land cover (e.g., forests) in 2014. This assumption is consistent with the fact that each location that has been classified as plantation in any of the RSPO maps is also labeled as a plantation in each of the future RSPO maps. However, this assumption, if incorrect for some locations, could inflate the errors reported for RSPO.

Since the RSPO dataset is only available through 2010, the selection of training data after 2010 depends on the RSPO land cover class in 2010 and subsequent (lack of) changes in EVI. The assumption of stable land cover under a stable EVI potentially introduces noisy training data and consequently poor classification after 2010. One potential solution is to incorporate the Global Forest Change dataset [18] to further increase certainty about locations with stable post-2010 land covers which can be used as samples.

Since the training is directly conducted on a collection of samples from multiple years, the classification performance can be degraded for the years that have different weather conditions to other years. A potential solution is to study the weather conditions (e.g., rainfall) for each year and train classifiers separately for each type of weather conditions (e.g., wet years vs. dry years).

According to our validation, the RSPO dataset has low producer’s accuracy. One potential reason is that many new plantations were developed after 2010, which conforms to our classification results (Figure 6). We also manually verify this source of error using Google Timelapse, which mainly relies on Landsat images [83]. We randomly select 20 plantation patches newly detected from 2011 to 2013 by PALM but that are not detected by the RSPO dataset. According to this analysis, every selected plantation patch was developed after 2010.

Our analysis was also limited by the resolution of the MODIS data. While the high resolution of Landsat data (30 m) and Sentinel data (10 m) offer potential to map plantations more accurately, the low temporal frequency of Landsat (16 day) and Sentinel (10 day) makes it hard to find images with little noise (e.g., clouds). Learning approaches that are more robust to the noise and the multi-scale learning framework that combines Landsat, Sentinel and MODIS can help address this limitation.

6. Conclusions

Here, we present an ensemble learning method to map plantation areas. We aggregate the land cover types defined by Roundtable on Sustainable Palm Oil and propose to simultaneously sample from multiple land cover types. To learn each individual classifier, we utilize Deep Belief Networks to extract representative information from spectral features over multiple days. Furthermore, we utilize HMM and spatial information to post-process the result. The use of large-scale remote sensing data provides an advantage for the learning process, and our test in Indonesia shows that (1) the method can automatically generate high-quality annual plantation maps, and (2) the detected plantation map achieves a better balance of user’s accuracy and producer’s accuracy than either TP or RSPO maps. Besides the detection of large-scale tree plantations, PALM also has the potential to discern specific plantation species. Since this method generates annual plantation maps, it has great potential to support mapping and monitoring of commodity crop expansion in the tropics.

Author Contributions

X.J., A.K., K.M.C., J.S.G., P.C.W., L.H.S. and V.K. conceived and designed experiments; X.J. and A.K. performed the experiments; X.J., A.K. and K.M.C. analyzed the data and generated figures; All authors have read and agreed to the published version of the manuscript.

Funding

National Science Foundation: 1029711.

Acknowledgments

X.J., A.K., and V.K. were funded by the NSF Awards 1838159 and 1029711. J.S.G. and P.C.W. were supported by the Belmont Forum/FACCE-JPI funded DEVIL project (Delivering Food Security from Limited Land) (NE/ M021327/1). K.M.C. was funded by the NASA New (Early Career) Investigator Program in Earth Science (NNX16AI20G) and the US Department of Agriculture’s National Institute of Food and Agriculture, including Hatch Project HAW01136-H and McIntire Stennis Project HAW01146-M, managed by the College of Tropical Agriculture and Human Resources. Access to computing facilities was provided by NASA Earth Exchange and Minnesota Supercomputing Institute.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gaveau, D.L.; Sheil, D.; Salim, M.A.; Arjasakusuma, S.; Ancrenaz, M.; Pacheco, P.; Meijaard, E. Rapid conversions and avoided deforestation: Examining four decades of industrial plantation expansion in Borneo. Sci. Rep. 2016, 6, 1–13. [Google Scholar] [CrossRef] [PubMed]
Ziegler, A.D.; Fox, J.M.; Xu, J. The rubber juggernaut. Science 2009, 324, 1024–1025. [Google Scholar] [CrossRef] [PubMed]
Carlson, K.M.; Curran, L.M.; Asner, G.P.; Pittman, A.M.; Trigg, S.N.; Adeney, J.M. Carbon emissions from forest conversion by Kalimantan oil palm plantations. Nat. Clim. Chang. 2013, 3, 283. [Google Scholar] [CrossRef]
Carlson, K.M.; Gerber, J.S.; Mueller, N.D.; Herrero, M.; MacDonald, G.K.; Brauman, K.A.; Havlik, P.; O’Connell, C.S.; Johnson, J.A.; Saatchi, S.; et al. Greenhouse gas emissions intensity of global croplands. Nat. Clim. Chang. 2017, 7, 63. [Google Scholar] [CrossRef]
Wilcove, D.S.; Koh, L.P. Addressing the threats to biodiversity from oil-palm agriculture. Biodivers. Conserv. 2010, 19, 999–1007. [Google Scholar] [CrossRef]
Carlson, K.M.; Curran, L.M.; Ponette-González, A.G.; Ratnasari, D.; Lisnawati, N.; Purwanto, Y.; Brauman, K.A.; Raymond, P.A. Influence of watershed-climate interactions on stream temperature, sediment yield, and metabolism along a land use intensity gradient in Indonesian Borneo. J. Geophys. Res. Biogeosci. 2014, 119, 1110–1128. [Google Scholar] [CrossRef]
Samberg, L.H.; Gerber, J.S.; Ramankutty, N.; Herrero, M.; West, P.C. Subnational distribution of average farm size and smallholder contributions to global food production. Environ. Res. Lett. 2016, 11, 124010. [Google Scholar] [CrossRef]
West, P.C.; Gibbs, H.K.; Monfreda, C.; Wagner, J.; Barford, C.C.; Carpenter, S.R.; Foley, J.A. Trading carbon for food: Global comparison of carbon stocks vs. crop yields on agricultural land. Proc. Natl. Acad. Sci. USA 2010, 107, 19645–19648. [Google Scholar] [CrossRef] [Green Version]
Scarlat, N.; Dallemand, J.F. Recent developments of biofuels/bioenergy sustainability certification: A global overview. Energy Policy 2011, 39, 1630–1646. [Google Scholar] [CrossRef]
Lambin, E.F.; Gibbs, H.K.; Heilmayr, R.; Carlson, K.M.; Fleck, L.C.; Garrett, R.D.; de Waroux, Y.l.P.; McDermott, C.L.; McLaughlin, D.; Newton, P.; et al. The role of supply-chain initiatives in reducing deforestation. Nat. Clim. Chang. 2018, 8, 109–116. [Google Scholar] [CrossRef]
Schouten, G.; Glasbergen, P. Creating legitimacy in global private governance: The case of the Roundtable on Sustainable Palm Oil. Ecol. Econ. 2011, 70, 1891–1899. [Google Scholar] [CrossRef]
Gunarso, P.; Hartoyo, M.E.; Agus, F.; Killeen, J.T.; Goon, J. Roundtable on Sustainable Palm Oil, Kuala Lumpur, Malaysia; Reports from the Technical Panels of the 2nd Greenhouse Gas Working Group of the Roundtable on Sustainable Palm Oil; 2013; Available online: https://rspo.org/publications/download/a2ac85181ed4501 (accessed on 1 February 2018).
Moser, C.; Hildebrandt, T.; Bailis, R. International sustainability standards and certification. In Sustainable Development of Biofuels in Latin America and the Caribbean; Springer: New York, NY, USA, 2014; pp. 27–69. [Google Scholar]
Sloan, S. Indonesia’s moratorium on new forest licenses: An update. Land Use Policy 2014, 38, 37–40. [Google Scholar] [CrossRef]
Carlson, K.M.; Heilmayr, R.; Gibbs, H.K.; Noojipady, P.; Burns, D.N.; Morton, D.C.; Walker, N.F.; Paoli, G.D.; Kremen, C. Effect of oil palm sustainability certification on deforestation and fire in Indonesia. Proc. Natl. Acad. Sci. USA 2018, 115, 121–126. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Program, W.C.S.I. Oil Palm, Biodiversity and Indonesian Law; WCS Indonesia Programme: Bogor, Indonesia, 2010. [Google Scholar]
Wakker, E.; Asia, A. Indonesia: Illegalities in Forest Clearance for Large-Scale Commercial Plantations; Forest Trends: Washington, DC, USA; Aidenvironment: Amsterdam, The Netherlands, 2014. [Google Scholar]
Hansen, M.C.; Potapov, P.V.; Moore, R.; Hancher, M.; Turubanova, S.; Tyukavina, A.; Thau, D.; Stehman, S.; Goetz, S.; Loveland, T.; et al. High-resolution global maps of 21st-century forest cover change. Science 2013, 342, 850–853. [Google Scholar] [CrossRef] [Green Version]
Hansen, M.C.; Stehman, S.V.; Potapov, P.V.; Loveland, T.R.; Townshend, J.R.; DeFries, R.S.; Pittman, K.W.; Arunarwati, B.; Stolle, F.; Steininger, M.K.; et al. Humid tropical forest clearing from 2000 to 2005 quantified by using multitemporal and multiresolution remotely sensed data. Proc. Natl. Acad. Sci. USA 2008, 105, 9439–9444. [Google Scholar] [CrossRef] [Green Version]
Margono, B.A.; Turubanova, S.; Zhuravleva, I.; Potapov, P.; Tyukavina, A.; Baccini, A.; Goetz, S.; Hansen, M.C. Mapping and monitoring deforestation and forest degradation in Sumatra (Indonesia) using Landsat time series data sets from 1990 to 2010. Environ. Res. Lett. 2012, 7, 034010. [Google Scholar] [CrossRef]
Rudorff, B.F.T.; Aguiar, D.A.; Silva, W.F.; Sugawara, L.M.; Adami, M.; Moreira, M.A. Studies on the rapid expansion of sugarcane for ethanol production in São Paulo State (Brazil) using Landsat data. Remote Sens. 2010, 2, 1057–1076. [Google Scholar] [CrossRef] [Green Version]
Rudorff, B.F.T.; Adami, M.; Aguiar, D.A.; Moreira, M.A.; Mello, M.P.; Fabiani, L.; Amaral, D.F.; Pires, B.M. The soy moratorium in the Amazon biome monitored by remote sensing images. Remote Sens. 2011, 3, 185–202. [Google Scholar] [CrossRef] [Green Version]
Fan, H.; Fu, X.; Zhang, Z.; Wu, Q. Phenology-based vegetation index differencing for mapping of rubber plantations using Landsat OLI data. Remote Sens. 2015, 7, 6041–6058. [Google Scholar] [CrossRef] [Green Version]
Morel, A.C.; Saatchi, S.S.; Malhi, Y.; Berry, N.J.; Banin, L.; Burslem, D.; Nilus, R.; Ong, R.C. Estimating aboveground biomass in forest and oil palm plantation in Sabah, Malaysian Borneo using ALOS PALSAR data. For. Ecol. Manag. 2011, 262, 1786–1798. [Google Scholar] [CrossRef]
Tropek, R.; Sedláček, O.; Beck, J.; Keil, P.; Musilová, Z.; Šímová, I.; Storch, D. Comment on “High-resolution global maps of 21st-century forest cover change”. Science 2014, 344, 981. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Koh, L.P.; Miettinen, J.; Liew, S.C.; Ghazoul, J. Remotely sensed evidence of tropical peatland conversion to oil palm. Proc. Natl. Acad. Sci. USA 2011, 108, 5127–5132. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Miettinen, J.; Hooijer, A.; Shi, C.; Tollenaar, D.; Vernimmen, R.; Liew, S.C.; Malins, C.; Page, S.E. Extent of industrial plantations on Southeast Asian peatlands in 2010 with analysis of historical expansion and future projections. Gcb Bioenergy 2012, 4, 908–918. [Google Scholar] [CrossRef] [Green Version]
Miettinen, J.; Shi, C.; Tan, W.J.; Liew, S.C. 2010 land cover map of insular Southeast Asia in 250-m spatial resolution. Remote Sens. Lett. 2012, 3, 11–20. [Google Scholar] [CrossRef]
Petersen, R.; Goldman, E.; Harris, N.; Sargent, S.; Aksenov, D.; Manisha, A.; Esipova, E.; Shevade, V.; Loboda, T.; Kuksina, N.; et al. Mapping Tree Plantations with Multispectral Imagery: Preliminary Results for Seven Tropical COUNTRIES; World Resources Institute: Washington, DC, USA, 2016. [Google Scholar]
Miettinen, J.; Shi, C.; Liew, S.C. Land cover distribution in the peatlands of Peninsular Malaysia, Sumatra and Borneo in 2015 with changes since 1990. Glob. Ecol. Conserv. 2016, 6, 67–78. [Google Scholar] [CrossRef] [Green Version]
Margono, B.A.; Usman, A.B.; Sugardiman, R.A. Indonesia’s forest resource monitoring. Indones. J. Geogr. 2016, 48, 7. [Google Scholar] [CrossRef]
Jia, X.; Khandelwal, A.; Gerber, J.; Carlson, K.; West, P.; Kumar, V. Learning large-scale plantation mapping from imperfect annotators. In Proceedings of the IEEE International Conference on Big Data (Big Data), Washington, DC, USA, 5–8 December 2016; pp. 1192–1201. [Google Scholar]
Gutiérrez-Vélez, V.H.; DeFries, R. Annual multi-resolution detection of land cover conversion to oil palm in the Peruvian Amazon. Remote Sens. Environ. 2013, 129, 154–167. [Google Scholar] [CrossRef]
Dong, J.; Xiao, X.; Sheldon, S.; Biradar, C.; Xie, G. Mapping tropical forests and rubber plantations in complex landscapes by integrating PALSAR and MODIS imagery. Isprs J. Photogramm. Remote. Sens. 2012, 74, 20–33. [Google Scholar] [CrossRef]
Tabassian, M.; Ghaderi, R.; Ebrahimpour, R. Combination of multiple diverse classifiers using belief functions for handling data with imperfect labels. Expert Syst. Appl. 2012, 39, 1698–1707. [Google Scholar] [CrossRef]
Li, Z.; Fox, J.M. Mapping rubber tree growth in mainland Southeast Asia using time-series MODIS 250 m NDVI and statistical data. Appl. Geogr. 2012, 32, 420–432. [Google Scholar] [CrossRef]
Friedman, J.H. On bias, variance, 0/1—loss, and the curse-of-dimensionality. Data Min. Knowl. Discov. 1997, 1, 55–77. [Google Scholar] [CrossRef]
Jia, X.; Li, S.; Zhao, H.; Kim, S.; Kumar, V. Towards Robust and Discriminative Sequential Data Learning: When and How to Perform Adversarial Training? In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019. [Google Scholar]
Melgani, F.; Bruzzone, L. Classification of hyperspectral remote sensing images with support vector machines. IEEE Trans. Geosci. Remote Sens. 2004, 42, 1778–1790. [Google Scholar] [CrossRef] [Green Version]
Bargiel, D. A new method for crop classification combining time series of radar images and crop phenology information. Remote Sens. Environ. 2017, 198, 369–383. [Google Scholar] [CrossRef]
Jakimow, B.; Griffiths, P.; van der Linden, S.; Hostert, P. Mapping pasture management in the Brazilian Amazon from dense Landsat time series. Remote Sens. Environ. 2017. [Google Scholar] [CrossRef]
Peuquet, D.J. Making space for time: Issues in space-time data representation. Geoinformatica 2001, 5, 11–32. [Google Scholar] [CrossRef]
Ahlqvist, O. Extending post-classification change detection using semantic similarity metrics to overcome class heterogeneity: A study of 1992 and 2001 US National Land Cover Database changes. Remote Sens. Environ. 2008, 112, 1226–1241. [Google Scholar] [CrossRef]
Zhu, Z.; Yang, L.; Stehman, S.V.; Czaplewski, R.L. Accuracy assessment for the US Geological Survey regional land-cover mapping program: New York and New Jersey region. Photogramm. Eng. Remote Sens. 2000, 66, 1425–1438. [Google Scholar]
Ka, Z.; Olson, C. Using Multi-dimensional Scaling Technique To Examine The Similarity Among Land Cover Types. In Proceedings of the 10th Annual International Symposium on Geoscience and Remote Sensing, College Park, MD, USA, 20–24 May 1990; pp. 925–928. [Google Scholar]
Karpatne, A.; Khandelwal, A.; Boriah, S.; Kumar, V. Predictive learning in the presence of heterogeneity and limited training data. In Proceedings of the 2014 SIAM International Conference on Data Mining, Philadelphia, PA, USA, 24–26 April 2014; pp. 253–261. [Google Scholar]
Karpatne, A.; Jiang, Z.; Vatsavai, R.R.; Shekhar, S.; Kumar, V. Monitoring land-cover changes: A machine-learning perspective. IEEE Geosci. Remote Sens. Mag. 2016, 4, 8–21. [Google Scholar] [CrossRef]
Vermote, E.; Kotchenova, S.; Ray, J. MODIS Land Surface Reflectance Science Computing Facility, MODIS Surface Reflectance User’s Guide, version 1.4; NASA: Greenbelt, MD, USA, 2015. [Google Scholar]
Data Pool, LP DAAC, NASA Land Data Products and Services. Available online: https://lpdaac.usgs.gov/data_access/data_pool (accessed on 1 February 2018).
Statistics Indonesia. Available online: https://www.bps.go.id/dynamictable/2015/09/04%2000\protect\kern+.2222em\relax00\protect\kern+.2222em\relax00/838/luas-tanaman-perkebunan-menurutpropinsi-dan-jenis-tanaman-indonesia-000-ha-2011-2016-.html (accessed on 7 July 2018).
Abood, S.A.; Lee, J.S.H.; Burivalova, Z.; Garcia-Ulloa, J.; Koh, L.P. Relative contributions of the logging, fiber, oil palm, and mining industries to forest loss in Indonesia. Conserv. Lett. 2015, 8, 58–67. [Google Scholar] [CrossRef] [Green Version]
Sagi, O.; Rokach, L. Ensemble learning: A survey. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2018, 8. [Google Scholar] [CrossRef]
Karpatne, A.; Kumar, V. Building Predictive Models for Noisy and Heterogeneous Data: An Application in Global Monitoring of Inland Water Dynamics. In Proceedings of the 2015 IEEE International Conference on Data Mining Workshop (ICDMW), Atlantic City, NJ, USA, 14–17 November 2015; pp. 1530–1531. [Google Scholar]
Rodriguez-Galiano, V.F.; Ghimire, B.; Rogan, J.; Chica-Olmo, M.; Rigol-Sanchez, J.P. An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J. Photogramm. Remote Sens. 2012, 67, 93–104. [Google Scholar] [CrossRef]
Chen, Y.; Lin, Z.; Zhao, X.; Wang, G.; Gu, Y. Deep learning-based classification of hyperspectral data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 2094–2107. [Google Scholar] [CrossRef]
Glorot, X.; Bordes, A.; Bengio, Y. Domain adaptation for large-scale sentiment classification: A deep learning approach. In Proceedings of the 28th international conference on machine learning (ICML-11), Bellevue, DC, USA, 28 June 2011; pp. 513–520. [Google Scholar]
Jia, X.; Willard, J.; Karpatne, A.; Read, J.; Zwart, J.; Steinbach, M.; Kumar, V. Physics guided RNNs for modeling dynamical systems: A case study in simulating lake temperature profiles. In Proceedings of the 2019 SIAM International Conference on Data Mining, Calgary, AL, Canada, 2–4 May 2019; pp. 558–566. [Google Scholar]
Jia, X.; Khandelwal, A.; Nayak, G.; Gerber, J.; Carlson, K.; West, P.; Kumar, V. Incremental dual-memory lstm in land cover prediction. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, Halifax, NS, Canada, 13–17 August 2017; pp. 867–876. [Google Scholar]
Jia, X.; Khandelwal, A.; Nayak, G.; Gerber, J.; Carlson, K.; West, P.; Kumar, V. Predict land covers with transition modeling and incremental learning. In Proceedings of the 2017 SIAM International Conference on Data Mining, Houston, FL, USA, 27–29 April 2017; pp. 171–179. [Google Scholar]
Bengio, Y.; Lamblin, P.; Popovici, D.; Larochelle, H. Greedy layer-wise training of deep networks. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 3 December 2007; pp. 153–160. [Google Scholar]
Read, J.S.; Jia, X.; Willard, J.; Appling, A.P.; Zwart, J.A.; Oliver, S.K.; Karpatne, A.; Hansen, G.J.; Hanson, P.C.; Watkins, W.; et al. Process-guided deep learning predictions of lake water temperature. Water Resour. Res. 2019, 55, 9173–9190. [Google Scholar] [CrossRef] [Green Version]
Jia, X.; Khandelwal, A.; Carlson, K.; Gerber, J.S.; West, P.C.; Kumar, V. Plantation mapping in Southeast Asia. Front. Big Data 2019, 2, 46. [Google Scholar] [CrossRef] [Green Version]
Nasrabadi, N.M. Pattern recognition and machine learning. J. Electron. Imag. 2007, 16, 049901. [Google Scholar]
Sun, Y.; Wong, A.K.; Kamel, M.S. Classification of imbalanced data: A review. Int. J. Pattern Recognit. Artif. Intell. 2009, 23, 687–719. [Google Scholar] [CrossRef]
Hinton, G.E. A practical Guide to training restricted Boltzmann machines. In Neural Networks: Tricks of the Trade; Springer: Berlin/Heidelberg, Germany, 2012; pp. 599–619. [Google Scholar]
Hinton, G.E.; Osindero, S.; Teh, Y.W. A fast learning algorithm for deep belief nets. Neural Comput. 2006, 18, 1527–1554. [Google Scholar] [CrossRef]
Welch, L.R. Hidden Markov models and the Baum-Welch algorithm. IEEE Inf. Theory Soc. Newsl. 2003, 53, 10–13. [Google Scholar]
Forney, G.D. The viterbi algorithm. Proc. IEEE 1973, 61, 268–278. [Google Scholar] [CrossRef]
Garrett, R.D.; Carlson, K.M.; Rueda, X.; Noojipady, P. Assessing the potential additionality of certification by the Round table on Responsible Soybeans and the Roundtable on Sustainable Palm Oil. Environ. Res. Lett. 2016, 11, 045003. [Google Scholar] [CrossRef]
Padilla, M.; Stehman, S.V.; Chuvieco, E. Validation of the 2008 MODIS-MCD45 global burned area product using stratified random sampling. Remote Sens. Environ. 2014, 144, 187–196. [Google Scholar] [CrossRef]
Indonesia Kicks off Scheme for Palm Oil Farmers to Meet New Sustainability Standards. Available online: http://www.undp.org/content/undp/en/home/presscenter/pressreleases/2015/02/24/indonesia-kicks-off-scheme-for-palm-oil-farmers-to-meet-new-sustainability-standards.html (accessed on 13 May 2018).
Indonesia, Central Bureau of Statistics (BPS), The Abdul Latif Jameel Poverty Action Lab. Available online: https://www.povertyactionlab.org/partners/indonesia-central-bureau-statistics-bps (accessed on 13 May 2018).
Commodity Prices–Price Charts, Data, and News–IndexMundi. Available online: https://www.indexmundi.com/commodities/ (accessed on 13 May 2018).
Pohl, C. Mapping Palm Oil Expansion Using SAR to Study the IMPACT on the CO₂ Cycle; IOP Publishing: Bristol, UK, 2014; Volume 20, p. 012012. [Google Scholar]
Hu, J.; Mou, L.; Schmitt, A.; Zhu, X.X. FusioNet: A two-stream convolutional neural network for urban scene classification using PolSAR and hyperspectral data. In Proceedings of the 2017 Joint Urban Remote Sensing Event (JURSE), Dubai, UAE, 6–8 March 2017; pp. 1–4. [Google Scholar]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436. [Google Scholar] [CrossRef] [PubMed]
Liaw, A.; Wiener, M. Classification and regression by randomForest. News 2002, 2, 18–22. [Google Scholar]
Freund, Y.; Schapire, R.; Abe, N. A short introduction to boosting. J. Jpn. Soc. Artif. Intell. 1999, 14, 1612. [Google Scholar]
Heaton, J. An empirical analysis of feature engineering for predictive modeling. In Proceedings of the SoutheastCon 2016, Norfolk, VA, USA, 30 March–3 April 2016; pp. 1–6. [Google Scholar]
Long, P.M.; Servedio, R.A. Random classification noise defeats all convex potential boosters. Mach. Learn. 2010, 78, 287–304. [Google Scholar] [CrossRef] [Green Version]
Long, J.; Shelhamer, E.; Darrell, T. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, Araucano Park, Chile, 11–18 December 2015; pp. 3431–3440. [Google Scholar]
Noh, H.; Hong, S.; Han, B. Learning deconvolution network for semantic segmentation. In Proceedings of the IEEE International Conference on Computer Vision, Araucano Park, Chile, 11–18 December 2015; pp. 1520–1528. [Google Scholar]
Timelapse–Google Earth Engine. Available online: https://earthengine.google.com/timelapse/ (accessed on 20 May 2018).

Figure 1. Our study region in three MODIS tiles—h29v09 (southern Kalimantan), h29v08 (northern Kalimantan) and h28v09 (southern Sumatra). Indonesia is shown in light grey; other countries are dark grey.

Figure 2. Flow chart of the PALM classification method. The input data on the left side include MODIS data, Tree Plantation (TP) dataset, and Roundtable on Sustainable Palm Oil (RSPO) dataset. The objective of PALM is to generate annual plantation maps (marked with the red circle). PALM is also designed to distinguish plantation species via hierarchical classification.

Figure 3. Restricted Boltzmann Machine (RBM). V denotes visible variables/input features and H denotes hidden variables. W,

B h

, and

B v

are the model parameters of RBM.

Figure 3. Restricted Boltzmann Machine (RBM). V denotes visible variables/input features and H denotes hidden variables. W,

B h

, and

B v

are the model parameters of RBM.

Figure 4. The five difference regions R0–R4 among TP, RSPO, and PALM.

Figure 5. (a) Plantation maps generated by PALM in MODIS tile h29v09 (southern Kalimantan), h29v08 (northern Kalimantan), and h28v09 (southern Sumatra) in 2014. Plantation locations are marked in yellow; (b) Comparison between plantations detected by PALM, TP, and RSPO.

Figure 6. Annual plantation area (km²) detected by PALMinMODIS tile (a) h29v09 (southernKalimantan), (b) h29v08 (northern Kalimantan) and (c) h28v09 (southern Sumatra) from 2001 to 2014. The plantation area detected by TP (in 2014) and RSPO (in 2001, 2005, 2010) is depicted in green and yellow, respectively.

Figure 7. (a–e) the high-resolution DigitalGlobe images in 2014 with the mask of difference regions R0–R4. (a–c) examples of R1 (labeled as plantation by TP and PALM) and R2 (TP only); (d) an example of R3 (labeled as plantation by PALM but not by TP); (e) an example of R4 (labeled as plantation by RSPO but not PALM) and R0 (labeled as plantation by TP, RSPO and PALM). The plantations can be identified based on special alignment structure and crossing road networks.

Table 1. Study area in MODIS tiles h29v09 (southern Kalimantan), h29v08 (northern Kalimantan) and h28v09 (southern Sumatra). Each MODIS pixel covers a 500 m × 500 m area.

Tile	Region	Study Area (km²)	MODIS Pixels
h29v09	Southern Kalimantan	328,028	1,312,112
h29v08	Northern Kalimantan	262,386	1,049,546
h28v09	Southern Sumatra	309,474	1,237,895

Table 2. Correspondence between the aggregated classes defined in this study (see Section 3.1.1), and high-level classes and land cover types in the Roundtable on Sustainable Palm Oil (RSPO) dataset (see Section 2.3.2). The last column provides a brief description of each land cover type.

Aggregated	High-Level Class	RSPO Land Cover Type	Description
Plantation	Oil palm	Oil Palm Plantation	Large industrial estates planted with oil palm
Plantation	Timber plantation	Timber Plantation	Large industrial estates planted to timber or pulp species
Plantation	Agriculture	Rubber Plantation	Large/medium sized industrial estates planted to rubber
Other	Agriculture	Coastal Fish Pond	Permanently flooded open areas
Other	Agriculture	Dry Cultivated Land	Herbaceous vegetation managed for row crops/pasture
Other	Agriculture	Mixed Tree Crops	Mosaic of cultivated and fallow land
Other	Agriculture	Rice Fields	Rice paddy with seasonal or permanent inundation
Other	Built-up	Settlements	Villages, urban areas, industrial areas, open mining
Other	Mining	Mining	Open area with surface mining activities
Other	Bare soil	Upland Grassland	Open vegetation dominated by grasses
Other	Bare soil	Upland Shrub land	Open woody vegetation, including forest and grassland
Other	Bare soil	Swamp Grassland	Extensive cover of herbaceous plants with shrubs/trees
Other	Bare soil	Swamp Shrub land	Open woody vegetation on poorly drained soils
Other	Water body	Water Bodies	Rivers, streams and lakes
Other	Disturbed forest	Disturbed Mangrove	Forest of mangrove species with evidence of clearing
Other	Disturbed forest	Disturbed Swamp Forest	Swamp forest with evidence of logging and clearings
Forest	Disturbed forest	Disturbed Upland Forest	Basal area reduced significantly due to logging
Forest	Undisturbed forest	Undisturbed Upland Forest	Natural forest, highly diverse species and high basal area
Forest	Undisturbed forest	Undisturbed Swamp Forest	Natural forest with temporary or permanent inundation

Table 3. Count of MODIS pixels by land cover for the years 2000, 2005 and 2010 (columns 3–5), and the estimated area (10³ km²) of each land cover type for the years 2000, 2005, and 2010 (columns 6–8) for MODIS tile h29v09 (southern Kalimantan), reported by the RSPO dataset.

Full Name	Land Cover	2000	2005	2010	A₂₀₀₀	A₂₀₀₅	A₂₀₁₀
Coastal Fish Pond	CFP	5120	5159	6324	1.28	1.29	1.58
Rubber Plantation	CPL	18398	19813	19741	4.60	4.95	4.94
Dry Cultivated Land	DCL	44640	57555	86230	11.16	14.39	21.56
Disturbed Upland Forest	DIF	413561	404786	386326	103.39	101.20	96.58
Disturbed Mangrove	DIM	6731	6731	6500	1.68	1.68	1.63
Disturbed Swamp Forest	DSF	81790	83001	66836	20.45	20.75	16.71
Upland Grassland	GRS	14772	12026	12273	3.69	3.01	3.07
Mining	MIN	1249	2308	4168	0.31	0.58	1.04
Mixed Tree Crops	MTC	6944	7657	7995	1.74	1.91	2.00
Oil Palm Plantation	OPL	27948	42572	101806	6.99	10.64	25.45
Rice Fields	RCF	28697	29416	30419	7.17	7.35	7.60
Upland Shrub land	SCH	288002	294930	258922	72.00	73.73	64.73
Settlements	SET	2776	2839	2840	0.69	0.71	0.71
Swamp Grassland	SGR	16713	13887	13525	4.18	3.47	33.8
Swamp Shrub land	SSH	98669	103509	108240	24.67	25.88	27.06
Timber Plantation	TPL	12008	12531	12117	3.00	3.13	3.03
Undisturbed Upland Forest	UDF	136217	115656	97007	34.05	28.91	24.25
Undisturbed Swamp Forest	USF	88069	77928	71035	22.02	19.48	17.76
Water Bodies	WAB	19808	19808	19808	4.95	4.95	4.95

Table 4. The area (×10³ km²) of major species detected by PALM in MODIS tile h29v09 (southern Kalimantan) annually from 2001–2014.

Species	2001	2002	2003	2004	2005	2006	2007	2008	2009	2010	2011	2012	2013	2014
Acacia	2.96	2.96	3.95	4.46	5.20	5.87	6.62	7.33	7.97	8.46	8.91	9.26	9.62	9.62
Rubber	0.41	0.41	0.66	0.84	1.04	1.23	1.50	1.95	2.42	2.92	3.53	3.78	4.24	4.24
Oil Palm	4.76	5.38	7.24	8.48	10.45	12.42	14.78	17.78	20.63	23.87	26.29	27.89	30.04	31.05

Table 5. The area (×10³ km²) of major species detected by PALM in MODIS tile h29v08 (northern Kalimantan) annually from 2001–2014.

Species	2001	2002	2003	2004	2005	2006	2007	2008	2009	2010	2011	2012	2013	2014
Oil Palm	8.19	10.18	10.69	11.08	12.05	12.61	13.32	14.29	15.26	17.07	18.08	19.17	20.12	20.44

Table 6. The area (×10³ km²) of major species detected by PALM in MODIS tile h28v09 (southern Sumatra) annually from 2001–2014.

Species	2001	2002	2003	2004	2005	2006	2007	2008	2009	2010	2011	2012	2013	2014
Acacia	9.30	9.32	9.91	10.37	11.01	11.78	12.31	12.70	13.14	13.75	14.07	14.19	14.19	14.19
Coconut	9.54	9.54	9.85	10.06	10.27	10.50	10.64	10.82	10.97	11.13	11.21	11.23	11.23	11.23
Rubber	17.24	17.24	18.24	18.69	19.14	19.79	20.47	21.65	21.98	22.34	22.53	22.49	22.54	22.55
Oil Palm	28.58	28.59	30.03	31.03	32.40	33.80	34.84	36.09	36.98	38.05	38.46	38.59	38.59	38.60

Table 7. Producer’s accuracy for PALM, TP, and RSPO derived from a sample of detected plantations around plantation mills in Indonesia. We report the total number of locations in each category, as well as the percent of true plantations detected by each data product (i.e., the producer’s accuracy).

Region	Sampled Locations	True Plantations	PALM Plantations	TP Plantations	RSPO Plantations
Southern Kalimantan	1116	951	890 (93.59%)	864 (90.85%)	751 (78.97%)
Northern Kalimantan	1004	724	578 (79.83%)	626 (86.46%)	388 (53.59%)
Southern Sumatra	1030	872	702 (80.50%)	744 (85.32%)	221 (25.34%)

Table 8. Analysis of random samples taken in 2014 by comparing to TP and RSPO (MODIS tile h29v09, southern Kalimantan). Confidence represents the percentage of samples that are real plantations. R0 are locations labeled as plantations by PALM, TP, and RSPO; R1 are locations labeled as plantations by PALM and TP, but not by RSPO; R2 are locations labeled as plantations by TP, but not by PALM and RSPO; R3 are locations labeled as plantations by PALM, but not by TP. R4 are locations labeled as plantations by RSPO, but not by PALM.

Metric	R0	R1	R2	R3	R4
number of pixels	85,539	71,801	77,916	25,790	13,471
confidence	100%	81.87%	39.00%	42.86%	41.90%

Table 9. Analysis of random samples taken in 2014 by comparing to TP and RSPO (MODIS tile h29v08, northern Kalimantan). Confidence represents the percentage of samples that are real plantations.

Metric	R0	R1	R2	R3	R4
number of pixels	20,984	31,220	48,567	11,408	4948
confidence	100%	81.65%	26.37%	58.33%	37.40%

Table 10. Analysis of random samples taken in 2014 by comparing to TP and RSPO (MODIS tile h28v09, southern Sumatra). Confidence represents the percentage of samples that are real plantations.

Metric	R0	R1	R2	R3	R4
number of pixels	60,488	210,037	169,565	67,795	16,256
confidence	100%	84.16%	31.98%	53.77%	48.33%

Table 11. Estimation of user’s accuracy for PALM, TP, and RSPO using DigitalGlobe high-resolution images in 2014.

MODIS Tile	Region	PALM	TP	RSPO
h29v09	Southern Kalimantan	85.53%	72.51%	92.10%
h29v08	Northern Kalimantan	85.93%	57.83%	88.04%
h28v09	Southern Sumatra	82.59%	65.59%	89.06%

Table 12. Estimation of overall accuracy for PALM, TP, and RSPO in each MODIS tile and the entire study region using DigitalGlobe high-resolution images in 2014.

MODIS Tile	Region	PALM	TP	RSPO
h29v09	Southern Kalimantan	97.15%	93.04%	96.48%
h29v08	Northern Kalimantan	97.70%	94.22%	95.77%
h28v09	Southern Sumatra	88.35%	81.31%	60.88%
-	Entire study region	94.29%	89.35%	84.03%

Table 13. Estimation of

κ

coefficient for PALM, TP, and RSPO in each MODIS tile and the entire study region using DigitalGlobe high-resolution images in 2014.

Table 13. Estimation of

κ

coefficient for PALM, TP, and RSPO in each MODIS tile and the entire study region using DigitalGlobe high-resolution images in 2014.

MODIS Tile	Region	PALM	TP	RSPO
h29v09	Southern Kalimantan	0.9616	0.8948	0.9547
h29v08	Northern Kalimantan	0.9738	0.9126	0.9521
h28v09	Southern Sumatra	0.7236	0.4472	−0.0275
-	Entire study region	0.9149	0.8220	0.7622

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jia, X.; Khandelwal, A.; Carlson, K.M.; Gerber, J.S.; West, P.C.; Samberg, L.H.; Kumar, V. Automated Plantation Mapping in Southeast Asia Using MODIS Data and Imperfect Visual Annotations. Remote Sens. 2020, 12, 636. https://doi.org/10.3390/rs12040636

AMA Style

Jia X, Khandelwal A, Carlson KM, Gerber JS, West PC, Samberg LH, Kumar V. Automated Plantation Mapping in Southeast Asia Using MODIS Data and Imperfect Visual Annotations. Remote Sensing. 2020; 12(4):636. https://doi.org/10.3390/rs12040636

Chicago/Turabian Style

Jia, Xiaowei, Ankush Khandelwal, Kimberly M. Carlson, James S. Gerber, Paul C. West, Leah H. Samberg, and Vipin Kumar. 2020. "Automated Plantation Mapping in Southeast Asia Using MODIS Data and Imperfect Visual Annotations" Remote Sensing 12, no. 4: 636. https://doi.org/10.3390/rs12040636

APA Style

Jia, X., Khandelwal, A., Carlson, K. M., Gerber, J. S., West, P. C., Samberg, L. H., & Kumar, V. (2020). Automated Plantation Mapping in Southeast Asia Using MODIS Data and Imperfect Visual Annotations. Remote Sensing, 12(4), 636. https://doi.org/10.3390/rs12040636

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Automated Plantation Mapping in Southeast Asia Using MODIS Data and Imperfect Visual Annotations

Abstract

1. Introduction

2. Dataset and Study Region

2.1. MODIS Data

2.2. Study Region

2.3. Training Data

2.3.1. Tree Plantation Dataset

2.3.2. RSPO Dataset

3. Method

3.1. PALM Framework

3.1.1. Ensemble Learning Method

3.1.2. Collecting Training Samples

3.1.3. Learning Model

3.1.4. Filtering

3.1.5. Post-Processing

3.1.6. Hierarchical Classification

3.2. Validation of Plantation Maps

4. Results

4.1. Plantation Map and Basic Statistics

4.2. Validation Using High-Resolution Images

4.2.1. Validation for Producer’s Accuracy

4.2.2. Validation for User’s Accuracy

4.2.3. Overall Accuracy

4.2.4. Case Studies of Model Performance

5. Discussion

5.1. Smallholder Tree Plantations

5.2. Tree Plantation Species-Specific Mapping

5.3. Classification Model

5.4. Limitations

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI