Next Article in Journal
Coastal Upwelling in the Western Bay of Bengal: Role of Local and Remote Windstress
Previous Article in Journal
Evaluating Long-Term Variability of the Arctic Stratospheric Polar Vortex Simulated by CMIP6 Models
Previous Article in Special Issue
Impact of Training Set Configurations for Differentiating Plantation Forest Genera with Sentinel-2 Imagery and Machine Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Landsat Data Based Prediction of Loblolly Pine Plantation Attributes in Western Gulf Region, USA

1
College of Environmental and Resource Sciences, Zhejiang University, Hangzhou 310058, China
2
School of Information Engineering, Zhejiang A&F University, Hangzhou 311300, China
3
Arthur Temple College of Forestry and Agriculture, Stephen F. Austin State University, Nacogdoches, TX 75965, USA
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(19), 4702; https://doi.org/10.3390/rs14194702
Submission received: 8 August 2022 / Revised: 15 September 2022 / Accepted: 16 September 2022 / Published: 21 September 2022
(This article belongs to the Special Issue Satellite-Based Forest Structure Mapping)

Abstract

:
The suitability of using Landsat sensor variables to predict key stand attributes, including stand average dominant/codominant tree height (HT), mean diameter at breast height (DBH), the number of trees per hectare (NT), basal area per hectare (BA), and stand density index (SDI), of intensively managed loblolly pine plantations in the Western Gulf Region at the plot/stand level was assessed. In total, thirty Landsat sensor variables including six original bands, three vegetation indices, three Tasseled Cap transformed indices, and eighteen texture measure variables were used as predictors. Field data of 125 permanent plots located across east Texas and western Louisiana were used as reference data. Individual trees of those plots were measured at plot establishment (referred to as the first cycle measurement; average about 4.5 years old) and remeasured in three-year intervals (the second cycle measurement at approximately seven years old and the third cycle measurement at approximately 10 years old). Thus, field reference data represent stand development from open- (first cycle) to closed-canopy (third cycle). Models to predict stand HT, DBH, NT, BA, and SDI were developed by cycle using multiple linear regression (MLR) and also random forests (RF) methods. Results indicated that the first cycle stands HT, DBH, BA, and SDI were well predicted using the Landsat sensor variables with R2 > 0.7 and low RMSEs. These relationships weakened with stand age, although still moderate with R2 being around 0.45 for the second cycle measurement and became practically useless (R2 < 0.30) for the third cycle measurement. For NT, no meaningful models were achieved regardless of the measurement cycle. The MLR and RF models were comparable in accuracy and had similar key predictors. Overall, the shortwave infrared bands, red band, and wetness index were the most important predictors, but their dominance declined with the cycle. Texture measure variables were relatively less important but a trend of increasing their importance with cycle was noted. Results show promise for operationally predicting stand variables for young pine plantations, an age class that typically presents significant challenges using conventional forest measurement methodologies. Potential methods to further improve model accuracy and how to use the results within the context of pine plantation management planning in the region were discussed.

1. Introduction

Accurate forest inventory data are essential in forest management planning. Conventional, forest inventory data are collected via ground-based survey methods [1]. These methods, while collecting information with acceptable accuracy, have been criticized for their high cost, time consumption, and inability to update in a timely manner [1]. Indeed, exploring methods that avoid the disadvantages of ground-based methods and achieve an acceptable accuracy is of great interest to foresters. Remote sensing techniques likely are a viable alternative.
While numerous sources of remotely sensed data are available, the Landsat sensors are still the most widely used in forestry [2,3]. This is because the Landsat sensor series data, in addition to their potential relationships with forest attributes, provide a large, free archive of imagery with medium spatial resolution, relatively large spatial extent, and frequent revisit time. Great efforts have been made to evaluate the suitability of using Landsat sensor data including individual spectral bands, vegetation indices, transformed images, and textural images, to estimate forest attributes. Mixed results have been reported, with some revealing a high level of accuracy (i.e., R2 > 0.60) [4,5,6], and others reporting great uncertainties [7,8]. Landsat sensor variables are sensitive to many factors such as stand structure, geographic region, and human disturbance, compromising their application in predicting forest attributes. Depending on stand attribute and condition, often Landsat images were found to estimate forest stand attributes with 30% or greater error [2,9], higher than the allowable error (15~20%) of what is often required for operational forest management. Furthermore, most of the previous studies have focused on natural forests with complicated stand structures of multiple species, age classes, various densities, and understory conditions.
Little research has been dedicated to commercial plantations despite their economic importance. Compared to natural forests, the application of remotely sensed data to plantation forests potentially faces different problems [10]. Most plantations are intensively managed, single-species monocultures, resulting in distinct stand structural and temporal characteristics. Silvicultural activities such as implementing site preparation, competition control, and fertilization homogenize the understory of young (open crown) plantations, facilitating the relation between remote sensing data and stand attributes, but homogenize stand canopy structure and enhance stand productivity, compromising the ability of images in discerning canopy-closed plantations. Stand structure is a complex combination of several parameters such as stand average height, diameter, density, biomass, and volume. Previous studies have focused on using remotely sensed data to predict forest stand biomass and volume. For plantation forests, biomass and volume are secondary variables, often calculated from traits such as stand tree size (average height and diameter) and density. Therefore, the usefulness of Landsat sensor variables to recover plantation stand attributes should be evaluated by their adequacy in predicting stand tree size and density. Density management is paramount for the success of managing plantations. Previous studies on the use of Landsat sensor data have unanimously adopted the number of trees per unit area (stocking) as an indicator of density, which, however, has intrinsic limitations when applied to plantations since it does not consider tree size [11]. Basal area or stand density index combines stocking and tree size and has been widely adopted in decision-making in managing plantation density by foresters [1]. Thus, the suitability of using Landsat sensor data to predict basal area or stand density index is of great practical value, which, however, has seldom been investigated. Recent research [11] reported promising results of using Landsat sensor data to predict the height and basal area of young spruce plantations. However, much less optimistic findings were indicated for predicting the number of trees per hectare and volume of coniferous plantations [10].
Plantation management planning schedules activities aimed at optimizing the utilities of a plantation, in particular wood production. Often the key activities include, but are not limited to, projection of stand conditions and economic value (growth and yield modeling), mid-rotation silvicultural treatments, and harvest. Timely inventory data are needed to support these activities. These data should be sufficiently accurate, the level of which depends on the activity. Less precise data may be acceptable for projection purposes while a higher accuracy is desirable for supporting silvicultural activities [9]. To integrate Landsat sensor data into forest management, their usefulness must be assessed by how well they can support the activities, which, however, is missing in the literature. Most previous studies have adopted a simplistic view of the information needs in the forest planning process, without relating the analysis to management decisions [9].
Loblolly pine (Pinus taeda L.) is the most widely planted commercial species in the West Gulf (WG) Region. For example, in east Texas, forestland occupies about 4.9 million hectares, of which, 1.2 million hectares (24%) are classified as pine plantations, with most being composed of loblolly pine [12]. Intensive silvicultural activities such as competition control, chemical site preparation, tillage, fertilization, thinning, and other activities have been routinely applied to loblolly pine plantations [13], resulting in rapid stand development including canopy closure over a short period post-establishment. For current operational forest management, forest inventory data are needed for young (around five years) and mid-rotation ages (around 10 years), yet traditional methods are costly. Landsat sensor data could be useful for this purpose, yet has not been fully investigated. Landsat sensor data have been demonstrated useful in managing loblolly pine plantations in ways such as mapping plantations [14] and predicting leaf area index [15,16,17,18]. Only one study [19] reported the application of Landsat sensor data to estimate loblolly pine plantation age and stocking (trees per hectare) and the results were encouraging (R2 = 0.70 and 0.60, respectively). None of the above studies linked results to specific planning activities, limiting operational application. Overall, the potential of Landsat sensor data for estimating pine plantation attributes has not been fully explored, yet the interests of forest companies in using remotely sensed data for forest inventory purposes remain high (Mr. Trevor Terry, personal communication).
Initiated in 1982, the East Texas Pine Plantation Research Project (ETPPRP) has been exploring models for managing loblolly pine plantations in the region (http://etpprp.hzgzsoft.com:92/ (accessed on 8 July, 2022)). In 2004, the ETPPRP began establishing permanent plots in intensively managed loblolly pine plantations across the region, which were repeatedly measured on a three-year scale, providing temporal reference ground data. The objective of this study was to analyze the suitability of using Landsat sensor data to predict loblolly pine plantation attributes from before and after crown closure. If satellite reflectance data can be sufficiently related to measured plantation stand attributes, information from this study can be incorporated into growth and yield modeling and forest management planning.

2. Materials and Methods

2.1. Field Data

Between 2004 and 2007, permanent plots were installed in loblolly pine plantations in East Texas and into Western Louisiana to best represent the growing conditions unique to the WG region. These plantations are artificially planted, even-aged monocultures and intensively managed, and belong to a paradigm of high-yielding, short-rotation, coniferous forest plantations. Plots were generally located between 30 to 33° north latitude and 93 to 96° west longitude (Figure 1). The region belongs to the humid subtropical climate, receiving an annual average rainfall of 123 cm, paired with a mean temperature range from 2.2 °C in January to 33.9 °C in July, and an elevation ranges from 46 to 107 m, and mostly sandy acidic loamy soils.
At each study location, one square plot measuring approximately 0.1 ha (30.48 × 30.48 m) was installed [20]. A Trimble Juno SB handheld GPS receiver was used to record the latitude and longitude of the center of each plot. With ArcPad installed on the handheld, an average of 30 positions were used for capturing each point location. No post-processing differential correction was performed. It achieved a horizontal accuracy of 2–5 m HRMS. Plots were operationally managed along with the plantations, i.e., being thinned and/or fertilized when the plantations were thinned and/or fertilized. At each plot, all planted loblolly pine trees and also volunteer pine trees with a diameter at breast height larger than four inches (10.2 cm) were permanently tagged. Trees were measured when the plot was installed (referred to as the first cycle measurement) and remeasured in summer on a three-year interval (referred to as second, third, …, cycles) for diameter at breast height (nearest 0.1 inches) and total height (nearest 1.0 foot), among other traits. Since plots were established at different years, multiple-year data were involved for each cycle (i.e., the first cycle data ranged from 2004 to 2007 and the second cycle from 2007 to 2010). Original English unit values were converted to metric values prior to analysis. Individual tree basal area was calculated using tree diameter (=0.0000785 × diameter2). Individual tree data were first examined, outliers removed, and then summarized to obtain plot and stand level mean values such as the height of the dominant and codominant trees (using the tallest 10 trees per plot) (HT in m), plot mean diameter (DBH in cm), number of trees per hectare (NT ha−1) and basal area per hectare (BA m2 ha−1). Stand density index (SDI) was calculated as SDI = NT × (Dq/25.4)−1.605 [21], where Dq is the stand quadratic mean diameter in cm and calculated as Dq = √(BA/(0.0000785 × NT)). Plantation age (years) was determined as the time between the current measurement date and the plantation establishment date derived from stand records.
To date, five measurement cycles have been completed for most plots. Since this study focused on using Landsat sensor data to support two critical management activities for pine plantations (projecting stand development and guiding mid-rotation silviculture), only the first three measurement cycles were used. Data of the first cycle represent a scenario of open-canopy plantations, the second cycle of partly open-canopy plantations, and the third cycle of closed-canopy plantations. Note that plot data were further cleaned by cycle; for example, while plots of the first cycle with age = one year were excluded from the first cycle data, those plots might be included in subsequent cycles. Additionally, note that four plots have been thinned in the third cycle. Details of summary statistics of the dataset by cycle are provided in Table 1. In total, data of 116, 125, and 121 plots were used in the first, the second, and the third cycles, respectively. When the plots were established (the 1st cycle), tree age ranged from two to 11 years old (average 4.5 years) and stocking from 829 to 2121 (average 1290) trees ha−1. During the period of six years from age 4.5 to 9.9 years, the stand grew rapidly; the average increased from 5.4 to 9.9 m for HT, 7.1 to 15.2 cm for DBH, and 6.6 to 23.3 m2 ha−1 for BA. Additionally, during the period, the density increased substantially from 194.2 to 557.5 in SDI, and the stand canopy advanced from open to closed. While many attributes can be used to describe stand structure, the response variables in the study included those which are often required in plantation planning: HT, DBH, BA, NT, and SDI [22].

2.2. Satellite Data

Google Earth Engine (GEE) platform is a cloud-based platform providing a plethora of satellite images including Landsat sensor time series data from the United States Geological Survey (USGS, http://earthexplorer.usgs.gov/ (accessed on 18 June 2022)). Researchers can use GEE for pre-processing and extracting large datasets for free, which has been widely used in recent years [23].
We used longitude and latitude of a plot center to match the spatial location of the plot and its remote sensing data. We used the 30 m Collection1, Tier1, Surface Reflectance (C01/T1_SR) Landsat sensor time series data including Landsat 5 Thematic Mapper (TM), Landsat 7 Enhanced Thematic Mapper Plus (ETM+), and Landsat 8 Operational Land Imager (OLI) from GEE, depending on availability, to create satellite datasets. Thus, the sizes between reference plots and Landsat image pixels were similar to avoid spatial errors as much as possible. The C01/T1_SR datasets were geometrically and atmospherically corrected using Landsat Disturbance Adaptive Processing System (Landsat 5 and Landsat 7) [24] and the Land Surface Reflectance Code algorithm (Landsat 8). In addition, all the clouded, shadowed, and snowed pixels were masked out using QA bands with C Function of the Mask algorithm (CFMask) [25,26]. Since reference plots were installed from 2004 to 2007, Landsat sensor data were obtained correspondingly to match year of field data collection. Since field data were collected in summer, images were acquired for an early July through late August timeframe (Table 2). Most images were cloud free or had cloud cover of less than 5%. Excessively clouded or poor-quality data were disqualified and the neighboring time data instead or an alternative Landsat source were used to insure complete Landsat data coverage.
Original bands used include BLUE (450–520 nm), GREEN (520–560 nm), RED (630–690 nm), Near Infrared (NIR, 770–900 nm), Shortwave Infrared 1 (SWIR1, 1550–1750 nm), and Shortwave Infrared 2 (SWIR2, 2080–2350 nm) bands. Three commonly used vegetation indices, including Normalized Difference Vegetation Index (NDVI) [27], Enhanced Vegetation Index (EVI) [28], and Modified Soil Adjusted Vegetation Index (MSAVI) [29], were derived and used. Tasseled Cap (TC) transformation was applied for each image to produce brightness, greenness, and wetness indices [30]. Eighteen texture features were calculated by the Gray Level Co-occurrence Matrix (GLCM) (Table 3) of NDVI for each pixel with 3 × 3, 5 × 5, and 7 × 7 windows [31,32], respectively, but only those with 5 × 5 pixels were reported since adding the other two did not change results substantially. The GLCM method is one of the most commonly used approaches for textural analysis in forestry applications [33]. Thus, a total of 30 Landsat image variables were used in this study as candidate predictors (Table 3).

2.3. Model Development and Validation

Both multiple linear regression (MLR) and random forests (RF) [34] methods were used to develop models predicting the stand attributes using the Landsat variables. These methods were selected because MLR is the most popular, while RF is the most accepted nonparametric method in forest statistical data analysis [35]. Data were split into two datasets, one for model development (75% of the data, referred to as the training dataset) and the other for model tests (25% of the data, referred to as the test dataset).
For MLR, a candidate model was first selected using the training dataset and stepwise selection, with a significant level of 0.05 being required to allow a variable into and stay in the model. To mitigate the issue of multicollinearity, the model predictors’ variance inflation factors were calculated and some predictors with large values were dropped (a maximum value of five was applied). The lm function in R [36] was used for the analysis.
For RF, models were trained using the training dataset and the randomForest package in R [37]. All Landsat variables were initially included as predictors. Two algorithm parameters, the number of trees (ntree) and the number of variables per level (mtry) were tuned and results suggested a mtry of 7 and an n.trees of 500 for modeling all the attributes. The models were trained, and the importance of predictors was evaluated by calculating %IncMSE, which shows mean decrease in accuracy of the predictions when a variable is removed from the model. In the subsequent step, only those with %IncMSE ≥ 2 were used as predictors and used to train the final models. An %IncMSE ≥ 2 is an arbitrary value, which, however, simplifies the models and can increase the use by foresters. This seems reasonable since our preliminary analyses show that including predictors with %IncMSE < 2 did not change model predictability.
For the resulting MLR and RF models, the model goodness-of-fit to the training dataset was evaluated by calculating the model coefficient of determination (R2) and root mean standard error (RMSE). Note that for RF models, the goodness-of-fit parameters for RF were automatically based on out-of-bag cross-validation, leading to underestimates. Models with larger R2 and smaller RMSE have greater predictability, although the latter is a better choice than the former for comparing the accuracy among different models. For MLR models, residual plots were visually checked to assess model assumptions (normality, equal variance, and independence).
All final models were then tested based on the independent test dataset. R2 and mean absolute error (MAE) were calculated by using model predicted and field observed values as indicators of model accuracy. MAE is preferred over RMSE in model tests due to its ease of interpretation. We did not calculate relative RMSE or MAE (express RMSE or MAE as a percentage of the sample average) since it can be misleading in repeated data since attributes with larger averages tend to have lower relative values.

3. Results

Selected MLR models for predicting the stand attributes using the Landsat sensor variables are shown in Table 4. The model’s goodness-of-fit varied with cycle and stand attributes. For the first cycle measurement, HT, DBH, BA, and SDI were predicted well. The achieved R2 ranged from 0.61 (DBH) to 0.74 (SDI) and their respective RMSE were fairly low, 1.22 m, 2.11 cm, 3.23 m2, and 78.16 for HT, DBH, BA, and SDI, relative to their respective minimum (Table 1). The selected model predictors were similar for all the attributes (Table 4); the SWIR1 was consistently the top predictor for all the attributes, followed by RED (for HT, BA, and SDI) or NIR (for DBH). The lower SWIR1 or larger RED values were associated with faster stand growth and higher density. No index variables stayed in any models other than the texture measures of ASM and IDM for predicting BA and SDI. The goodness-of-fit for the HT, DBH, BA and SDI models of the second cycle measurement were greatly reduced, but still moderate, with R2 ranging from 0.43 (HT) to 0.48 (SDI). SWIR1 remained the top predictor for all the models, while indices SAVG (for DBH and BA) and NDVI (for BA and SDI) were also selected. The model goodness-of-fit of the third cycle models worsened further to the point that the Landsat sensor variables were practically useless for predicting HT and DBH. For BA and SDI, the models, respectively, explained 30% and 21% of the total variation and SWIR1 and RED were still the best predictors. Residual plots showed that model assumptions (normality, equal variance and independence) were acceptable in general, although some imperfections were present (data not shown). Differently, the NT was poorly predicted within the training datasets for all cycles, with R2 being 0.19, 0.08 and 0.17 for cycles one to three, respectively. Identified predictors varied with cycle and neither SWIR1 nor RED was important predictor.
Applying the models to the test dataset confirmed the MLR model fitness (Table 4; Figure 2, Figure 3 and Figure 4). The obtained R2 values were similar in magnitude to respective ones in the model training for each cycle. The MAE estimates were low, e.g., being 0.87 m, 1.49 cm, 1.77 m2 ha−1, and 52.49 for the first cycle HT, DBH, BA, and SDI, respectively. Overestimations occurred for small values and underpredictions were true for large values, and this trend was more evident for models of the latter cycles (Figure 2, Figure 3 and Figure 4).
Models for predicting selected stand attributes were trained using RF to evaluate the importance of predictors, and Table 5, Table 6 and Table 7 list all the selected predictors with %IncMSE ≥ 2. For the first cycle measurement, the reflectance bands SWIR1, SWIR2, and RED, and the TCT variables wetness or brightness were the important (%IncMSE ≥ 10) predictors for modeling HT, DBH, BA, and SDI, while vegetative indices were the top predictors for predicting NT. The identified top predictors for the second cycle models remained similar, although texture index SAVG was also important for the HT, DBH, and NT models. For the third cycle, while the SWIR1, SWIR2, and wetness were still the top predictors, more texture indices were identified as the top predictors as well. Evidently, contributions were dominated by a few top predictors in the first cycle, this dominance, however, decreased with increasing cycles. For example, SWIR1 had a %IncMSE value of 30.42 for predicting BA in the first cycle, which was reduced to 24.19 and 12.53, respectively, in the second and third cycles. Many texture variables were included with %IncMSE between 2~5, only a few, e.g., SAVG and IMCORR1, had a %IncMSE > 7. The relative importance of texture variables seemly increased with the cycle. None of the vegetative indices were identified in the top four predictors for all models other than predicting NT for cycles one and two.
The final RF models were re-trained using the important predictors (Table 5, Table 6 and Table 7) and the training dataset, and the model goodness can be found in Table 8. For the first cycle, the models for predicting HT, DBH, BA, and SDI explained 65% or more of the total variation, and achieved reasonably low RMSEs of 1.33 m, 2.20 cm, 3.40 m2 ha−1 and 80.49 for HT, DBH, BA and SDI, respectively. The corresponding models for the second cycle showed a medium fitness with R2 being around 0.40 and those for the third cycle having no practical values. Similar to the MLR results, the NT models were poor in fitness for all the cycles, with a R2 ≤ 0.16. Our model test using the test dataset showed that the models of the first cycle had the accuracy as expected, as demonstrated by the similar R2 between training and test datasets and the low MAEs of 0.84 m, 1.34 cm, 2.06 m2 ha−1 and 60.2 for HT, DBH, BA and SDI, respectively (Table 8; Figure 2, Figure 3 and Figure 4). However, the models of the second cycle had lower test R2 than respective ones of the model fitting for all attributes other than SDI, of which a similar R2 was achieved. Similar to the MLR models, the attributes were overestimated for low observed values and underestimated at the high end for all the attributes; this trend was more evident for latter cycles (Table 8; Figure 2, Figure 3 and Figure 4).

4. Discussion

Intensively managed loblolly pine plantations are often subjected to a variety of silvicultural treatments at the time of planting or shortly thereafter [38], and their canopy structures develop in a rapid, yet predictable, way. Intensive management enhances tree growth and suppresses underground vegetation, creating a favorable environment for using images for open-canopy plantations, but can lead to homogenous, dense stand canopy structures, weakening penetrability of Landsat sensor signals and compromising the estimation of stand attributes after clown closure. The repeated (cycles one to three) data used represent crown development from pre- to post-closure over a period of six years in operational loblolly pine plantations. Results reveal that HT, DBH, BA, and SDI of loblolly pine plantations may be predicted adequately when crowns were open, but such predictability weakened as stands developed, being moderate when crowns were partially closed and becoming practically useless after crowns were completely closed. To our knowledge, this study is the first to quantify changes in Landsat sensor data suitability within a critical period of stand development from open crown to complete crown closure. The general belief that strong relationships exist between Landsat sensor data and stand attributes in homogeneous forests such as in conifer plantations [39,40] is an oversimplification. Since the plantation canopy was at such a high level of density after crown closure that Landsat sensors lost the ability to penetrate the canopy and were unable to differentiate between individual tree canopies, compromising the estimates of DBH, and consequently the estimates of BA and SDI. In upland conifer plantations of various ages in Scotland, stand HT and BA were strongly (R2 > 0.77) correlated with Landsat sensor bands but the strength of the relationships decreased substantially after canopy closure occurred [39]. The strong predictability of Landsat sensor variables for young (pre-crown closure, 2–17 years old) Sitka spruce plantations has been reported [11]. In even-aged Eucalyptus plantations, correlations between high spectral resolution ASTER satellite data and stand trees ha−1, DBH, HT, BA, and volume were stronger for plantations of ages 4–6 years than those of 7–9 years [41]. These studies paired with our findings suggest that the suitability of using Landsat sensor data to estimate plantation stand attributes varies with the stand development stage, being more useful for young and open-canopy plantations.
One important finding was that Landsat images were not practically useful for predicting NT during the study period (Table 4 and Table 8), suggesting that the counting of trees is difficult when the satellite images are of moderate resolution, especially when tree canopies overlap. This disagrees with findings from a study also on loblolly pine plantations in east Texas [19] where as high as 60% of the variation in NT was accounted for by Landsat sensor variables such as NDVI. This inconsistency could partly be due to sampling; Their NT ranged from 222 to 2396 trees ha−1, whereas our study sampled a much narrower NT range (Table 1), increasing the difficulty of discerning populations from Landsat sensor variables. Additionally, reflection of satellite on a pine plantation is more determined by crown characteristics, in particular crown width, a strong predictor for tree DBH. Thus, a stronger relationship of Landsat sensor data with BA or SDI over NT is not unexpected. Poor predictability for NT using Landsat sensor variables has been confirmed in young commercial plantations of other species [11,39]. Thus, there is uncertainty associated with the application of using Landsat variables to predict NT for commercial plantations, suggesting further investigation of other variables or methods to explain forest variability in NT is necessary. Further discussion focuses more on stand attributes of HT, DBH, BA and SDI.
While used widely, the MLR method is often criticized for model development to estimate stand attributes since several assumptions (linear relationship, equal variance, normality and independence) are required. RF has theoretical advantages over MLR modeling by overcoming the assumption-related limitations and also its ability to address the issues of high-dimensionality and multicollinearity. RF, relatively new in forest statistical data analysis, has recently been adopted to predict forest stand attributes using climate and biophysical data [35,42,43]. For this data, even with many strongly correlated Landsat variables being involved, the models from RF and MLR were competitive with one another in predictability; neither approach produced substantially better models (Table 4 and Table 8). The selected predictors by both methods were similar, in particular for the first cycle models. In a study to modeling stand BA, volume and biomass using airborne Lidar and multispectral imagery in heterogeneous mixed species forests in Alabama, models developed by MLR and RF were comparable [44]. Fewer Landsat variables were selected as predictors of the MLR models, and their predictability may be improved by including highly related predictors such as SWIR2, which, however, complicates biological explanations. Further improvements in predictability may be realized by adding extra predictors to RF models too, but the improvement is unlikely to be substantial. One drawback of using RF technique to develop models is the implementation of the models which is difficult without programming the models into source codes [35]. Currently the MLR models proposed in this study are recommended over RF models due to their ease of use, comparable accuracy and straight-forward interpretation.
Both the MLR and RF models tended to bias for low (overestimate) and high (underestimate) observed values (Figure 2, Figure 3 and Figure 4), which has been identified in previous studies [45]. For plantations with closed crowns and high productivity, the limited sensitivity of the optical sensors for interpreting the canopy structure leads to saturated measurements while for young stands with open crowns the confounding effect of understory vegetation may overestimate stand attributes. Our results further indicated that the biased tendencies varied with stand development, being relatively minor for the first cycle measurement and becoming more evident with stand crown closing.
Landsat sensor data are commonly used for assessing stand attributes due to their potential correlations, free availability, and broad spatial and temporal coverage. While numerous Landsat variables are available, the selection of the ideal ones is challenging, varying with stand structure and environment. The dynamic changes in stand density (crown closure) and understory background as stands develop can lead to various variables being the optimal predictors at different stages. In this regard, some patterns stood out based on our results for a short period of six years of the plantation’s early growth. First, the bands, SWIR and RED, were consistently the most important predictors. Previous studies on loblolly pine plantations indirectly support our findings, where SWIR1, RED, and NDMI (an index strongly influenced by SWIR) strongly correlated with stand age and leaf area index, both growth-related stand attributes [15,18,19]. The SWIR1 was also identified as the top predictor for forest stand properties on other pine stands [39,46,47]. Second, wetness could be a key variable for predicting tree size of open-canopy plantations (Table 5) and for predicting stand density of plantations with partly or completely closed crowns (Table 6 and Table 7), which is in parallel to the findings on other species [19]. Our MLR models did not include wetness due to its strong correlation with SWIR, leading to an issue of multicollinearity. Third, all vegetative indices except NDVI were relatively unimportant. NDVI was selected as a predictor for predicting BA and SDI for canopy-closed plantations (Table 4). Vegetation indices are useful in estimating stand attributes such as biomass because they enhance green vegetation signals and minimize the impacts from the surface and atmospheric effects in complex vegetation structures [4]; their usefulness likely reduces in relatively simple and homogeneous structures such as intensively managed pine plantations. Fourth, many texture variables contributed somewhat to improving model prediction, with most having a %IncRME between 2–4. The models centralized the importance at a few band variables such as SWIR1 and RED before crown closure, this dominance declined as stands develop, and models split the importance to more texture measure variables after crowns were completely closed (Table 5, Table 6 and Table 7). Texture features performed better than spectral factors for modeling stand attributes in areas with multiple layers and complex forest stand structures [48,49], and our results support the pattern that the importance of texture variables increased from the first cycle in a situation of simple structure to the more complex structure of the third cycle.
Basal area (BA) and SDI are numerical values that capture intensity of competition within forest stands. Operationally, BA and SDI are parameters often used to help foresters with thinning decisions, and thus accurately predicting BA or SDI in particular at the mid-rotation age has great practical value. Even models predicting BA and particularly SDI had larger R2 than the respective ones for predicting HT and DBH (Table 4 and Table 8), their accuracies in predicting the mid-rotation age (third cycle) BA and SDI were low, with R2 < 0.25, suggesting infeasible of the method to guide thinning of loblolly pine plantations unless the models can be otherwise improved in future.
Can the findings of this study be incorporated into plantation management of loblolly pine in the region? This question has to be answered within the context of operational pine plantation planning. For loblolly pine plantations in the region, often a timber cruise around 5 years is carried out to obtain stand information such as mean HT, DBH, and NT, which is then entered into regional growth and yield models [22] to project future stand characteristics (BA, volume, and NT) and economic values under scenarios of various silvicultural activities. For this purpose, the required accuracies for predicting HT, DBH, and NT are relatively low. Our results are encouraging for predicting HT and DBH of the plantations around age five (Table 4 and Table 8). The issue was NT, which was poorly predicted for all cycles. In reality, however, loblolly pine plantation survival is routinely surveyed at the end of the first or second growing season, and this number is unlikely to change substantially in the subsequent years until competition-induced mortality occurs [38]. In other words, NT of young loblolly pine plantations are typically known. Thus, our results do support the use of Landsat sensor data to facilitate the long-term management planning. Forest inventory data are indispensable for mid-rotation (around age 10 years) loblolly pine plantations to support silvicultural activities such as commercial thinning and/or fertilization. Toward this end, Landsat sensor data were found not to be reliable indicators of stand attributes and, therefore, caution should be taken for using Landsat sensor data to replace field measurements.
Are there options to improve the predictability of the models? With the inclusion of other factors, a few studies have shown positive results in predicting stand attributes. Ages are known accurately in any well-managed loblolly pine plantations and adding age as a predictor was reported to improve the performance of Landsat sensor data-based models [45]. In this study, when stand age was added as a predictor to the MLR models for predicting HT, DBH, BA, and SDI, the model R2 was improved to 0.82, 0.75, 0.80, and 0.80 for the first cycle, 0.72, 0.62, 0.62, and 0.61 for the second cycle, and 0.55, 0.44, 0.41, and 0.32 for the third cycle (note that multicollinearity did not become an issue), greatly reducing the uncertainties of model estimates. Geographic factors, such as stand elevation, slope, and aspect, are known to affect spectral reflectance values [50], and thus including them may improve model performance. We are currently collecting this information so that a test on the topic can be carried out. Synergistic use of data from different sources or from different sampling dates (or seasons) of one source to estimate stand attributes can be more accurate than data from a single source or a single date [3,51]. For this study, Landsat sensor data were extracted for summer only. The suitability of incorporating Landsat sensor data from other seasons to improve model performance should be further investigated. Clearly, applying high-resolution remotely sensed data would improve model performance, which, however, is accompanied by higher data costs. Reference [52] estimated stand attributes of pine plantations using a WorldView-2 multispectral imager (2m spatial resolution) and achieved acceptable estimates for HT and DBH (errors of <13%) but relatively large errors (up to 30%) for BA, volume, and NT estimates. In another study, researchers estimated BA for a plantation of loblolly pine using airborne LiDAR data, achieving an accuracy of R2 = 0.97 [40].
Field reference data well representing the truth are indispensable for model development. There are a few issues about field data which need to be clarified for this study. Four plots were installed in plantations with ages between 8~11 years, and therefore their first cycle data do not represent open-canopy plantations. Additionally, four plots at the third measurement cycle have been commercially thinned once. We did not exclude these plots in analyses since our preliminary analyses showed that removing them would not change the results. Our field plots were in the square of 900 m2, very similar to the pixel shape and size, nonetheless, a potential mismatch between the field coordinates and the points of spectral variable extraction might occur. Note that the plots measured in this study are considered operational for loblolly pine plantations in the WG region. Consequently, our results well represent the relationships between Landsat sensor data and observed growth and density in operational settings there, and thus, the models may be valuable tools for managing the current regional loblolly pine plantations.

5. Conclusions

In current industrial pine plantation management in Southern US, inventory data at the young stand development (i.e., for projecting stand development) and mid-rotation stage (for guiding silvicultural activities) are indispensable. Using repeated data collected from loblolly pine plantations in the WG region, USA, the suitability of using Landsat sensor data to predict key plantation attributes for the early (open-crown) to mid-rotation (closed crown) stage was evaluated. Landsat sensor data predicted the young, open-canopy plantation tree height, DBH, basal area, and stand density index adequately. This predictability, however, weakened with stand development, becoming moderate for plantations with partly closed crowns and poor for plantations with completely closed crowns. Regardless of stage, Landsat sensor data predicted the number of trees poorly, which, however, often has been known. The shortwave infrared bands, red band, and wetness index were the most important predictors, but their dominance declined with stand development. Texture measure variables were relatively less important, but a trend of increasing importance was noted as stands developed during the study period. Model predictability may be improved by adding other predictors such as stand age. We concluded that Landsat sensor data may be used to obtain stand-level information for projecting stand development but may not be accurate enough to guide mid-rotation silviculture activities such as thinning.
Developed models are based on the training dataset, extrapolation outside the range of the training dataset can be biased, and caution should be used. We suggest that the procedure adopted in this study should be tested in loblolly pine and other commercial pine plantations of various stages across Southern US while investigating the accuracy of using Landsat sensor series and other remote sensing data.

Author Contributions

Conceptualization, C.C. and Y.W.; methodology, C.C., Y.W., K.W. and L.F.; software, C.C., K.W. and L.F.; formal analysis, C.C., Y.W. and C.T.; investigation, C.C. and J.G.; resources, Y.W. and KW; writing—original draft preparation, C.C. and Y.W.; writing—review and editing, J.G., K.W., L.F. and C.T.; supervision, K.W. and Y.W.; project administration, Y.W.; funding acquisition, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the ETPPRP and the McIntire—Stennis program.

Acknowledgments

We would like to thank the ETPPRP members (Rayonier, Resource Management Services, and Forest Resources Consultants) and the Stephen F. Austin State University for their support of the program as well as all the ETPPRP student workers who helped collect data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Burkhart, H.E.; Avery, T.E.; Bullock, B.P. Forest Measurements, 6th ed.; Waveland Press Inc.: Long Grove, IL, USA, 2019; pp. 352–371. [Google Scholar]
  2. Hyyppä, J.; Hyyppä, H.; Inkinen, M.; Engdahl, M.; Linko, S.; Zhu, Y.H. Accuracy comparison of various remote sensing data sources in the retrieval of forest stand attributes. For. Ecol. Manag. 2000, 128, 109–120. [Google Scholar] [CrossRef]
  3. Lefsky, M.A.; Cohen, W.B.; Spies, T.A. An evaluation of alternate remote sensing products for forest inventory, monitoring, and mapping of Douglas-fir forests in western Oregon. Can. J. For. Res. 2011, 31, 78–87. [Google Scholar] [CrossRef]
  4. Lu, D.S.; Mausel, P.; Brondizio, E.S.; Moran, E. Relationships between forest stand parameters and Landsat TM spectral responses in the Brazilian Amazon Basin. For. Ecol. Manag. 2004, 198, 149–167. [Google Scholar] [CrossRef]
  5. Zheng, D.; Rademacher, J.; Chen, J.; Crow, T.; Bressee, M.; Moine, J.L.; Ryu, S. Estimating aboveground biomass using Landsat ETM+ data across a managed landscape in northern Wisconsin, USA. Remote Sens. Environ. 2004, 93, 402–411. [Google Scholar] [CrossRef]
  6. Hall, R.J.; Skakun, R.S.; Arsenault, E.J. Modeling forest stand structure attributes using Landsat ETM+ data: Application to mapping of aboveground biomass and stand volume. For. Ecol. Manag. 2006, 225, 378–390. [Google Scholar] [CrossRef]
  7. Mäkelä, H.; Pekkarinen, A. Estimation of forest stand volumes by Landsat TM imagery and stand-level field-inventory data. For. Ecol. Manag. 2004, 196, 245–255. [Google Scholar] [CrossRef]
  8. de la Cueva, A.V. Structural attributes of three forest types in central Spain and Landsat ETM+ information evaluated with redundancy analysis. Int. J. Remote Sens. 2008, 29, 5657–5676. [Google Scholar] [CrossRef]
  9. Holmgren, P.; Thuresson, T. Satellite remote sensing for forestry planning: A review. Scand. J. For. Res. 1998, 13, 90–110. [Google Scholar] [CrossRef]
  10. Trotter, C.M.; Dymond, J.R.; Goulding, C.J. Estimation of timber volume in a coniferous plantation forest using Landsat TM. Int. J. Remote Sens. 1997, 18, 2209–2223. [Google Scholar] [CrossRef]
  11. Donoghue, D.N.M.; Watt, P.J.; Cox, N.J.; Dunford, R.W.; Wilson, J.; Stables, S.; Smith, S. An evaluation of the use of satellite data for monitoring Sitka spruce plantation forest growth. Forestry 2004, 77, 383–396. [Google Scholar] [CrossRef]
  12. Miles, P.D. Forest Inventory EVALIDator Web-Application Version 1.5.1.04. USDA Forrest Service; North Centre Research Station: St. Paul, MN, USA, 2013; Available online: http://apps.fs.fed.us/Evalidator/tmprcPost.jsp (accessed on 8 June 2022).
  13. Fox, T.R.; Jokela, E.J.; Allen, H.L. The development of pine plantation silviculture in the Southern United States. J. For. 2007, 105, 337–347. [Google Scholar]
  14. Fagan, M.E.; Morton, D.C.; Cook, B.D.; Masek, J.; Zhao, F.; Nelson, R.F.; Huang, C. Mapping pine plantations in the southeastern U.S. using structural, spectral, and temporal remote sensing data. Remote Sens. Environ. 2018, 216, 415–426. [Google Scholar] [CrossRef]
  15. Blinn, C.E.; House, M.; Wynne, R.; Thomas, V.; Fox, T.; Sumnall, M. Landsat 8 based leaf area index estimation in loblolly pine plantations. Forests 2019, 10, 222. [Google Scholar] [CrossRef]
  16. Blinn, C.E.; Albaugh, T.J.; Fox, T.R.; Wynne, R.H.; Stape, J.L.; Rubilar, R.A.; Allen, H.L. A method for estimating deciduous competition in pine stands using Landsat. South. J. Appl. For. 2012, 36, 71–78. [Google Scholar] [CrossRef]
  17. Shoemaker, D.; Cropper, W. Prediction of leaf area index for southern pine plantations from satellite imagery using regression and artificial neural networks. In Proceedings of the 6th Southern Forestry and Natural Resources GIS Conference, Orlando, FL, USA, 24–26 March 2008. [Google Scholar]
  18. Kinane, S.M.; Montes, C.R.; Albaugh, T.J.; Mishra, D.R. A model to estimate leaf area index in loblolly pine plantations using Landsat 5 and 7 Images. Remote Sens. 2021, 13, 1140. [Google Scholar] [CrossRef]
  19. Sivanpillai, R.; Smith, C.T.; Srinivasan, R.; Messina, M.G.; Wu, X.B. Estimation of managed loblolly pine stand age and density with Landsat ETM+ data. For. Ecol. Manag. 2006, 223, 247–254. [Google Scholar] [CrossRef]
  20. Trim, K.R.; Coble, D.W.; Weng, Y.H.; Stovall, J.P.; Hung, I.K. A new site index model for intensively managed loblolly pine plantations in the West Gulf Coastal Plain. For. Sci. 2020, 66, 2–13. [Google Scholar] [CrossRef]
  21. Reineke, L.H. Perfecting a stand density index for even-aged forests. J. Agric. Res. 1933, 46, 627–638. [Google Scholar]
  22. Coble, D.W.; McTague, J.P.; Weng, Y.H. A Whole-Stand Growth and Yield Model for Intensively and Extensively Managed Loblolly Pine Plantations in East Texas Prior to First Thin; ETPPRP Rep. No. 67; Stephen, F., Ed.; Austin State University: Nacogdoches, TX, USA, 2017; p. 25. [Google Scholar]
  23. Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
  24. Schmidt, G.L.; Jenkerson, C.B.; Masek, J.; Vermote, E.; Gao, F. Landsat Ecosystem Disturbance Adaptive Processing System (LEDAPS) Algorithm Description; Open-File Report 2013–1057; U.S. Geological Survey: Sioux Falls, SD, USA, 2013; p. 17.
  25. Zhu, Z.; Woodcock, C.E. Object-based cloud and cloud shadow detection in Landsat imagery. Remote Sens. Environ. 2012, 118, 83–94. [Google Scholar] [CrossRef]
  26. Zhu, Z.; Wang, S.X.; Woodcock, C.E. Improvement and expansion of the Fmask algorithm: Cloud, cloud shadow, and snow detection for Landsats 4–7, 8, and Sentinel 2 images. Remote Sens. Environ. 2015, 159, 269–277. [Google Scholar] [CrossRef]
  27. Rouse, J.W.; Haas, R.H.; Deering, D.W.; Schell, J.A. Monitoring the Vernal Advancement and Retrogradation (Greenwave Effect) of Natural Vegetation; Progress Rep. Apr–Sep 73; National Aeronautics and Space Administration: Pasadena, CA, USA, 1974.
  28. Huete, A.; Didan, K.; Miura, T.; Rodriguez, E.P.; Gao, X.; Ferreira, L.G. Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sens. Environ. 2002, 83, 195–213. [Google Scholar] [CrossRef]
  29. Qi, J.G.; Chehbouni, A.R.; Huete, A.R.; Kerr, Y.H.; Sorooshian, S. A modified soil adjusted vegetation index. Remote Sens. Environ. 1994, 48, 119–126. [Google Scholar] [CrossRef]
  30. Kauth, R.J.; Thomas, G.S. The tasselled cap—A graphic description of the spectral-temporal development of agricultural crops as seen by Landsat. In Proceedings of the Symposium on Machine Processing of Remotely Sensed Data, West Lafayette, IN, USA, 29 June–1 July 1976; pp. 41–51. [Google Scholar]
  31. Haralick, R.M.; Shanmugam, K.; Dinstein, I. Textural features for image classification. IEEE Trans. Syst. Man Cybern. 1973, 3, 610–621. [Google Scholar] [CrossRef]
  32. Conners, R.W.; Harlow, C.A. Toward a structural textural analyzer based on statistical methods. Comput. Graph. Image Process. 1980, 12, 224–256. [Google Scholar] [CrossRef]
  33. Coburn, C.A.; Roberts, A.C.B. A multiscale texture analysis procedure for improved forest stand classification. Int. J. Remote Sens. 2004, 25, 4287–4308. [Google Scholar] [CrossRef]
  34. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  35. Lou, X.W.; Weng, Y.H.; Fang, L.M.; Gao, H.L.; Grogan, J.; Hung, I.K.; Oswald, B.P. Predicting stand attributes of loblolly pine in West Gulf Coastal Plain using gradient boosting and random forests. Can. J. For. Res. 2021, 51, 807–816. [Google Scholar] [CrossRef]
  36. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. 2013. Available online: http://www.R-project.org/ (accessed on 20 June 2022).
  37. Liaw, A.; Wiener, M. Classification and regression by random forest. R News 2002, 2, 18–22. [Google Scholar]
  38. Westfall, J.A.; Burkhart, H.E.; Allen, H.L. Young stand growth modeling for intensively-managed loblolly pine plantations in southeastern US. For. Sci. 2004, 50, 823–835. [Google Scholar]
  39. Puhr, C.B.; Donoghue, D.N.M. Remote sensing of upland conifer plantations using Landsat TM data: A case study from Galloway, south-west Scotland. Int. J. Remote Sens. 2000, 21, 633–646. [Google Scholar] [CrossRef]
  40. Silva, C.A.; Klauberg, C.; Hudak, A.T.; Vierling, L.A.; Fennema, S.J.; Corte, A.P.D. Modeling and mapping basal area of Pinus taeda L. plantation using airborne LiDAR data. An. Acad. Bras. Ciênc. 2017, 89, 1895–1905. [Google Scholar] [CrossRef]
  41. Gebreslesie, M.T.; Ahmed, F.B.; van Aardt, J. Estimating plot-level forest structural attributes using high spectral resolution ASTER satellite data in even-aged Eucalyptus plantations in southern KwaZulu-Natal, South Africa. South. For. 2008, 70, 227–236. [Google Scholar] [CrossRef]
  42. Weiskittel, A.R.; Crookston, N.L.; Radtk, P.J. Linking climate, gross primary productivity, and site index across forests of the western United States. Can. J. For. Res. 2011, 41, 1710–1721. [Google Scholar] [CrossRef]
  43. Sabatia, C.O.; Burkhart, H.E. Predicting site index of plantation loblolly pine from biophysical variables. For. Ecol. Manag. 2014, 326, 142–156. [Google Scholar] [CrossRef]
  44. Brown, S.; Narine, L.L.; Gilbert, J. Using airborne lidar, multispectral imagery, and field inventory data to estimate basal area, volume, and aboveground biomass in heterogeneous mixed species forests: A case study in southern Alabama. Remote Sens. 2022, 14, 2708. [Google Scholar] [CrossRef]
  45. Ou, G.L.; Li, C.; Lv, Y.Y.; Wei, A.C.; Xiong, H.X.; Xu, H.; Wang, G.X. Improving aboveground biomass estimation of Pinus densata forests in Yunnan using Landsat 8 imagery by incorporating age dummy variable and method comparison. Remote Sens. 2019, 11, 738. [Google Scholar] [CrossRef]
  46. Jakubauskas, M.E.; Price, K.P. Empirical relationships between structural and spectral factors of Yellowstone lodgepole pine forests. Photogramm. Eng. Remote Sens. 1997, 63, 1375–1381. [Google Scholar]
  47. Grabska, E.; Socha, J. Evaluating the effect of stand properties and site conditions on the forest reflectance from Sentinel-2 time series. PLoS ONE 2021, 16, e0248459. [Google Scholar] [CrossRef]
  48. Lu, D.S. Aboveground biomass estimation using Landsat TM data in the Brazilian Amazon. Int. J. Remote Sens. 2005, 26, 2509–2525. [Google Scholar] [CrossRef]
  49. Gao, Y.Q.; Lu, D.S.; Li, G.Y.; Wang, G.X.; Chen, Q.; Liu, L.J.; Li, D.Q. Comparative analysis of modeling algorithms for forest aboveground biomass estimation in a subtropical region. Remote Sens. 2018, 10, 627. [Google Scholar] [CrossRef]
  50. Cook, E.A.; Iverson, L.R.; Graham, R.L. Estimating forest productivity with Thematic Mapper and biogeographical data. Remote Sens. Environ. 1989, 28, 131–141. [Google Scholar] [CrossRef]
  51. Zhu, X.L.; Liu, D.S. Improving forest aboveground biomass estimation using seasonal Landsat NDVI time-series. ISPRS J. Photogramm. Remote Sens. 2015, 102, 222–231. [Google Scholar] [CrossRef]
  52. Shamsoddini, A.; Trinder, J.C.; Turner, R. Pine plantation structure mapping using WorldView-2 multispectral image. Int. J. Remote Sens. 2013, 34, 3986–4007. [Google Scholar] [CrossRef]
Figure 1. Geographic location of field permanent plots of loblolly plantations.
Figure 1. Geographic location of field permanent plots of loblolly plantations.
Remotesensing 14 04702 g001
Figure 2. Relationships between observed and predicted values based on the independent test dataset of the 1st cycle measurement: multiple linear regression models on left column and random forests models on the right column.
Figure 2. Relationships between observed and predicted values based on the independent test dataset of the 1st cycle measurement: multiple linear regression models on left column and random forests models on the right column.
Remotesensing 14 04702 g002aRemotesensing 14 04702 g002b
Figure 3. Relationships between observed and predicted values based on the independent test dataset of the 2nd cycle measurement: multiple linear regression models on left column and random forests models on the right column.
Figure 3. Relationships between observed and predicted values based on the independent test dataset of the 2nd cycle measurement: multiple linear regression models on left column and random forests models on the right column.
Remotesensing 14 04702 g003aRemotesensing 14 04702 g003b
Figure 4. Relationships between observed and predicted values based on the independent test dataset of the 3rd cycle measurement: multiple linear regression models on left column and random forests models on the right column.
Figure 4. Relationships between observed and predicted values based on the independent test dataset of the 3rd cycle measurement: multiple linear regression models on left column and random forests models on the right column.
Remotesensing 14 04702 g004aRemotesensing 14 04702 g004b
Table 1. Summary statistics of field reference data by cycle.
Table 1. Summary statistics of field reference data by cycle.
CycleAttributeMeanStandard DeviationNMinimumMaximum
1 aAge (years)4.461.79116211
HT (m)5.372.111162.0513.28
DBH (cm)7.113.341162.0515.07
BA (m2 ha−1)6.615.711160.2628.14
NT (trees ha−1)1289.72267.67116828.822120.49
SDI194.21143.9911615.42702.10
2Age (years)6.931.93125314
HT (m)8.982.151253.6014.97
DBH (cm)12.112.721253.7918.48
BA (m2 ha−1)16.086.491251.5232.7
NT (trees ha−1)1298.38255.29125818.062098.96
SDI413.28144.3412558.34790.92
3 bAge (years)9.941.97121617
HT (m)11.672.181216.6721.26
DBH (cm)15.162.531219.1527.54
BA (m2 ha−1)23.316.101218.1836.10
NT (trees ha−1)1263.74291.31121365.972066.67
SDI557.51129.71121225.76859.39
a 7 plots with ages between 8 to 11 years; b 4 plots have been thinned once.
Table 2. Path/Row and dates of acquiring Landsat image data.
Table 2. Path/Row and dates of acquiring Landsat image data.
YearSensorPath/Row
P24/R38P24/R39P25/R38P25/R39P26/R38
2004ETM+9-August24-July15-July16-August7-August
2005ETM+11-July11-July19-August2-July26-August
2006TM7-August22-July30-August30-August5-August
2007ETM+2-August2-August9-August9-August15-July
2008ETM+4-August4-August26-July26-July1-July
2009TM15-August14-July22-August22-August12-July
2010TM2-August2-August9-August25-August31-July
2011TM5-August5-August28-August28-August3-August
2012ETM+30-July30-July22-August5-July28-July
2013OLI9-July9-July17-August17-August8-August
Table 3. The variables derived from the Landsat series images.
Table 3. The variables derived from the Landsat series images.
VariablesDescriptionNumber of Variables
Original bandBand 2 (450–520 nm): BLUE, band 3 (520–560): GREEN, band 4 (630–690): RED, band 5 (770–900): Near Infrared (NIR), band 6 (1550–1750): Shortwave Infrared 1 (SWIR1), band 7 (2080–2350): Shortwave Infrared 2 (SWIR2)6
Vegetation indicesNormalized difference vegetation index (NDVI = (NIR − RED)/(NIR + RED)) [27]; Enhanced vegetation index (EVI = (NIR − RED)/(NIR + 6 (RED) − 7.5 (BLUE) + 1)) [28]; Modified soil adjusted vegetation index (MSAVI = NIR + 0.5 − ( NIR   + 0.5 2 2 × NIR     R ) 1 2 ) [29]3
Tasseled Cap TransformationBrightness, Greenness, Wetness [30]3
Texture measuresGrey-level co-occurrence matrix-based texture measures including the angular second moment (ASM), contrast (CONTRAST), correlation (CORR), variance (VAR), inverse difference moment (IDM), sum average (SAVG), sum variance (SVAR), sum entropy (SENT), entropy (ENT), difference variance (DVAR), difference entropy (DENT), information measure of Corr.1 (IMCORR1), information measure of Corr.2 (IMCORR2), max Corr. Coefficient (MAXCORR), DISSimilarity (DISS), INERTIAtia (INERTIATIA), cluster shade (SHADE) and cluster PROMinence (PROM) features of NDVI for pixel size with 5 × 5 [31,32]18
Table 4. Final multiple linear regression models by cycle and attribute including model predictors, model goodness of fit on the training data, and model validation on the test data.
Table 4. Final multiple linear regression models by cycle and attribute including model predictors, model goodness of fit on the training data, and model validation on the test data.
CycleAttributePredictorsTrainingTest
R2RMSER2MAE
1HTSWIR1, RED0.681.220.660.87
DBHSWIR1, NIR0.612.110.611.49
BASWIR1, RED, ASM, IDM0.723.230.741.77
NTDISS, Brightness, DVAR, NIR0.19237.080.19184.65
SDISWIR1, RED, ASM, IDM0.7478.160.7952.49
2HTSWIR10.431.680.351.20
DBHSWIR1, SAVG, BLUE0.453.430.362.57
BASWIR1, NDVI, SAVG0.464.590.523.63
NTNIR, BLUE, Brightness0.08236.230.10182.66
SDISWIR1, NDVI, SHADE0.48102.560.5278.71
3HTNANANANANA
DBHRED, Wetness, SAVG, Brightness0.082.470.261.48
BASWIR1, RED, NDVI0.305.240.144.23
NTSWIR1, VAR, PROM0.17253.150.23180.78
SDISWIR1, RED0.21117.130.3381.83
Table 5. Relative importance of predictors with %IncMSE ≥ 2 by random forests models using the training dataset for the 1st cycle measurement.
Table 5. Relative importance of predictors with %IncMSE ≥ 2 by random forests models using the training dataset for the 1st cycle measurement.
HTDBHBASDINT
Variables%IncMSEVariables%IncMSEVariables%IncMSEVariables%IncMSEVariables%IncMSE
SWIR128.35SWIR126.46SWIR130.42SWIR133.57MSAVI9.31
SWIR219.37SWIR218.46SWIR223.40SWIR221.45NIR8.76
Wetness11.99Wetness12.86RED11.45RED12.13EVI7.58
RED11.01RED12.32Brightness11.36Wetness9.12NDVI6.84
GREEN7.77IMCORR18.24GREEN6.73Brightness7.58Greenness6.30
ASM6.38GREEN7.24NDVI6.29SAVG5.58IDM6.24
SENT5.73IMCORR26.03Wetness5.56NDVI4.41ENT4.78
IMCORR25.67SENT5.73ASM4.53ASM3.80Wetness4.37
SAVG5.18ASM5.23SAVG4.34SVAR3.65PROM3.57
Brightness4.83Brightness4.85Greenness4.01IMCORR23.56Brightness3.52
Greenness4.67SAVG4.64SENT3.97GREEN3.47DISS3.36
IMCORR14.35Greenness4.44IDM3.28SENT3.15SWIR13.31
NDVI3.94VAR3.77IMCORR12.82PROM2.26IMCORR22.44
DENT3.35BLUE2.46SVAR2.13ENT2.25SWIR22.38
ENT2.86MSAVI2.12EVI2.00IMCORR12.10SHADE2.19
BLUE2.48ENT2.09 VAR2.10DVAR2.16
IDM2.17 INERTIATIA2.14
MSAVI2.01
Table 6. Relative importance of predictors with %IncMSE ≥ 2 by random forests models using the training dataset for the 2nd cycle measurement.
Table 6. Relative importance of predictors with %IncMSE ≥ 2 by random forests models using the training dataset for the 2nd cycle measurement.
HTDBHBASDINT
Variables%IncMSEVariables%IncMSEVariables%IncMSEVariables%IncMSEVariables%IncMSE
SWIR123.97SWIR125.05SWIR124.19SWIR124.11NIR10.52
SWIR215.20SWIR211.07SWIR217.75SWIR220.45EVI7.95
SAVG7.23RED9.40Wetness7.94Wetness8.48SAVG5.30
VAR6.58SAVG8.67RED6.14IMCORR18.45MSAVI4.91
MSAVI6.19NIR6.66SAVG5.69IMCORR26.33IMCORR14.79
GREEN5.80BLUE5.71IMCORR15.02RED5.78SVAR3.92
NIR5.24VAR5.44PROM4.67VAR5.43SENT3.47
Wetness4.42PROM4.68IDM4.03SENT5.35DENT3.42
INERTIA3.99SVAR4.60IMCORR23.81NDVI3.82IMCORR23.19
DISS3.89Wetness4.52INERTIA3.37DENT2.45DVAR2.75
PROM3.76GREEN4.07Brightness3.22BLUE2.17DISS2.44
DVAR3.73INERTIA4.02NDVI3.15 INERTIA2.40
RED3.48IMCORR23.54CONTRAST3.11 CONTRAST2.03
DENT3.34DISS3.49VAR2.73
NDVI3.12DENT3.22DVAR2.71
SVAR2.61NDVI3.06SVAR2.71
CONTRAST2.56SHADE3.03SENT2.54
EVI2.53MSAVI3.02SHADE2.35
SHADE2.49IDM2.85DISS2.29
BLUE2.19Brightness2.72
CONTRAST2.18
Table 7. Relative importance of predictors with %IncMSE ≥ 2 by random forests models using the training dataset for the 3rd cycle measurement.
Table 7. Relative importance of predictors with %IncMSE ≥ 2 by random forests models using the training dataset for the 3rd cycle measurement.
HTDBHBASDINT
Variables%IncMSEVariables%IncMSEVariables%IncMSEVariables%IncMSEVariables%IncMSE
RED6.18DISS8.58SWIR112.53SWIR116.91SWIR112.02
DVAR5.64SWIR26.09Wetness9.60Wetness14.43IDM8.96
DISS5.42CONTRAST5.40SWIR26.36ENT6.95SWIR27.13
SWIR15.22SWIR15.37RED5.96SWIR25.96ENT5.83
INERTIA4.89INERTIA4.98NDVI5.64IMCORR25.74ASM4.84
CONTRAST4.80IMCORR24.61ENT5.01NDVI4.17IMCORR14.64
Wetness4.18ASM4.53BLUE4.41ASM3.88MSAVI4.38
SWIR23.94DVAR4.13DISS4.08RED3.81EVI4.05
IDM3.88NDVI4.08GREEN3.62IDM3.71NDVI3.77
GREEN3.53DENT4.06IMCORR23.39EVI3.53Wetness3.64
IMCORR23.43RED4.04DENT2.63CORR3.36RED3.32
Greenness3.32IMCORR13.88INERTIA2.35IMCORR13.09NIR3.16
VAR2.85SENT3.88IDM2.29DVAR2.50SENT3.11
EVI2.67MSAVI3.87SVAR2.17DISS2.25VAR3.09
PROM2.64ENT3.67ASM2.06VAR1.97DENT3.05
NDVI2.38SHADE3.65 SVAR2.54
BLUE2.34GREEN3.49 PROM2.40
MSAVI2.10IDM3.44 IMCORR22.02
IMCORR12.08EVI3.03
CORR2.93
Wetness2.59
VAR2.39
SVAR2.36
Table 8. Final random forests models by cycle and stand attribute including model goodness-of-fit on the training data and model validation on the test data.
Table 8. Final random forests models by cycle and stand attribute including model goodness-of-fit on the training data and model validation on the test data.
CycleAttributeTrainTest
R2RMSER2MAE
1HT0.681.330.710.84
DBH0.652.200.771.34
BA0.693.400.732.06
NT0.16231.110.25192.75
SDI0.7080.490.7260.20
2HT0.411.740.161.36
DBH0.422.220.271.38
BA0.365.360.293.89
NT0.08253.110.06154.07
SDI0.42114.160.4685.16
3HT0.062.460.081.61
DBH0.132.780.131.85
BA0.076.250.054.98
NT0.08288.810.09222.83
SDI0.25110.080.1072.29
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chen, C.; Wang, K.; Fang, L.; Grogan, J.; Talmage, C.; Weng, Y. Landsat Data Based Prediction of Loblolly Pine Plantation Attributes in Western Gulf Region, USA. Remote Sens. 2022, 14, 4702. https://doi.org/10.3390/rs14194702

AMA Style

Chen C, Wang K, Fang L, Grogan J, Talmage C, Weng Y. Landsat Data Based Prediction of Loblolly Pine Plantation Attributes in Western Gulf Region, USA. Remote Sensing. 2022; 14(19):4702. https://doi.org/10.3390/rs14194702

Chicago/Turabian Style

Chen, Chongzhi, Ke Wang, Luming Fang, Jason Grogan, Clinton Talmage, and Yuhui Weng. 2022. "Landsat Data Based Prediction of Loblolly Pine Plantation Attributes in Western Gulf Region, USA" Remote Sensing 14, no. 19: 4702. https://doi.org/10.3390/rs14194702

APA Style

Chen, C., Wang, K., Fang, L., Grogan, J., Talmage, C., & Weng, Y. (2022). Landsat Data Based Prediction of Loblolly Pine Plantation Attributes in Western Gulf Region, USA. Remote Sensing, 14(19), 4702. https://doi.org/10.3390/rs14194702

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop