Next Article in Journal
Non-Cognitive Skills and Farmers’ Entrepreneurial Performance: Evidence from Chinese Family Panel Studies
Previous Article in Journal
A Diameter Measurement Method of Red Jujubes Trunk Based on Improved PSPNet
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimating Yield-Related Traits Using UAV-Derived Multispectral Images to Improve Rice Grain Yield Prediction

Graduate School of Bioagricultural Sciences, Nagoya University, Furo-Cho, Chikusa-Ku, Nagoya 464-8601, Japan
*
Author to whom correspondence should be addressed.
Agriculture 2022, 12(8), 1141; https://doi.org/10.3390/agriculture12081141
Submission received: 5 July 2022 / Revised: 26 July 2022 / Accepted: 30 July 2022 / Published: 2 August 2022
(This article belongs to the Section Agricultural Technology)

Abstract

:
Rice grain yield prediction with UAV-driven multispectral images are re-emerging interests in precision agriculture, and an optimal sensing time is an important factor. The aims of this study were to (1) predict rice grain yield by using the estimated aboveground biomass (AGB) and leaf area index (LAI) from vegetation indices (VIs) and (2) determine the optimal sensing time in estimating AGB and LAI using VIs for grain yield prediction. An experimental trial was conducted in 2020 and 2021, involving two fertility conditions and five japonica rice cultivars (Aichinokaori, Asahi, Hatsushimo, Nakate Shinsenbon, and Nikomaru). Multi-temporal VIs were used to estimate AGB and LAI throughout the growth period with the extreme gradient boosting model and Gompertz model. The optimum time windows for predicting yield for each cultivar were determined using a single-day linear regression model. The results show that AGB and LAI could be estimated from VIs (R2: 0.56–0.83 and 0.57–0.73), and the optimum time window for UAV flights differed between cultivars, ranging from 4 to 31 days between the tillering stage and the initial heading stage. These findings help researchers to save resources and time for numerous UAV flights to predict rice grain yield.

1. Introduction

Precision agriculture involves real-time crop monitoring to aid in necessary field management intervention to ensure a good harvest. This is made possible by the innovation of using unmanned aerial vehicles (UAVs) in the field. The images taken from the UAV serve as an alternative parameter to actual plant traits such as plant height [1,2,3], leaf area index (LAI) [4,5,6], and aboveground biomass (AGB) [7,8,9]. In recent years, this type of remote sensing has also been applied in creating prediction models using machine learning algorithms [10,11,12].
Rice grain yield prediction presents several benefits. With the market demand, the basis for the import and export inventories in a district is the grain yield estimates [13]. Yield forecast, to some extent, can be a strategy for crop insurance providers in evaluating if a farm is entitled to an indemnity due to low production. Local agricultural officers can use grain yield prediction to determine if the current crop management program in the area is bridging the yield gap while considering normal year-to-year variation. Inter-annual variations in rice grain yield can be attributed to year-to-year climate variability such as anomalous weather. The increase in nighttime temperature and longer hot days are expected to affect rice productivity. The presence of cloud cover, reduced solar radiation and change in fertility application can influence grain yield variation between years [14,15].
These situations require reliable results yet with a short-term process. The use of UAVs presents an advantage, as it provides higher spatial–temporal resolution than the use of satellites, allowing for more precise crop monitoring than the situations mentioned above.
However, rice grain yield prediction models using UAVs are not without difficulty. The need to acquire data during certain rice growth stages, which are important for yield determination, and differences in observed phenotype trends due to differences in plant architecture among rice cultivars are inevitable problems in yield prediction by remote sensing.
Previous research has shown that the use of vegetation indices (VIs) to predict grain yield resulted in a good coefficient of determination (R2) when yield–VI models were incorporated with other variables such as phenological data [16,17] and other plant parameters such as days to maturity, plant height [9], and ground coverage [18,19]. Better results were also observed when different VIs [20,21] and multi-temporal VIs [22,23] were incorporated into the model. These prediction models work on the hypothesis that grain yield can be predicted using plant parameters and weather data that indicate the plant’s physiological efficiency during grain filling. Additionally, increasing the predictors may improve the determination efficiency to a certain number of terms. Combining VIs with texture indices measured at the booting stage had comparable results to VIs measured at the booting and heading stages [24,25].
However, incorporating these parameters did not always result in better prediction. In particular, the inclusion of growing degree-days did not improve the model compared to using the NDVI alone [26]. Rice prediction models use a generalized optimal sensing time and multi-temporal Vis to predict grain yield, and their performances depend on the accuracy of monitoring certain rice growth stages that are known to be important in grain yield determination such as panicle initiation and flowering stages. Sensing beyond panicle differentiation resulted in a lower rice grain yield prediction than during the interval between panicle initiation and differentiation [22,26].
Other research has also shown that the prediction method is more important than the sample size, with the random forest model being the best method in most cases. The sample size can influence the performance of cross-validated prediction models, and a small sample size is used in a high number of k-fold cross-validations, where a high R2 in the test dataset can be observed, resulting in a biased result [9,17,27].
The target cultivar used in the prediction is also crucial. The importance of VIs is often evaluated on a case-to-case basis using correlation analyses and variable importance plots, meaning that a particular VI may show a linear relationship in one study but not in another study. Using VIs that can represent the morphological differences among cultivars can improve the variability aspect of the prediction model [28]. Grain yield prediction models for different cultivars have attempted to avoid high variance in prediction model research [29].
The results of the above experiments indicate that the current UAV-based grain yield prediction model consists of multi-temporal VIs, other image-derived variables, and the selection of the best prediction model. Little attention has been paid to estimated plant parameters using UAVs and these estimates to predict rice grain yield.
Our study hypothesized that reasonable estimates of individual rice plant AGB and LAI throughout the growth period from UAV-derived data could predict grain yield without incorporating the actual plant parameters measured manually. The extreme gradient boosting algorithm (XGBoost), a robust ensemble machine learning algorithm, was chosen to estimate AGB and LAI. Previous research has shown that XGBoost outperformed other machine learning algorithms in the yield forecasting of several crops [30,31]. It can also determine trends in crop-yield-related parameters such as rainfall and temperature [32]. XGBoost uses a gradient boosting method wherein the model learns from the weak learners and corrects the mistake in the prediction while regularizing the model due to complexity. Compared to the simple random forest and ridge regression, XGBoost has both strengths of regularization (ridge) and decision trees (random forest). The grain yield prediction using the estimated AGB and LAI depends on the estimation accuracy of AGB and LAI first. Using predicted values to predict another variable poses an inherent prediction error but can be mitigated using a model that can bode well.
Once the AGB and LAI are estimated using VIs, the AGB and LAI dynamics throughout the growth duration can be calculated using the developed estimation models and the Gompertz growth curve. A sigmoid curve such as the Gomperz growth curve is often used to describe the plant weight and LAI as a function of time [33,34]. The Gompertz curve has also been applied in determining a relationship between plant dry matter and vegetation cover derived from computer vision techniques [35].
When the dynamics have been estimated, the optimum time for UAV flights can be determined. This is to optimize the UAV data acquisition by avoiding unnecessary and redundant UAV flights for AGB and LAI estimation that can be used for grain yield prediction. The proposed prediction method was developed for each of the cultivars tested in the study. This study attempted to build a two-step grain yield prediction model by estimating the in-season AGB and LAI using correlated VIs and then utilizing these estimated parameters to predict grain yield. This study also aimed to determine the optimum time window for UAV flights.

2. Materials and Methods

The workflow of the study consisted of the following: (1) data collection and processing of AGB, LAI, grain yield, and UAV spectral data for the years 2020 and 2021, (2) establishment of baseline grain yield prediction, and (3) estimation models for AGB and LAI to be used as predictors for the GY prediction model. The details are described in Figure 1.

2.1. Study Area

The study site is located at the Togo Field, Field Science Research Center of the Nagoya University in Hatajiri, Morowa District, Togo, Aichi Prefecture, Japan (35°06′37.1″ N 137°04′59.4″ E) (Figure 2).
Aichi Prefecture belongs to a humid subtropical climate zone. During the growing season, the average monthly total sunshine duration is 165.38 h, the average monthly total precipitation is 223.35, and the average daily temperature is 25.7 °C [36]. The primary soil type is ultisols.

2.2. Experimental Design

Two experimental trials, namely, a trial with no fertilizer and a trial with fertilizer, were conducted in 2020 and 2021, involving five japonica cultivars with different heading times and plant architectures, Aichinokaori, Asahi, Hatsushimo, Nakate Shinsenbon, and Nikomaru (Figure 2). These cultivars were divided into two groups at the study site: the early-heading group (Aichinokaori, Nakate Shinsenbon, and Nikomaru) and the late-heading group (Asahi and Hatsushimo). Nakate Shinsenbon has a large number of small-sized panicles with a short plant stature, Nikomaru has a small number of large-sized panicles with a tall plant stature, and Aichinokaori, Asahi, and Hatsushimo have an intermediate stature. Basal fertilization was carried out according to local practices for the fertilizer trials, and top-dressing was not performed for both trials. The amount of fertilizer applied in the fertilizer plot was 48 kg N, 51 kg P2O5, and 53 kg K2O per hectare. The plot size was 1.68 m × 5.52 m with a 24 cm plant spacing. The experiments were laid out in a randomized complete block design with three replications.

2.3. Field Data Collection

AGB was sampled for each cultivar at different growth stages: (1) tillering stage, (2) stem elongation, (3) booting stage and (4) heading stage. The tillering stage is referred to as the late tillering stage (V8), the stem elongation stage is when the panicle branches have formed (R1), the booting stage is the flag leaf collar formation (R2), and the heading stage is when one or more florets of the main stem panicle achieve anthesis (R4) [37].
Sampling was conducted after the UAV flight for spectral data collection (Table 1). Four adjacent hills of rice plants were sampled in each plot. The number of tillers per sampled hill was counted and separated into leaves and stems. The detached leaves were used to determine the LAI using an leaf area meter, AAM-9 (Hayashi Denko Co., Ltd., Tokyo, Japan). After measuring the leaf area, the detached plant parts were oven-dried at 70 °C for 48 h until a constant weight was attained and weighed. LAI determination was carried out only until the booting stage. Grain yield was measured at physiological maturity. Eight hills per plot were harvested. The grains were cleaned and weighed, and the grain yield was converted to ton/ha.

2.4. UAV Image Acquisition and Preprocessing

This study utilized a UAV system: Matrice 210 RTK v2 with a Zenmuse X7 50 mm camera (DJI, Shenzhen, Guangdong, China) and Micasense RedEdge-MX sensors on board. The multispectral bands used in this study were the following: blue (475 ± 32 nm), green (560 ± 27 nm), red (668 ± 14 nm), red edge (717 ± 12 nm), and near-IR (842 ± 57 nm). The flight altitude was 20.0 m with a forward overlap of 80% and a side overlap of 75%. The shooting mode was hovering with a flight speed of 1.2 m/s. The approximate resolution was 0.20 cm/pix.
For each UAV flight, the sensors were calibrated using a calibrated reflectance panel (CRP) provided by the sensor manufacturer according to its instructions. UAV flights were conducted between 10:00 am and 4:00 pm under windless and clear-sky conditions once per week until the heading stage. The number of UAV flights used in this study was 11 in 2020 and 17 in 2021.
The acquired images were orthomosaicked using the Pix4D mapper software (Pix4D SA, Prilly, Switzerland) to generate the spectral reflectance images of the experimental plots and georeferenced using ground control points.
An orthomosaicked true (RGB) image was used to create a region of interest (ROI). A shapefile consisting of circular polygons with a 10 cm radius corresponding to each hill of rice plants served as the ROI.
The reflectance images were then rasterized using the ‘raster’ [38] package in R. The orthomosaicked reflectance images were clipped into smaller raster images covering the experimental plot and cut for individual hills using the ROI shapefile. The mean values of reflectance from the clipped hill images were calculated. A total of 24 vegetation indices (Table A1) were calculated using different combinations of the mean values of the five spectral reflectance images.

2.5. Data Analysis

Data from rice growing seasons (2020 and 2021) were pulled together to comprise the whole dataset for the study—the study was composed of three prediction models for each cultivar. To establish a baseline for predicting grain yield by AGB and LAI, a linear regression model was conducted using repeated k-fold cross-validation (k = 3) provided by the ‘caret’ package in R [39]. A regression model for each growth stage was conducted to determine which growth stage can best predict grain yield. The growth stage corresponding to the model with the lowest RMSE in the model evaluation using an independent test dataset was considered as the optimum growth stage for predicting grain yield using AGB and LAI.
A correlation analysis was performed between AGB, LAI, and VIs using the ‘corrplot’ [40] package in R, and VIs were chosen as the feature variable for further XGBoost modeling. Since the XGBoost algorithm cannot handle high-dimensional data, the top ten correlated VIs were used to estimate AGB and LAI.
The ‘xgboost’ package in R [41] was used to train the XGBoost model to estimate AGB and LAI. A hold-out method was used with 70% as the training dataset and 30% as the independent test dataset. With the training dataset, the model was trained by a repeated k-fold cross-validation method (k = 3, 10 times). The optimum estimation model was determined using the lowest RMSE as the evaluation metric criterion.
The XGBoost algorithm is a boosting ensemble machine learning algorithm combining boosted trees’ features and the gradient descent method with regularization functions. In the tree-based learning method, a prediction is made by creating a regression tree based on a decision tree score. Therefore, correlated features do not necessarily influence the performance of XGBoost. Then, the weak learners in the regression tree are given more weight in the next iteration. This occurs by incorporating the prediction of the residuals or errors of the prior regression tree into the new regression tree. Then, a regularization parameter is included in the new regression tree to account for the increase in the complexity of the model. The gradient descent method minimizes the cost function, which measures the error between the predicted and observed values.
To further explain the prediction process, feature importance was summarized using SHAP (SHapley Additive exPlanations) from the ‘SHAPforxgboost’ package in R [42]. The independent test dataset was evaluated using the normalized RMSE.
To achieve an optimized prediction model using XGBoost, there are several tuning parameters that can be explored as described by [41,43]. This study attempted to optimize the following parameters: (1) max depth, which is the depth of the decision tree, which means a tree must have features equal to the number of depths to create a prediction; (2) colsample by tree, which is the number of features supplied to a tree.
The estimation models for AGB and LAI were then utilized to predict the AGB and LAI across the growth period using the VIs calculated from other UAV flights not used in the training data for the estimation models. A predicted value corresponding to the extracted ROI of one hill was eliminated when it was less than 85% of the preceding predicted value and the records with less than 3 predicted values were removed from the next analyses. The remaining predicted AGB and LAI for each hill were fitted into the Gompertz growth curve using the ‘drc’ package in R [44] to interpolate AGB and LAI values at 1 to 100 DAT using 0.1 as the starting value (DAT 0) for AGB and LAI. Then, to determine which days were the optimum time window for conducting a UAV flight, a single-day linear regression model between grain yield and the predicted AGB and LAI was conducted using the ‘caret’ package in R [34] by a repeated k-fold cross-validation method (k = 3, 10 times). In total, 960 records of AGB and grain yield and 720 records of LAI were used in our study, but only 48 were used for each stage per cultivar. Training single-day linear regression models with a limited number of data can lead to overfitting and inaccurate results. To expand the yield data quantitatively and to eliminate the chance success produced by bias in data partitioning, we conducted a trial of 10 repeats of 3 CVs. The Akaike information criteria (AICc) score was computed for each regression model using the ‘AICcmodavg’ package in R [45]. The AIC weights were then calculated for each of the 100 regression models based on the regression model with the lowest AIC score. The DAT values corresponding to the regression models with an AIC weight of fewer than 1.0 units were considered the optimum time window for the UAV flights. The single-day regression models that fall in this criterion are considered the models that can best estimate the target variable, grain yield. A smoothing method, namely, LOESS (locally weighted scatterplot smoothing), in the ‘ggplot2’ [46] package in R was used to determine the trend in the single-day regression series. In this smoothing method, the R2 for a single-day regression was recalculated based on the R2 values of the nearby R2 data points. The nearby data points were set at 50% of the total R2 data points.

3. Results

3.1. Relationship of AGB and LAI with Grain Yield

Five rice cultivars with different architectures and heading timings were used in this study and grown at different fertilizer levels to ensure diversity in the data. The AGB, LAI, and grain yields differed between the five rice cultivars on a hill basis (Figure 3 and Figure 4), suggesting that the grain yield prediction model for each cultivar is reasonable. In 2020, there were no significant varietal differences in AGB as observed from the high AGB variability within a cultivar while in 2021, and varietal differences in AGB were observed with less AGB variability within a cultivar. Conversely, the AGB between cultivars in the fertilizer trial showed similar AGB variability patterns in both years. AGB was also higher in the fertilizer trial compared to the trial with no fertilizer, regardless of years.
The range of LAI values per growth stage was similar among cultivars in the fertilizer trial except with Aichinokaori in 2020 and Nakate Shinsenbon in 2021 (Figure 3e,f). Higher LAI values were observed among cultivars in 2020 than in 2021. LAI measured from the fertilizer trial was also higher than in the trial with no fertilizer. Higher LAI variability within a cultivar was also observed in the fertilizer trial, especially at later growth stages (Figure 3g,h).
Grain yields across cultivars showed slight variation in the trial with no fertilizer compared to grain yields in the fertilizer trial. In 2020, Aichinokaori had a lower grain yield than Hatsushimo and Nikomaru, while in 2021, Hatsushimo had a higher grain yield than Nakate Shinsenbon. Aichinokaori had the lowest grain yield across trials and years (Figure 4a–d).
Simple linear regression modeling between grain yield and the actual AGB and LAI was conducted to determine the varietal difference and to establish a baseline for predicting grain yield by AGB and LAI. Grain yield could be predicted using the actual AGB and LAI by accounting for the 19.0% to 53.0% variation in grain yield (Figure 5). In this study, grain yield could be optimally predicted from the actual AGB and LAI at the stem elongation stage for all the cultivars used except for Nikomaru in this study. From these results, it may be possible to develop a model to estimate yield using LAI and AGB at a particular timing.

3.2. AGB and LAI Estimation

Based on the results of the yield estimation model using the actual LAI and AGB, it was thought that LAI and AGB could be used for yield estimation. For the estimation of LAI and AGB using UAVs, a model was developed to estimate AGB and LAI from VIs obtained from the flights.
In this study, 24 VIs were calculated from reflectance images, but since there were too many to include all of them in the XGBoost model to estimate LAI and AGB, VIs were selected to be used as explanatory variables. To select the VIs for AGB and LAI estimation modeling, a correlation analysis was performed between AGB, LAI, and VIs (Table A2 and Table A3). The AGB of Nakate Shinsenbon had the highest range of the top ten (10) correlated VIs (R2: 0.772–0.806), followed by the AGB of cultivar Nikomaru (R2: 0.679–0.752). Medium levels of correlation were found between the VIs and AGB of cultivars Asahi (R2: 0.538–0.698) and Aichinokaori (R2: 0.528–0.621). The AGB of cultivar Hatsushimo showed the lowest correlation (R2: 0.447–0.631) (Table A2).
The LAI was positively correlated with the different VIs (Table A3). The correlation coefficient of the top ten (10) VIs highly correlated with the LAI of Nakate Shinsenbon ranged from 0.772 to 0.818 in R2. Medium levels of correlation were found for Asahi (R2: 0.681–0.805), Hatsushimo (R2: 0.665–0.810), and Nikomaru (R2: 0.609–0.707). Aichinokaori had the lowest correlation (R2: 0.553–0.674).
Using the top ten (10) correlated VIs indicated in Table A2 and Table A3, an AGB and LAI estimation model was developed using XGBoost. The different hyperparameters of the XGBoost model were tuned to obtain the optimum estimation for each cultivar (Table A4 and Table A5). For some select cultivars, the AGB and LAI estimation models tuned using the lowest RMSE selection method showed decreased prediction performance when evaluated using the test data (Table 2, Figure 6 and Figure 7). The AGB estimation model for Hatsushimo and the LAI estimation models for Asahi, Hatsushimo, and Nakate Shinsenbon showed a lower R2 when estimating the target variables in the test dataset. A higher number of feature variables per tree and depth of trees may result in an estimation model that finds patterns particular to the sample used for building the prediction, resulting in overfitting (Table A4 and Table A5).
In contrast, LAI estimation models that used 50% of the feature variables and regression tree depths of five and three did not show overfitting AGB estimation models for Aichinokaori, Asahi, Nakate Shinsenbon, and Nikomaru, and LAI estimation models for Aichinokaori and Nikomaru.
The XGBoost model has a variable importance computation that summarizes the feature variables’ level of influence on the training process to predict AGB and LAI based on the parameter gain of a feature variable in the prediction process. The parameter gain measures the relative contribution of a feature variable in a tree created in the model. A higher gain value of a feature close to other features indicates that the former feature is more critical in the prediction. The relative importance of the correlated VIs in the AGB estimation models for each cultivar was calculated (Figure 8). All of the feature variables used in the training of the XGBoost model were significantly correlated with AGB, with varying correlations (Table A2).
Comparing the levels of correlation of VIs to the levels of importance in the XGBoost model showed a difference in trend. There were VIs with a medium correlation with AGB, but they showed the highest level of importance in the XGBoost model. In particular, the variable importance of the AGB estimation model for Aichinokaori showed that EVI2.2 was the essential feature in the prediction (Figure 8a). Likewise, the variable importance of the AGB estimation model for Nakate Shinsenbon showed that ATSAVI was the most crucial feature in the prediction model (Figure 8d) but was the least correlated VI with AGB among the ten VIs (Table A2) used in the training process. However, this difference in feature ranking was compensated by making the most correlated VIs the second most important feature in XGBoost, as seen in the variable importance of the AGB estimation models for Asahi, Hatsushimo, and Nikomaru. Nevertheless, the variable importance of the AGB estimation models for other cultivars such as Asahi, Hatsushimo, and Nikomaru showed that BWDRVI was the essential feature, which was also the VI most correlated with AGB (Table A2). The SHAP values of the AGB estimation models show that at least four feature variables per estimation model did not influence the prediction.
The variable importance in the LAI estimation model for each cultivar is shown in Figure 9a–e. The topmost vital features in the training process of the LAI estimation models were the VIs in the middle in the order of ascending correlation with LAI. The least correlated VIs also ranked the lowest in variable importance, such as GRNDVI in the estimation model for Hatsushimo and NDVI in the estimation model for Nikomaru. Similar to the AGB estimation models, the most correlated VIs were the second most important feature in the variable importance for the LAI estimation model.

3.3. VI-Estimated AGB and LAI Fitted to the Gompertz Growth Curve

Aerial photography by UAV flights was conducted once per week until heading, and by using the reflectance data and the estimation model described above, it is possible to estimate changes in AGB and LAI for each rice cultivar over the growing season. The estimated AGB and LAI of each flight day were fitted to the Gompertz growth curve (Figure 10 and Figure 11). Although the pattern of growth varied by year and fertilizer conditions, the relative growth trends for each cultivar were common. In our study, Hatsushimo was the fastest growing cultivar, with relatively high AGB and LAI at 85 DAT. At the same time, Nikomaru was the slowest growing cultivar, with a low AGB and LAI at the early growth stage. To evaluate the accuracy of the estimation and curve fitting, the estimated AGB and LAI values for each cultivar, year, and trial were compared to the actual values of AGB at the heading stage and LAI at the booting stage. Based on this comparison, the highest accuracy for AGB estimation was observed in Nikomaru, while the lowest accuracy was observed in Asahi. The estimation for LAI across years and trials showed that the highest accuracy was observed in Asahi, and the lowest accuracy was observed in Hatsushimo (data not shown).

3.4. Grain Yield Prediction Using VI-Estimated AGB and LAI

To identify the best time of year for UAV yield estimation, the linear regression between grain yield and VI-estimated AGB and LAI per day which fitted into the Gompertz growth curve was established for each cultivar on daily basis. A repeated k-fold cross-validation was performed to determine the regression function of the VI-estimated AGBand LAI per day for grain yield (Figure 12 and Figure A1a–e). In the case of the linear regression with no parameter tuning, the mean R2 values were also the R2 of the final model for each single-day regression.
Using the VI-estimated AGB and LAI, the RMSE of grain yield prediction across cultivars varied throughout the growth period, and the lowest RMSE values were observed at same DAT of the highest R2 peak (Figure 12 and Figure A1a–e). However, the strength of the grain yield prediction differed between cultivars. The VI-estimated AGB and LAI could explain only 10% and 30% of the grain yield variation for cultivars Aichinokaori and Hatsushimo and 40–50% for Asahi, Nakate Shinsenbon, and Nikomaru. The optimum time window for UAV flights was 42–87 DAT without referring to the actual growth stage dates observed in the study. Correspondingly, the RMSE values in these periods (25, 35, and 85 DAT) were also relatively higher, but the RMSE values did not fluctuate strongly compared to the R2 series, particularly in cultivars Aichinokaori and Asahi (Figure A1a,b). In contrast, Hatsushimo and Nikomaru had a wide range of RMSE values (0.74–1.66 ton/ha and 0.46–1.16 ton/ha, respectively) which makes the determination of the optimum time window necessary for accurate grain yield prediction.
Using the AIC, the optimum single-day regression model can be determined for each cultivar by calculating the AIC weights. The DAT values whose AIC weights are less than 1.0 units are considered the optimum time window for the UAV flight (Figure 12). Concerning the growth stage dates in the 2020 and 2021 rice seasons, the training process showed that the cultivars had different optimum time windows. For Aichinokaori, the optimum time window was 57–87 DAT, corresponding to the stem elongation to heading stages. The optimum time window was observed at 72–83 DAT for Asahi, coinciding with the stem elongation to booting stages. For Hatsushimo, the optimum time window was 50–56 DAT, corresponding to the stem elongation stage. Nakate Shinsenbon’s optimum time window was 42–45 DAT, coinciding with the stem elongation stage. For Nikomaru, the optimum time window was 53–58 DAT, corresponding to the stem elongation stage.
The independent test dataset evaluated the prediction models corresponding to the optimum time windows (Figure 13a–e). The results show that prediction models corresponding to the optimum time window for cultivars Asahi and Nikomaru did not generalize well on the unseen dataset in R2. The grain yield prediction model for Nakate Shinsenbon performed better than the previous two cultivars (RMSE = 0.52 ton/ha). Conversely, a prediction model for cultivar Hatsushimo showed a better performance compared to the other cultivars. The prediction model for Aichinokaori at a later optimum time window showed a good prediction performance (RMSE = 0.61 ton/ha). Asahi was found to have a higher RMSE in the independent test dataset, and the prediction error (RMSE = 1.04 ton/ha) was two times higher than the prediction error of the other cultivars.
The difference between the result of the training dataset and the independent test dataset in the single-day regression model was attributed to the following factors: (1) the repeated k-fold cross validation method that we employed; (2) the prediction errors that were carried over by the AGB/LAI estimation models and the Gompertz growth curve to the single-day regression model. The large reduction of R2 from the training dataset (optimum time graph) to the independent test dataset might be due to the low sampling size of the independent test dataset

4. Discussion

Based on the research results, we developed a model to estimate AGB and LAI based on the reflectance obtained from UAV flights (Figure 6 and Figure 7) and to provide the optimal time window to fly UAVs for yield prediction using the AGB and LAI for five rice cultivars based on the AIC weight of the single-day regression models (Figure 12 and Figure A1).
Although the relationship between the in-season AGB and grain yield is continuously studied and can be generalized, a higher grain yield has been observed in rice cultivars that produce AGB before the heading stage [47], while the opposite is true for other cultivars [48]. With this, the selection of cultivars and the timing of UAV flights are critical concerns in grain yield prediction. Thus, a grain yield prediction model for each tested cultivar was developed to determine if a similar optimum time window relative to the growth stage for UAV flights can be obtained for different rice cultivars (Figure 5 and Figure 12). In this study, UAVs were used to estimate AGB and LAI, and the estimated values were used to search for suitable flight times for yield estimation.
To estimate the appropriate time for yield estimation using UAVs, we conducted yield estimation on a hill basis. Because the spatial arrangement of the leaves is different from that of the population as a plot, it is assumed that the reflectance obtained from the plant will be different from that obtained from the community. However, it is quite possible that the reflectance obtained from individual plants will show a similar trend to that of a community composed of plants of comparable growth.
In a previous study [23], the grain yield prediction model determined the correlated multi-temporal VIs calculated from the UAV-derived multispectral images and employed a multi-linear regression model to predict grain yield. The same research also observed that VIs correlated with LAI were good predictors of grain yield. However, an LAI estimation model from these correlated VIs was not developed. The results from that study showed that the optimum times for UAV flights were as follows: combined UAV flights at the booting and heading stages and combined UAV flights at the jointing and booting stages. Another study [22] used the field-estimated AGB and leaf chlorophyll content (LCC) to predict grain yield. The results showed that destructive AGB sampling can predict grain yield, with the R2 ranging from 0.69 to 0.75. The same paper used VIs and UAV-derived canopy structural information such as canopy height to predict the rice grain yield.
The previous research above showed that actual plant parameters, such as AGB and LAI, and the VIs derived from UAVs are feasible for grain yield prediction. This study investigated the relationship between AGB, LAI, VIs, and grain yield, and AGB and LAI estimation models were developed (Figure 6 and Figure 7). Then, the AGB and LAI estimates were simulated using the Gompertz growth curve to observe the AGB and LAI dynamics (Figure 10 and Figure 11). These dynamics were then examined for their relationship with grain yield to determine the optimum time where these estimates can predict grain yield.
Considering the linear relationship of the selected VIs with the AGB and LAI of the five cultivars, several remarks about the model establishment were made (Table A2 and Table A3). Using VIs with a smaller range of correlation coefficient values at a high level of correlation with the target variable can result in a better model accuracy than using VIs with a smaller range of correlation coefficient values at a medium level of correlation with the target variable. This was shown by comparing the AGB estimation model for Nakate Shinsenbon with that of Aichinokaori and Asahi, and the LAI estimation for Nakate Shinsenbon with that of Aichinokaori. VIs with a broader range of correlation values at medium to low levels of correlation with the target variable can result in a similar or better model accuracy than VIs with a smaller range of correlation coefficient values. The LAI estimation model for Hatsushimo was better than the LAI estimation model for Nikomaru.
Before the XGBoost model training for AGB and LAI estimation, feature selection was conducted using correlation analysis to determine feature variables—VIs—that linearly correlated with AGB and LAI (Table A2 and Table A3). Even though some cultivars had similar correlated VIs, no two had the same set of correlated VIs. Among the VIs used in the AGB estimation models of the different cultivars, BWDRVI was the most common highly correlated VI. In past research, BWDRVI has provided good contrast between soybean soil and plant leaves at the maturity stage [49]. Other correlated VIs observed in this study confirm the findings of previous research in rice AGB estimation, where modified normalized vegetation indices showed a high correlation with AGB [50]. Conversely, the LAI of the five rice cultivars also showed a high correlation with the modified normalized difference vegetation indices. A similar research result has been observed between the LAI of rice and VIs such as GBNDVI and GNDVI [51]. Similar to the previous research, this study also shows that GBNDVI and GNDVI were more correlated with LAI than NDVI. The correlation analysis to select VIs for AGB and LAI estimations seemed reliable and reasonable.
The AGB and LAI can be estimated with good model accuracy using XGBoost (Table 2). Previous research on the estimation of aboveground forest biomass showed that the random forest algorithm (c = 0.71) works better than the stochastic gradient boosting (R2 = 0.60) [52]. In this study, the model evaluation using test datasets of the XGBoost models for AGB estimation resulted in an R2 ranging from 0.565 to 0.827. For the LAI estimation models, this resulted in an R2 ranging from 0.574 to 0.727. Comparing the model performances between the AGB estimation and LAI estimation in this study shows that AGB estimation models had better accuracy than LAI estimation models. Using the UAV-derived VIs, AGB estimation models showed less overfitting than LAI estimation models (Table 2).
XGBoost and stochastic gradient boosting utilize the gradient descent method to find the minimum cost function for predicting the target. However, the XGBoost algorithm has regularization as one of its booster parameters, which prevents the overfitting of prediction models (Table A4 and Table A5). The AGB estimation model for Hatsushimo used 90% of the feature variables to build regression trees, as represented by the booster parameter colsample_bytree. Conversely, LAI estimation models for Asahi, Hatsushimo, and Nakate Shinsenbon had higher depths of regression trees at 10. A higher number of feature variables per tree and depth of trees may result in an estimation model that finds patterns particular to the sample used for building the prediction, resulting in overfitting. Much earlier research conducted in remote sensing for AGB and LAI estimation demonstrated that AGB estimation is more suited to remote sensing than LAI estimation [53]. Nevertheless, comparing the NRMSE results of the test dataset showed that all the AGB and LAI estimation models of the five cultivars had good levels of model accuracy (Table 2). Moreover, the variable importance shown through the SHAP plots indicates that boosting the ensemble algorithm can be an excellent strategy to predict AGB and LAI when variation in correlations between the VIs and target exists in the training process (Figure 8 and Figure 9).
After the AGB and LAI estimation, each cultivar’s grain yield prediction model was developed using the VI-estimated AGB and LAI fitted into the Gompertz growth curve (Figure 10 and Figure 11). Comparing the accurate estimations of the growth curve fitted, AGB and LAI estimates from the NRMSE of the AGB and LAI estimation models showed a difference in varietal performance (data not shown). Cultivars Aichinokaori and Nikomaru had higher accuracy in the test dataset than in the datasets from the growth curve fitting. In contrast, cultivars Asahi and Nakate Shinsenbon had higher accuracy in the dataset from the growth curve fitting than in the test dataset. The cultivar Hatsushimo showed the lowest accuracy for both datasets.
Using the single-day VI-estimated AGB and LAI simulated from the growth curve, a single-day regression was conducted to predict grain yield. The optimum time window for UAV flights was determined, indicating good model accuracy based on the AIC weights (Figure 12 and Figure A1). Cultivars such as Asahi, Hatsushimo, and Nakate Shinsenbon had optimum time windows at the stem elongation stage. Aichinokaori had an optimum time window at the initial heading stage, while Nikomaru had an optimum time window at the tillering stage. A similar optimum growth stage was observed in the grain yield prediction model using the actual AGB and LAI (Figure 5a–e), except for Aichinokaori, which could be predicted at the stem elongation stage using the actual AGB and LAI, or at the later growth stage using the VI-estimated AGB and LAI.
By evaluating the performances of single-day regression models using the VI-estimated AGB and LAI (Figure 13a–e) and the grain yield prediction using the actual AGB and LAI (Figure 5a–e) on an independent test dataset, the following results were observed: (1) the grain yields of cultivars Hatsushimo and Nakate Shinsenbon were better predicted using the VI-estimated AGB and LAI; (2) the grain yield prediction of cultivars Aichinokaori and Nikomaru w better using the actual AGB and LAI, and (3) the grain yield prediction of cultivar Asahi could not be predicted well by either VI-estimated AGB and LAI or the actual AGB and LAI. These results show that using AGB and LAI was not sufficient enough to accurately predict the rice grain yield for all cultivars (e.g., Asahi).
Concerning each cultivar’s AGB and LAI estimations (Table 2), an accurate time window for UAV flights to predict grain yield could be determined. However, the percent variation in grain yield that can be explained still depends on the correlation of the VIs with AGB and LAI and the robustness of the machine learning algorithm. The optimum time windows determined in this study are consistent with previous studies for the pre-heading to initial heading stages [22,26], and our results suggest that the lengths of the optimal time windows for each cultivar differed from 4 to 31 days (Figure 12 and Figure A1). In practical terms, a longer optimum time window for UAV flights is an advantage, as flight days will be flexible. A shorter time window will make UAV flights a time constraint, especially in cases where unfavorable weather conditions prevent UAV flights.
This study shows that a robust boosting ensemble machine learning algorithm—in this case, XGBoost—can be used to estimate agronomic traits, namely, AGB and LAI. Then, using the developed estimation models, AGB and LAI can be calculated using the UAV-derived reflectance data gathered during the entire growth period. The estimated AGB and LAI throughout the growth period determined the optimum time window for the UAV flights. The simulated VI-estimated AGB and LAI dynamics allowed for grain yield prediction at any point throughout the growth period. This method will enable researchers to save resources and time for numerous UAV flights to predict grain yield from UAV-derived reflectance data.

5. Conclusions

The aims of this study were to (1) predict rice yield by using the estimated aboveground biomass (AGB) and leaf area index (LAI) until the heading stage from vegetation indices (VIs) and (2) determine the optimal sensing time in estimating AGB and LAI using VIs for yield prediction.
The AGB and LAI estimation models from the VIs using the XGBoost algorithm resulted in an R2 ranging from 0.565 to 0.827 and from 0.574 to 0.727, respectively, and the VI-estimated AGB and LAI predicted grain yield better than the actual AGB and VI for cultivars Hatsushimo and Nakate Shinsenbon (RMSE: 0.69 ton/ha and 0.52 ton/ha, respectively). The actual AGB and LAI predicted grain yields of Aichinokaori and Nikomaru (RMSE: 0.75 ton/ha and 0.7 ton/ha, respectively) better than the VI-estimated AGB and LAI, while the grain yield of Asahi could not be predicted well by both methods. The optimum time window for UAV flights was at the stem elongation stage for cultivars Asahi, Hatsushimo, and Nakate Shinsenbon and at the initial heading stage for Aichinokaori, ranging from 4 to 31 days.
These results indicate that the accuracy of rice grain yield estimation using the VI-s estimated AGB and LAI and the suitable period for yield estimation are cultivar dependent. Conversely, due to the small number of cultivars tested in this study, it is not clear whether this model specificity is due to differences in plant architecture and panicle traits. Further research should be conducted on multiple cultivars using these UAV-derived features to develop a generalized yield estimation model that is tolerant of differences among rice cultivars.

Author Contributions

Conceptualization, M.V.B., K.D., J.M. and S.N.; methodology, M.V.B. and S.N.; software, M.V.B. and S.N.; validation, M.V.B. and S.N.; formal analysis, M.V.B., N.K., T.N., N.K., S.S. and S.N.; investigation, M.V.B., T.N., I.T., N.K., Y.K., S.I. and S.N.; resources, S.N.; data curation, M.V.B. and S.N.; writing—original draft preparation, M.V.B.; writing—review and editing, M.V.B. and S.N.; visualization, M.V.B.; supervision, K.D. and S.N.; project administration, S.N.; funding acquisition, J.M. and S.N. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by JST, PRESTO grant number JPMJPR16O4, and JSPS KAKENHI grant number 20H02887, Japan.

Data Availability Statement

The raw data presented in this study are openly available on Figshare at doi:10.6084/m9.figshare.20174159.

Acknowledgments

M.V.B. is grateful to MEXT for the scholarship. The authors would like to thank our field staffs, Mizuno and Kuwahata for the assistance.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. List of vegetation indices used in the study.
Table A1. List of vegetation indices used in the study.
Vegetation IndexFormulaReferences
Atmospherically Resistant Vegetation Index 2 (ARVI2)−0.18 + 1.17[(NIR − R)/(NIR + R)][54]
Adjusted Transformed Soil Adjusted VI (ATSAVI)a[(NIR − a × R − b)/(a × NIR + R − a × b + X(1 + a2))][55]
Blue-Wide Dynamic Range Vegetation Index (BWDRVI)(0.1NIR − B)/(0.1NIR + B)[54]
Chlorophyll Vegetation Index (CVI)NIR × (R/G2)[56]
Difference Vegetation Index (DVI)NIR − R[23]
Enhanced Vegetation Index (EVI)2.5[(NIR − R)/((NIR + 6R−7.5B) + 1)][57]
Enhanced Vegetation Index 2 (EVI2)2.4[(NIR − R)/(NIR + R + 1)][58]
Enhanced Vegetation Index 2-2 (EVI2-2)2.5[(NIR − R)/(NIR + 2.4R + 1)][59]
Green-Blue Normalized Difference Vegetation Index (GBNDVI)[NIR − (G + B)]/[NIR + (G + B)][60]
Green Difference Vegetation Index (GDVI)NIR − G[61]
Green Normalized Difference Vegetation Index (GNDVI)(NIR − G)/(NIR + G)[62]
Green Optimal Soil Adjusted Vegetation Index (GOSAVI)(1 + 0.16) × (NIR − G)/(NIR + G + 0.16)[63]
Green-Red Normalized Difference Vegetation Index (GRNDVI)[NIR − (G + R)]/[NIR + (G + R)][61]
Green Soil Adjusted Vegetation Index (GSAVI)1.5(NIR − G)/(NIR + G + 0.5)][64]
Normalized Difference Red-Edge (NDRE)(NIR − RE)/(NIR + RE)[65]
Normalized Difference Vegetation Index (NDVI)(NIR − R)/(NIR + R)[51]
Pan Normalized Difference Vegetation Index (PNDVI)[NIR − (G + R + B)]/[NIR + (G + R + B)][66]
Red-Blue Normalized Difference Vegetation Index (RBNDVI)[NIR − (R + B)]/[NIR + (R + B)][67]
Red Edge Difference Vegetation Index (REDVI)NIR − RE[68]
Red Edge Soil Adjusted Vegetation Index (RESAVI)1.5(NIR − RE)/(NIR + RE + 0.5)][69]
Red Edge Wide Dynamic Range Vegetation Index (REWDRVI)(a × NIR − RE)/(a × NIR + RE) (a = 0.12)[70]
Ratio Vegetation Index (RVI)NIR/R[71]
Soil and Atmospherically Resistant Vegetation Index 2 (SARVI2)2.5(NIR − R)/(1 + NIR + 6R − 7.5B)][72]
Wide Dynamic Range Vegetation Index (WDRVI)(a × NIR − R)/(a × NIR + R) (a = 0.12)[73]
R, red reflectance; G, green reflectance; B, blue reflectance; NIR, near-infrared reflectance; RE, red-edge reflectance.
Table A2. Pearson correlation coefficient between aboveground biomass (ton/ha) and vegetation indices across growth stages.
Table A2. Pearson correlation coefficient between aboveground biomass (ton/ha) and vegetation indices across growth stages.
AichinokaoriAsahiHatsushimoNakate ShinsenbonNikomaru
ARVI20.569 ***0.538 ***0.531 ***0.641 ***0.586 ***
ATSAVI 0.499 ***0.413 ***0.319 ***0.779 ***0.605 ***
BWDRVI0.621 ***0.698 ***0.631 ***0.789 ***0.752 ***
CVI0.0490.0870.447 ***0.1360.486 ***
DVI0.535 ***0.427 ***0.345 ***0.788 ***0.665 ***
EVI0.516 ***0.413 ***0.308 ***0.800 ***0.629 ***
EVI20.532 ***0.419 ***0.319 ***0.807 ***0.650 ***
EVI2.20.539 ***0.431 ***0.330 ***0.818 ***0.658 ***
GBNDVI0.435 ***0.578 ***0.625 ***0.714 ***0.700 ***
GDVI0.527 ***0.433 ***0.369 ***0.787 ***0.682 ***
GNDVI0.343 ***0.524 ***0.619 ***0.660 ***0.680 ***
GOSAVI0.472 ***0.445 ***0.404 ***0.772 ***0.679 ***
GRNDVI0.436 ***0.550 ***0.592 ***0.690 ***0.668 ***
GSAVI0.508 ***0.428 ***0.358 ***0.798 ***0.680 ***
NDRE0.260 ***0.327 ***0.306 ***0.582 ***0.551 ***
NDVI0.569 ***0.538 ***0.531 ***0.641 ***0.586 ***
PNDVI0.484 ***0.581 ***0.603 ***0.721 ***0.690 ***
RBNDVI0.603 ***0.590 ***0.565 ***0.692 ***0.638 ***
REDVI0.508 ***0.437 ***0.351 ***0.806 ***0.684 ***
RESAVI0.447 ***0.393 ***0.292 ***0.758 ***0.634 ***
REWDRVI0.276 ***0.327 ***0.295 ***0.595 ***0.577 ***
RVI0.528 ***0.518 ***0.446 ***0.691 ***0.683 ***
SARVI20.516 ***0.413 ***0.308 ***0.800 ***0.629 ***
WDRVI0.575 ***0.569 ***0.529 ***0.717 ***0.676 ***
*** Significant at the 0.01 level of significance. VIs in bold were used in the XGBoost model training.
Table A3. Pearson correlation coefficient between leaf area index (m2/m2) and vegetation indices across growth stages.
Table A3. Pearson correlation coefficient between leaf area index (m2/m2) and vegetation indices across growth stages.
AichinokaoriAsahiHatsushimoNakate ShinsenbonNikomaru
ARVI20.562 ***0.616 ***0.607 ***0.717 ***0.609 ***
ATSAVI 0.396 ***0.636 ***0.581 ***0.706 ***0.516 ***
BWDRVI0.535 ***0.736 ***0.760 ***0.777 ***0.707 ***
CVI0.582 ***0.645 ***0.499 ***0.394 ***0.417 ***
DVI0.362 ***0.605 ***0.572 ***0.683 ***0.429 ***
EVI0.376 ***0.626 ***0.567 ***0.695 ***0.436 ***
EVI20.376 ***0.623 ***0.577 ***0.689 ***0.455 ***
EVI2.20.385 ***0.640 ***0.589 ***0.710 ***0.472 ***
GBNDVI0.571 ***0.797 ***0.727 ***0.782 ***0.688 ***
GDVI0.393 ***0.648 ***0.592 ***0.717 ***0.467 ***
GNDVI0.553 ***0.805 ***0.724 ***0.754 ***0.677 ***
GOSAVI0.484 ***0.762 ***0.665 ***0.752 ***0.589 ***
GRNDVI0.586 ***0.768 ***0.706 ***0.782 ***0.666 ***
GSAVI0.443 ***0.715 ***0.627 ***0.732 ***0.527 ***
NDRE0.286 ***0.619 ***0.658 ***0.621 ***0.562 ***
NDVI0.562 ***0.616 ***0.607 ***0.717 ***0.609 ***
PNDVI0.593 ***0.770 ***0.720 ***0.795 ***0.680 ***
RBNDVI0.563 ***0.654 ***0.649 ***0.738 ***0.645 ***
REDVI0.379 ***0.671 ***0.622 ***0.691 ***0.480 ***
RESAVI0.385 ***0.697 ***0.639 ***0.662 ***0.511 ***
REWDRVI0.345 ***0.652 ***0.718 ***0.652 ***0.586 ***
RVI0.674 ***0.718 ***0.810 ***0.849 ***0.649 ***
SARVI20.376 ***0.626 ***0.567 ***0.695 ***0.436 ***
WDRVI0.628 ***0.681 ***0.732 ***0.817 ***0.675 ***
*** Significant at the 0.01 level of significance. VIs in bold were used in the XGBoost model training.
Figure A1. R2 result of the training dataset of the single-day regression model between grain yield and VI-estimated AGB and LAI of the five rice cultivars. The red box indicates the optimum time window for UAV flights as determined from the AIC score comparison. The black line indicates the mean R2 values from the repeated k-fold cross-validation conducted for each single-day regression. The darker blue line indicates the smoothed R2 values that can be expected from the plotted R2 series derived from the repeated k-fold cross-validation. The smoothed R2 values were determined by using the LOESS (locally weighted scatterplot smoothing) method. The lighter blue region indicates the range of smoothed R2 values to be expected 95% of the time from each single-day regression. (a) Aichinokaori; (b) Asahi; (c) Hatsushimo; (d) Nakate Shinsenbon; (e) Nikomaru.
Figure A1. R2 result of the training dataset of the single-day regression model between grain yield and VI-estimated AGB and LAI of the five rice cultivars. The red box indicates the optimum time window for UAV flights as determined from the AIC score comparison. The black line indicates the mean R2 values from the repeated k-fold cross-validation conducted for each single-day regression. The darker blue line indicates the smoothed R2 values that can be expected from the plotted R2 series derived from the repeated k-fold cross-validation. The smoothed R2 values were determined by using the LOESS (locally weighted scatterplot smoothing) method. The lighter blue region indicates the range of smoothed R2 values to be expected 95% of the time from each single-day regression. (a) Aichinokaori; (b) Asahi; (c) Hatsushimo; (d) Nakate Shinsenbon; (e) Nikomaru.
Agriculture 12 01141 g0a1
Table A4. Tuned booster parameters for optimizing the XGBoost model for the AGB estimation of five rice cultivars using the lowest RMSE as the selection criterion.
Table A4. Tuned booster parameters for optimizing the XGBoost model for the AGB estimation of five rice cultivars using the lowest RMSE as the selection criterion.
CultivarEtaMax_
Depth
GammaColsample_
Bytree
Min_
Child_Weight
Sub
Sample
N Rounds
Aichinokaori 0.150.10.811100
Asahi 0.150.10.511100
Hatsushimo 0.130.10.911100
Nakate Shinsenbon0.130.10.511100
Nikomaru 0.150.10.511100
Table A5. Tuned booster parameters for optimizing the XGBoost model for the LAI estimation of five rice cultivars using the lowest RMSE as the selection criterion.
Table A5. Tuned booster parameters for optimizing the XGBoost model for the LAI estimation of five rice cultivars using the lowest RMSE as the selection criterion.
CultivarEtaMax_
Depth
GammaColsample_
Bytree
Min_
Child_Weight
SubsampleN Rounds
Aichinokaori 0.130.10.611100
Asahi 0.1100.10.811100
Hatsushimo 0.1100.10.611100
Nakate Shinsenbon0.1100.10.811100
Nikomaru 0.150.10.511100

References

  1. Hu, P.; Chapman, S.C.; Wang, X.; Potgieter, A.; Duan, T.; Jordan, D.; Guo, Y.; Zheng, B. Estimation of Plant Height Using a High Throughput Phenotyping Platform Based on Unmanned Aerial Vehicle and Self-Calibration: Example for Sorghum Breeding. Eur. J. Agron. 2018, 95, 24–32. [Google Scholar] [CrossRef]
  2. Tao, H.; Feng, H.; Xu, L.; Miao, M.; Yang, G.; Yang, X.; Fan, L. Estimation of the Yield and Plant Height of Winter Wheat Using UAV-Based Hyperspectral Images. Sensors 2020, 20, 1231. [Google Scholar] [CrossRef] [PubMed]
  3. Madec, S.; Baret, F.; De Solan, B.; Thomas, S.; Dutartre, D.; Jezequel, S.; Comar, A. High-Throughput Phenotyping of Plant Height: Comparing Unmanned Aerial Vehicles and Ground LiDAR Estimates. Front. Plant Sci. 2017, 8, 2002. [Google Scholar] [CrossRef]
  4. Gong, Y.; Yang, K.; Lin, Z.; Fang, S.; Wu, X.; Zhu, R.; Peng, Y. Remote Estimation of Leaf Area Index (LAI) with Unmanned Aerial Vehicle (UAV) Imaging for Different Rice Cultivars throughout the Entire Growing Season. Plant Methods 2021, 17, 88. [Google Scholar] [CrossRef]
  5. Hasan, U.; Sawut, M.; Chen, S. Estimating the Leaf Area Index of Winter Wheat Based on Unmanned Aerial Vehicle RGB-Image Parameters. Sustainability 2019, 11, 6829. [Google Scholar] [CrossRef]
  6. Roosjen, P.P.; Brede, B.; Suomalainen, J.M.; Bartholomeus, H.M.; Kooistra, L.; Clevers, J.G. Improved Estimation of Leaf Area Index and Leaf Chlorophyll Content of a Potato Crop Using Multi-Angle Spectral Data—Potential of Unmanned Aerial Vehicle Imagery. Int. J. Appl. Earth Obs. Geoinform. 2018, 66, 14–26. [Google Scholar] [CrossRef]
  7. Yue, J.; Yang, G.; Tian, Q.; Feng, H.; Xu, K.; Zhou, C. Estimate of Winter-Wheat above-Ground Biomass Based on UAV Ultrahigh-Ground-Resolution Image Textures and Vegetation Indices. ISPRS J. Photogramm. Remote Sens. 2019, 150, 226–244. [Google Scholar] [CrossRef]
  8. Niu, Y.; Zhang, L.; Zhang, H.; Han, W.; Peng, X. Estimating Above-Ground Biomass of Maize Using Features Derived from UAV-Based RGB Imagery. Remote Sens. 2019, 11, 1261. [Google Scholar] [CrossRef]
  9. Lu, N.; Zhou, J.; Han, Z.; Li, D.; Cao, Q.; Yao, X.; Tian, Y.; Zhu, Y.; Cao, W.; Cheng, T. Improved Estimation of Aboveground Biomass in Wheat from RGB Imagery and Point Cloud Data Acquired with a Low-Cost Unmanned Aerial Vehicle System. Plant Methods 2019, 15, 17. [Google Scholar] [CrossRef]
  10. Yang, H.; Li, F.; Wang, W.; Yu, K. Estimating Above-Ground Biomass of Potato Using Random Forest and Optimized Hyperspectral Indices. Remote Sens. 2021, 13, 2339. [Google Scholar] [CrossRef]
  11. Zhang, J.; Cheng, T.; Guo, W.; Xu, X.; Qiao, H.; Xie, Y.; Ma, X. Leaf Area Index Estimation Model for UAV Image Hyperspectral Data Based on Wavelength Variable Selection and Machine Learning Methods. Plant Methods 2021, 17, 49. [Google Scholar] [CrossRef] [PubMed]
  12. Osco, L.P.; Junior, J.M.; Ramos, A.P.M.; Furuya, D.E.G.; Santana, D.C.; Teodoro, L.P.R.; Gonçalves, W.N.; Baio, F.H.R.; Pistori, H.; da Silva Junior, C.A.; et al. Leaf Nitrogen Concentration and Plant Height Prediction for Maize Using UAV-Based Multispectral Imagery and Machine Learning Techniques. Remote Sens. 2020, 12, 3237. [Google Scholar] [CrossRef]
  13. Prasad, N.R.; Patel, N.R.; Danodia, A.; Manjunath, K.R. Comparative Performance of Semi-Empirical Based Remote Sensing and Crop Simulation Model for Cotton Yield Prediction. Model. Earth Syst. Environ. 2022, 8, 1733–1747. [Google Scholar] [CrossRef]
  14. Rehmani, M.I.A.; Ding, C.; Li, G.; Ata-Ul-Karim, S.T.; Hadifa, A.; Bashir, M.A.; Ding, Y. Vulnerability of Rice Production to Temperature Extremes during Rice Reproductive Stage in Yangtze River Valley, China. J. King Saud Univ.-Sci. 2021, 33, 101599. [Google Scholar] [CrossRef]
  15. Rehmani, M.I.A.; Wei, G.; Hussain, N.; Ding, C.; Li, G.; Liu, Z.; Ding, Y. Yield and Quality Responses of Two Indica Rice Hybrids to Post-Anthesis Asymmetric Day and Night Open-Field Warming in Lower Reaches of Yangtze River Delta. Field Crops Res. 2014, 156, 231–241. [Google Scholar] [CrossRef]
  16. Onwuchekwa-Henry, C.B.; Ogtrop, F.V.; Roche, R.; Tan, D.K.Y. Model for Predicting Rice Yield from eflectance Index and Weather Variables in Lowland Rice Fields. Agriculture 2022, 12, 130. [Google Scholar] [CrossRef]
  17. Ge, H.; Ma, F.; Li, Z.; Du, C. Grain Yield Estimation in Rice Breeding Using Phenological Data and Vegetation Indices Derived from UAV Images. Agronomy 2021, 11, 2439. [Google Scholar] [CrossRef]
  18. Freeman, K.W.; Girma, K.; Arnall, D.B.; Mullen, R.W.; Martin, K.L.; Teal, R.K.; Raun, W.R. By-Plant Prediction of Corn Forage Biomass and Nitrogen Uptake at Various Growth Stages Using Remote Sensing and Plant Height. Agron. J. 2007, 99, 530–536. [Google Scholar] [CrossRef]
  19. Rahman, M.M.; Crain, J.; Haghighattalab, A.; Singh, R.P.; Poland, J. Improving Wheat Yield Prediction Using Secondary Traits and High-Density Phenotyping Under Heat-Stressed Environments. Front. Plant Sci. 2021, 12, 1977. [Google Scholar] [CrossRef]
  20. Barzin, R.; Pathak, R.; Lotfi, H.; Varco, J.; Bora, G.C. Use of UAS Multispectral Imagery at Different Physiological Stages for Yield Prediction and Input Resource Optimization in Corn. Remote Sens. 2020, 12, 2392. [Google Scholar] [CrossRef]
  21. Johansen, K.; Morton, M.J.L.; Malbeteau, Y.; Aragon, B.; Al-Mashharawi, S.; Ziliani, M.G.; Angel, Y.; Fiene, G.; Negrão, S.; Mousa, M.A.A.; et al. Predicting Biomass and Yield in a Tomato Phenotyping Experiment Using UAV Imagery and Random Forest. Front. Artif. Intell. 2020, 3, 28. [Google Scholar] [CrossRef]
  22. Wan, L.; Cen, H.; Zhu, J.; Zhang, J.; Zhu, Y.; Sun, D.; Du, X.; Zhai, L.; Weng, H.; Li, Y.; et al. Grain Yield Prediction of Rice Using Multi-Temporal UAV-Based RGB and Multispectral Images and Model Transfer—A Case Study of Small Farmlands in the South of China. Agric. For. Meteorol. 2020, 291, 108096. [Google Scholar] [CrossRef]
  23. Zhou, X.; Zheng, H.B.; Xu, X.Q.; He, J.Y.; Ge, X.K.; Yao, X.; Cheng, T.; Zhu, Y.; Cao, W.X.; Tian, Y.C. Predicting Grain Yield in Rice Using Multi-Temporal Vegetation Indices from UAV-Based Multispectral and Digital Imagery. ISPRS J. Photogramm. Remote Sens. 2017, 130, 246–255. [Google Scholar] [CrossRef]
  24. Wang, F.; Yi, Q.; Hu, J.; Xie, L.; Yao, X.; Xu, T.; Zheng, J. Combining Spectral and Textural Information in UAV Hyperspectral Images to Estimate Rice Grain Yield. Int. J. Appl. Earth Obs. Geoinform. 2021, 102, 102397. [Google Scholar] [CrossRef]
  25. Zhou, J.; Lu, X.; Yang, R.; Chen, H.; Wang, Y.; Zhang, Y.; Huang, J.; Liu, F. Developing Novel Rice Yield Index Using UAV Remote Sensing Imagery Fusion Technology. Drones 2022, 6, 151. [Google Scholar] [CrossRef]
  26. Harrell, D.L.; Tubaña, B.S.; Walker, T.W.; Phillips, S.B. Estimating Rice Grain Yield Potential Using Normalized Difference Vegetation Index. Agron. J. 2011, 103, 1717–1723. [Google Scholar] [CrossRef]
  27. Fassnacht, F.E.; Hartig, F.; Latifi, H.; Berger, C.; Hernández, J.; Corvalán, P.; Koch, B. Importance of Sample Size, Data Type and Prediction Method for Remote Sensing-Based Estimations of Aboveground Forest Biomass. Remote Sens. Environ. 2014, 154, 102–114. [Google Scholar] [CrossRef]
  28. Kanke, Y.; Tubana, B.; Dalen, M.; Harrell, D. Evaluation of Red and Red-Edge Reflectance-Based Vegetation Indices for Rice Biomass and Grain Yield Prediction Models in Paddy Fields. Precis. Agric. 2016, 17, 507–530. [Google Scholar] [CrossRef]
  29. Perros, N.; Kalivas, D.; Giovos, R. Spatial Analysis of Agronomic Data and UAV Imagery for Rice Yield Estimation. Agriculture 2021, 11, 809. [Google Scholar] [CrossRef]
  30. Bouras, E.H.; Jarlan, L.; Er-Raki, S.; Balaghi, R.; Amazirh, A.; Richard, B.; Khabba, S. Cereal Yield Forecasting with Satellite Drought-Based Indices, Weather Data and Regional Climate Indices Using Machine Learning in Morocco. Remote Sens. 2021, 13, 3101. [Google Scholar] [CrossRef]
  31. Kang, Y.; Ozdogan, M.; Zhu, X.; Ye, Z.; Hain, C.; Anderson, M. Comparative Assessment of Environmental Variables and Machine Learning Algorithms for Maize Yield Prediction in the US Midwest. Environ. Res. Lett. 2020, 15, 064005. [Google Scholar] [CrossRef]
  32. Shah, A.; Agarwal, R.; Baranidharan, B. Crop Yield Prediction Using Remote Sensing and Meteorological Data. In Proceedings of the 2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS), Coimbatore, India, 25–27 March 2021; pp. 952–960. [Google Scholar]
  33. Archontoulis, S.V.; Miguez, F.E. Nonlinear Regression Models and Applications in Agricultural Research. Agron. J. 2015, 107, 786–798. [Google Scholar] [CrossRef]
  34. Ismail, Z.; Khamis, A.; Jaafar, M. Fitting Nonlinear Gompertz Curve to Tobacco Growth Data. J. Agron. 2003, 2, 223–236. [Google Scholar] [CrossRef]
  35. Kataoka, T.; Kaneko, T.; Okamoto, H.; Hata, S. Crop Growth Estimation System Using Machine Vision. In Proceedings of the Proceedings 2003 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM 2003), Kobe, Japan, 20–24 July 2003; Volume 2, pp. b1079–b1083. [Google Scholar]
  36. Japan Meteorological Agency. Tables of Climatological Normals (1991–2020). Available online: https://www.data.jma.go.jp/stats/data/en/normal/normal.html (accessed on 27 May 2022).
  37. Counce, P.; Keisling, T.; Mitchell, A. A Uniform, Objective, and Adaptive System for Expressing Rice Development. Crop Sci. 2000, 40, 436–443. [Google Scholar] [CrossRef]
  38. Hijmas, R.J. Raster: Geographic Data Analysis and Modeling. R Package Version. 3.5-15. 2022. Available online: https://CRAN.R-project.org/package=raster (accessed on 27 December 2021).
  39. Kuhn, M. Building Predictive Models in R Using the Caret Package. J. Stat. Softw. 2008, 28, 1–26. [Google Scholar] [CrossRef]
  40. Taiyun Wei and Viliam Simko. R package ‘corrplot’: Visualization of a Correlation Matrix (Version 0.92). 2021. Available online: https://github.com/taiyun/corrplot (accessed on 27 December 2021).
  41. Chen, T.; He, H.T.; Benesty, M.; Khotilovich, V.; Tang, Y.; Cho, H.; Chen, K.; Mitchell, R.; Cano, I.; Zhou, T.; et al. Xgboost: Extreme Gradient Boosting. R Package Version 1.6.0.1. 2022. Available online: https://CRAN.R-project.org/package=xgboost (accessed on 3 May 2022).
  42. Yang, L.; Just, A. SHAPforxgboost: SHAP Plots for ‘XGBoost’. R Package Version 0.1.1. 2021. Available online: https://CRAN.R-project.org/package=SHAPforxgboost (accessed on 3 May 2022).
  43. Srinivas, P.; Katarya, R. HyOPTXg: OPTUNA Hyper-Parameter Optimization Framework for Predicting Cardiovascular Disease Using XGBoost. Biomed. Signal Process. Control 2022, 73, 103456. [Google Scholar] [CrossRef]
  44. Ritz, C.; Baty, F.; Streibig, J.C.; Gerhard, D. Dose-Response Analysis Using R. PLoS ONE 2015, 10, e0146021. [Google Scholar] [CrossRef] [PubMed]
  45. Mazerolle, M.J. AICcmodavg: Model Selection and Multimodel Inference Based on (Q)AIC(c). R Package Version 2.3-1. 2020. Available online: https://cran.r-project.org/package=AICcmodavg (accessed on 3 May 2022).
  46. Wickham, H. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2016; Available online: https://ggplot2.tidyverse.org (accessed on 27 December 2021).
  47. Pan, Y.; Gao, S.; Xie, K.; Lu, Z.; Meng, X.; Wang, S.; Lu, J.; Guo, S. Higher Radiation Use Efficiency Produces Greater Biomass before Heading and Grain Yield in Super Hybrid Rice. Agronomy 2020, 10, 209. [Google Scholar] [CrossRef]
  48. Pieters, A.J.; El Souki, S.; Nazar, C. Growth, Yield and Nitrogen Allocation in Two Rice Cultivars under Field Conditions in Venezuela. Interciencia 2006, 31, 671–676. [Google Scholar]
  49. Zhou, J.; Yungbluth, D.; Vong, C.N.; Scaboo, A.; Zhou, J. Estimation of the Maturity Date of Soybean Breeding Lines Using UAV-Based Multispectral Imagery. Remote Sens. 2019, 11, 2075. [Google Scholar] [CrossRef]
  50. Cen, H.; Wan, L.; Zhu, J.; Li, Y.; Li, X.; Zhu, Y.; Weng, H.; Wu, W.; Yin, W.; Xu, C.; et al. Dynamic Monitoring of Biomass of Rice under Different Nitrogen Treatments Using a Lightweight UAV with Dual Image-Frame Snapshot Cameras. Plant Methods 2019, 15, 32. [Google Scholar] [CrossRef] [PubMed]
  51. Wang, F.; Huang, J.; Tang, Y.-L.; Wang, X. New Vegetation Index and Its Application in Estimating Leaf Area Index of Rice. Rice Sci. 2007, 14, 195–203. [Google Scholar] [CrossRef]
  52. Ghosh, S.M.; Behera, M.D. Aboveground Biomass Estimation Using Multi-Sensor Data Synergy and Machine Learning Algorithms in a Dense Tropical Forest. Appl. Geogr. 2018, 96, 29–40. [Google Scholar] [CrossRef]
  53. Casanova, D.; Epema, G.F.; Goudriaan, J. Monitoring Rice Reflectance at Field Level for Estimating Biomass and LAI. Field Crops Res. 1998, 55, 83–92. [Google Scholar] [CrossRef]
  54. Sonobe, R.; Yamaya, Y.; Tani, H.; Wang, X.; Kobayashi, N.; Mochizuki, K. Crop Classification from Sentinel-2-Derived Vegetation Indices Using Ensemble Learning. J. Appl. Remote Sens. 2018, 12, 026019. [Google Scholar] [CrossRef]
  55. Zhang, J.; He, Y.; Yuan, L.; Liu, P.; Zhou, X.; Huang, Y. Machine Learning-Based Spectral Library for Crop Classification and Status Monitoring. Agronomy 2019, 9, 496. [Google Scholar] [CrossRef]
  56. Clevers, J.G.P.W.; Kooistra, L.; Van den Brande, M.M.M. Using Sentinel-2 Data for Retrieving LAI and Leaf and Canopy Chlorophyll Content of a Potato Crop. Remote Sens. 2017, 9, 405. [Google Scholar] [CrossRef]
  57. Xiao, X.; Boles, S.; Liu, J.; Zhuang, D.; Frolking, S.; Li, C.; Salas, W.; Moore, B. Mapping Paddy Rice Agriculture in Southern China Using Multi-Temporal MODIS Images. Remote Sens. Environ. 2005, 95, 480–492. [Google Scholar] [CrossRef]
  58. Qiu, B.; Li, W.; Tang, Z.; Chen, C.; Qi, W. Mapping Paddy Rice Areas Based on Vegetation Phenology and Surface Moisture Conditions. Ecol. Indic. 2015, 56, 79–86. [Google Scholar] [CrossRef]
  59. Jiang, Z.; Huete, A.; Didan, K.; Miura, T. Development of a Two-Band Enhanced Vegetation Index without a Blue Band. Remote Sens. Environ. 2008, 112, 3833–3845. [Google Scholar] [CrossRef]
  60. Qiu, C.; Liao, G.; Tang, H.; Liu, F.; Liao, X.; Zhang, R.; Zhao, Z. Derivative Parameters of Hyperspectral NDVI and Its Application in the Inversion of Rapeseed Leaf Area Index. Appl. Sci. 2018, 8, 1300. [Google Scholar] [CrossRef]
  61. Kang, Y.; Nam, J.; Kim, Y.; Lee, S.; Seong, D.; Jang, S.; Ryu, C. Assessment of Regression Models for Predicting Rice Yield and Protein Content Using Unmanned Aerial Vehicle-Based Multispectral Imagery. Remote Sens. 2021, 13, 1508. [Google Scholar] [CrossRef]
  62. Teal, R.K.; Tubana, B.; Girma, K.; Freeman, K.W.; Arnall, D.B.; Walsh, O.; Raun, W.R. In-Season Prediction of Corn Grain Yield Potential Using Normalized Difference Vegetation Index. Agron. J. 2006, 98, 1488–1494. [Google Scholar] [CrossRef]
  63. Marin, D.B.; Ferraz, G.A.e.S.; Guimarães, P.H.S.; Schwerz, F.; Santana, L.S.; Barbosa, B.D.S.; Barata, R.A.P.; Faria, R.d.O.; Dias, J.E.L.; Conti, L.; et al. Remotely Piloted Aircraft and Random Forest in the Evaluation of the Spatial Variability of Foliar Nitrogen in Coffee Crop. Remote Sens. 2021, 13, 1471. [Google Scholar] [CrossRef]
  64. Candiago, S.; Remondino, F.; De Giglio, M.; Dubbini, M.; Gattelli, M. Evaluating Multispectral Images and Vegetation Indices for Precision Farming Applications from UAV Images. Remote Sens. 2015, 7, 4026–4047. [Google Scholar] [CrossRef]
  65. Tian, Y.C.; Yao, X.; Yang, J.; Cao, W.X.; Hannaway, D.B.; Zhu, Y. Assessing Newly Developed and Published Vegetation Indices for Estimating Rice Leaf Nitrogen Concentration with Ground- and Space-Based Hyperspectral Reflectance. Field Crops Res. 2011, 120, 299–310. [Google Scholar] [CrossRef]
  66. Xu, R.; Zhao, S.; Ke, Y. A Simple Phenology-Based Vegetation Index for Mapping Invasive Spartina Alterniflora Using Google Earth Engine. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 190–201. [Google Scholar] [CrossRef]
  67. Tanaka, S.; Goto, S.; Maki, M.; Akiyama, T.; Muramoto, Y.; Yoshida, K. Estimation of leaf chlorophyll concentration in winter wheat [Triticum aestivum] before maturing stage by a newly developed vegetation index-rbNDVI. J. Jpn. Agric. Syst. Soc. Jpn. 2007, 4, 297–303. [Google Scholar]
  68. Zha, H.; Miao, Y.; Wang, T.; Li, Y.; Zhang, J.; Sun, W.; Feng, Z.; Kusnierek, K. Improving Unmanned Aerial Vehicle Remote Sensing-Based Rice Nitrogen Nutrition Index Prediction with Machine Learning. Remote Sens. 2020, 12, 215. [Google Scholar] [CrossRef]
  69. Cao, Q.; Miao, Y.; Wang, H.; Huang, S.; Cheng, S.; Khosla, R.; Jiang, R. Non-Destructive Estimation of Rice Plant Nitrogen Status with Crop Circle Multispectral Active Canopy Sensor. Field Crops Res. 2013, 154, 133–144. [Google Scholar] [CrossRef]
  70. Lu, J.; Miao, Y.; Shi, W.; Li, J.; Hu, X.; Chen, Z.; Wang, X.; Kusnierek, K. Developing a Proximal Active Canopy Sensor-Based Precision Nitrogen Management Strategy for High-Yielding Rice. Remote Sens. 2020, 12, 1440. [Google Scholar] [CrossRef]
  71. Zhu, Y.; Yao, X.; Tian, Y.; Liu, X.-J.; Cao, W. Analysis of Common Canopy Vegetation Indices for Indicating Leaf Nitrogen Accumulations in Wheat and Rice. Int. J. Appl. Earth Obs. Geoinform. 2008, 10, 1–10. [Google Scholar] [CrossRef]
  72. Basso, B.; Cammarano, D. Remotely Sensed Vegetation Indices: Theory and Applications for Crop Management. Riv. Ital. Agrometeorol. 2004, 1, 36–53. [Google Scholar]
  73. Cao, Y.; Li, G.L.; Luo, Y.K.; Pan, Q.; Zhang, S.Y. Monitoring of Sugar Beet Growth Indicators Using Wide-Dynamic-Range Vegetation Index (WDRVI) Derived from UAV Multispectral Images. Comput. Electron. Agric. 2020, 171, 105331. [Google Scholar] [CrossRef]
Figure 1. Workflow of the study.
Figure 1. Workflow of the study.
Agriculture 12 01141 g001
Figure 2. Location of the study site in Aichi, Japan. Orthomosaic image of the two experimental trials: trial with no fertilizer (in blue box) and trial with fertilizer (red box). On the right, from the top to bottom, are the plot images clipped from the orthomosaic image of rice cultivars, namely, Aichinokaori, Hatsushimo, Asahi, Nakate Shinsenbon, and Nikomaru.
Figure 2. Location of the study site in Aichi, Japan. Orthomosaic image of the two experimental trials: trial with no fertilizer (in blue box) and trial with fertilizer (red box). On the right, from the top to bottom, are the plot images clipped from the orthomosaic image of rice cultivars, namely, Aichinokaori, Hatsushimo, Asahi, Nakate Shinsenbon, and Nikomaru.
Agriculture 12 01141 g002
Figure 3. Aboveground biomass (ton/ha) of five cultivars grown in 2020 and 2021 in the (a,b) trial with no fertilizer and (c,d) trial with fertilizer. Leaf area index (m2/m2) of five cultivars grown in 2020 and 2021 in the (e,f) trial with no fertilizer and (g,h) trial with fertilizer.
Figure 3. Aboveground biomass (ton/ha) of five cultivars grown in 2020 and 2021 in the (a,b) trial with no fertilizer and (c,d) trial with fertilizer. Leaf area index (m2/m2) of five cultivars grown in 2020 and 2021 in the (e,f) trial with no fertilizer and (g,h) trial with fertilizer.
Agriculture 12 01141 g003aAgriculture 12 01141 g003b
Figure 4. Grain yield (ton/ha) of five rice cultivars grown in 2020 and 2021 in the (a,b) trial with no fertilizer and (c,d) trial with fertilizer.
Figure 4. Grain yield (ton/ha) of five rice cultivars grown in 2020 and 2021 in the (a,b) trial with no fertilizer and (c,d) trial with fertilizer.
Agriculture 12 01141 g004aAgriculture 12 01141 g004b
Figure 5. Model validation of the linear regression model between grain yield and actual AGB and LAI of the five rice cultivars: (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru, at the optimum growth stage. The optimum growth stage was determined by selecting the growth stage corresponding to the simple linear model that resulted in the lowest RMSE in the model validation. Cultivars Aichinokaori, Asahi, Hatsushimo, and Nakate Shinsenbon were optimally predicted at the stem elongation stage, while Nikomaru was optimally predicted at the tillering stage.
Figure 5. Model validation of the linear regression model between grain yield and actual AGB and LAI of the five rice cultivars: (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru, at the optimum growth stage. The optimum growth stage was determined by selecting the growth stage corresponding to the simple linear model that resulted in the lowest RMSE in the model validation. Cultivars Aichinokaori, Asahi, Hatsushimo, and Nakate Shinsenbon were optimally predicted at the stem elongation stage, while Nikomaru was optimally predicted at the tillering stage.
Agriculture 12 01141 g005aAgriculture 12 01141 g005b
Figure 6. Model evaluation of AGB estimation model using the test dataset for cultivars (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru.
Figure 6. Model evaluation of AGB estimation model using the test dataset for cultivars (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru.
Agriculture 12 01141 g006
Figure 7. Model evaluation of LAI estimation model using the test dataset for cultivars (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru.
Figure 7. Model evaluation of LAI estimation model using the test dataset for cultivars (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru.
Agriculture 12 01141 g007
Figure 8. Variable importance in the AGB estimation model for cultivars (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru.
Figure 8. Variable importance in the AGB estimation model for cultivars (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru.
Agriculture 12 01141 g008
Figure 9. Variable importance in the LAI estimation model for cultivars (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru.
Figure 9. Variable importance in the LAI estimation model for cultivars (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru.
Agriculture 12 01141 g009
Figure 10. Estimated aboveground biomass (ton/ha) of five cultivars grown in 2020 and 2021 in the trial with no fertilizer (a,b) and trial with fertilizer (c,d).
Figure 10. Estimated aboveground biomass (ton/ha) of five cultivars grown in 2020 and 2021 in the trial with no fertilizer (a,b) and trial with fertilizer (c,d).
Agriculture 12 01141 g010
Figure 11. Estimated leaf area index (m2/m2) of five cultivars grown in 2020 and 2021 in the trial with no fertilizer (a,b) and trial with fertilizer (c,d).
Figure 11. Estimated leaf area index (m2/m2) of five cultivars grown in 2020 and 2021 in the trial with no fertilizer (a,b) and trial with fertilizer (c,d).
Agriculture 12 01141 g011aAgriculture 12 01141 g011b
Figure 12. RMSE result of the training dataset of the single-day regression model between grain yield and VI-estimated AGB and LAI of the five rice cultivars. The red box indicates the optimum time window for UAV flights as determined from the AIC score comparison. The black line indicates the mean R2 values from the repeated k-fold cross-validation conducted for each single-day regression. The darker blue line indicates the smoothed R2 values that can be expected from the plotted R2 series derived from the repeated k-fold cross-validation. The smoothed R2 values were determined by using the LOESS (locally weighted scatterplot smoothing) method. The lighter blue region indicates the range of smoothed R2 values to be expected 95% of the time from each single-day regression. (a) Aichinokaori; (b) Asahi; (c) Hatsushimo; (d) Nakate Shinsenbon; (e) Nikomaru.
Figure 12. RMSE result of the training dataset of the single-day regression model between grain yield and VI-estimated AGB and LAI of the five rice cultivars. The red box indicates the optimum time window for UAV flights as determined from the AIC score comparison. The black line indicates the mean R2 values from the repeated k-fold cross-validation conducted for each single-day regression. The darker blue line indicates the smoothed R2 values that can be expected from the plotted R2 series derived from the repeated k-fold cross-validation. The smoothed R2 values were determined by using the LOESS (locally weighted scatterplot smoothing) method. The lighter blue region indicates the range of smoothed R2 values to be expected 95% of the time from each single-day regression. (a) Aichinokaori; (b) Asahi; (c) Hatsushimo; (d) Nakate Shinsenbon; (e) Nikomaru.
Agriculture 12 01141 g012aAgriculture 12 01141 g012b
Figure 13. Model evaluation of the optimum single-day regression model using the test dataset between grain yield and VI-estimated aboveground biomass and leaf area index of the five rice cultivars: (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru.
Figure 13. Model evaluation of the optimum single-day regression model using the test dataset between grain yield and VI-estimated aboveground biomass and leaf area index of the five rice cultivars: (a) Aichinokaori, (b) Asahi, (c) Hatsushimo, (d) Nakate Shinsenbon, and (e) Nikomaru.
Agriculture 12 01141 g013
Table 1. UAV flight and sampling dates for the corresponding growth stages of the two-year experiment trials.
Table 1. UAV flight and sampling dates for the corresponding growth stages of the two-year experiment trials.
Growth Stage2020 UAV
Flight Date (DAT)
2021 UAV
Flight Date (DAT)
Aichinokaori
Tillering21-July (49)31-July (52)
Stem Elongation11-August (70)18-August (70)
Booting17-August (76)25-August (77)
Heading25-August (84)31-August (83)
Asahi
Tillering29-July (57)5-August (57)
Stem Elongation11-August (70)18-August (70)
Booting23-August (82)31-August (83)
Heading11-September (101)9-September (92)
Hatushimo
Tillering21-July (49)28-July (49)
Stem Elongation5-August (64)11-August (63)
Booting23-August (82)31-August (83)
Heading11-September (101)9-September (92)
Nakate Shinsenbon
Tillering15-July (43)22-July (43)
Stem Elongation29-July (57)5-August (57)
Booting17-August (76)20-August (72)
Heading25-August (84)31-August (83)
Nikomaru
Tillering21-July (49)28-July (49)
Stem Elongation5-August (64)11-August (63)
Booting17-August (76)25-August (77)
Heading25-August (84)9-September (92)
Table 2. R2 and RMSE results of the cross-validated estimation model for aboveground biomass (ton/ha) and leaf area index (m2/m2) using XGBoost models.
Table 2. R2 and RMSE results of the cross-validated estimation model for aboveground biomass (ton/ha) and leaf area index (m2/m2) using XGBoost models.
Aboveground Biomass (Ton/ha)Leaf Area Index (m2/m2)
CultivarTraining ResultTest ResultTraining ResultTest Result
R2RMSER2Normalized RMSER2RMSER2Normalized RMSE
Aichinokaori0.66 ± 0.071.430.780.260.46 ± 0.050.650.710.20
Asahi0.67 ± 0.071.500.660.290.85 ± 0.090.480.570.29
Hatsushimo0.72 ± 0.071.540.560.360.84 ± 0.080.610.650.27
Nakate Shinsenbon0.85 ± 0.091.240.830.290.82 ± 0.080.560.630.39
Nikomaru0.75 ± 0.071.210.800.250.55 ± 0.050.650.730.22
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Bascon, M.V.; Nakata, T.; Shibata, S.; Takata, I.; Kobayashi, N.; Kato, Y.; Inoue, S.; Doi, K.; Murase, J.; Nishiuchi, S. Estimating Yield-Related Traits Using UAV-Derived Multispectral Images to Improve Rice Grain Yield Prediction. Agriculture 2022, 12, 1141. https://doi.org/10.3390/agriculture12081141

AMA Style

Bascon MV, Nakata T, Shibata S, Takata I, Kobayashi N, Kato Y, Inoue S, Doi K, Murase J, Nishiuchi S. Estimating Yield-Related Traits Using UAV-Derived Multispectral Images to Improve Rice Grain Yield Prediction. Agriculture. 2022; 12(8):1141. https://doi.org/10.3390/agriculture12081141

Chicago/Turabian Style

Bascon, Maria Victoria, Tomohiro Nakata, Satoshi Shibata, Itsuki Takata, Nanami Kobayashi, Yusuke Kato, Shun Inoue, Kazuyuki Doi, Jun Murase, and Shunsaku Nishiuchi. 2022. "Estimating Yield-Related Traits Using UAV-Derived Multispectral Images to Improve Rice Grain Yield Prediction" Agriculture 12, no. 8: 1141. https://doi.org/10.3390/agriculture12081141

APA Style

Bascon, M. V., Nakata, T., Shibata, S., Takata, I., Kobayashi, N., Kato, Y., Inoue, S., Doi, K., Murase, J., & Nishiuchi, S. (2022). Estimating Yield-Related Traits Using UAV-Derived Multispectral Images to Improve Rice Grain Yield Prediction. Agriculture, 12(8), 1141. https://doi.org/10.3390/agriculture12081141

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop