1. Introduction
Agriculture is an important industry as the basis of food security, and as a significant aspect of the world economy. However, factors such as rapidly increasing global demand, fluctuations in production due to climate change, and a greater awareness of the negative environmental impact of agriculture on surrounding ecosystems, contribute to an increasing need for more efficient and sustainable farming practices. Especially in Canada, where agriculture is a significant industry, developing agricultural methods to be adaptable and resilient is necessary [
1]. This is possible through precision agriculture (PA), a management technique that selectively applies crop farming resources such as fertilizer, water, pesticides, and herbicides based on the plant needs within a field [
2,
3,
4].
One of the main fields of applications of precision agriculture is the management of nitrogen fertilizers [
5,
6]. Nitrogen is an essential macronutrient to plants, as a major constituent of organic material, enzymic processes, and oxidation-reduction reactions [
7]. As such, nitrogen content in above-ground plant tissue is an important indicator of crop health and yield potentials. Several global studies demonstrate that the mean nitrogen recovery efficiency by annual crops was less than 50% of the amount of fertilizer applied [
6,
8]. Nitrogen is one of the most expensive nutrients to supply, and commercial fertilizers represent a major cost in plant production [
9]. Rates of nitrogen fertilizer application depend on the crop type, desired yield, nitrogen present in the soil, and subsequently in the plants [
7]. Excess nitrogen can reduce crop yield and can be leached from the soil, contaminating surface and groundwater, leading to harmful effects on human health and ecosystem consequences such as algal blooms and hypoxia in water bodies [
10]. The United Nations Food and Agriculture Organization identifies classes of agricultural climate adaptation, one major class being management of field operation inputs including fertilizers [
11]. As such, optimizing the management of nitrogen fertilizers is an important field of research as new methods and technology are developed to improve nutrient use efficiency, quality, and crop yield while minimizing significantly negative environmental impacts and cost of production.
Literature does include much research on crop canopy nitrogen retrieval using UAVs, but model parameters are often focused on spectroscopy with the use of vegetation indices and spectral remote sensors [
12]. PA incorporates the use of many different types of spatial technologies such as geographic information systems (GIS), precision machinery, and remote sensing imagery to ground-based data collection [
13]. In PA, remote sensing imagery is especially useful because it does not require physical or destructive contact with plants to gather valuable crop information. The spectral information provided by the imagery can be transformed into vegetation indices (VIs). VIs are mathematical combinations or transformations of spectral bands that have been widely used in agricultural research because they allow for deriving of specific plant properties such as chlorophyll or nutrient content by taking advantage of the differential spectral properties of plants in the visible and near-infrared wavelengths [
14,
15,
16]. The resulting VI data can then provide timely information for monitoring field conditions and crop health, allowing for the optimal number of resources to be placed where they are needed, when they are needed.
In PA, crop monitoring has largely been conducted using optical satellites [
17]. As demand for timely, accurate, and cost-effective data on the earth’s surface increased in the last few decades, numerous satellite systems have been launched. Examples of optical satellites in operation include Landsat 8 (since 2013) and Sentinel-2 (since 2015), which have been used in studies on agriculture management [
4]. The Landsat program, which began in 1972 with the launch of Landsat 1, is the longest-running program for satellite imagery of the earth [
18]. Landsat 8 Operational Land Imager has nine spectral bands including visible, near-infrared, and shortwave infrared, with varying spatial resolutions of 15 to 30 m. Taking more than 700 scenes a day, it has a 16-day revisit time to the same area. Sentinel-2 has 13 spectral bands in the visible, near-infrared, and shortwave infrared with varying spatial resolutions of 10 m, 20 m, and 60 m [
19]. With the constellation of twin satellites, the revisit cycle over an area is five days.
Limitations in optical satellite imagery include low spatial sensitivity, as the spatial resolution in the range of meters allows for analysis of larger-scale regional or national areas but is too coarse for small-scale crop fields. The temporal sensitivity can be rather low, as in the case of Landsat 8; within a 16-day revisit time, crops would have changed significantly and valuable information on the different stages of growth would not be obtained. In addition to factors such as geometric distortion, atmospheric distortion, and cloud cover obscuring view of the land, advanced processing expertise may be required to ensure sufficient image quality [
20]. In comparison, UAV-based remote sensing can provide lower cost and higher spatial and temporal resolution data for crop management. Individuals with basic training can operate a UAV using programmed routes and collect images with <10 cm resolutions [
21]. They can be flown to capture more frequent image data, including monitoring each significant stage of crop growth and offer flexibility in operation for times when weather is most suitable. This makes them ideal for field management conducted in a timely and accurate manner according to the needs of the crop [
22]. Compared to satellites, overall UAV-based systems are often lower in cost for data collection and processing. As such, the use of UAV imagery in PA has become a research area of great interest due to its potential for larger environmental and economic impacts [
4].
Corn was selected for this study because it is among the most grown crops in Ontario [
17]. Recent studies have tested the use of linear regression, Random Forest (RF), and Support Vector Regression (SVR) models in UAV-based canopy nitrogen weight prediction models [
23]. Although linear regression is a commonly used method to predict nitrogen, some VIs (e.g., NDVI) may saturate beyond the early growth crop stages and models may have reduced accuracy due to multicollinearity [
24]. By contrast, machine learning-based regression methods such as RF and SVR have been found to produce more accurate models compared to classical linear regression methods, as they are unaffected by the assumptions of linear regression [
24]. However, Lee et al. (2020) considered only UAV spectral information and canopy nitrogen weight prediction in their study. The nitrogen prediction may be improved if plant physiology, topographic metrics, and soil variables are included in the analysis, given that crop nitrogen highly depends on these variables [
7].
To make well-informed fertilization management decisions, knowledge about the plant nutrient supply, health, and several environmental factors such as water availability, soil quality, and micro-topography of a field are key. The objectives of this study include, (i) studying the relationship between the spatial variation of canopy nitrogen weight and factors such as plant height, topographic metrics, soil chemical properties, and soil moisture conditions within a corn field in Southwestern Ontario using multispectral UAV-based imagery; (ii) determining the optimal combination(s) of spectral variable(s), crop plant physiology variables, and/or environmental conditions (soil, water, topographic data) for corn canopy nitrogen estimation and prediction; and (iii) evaluating the temporal variation of nitrogen estimation and prediction during early growth stages of corn using UAV images and select variables.
4. Discussion
In this study, RF and SVM regression methods were used to predict canopy nitrogen weight of corn using UAV Micasense individual band reflectance, associated VIs, plant physiology variables, topographic metrics, and soil metrics. The variation of the in-situ canopy nitrogen weight measurements was very low in the earliest growth stage on 8 June and gradually increased until the latest sampling date of 15 July, with a marked decrease afterwards. The increase in canopy nitrogen variation during the early growth stages of BBCH 00-49 can be explained by the leaf growth and stem elongation because the crop biomass increases rapidly during that period. Then, as the plant reaches the BBCH 51 stage that corresponds to the inflorescence emergence and heading, the canopy nitrogen variation decreases because of the dilution effect [
6].
The RF and SVR models were first calibrated with all the 29 variables using single and multi-date datasets. With the validation datasets, single-date models had overall poor performance. Combinations of multi-date models led to better results, with the best performance obtained with the RF model. In the variance importance plot of the best RF model, the plant height was the most important predictor out of all variables used. Freeman et al. [
55] already found that plant height is a useful variable in identifying nitrogen uptake in corn. Precision agriculture studies have used crop height for phenology, biomass, and yield prediction successfully, and this crop height can be derived from UAV point cloud datasets [
56]. Among all the individual MicaSense band reflectances, the reflectance of the red-edge band has the poorest performance. The red-edge region (680–800 nm) represents a sharp change in the canopy reflectance and can provide important details about phenology [
47]. Our result agrees with Lee et al.’s [
23] work that uses the same Micasense camera. Likely, the narrow 10 mm band range of the red-edge band of the Micasense camera did not capture the change in the region well. This could explain why our results are not in agreement with several other studies that find that the red-edge region is a sensitive indicator of leaf chlorophyll content, because of the high absorption in the red radiation and the high reflectance in the near-infrared region during plant growth stages [
57,
58]. Overall, most of the soil metrics had little to no effect on the models, but the soil was sampled once at the beginning of the growing season. With the consideration of costs and historical farm operations where recommendations for soil tests are only once a year, this study emphasizes the limitations of the current soil testing practices. Soil metrics results from Mulvaney et al.’s [
59] and Tremblay et al.’s [
60] studies on a soil-based approach in corn nitrogen management, found that soil tests were useful in their models when field conditions were conducive to soil nitrogen mineralization, crop uptake, and utilization. With different sampling methods, soil metrics may still be useful in models. There is therefore the need to conduct soil tests at different dates to better characterize the soil condition changes, for example, because of fertilizer applications, precipitation patterns, and crop growth [
7].
The RF model’s variable importance plot allowed selecting groups of top 6, 10, 15, 18, and 20 variables for developing new RF and SVM models. The top 20 variables included plant height, all the 11 VIs used in this study, all the MicaSense band reflectance mosaic but the red-edge one, the soil moisture, and the profile curvature, as well as the topographic wetness indices #1 and #2. The group of top 15 variables that performed best has only the plant height and profile curvature as non-spectral parameters. Considering that topographic metrics were derived from the UAV Phantom 4 RTK imagery along with the possibility of deriving crop height across the field from point cloud data, all variables in the best model can be measured from in-situ, non-destructive, UAV-based data collection [
55]. Having all model data that can be collected by remote sensing could be a greater benefit, as common limitations of in-situ studies and subsequent application methods are the intensive labour and high costs required to obtain model input data.
In this study, the final validation of canopy nitrogen models with various combinations of variables indicated RF models had better performance than SVR in terms of R
2 values. This is consistent with results from Liu et al. [
61], Zha et al. [
62], and Lee et al. [
23], with RF yielding better nitrogen content prediction in wheat, rice, and corn crops compared to SVR models. Although SVR had lower RMSE values in comparison to RF, overall RF RMSE values were low as well in the context of nitrogen estimation for g/m
2. In comparison to the study by Lee et al. [
23], the RMSE values of this study’s models are much lower, which can be beneficial for fertilizer management recommendations to farmers in general. In the case of this study, the RF algorithmic method of using many decision trees may better suit the use of numerous variables in regression models. In comparison, using many variables in SVR requires user hyper-tuning and the kernel trick function to separate data into groups, relying on the radial distance between points to be meaningful in the model. Overall, the performance of SVR models was good, but RF models can be considered more useful in terms of ease of use and the quality of results.
5. Conclusions
This study tested machine learning regression methods to predict corn canopy nitrogen weight using UAV Micasense band reflectance mosaics, associated VIs, plant physiology variables, topographic metrics, and soil metrics. With all 29 variables in RF and SVR models, the combination of all three dates with the RF model produced the best results: The validation model having an R2 of 0.75 and an RMSE of 2.29 g/m2. From the multi-date RF model’s variable importance plot, the top 6, 10, 15, 18, and 20 variables were tested in RF and SVR models. The best validation model was the RF model (R2 value at 0.73 and RMSE at 2.21 g/m2) with the top 15 variables, most of them being spectral variables.
We developed models for estimating canopy nitrogen weight from spectral, plant, soil, and topographic variables using machine learning algorithms, but the resulting models are still empirical, and their applicability can be limited to the dataset on which they were built and validated. This is a common limitation in agricultural research as in-situ measurements often require intensive labour, costs, and variable conditions. Overall, many factors need to be considered to define plant growth conditions such as plant species, soil condition, environmental factors of field topology, moisture supply, weather, and more. There is a need to test the developed models on other datasets to determine their efficacy and to understand their applicability in precision agriculture. Future work can consider using a more deterministic modelling approach, for example, the PROSAIL model [
63], as it is less empirical and applies to a high variety of conditions but requires more advanced parameter calibration. The PROSAIL model uses spectral data of leaf and canopy level parameters to retrieve chlorophyll and nitrogen content, with robust results from lab and field studies [
63]. Eventually, methods of crop height extraction from RTK UAVs [
56], in addition to UAV-derived topographic and spectral variables, can be used to develop a final map for a whole field. The model information can be used to improve field nitrogen prediction, leading to more effective and efficient N fertilizer management.