1. Introduction
Rice false smut (RFS), resulting from
Ustilaginoidea virens, grows on rice grains and leads to heavy losses of rice yield in most major rice-producing areas. RFS was previously recorded as a minor disease of rice and considered a symbol of a good harvest in old times. In recent years, increasing occurrences of RFS have been reported in most major rice-growing regions throughout the world, such as China, India, and USA [
1], causing chalkiness, and reducing "1000-grain weight" and seed germination (by up to 35%). In damp weather, the disease could be severe with losses reaching 25%. In India, a yield loss of 7–75% was observed [
2]. Moreover, it is still viable in the soil and infects seedlings after planting [
3]. Ustiloxins produced by
U. virens pose as a serious hazard to human health and to the ecological safety of farmlands [
4].
Suitable management practices need to be made to avoid the disease, in order to minimize direct economic loss. Breeding and utilization of a resistant cultivar is the most effective and economical way to control RFS disease and ensure the high yield of rice. Culture management will affect the incidence of RFS and early-planted rice has less RFS balls than late-planted rice; moreover, high nitrogen increases disease incidence. Chemical control is effective, for example, using fungicides with high efficiency, low toxicity, and low residue is currently the best choice to control RFS disease, but it is often not environmentally-friendly. If the detection of RFS is more accurate in farmlands, plant protection will be more precise.
In the literature, many researchers have looked into the biological and chemical methods for RFS detection. For instance, Tang [
5] developed a nested polymerase chain reaction (PCR)-based assay to detect
U. virens using the genes of
U. virens as specific targets. The study might be conducted to detect infection by
U. virens at an early stage—to look into how to mitigate disease spread and to study the ecology. To evaluate disease severity, Dhua [
6] presented a precise assessment method; a yield based on the florets and actual grains was simulated for the disease severity assessment of RFS disease. The above methods are relatively tedious. If a nondestructive and rapid method, e.g., remote sense, is used to detect RFS, it will be very convenient.
Lin [
7] started with the pathogenesis of false smut bacteria and occurrence; he combined meteorological, agricultural, and remote sensing elements, and considered the meteorological factors conducive to false smut pathogen infection and dissemination, field management, rice varieties, nitrogen fertilizer, etc. From the perspective of bacteria pathogenesis and the occurrence of RFS, the study selected five indicators to simulate a forecast model of the RFS index (50 samples in 2 years) through the model verification; the results generally agreed with the actual situation, and the verification accuracy reached 83.32%.
Fewer studies on RFS have been based on near earth remote sensing. A large number of researchers have emphasized other crop diseases, such as satellite remote sensing for wheat Fusarium head blight [
8], soybean sudden death syndrome [
9], tobacco crop [
10], rice bacterial leaf blight [
11], soybean sudden death syndrome [
12], near earth remote sensing for cucumber leaves in response to angular leaf spot disease [
13], early disease in wheat fields [
14], watermelon disease detection [
15], rye leaf rust symptoms [
16], paddy leaf disease [
17], onion purple blotch [
18], etc.
Among the literature, for satellite remote sensing research, Landsat imagery is free, but the spatial resolution is too low to accurately map smaller infestations. There are also satellites with pixel sizes of 5 m or less, such as GeoEye-1, Pleiades, WorldView-3 and -4, and GaoJing-1, or imagery with resolutions of 5–10 m, such as RapidEye, SPOT 6 and 7, and Sentinel-2, which are only suitable for macro-crop detection—near remote sensing imagery has a fine pixel size [
19].
Although many crop diseases can be successfully detected and mapped using airborne or satellite imagery—understanding how to convert remote sensing data to practical prescription maps is still lacking. More research is needed to develop operational procedures for transforming image classification maps to applications maps. Each disease has its own characteristics and requires different procedures for detection and management.
For RFS detection, there are various influencing factors affecting RFS detection in near earth remote sensing, such as different sowing dates and farm management types. In this study, to obtain a more convincing and generalized model for RFS detection, we considered the sowing data and farm management types; 14 paddy fields were prepared, including 7 different sowing dates and 2 different farm management types. The data were obtained using a frame-based hyperspectral image device, based on near earth remote sensing; 23 different differentiation tradition models were built to obtain the most useful model. Spectral based deep learning was also used for detection. Finally, the selected model was verified by actual field investigation results. Our specific objectives were: (1) to ensure differentiation between healthy ears of rice (HER) and disease ears of rice (DER) at different sowing dates and farm management types. (2) To develop different models and get the most reliable model. (3) To develop an appropriate structure of the spectral deep learning model. (4) To verify the model by actual field investigation results; the disease prescription map was generated. The workflow of this study is shown in
Figure 1.
3. Results and Discussion
3.1. Comparisons between Different Test Fields
Table 2,
Table 3,
Table 4 and
Table 5 shows the accuracy between DER and HER in the farm management of FMT and NGT. The name rules in the table are based on different test plots. For instance, 1A denotes the earliest and 7A the latest sowing plot with farm management. Likewise, 1D is the earliest and 7D is the latest sowing plot with nature growing. The remaining names in table from 1A (D) to 7A (D) are consistent one-to-one matches between different sowing plots.
PCA was employed to reduce feature dimension, which kept enough components to explain the 95% variance in this study. The maximum accuracy could be obtained by more than one method. The best methods were statistically analyzed according to the selected frequency (
Figure 4). In
Table 2; the accuracy ranged from 87 to 100%, mostly above 95%.
3.2. Accuracy between Different Farm Managements of FMT and NGT
Figure 5 shows the results of the differentiation between FMT and NGT. The plot covers 196 results with different arrangements. Box-plots for the sample set involved the representatives of different sowing periods and different farm managements of rice growth. In
Figure 4, the horizontal ordinate represents the table results different from that of
Table 2,
Table 3,
Table 4 and
Table 5 above. From the box-plots, the accuracies ranged from 92 to 99%.
The differentiation methods between DER and HER among the different farm managements and sowing dates were analyzed. Among the arrangements, WNN was the most frequently used differentiation method for RFS analysis. Many researchers [
26] have already estimated various differentiation models; moreover, there are different opinions about which model is the most appropriate for DER and HER differentiation. This study used 23 discrimination methods, a very common algorithm at present, to detect all possible combinations, to ensure the effectiveness of the differentiation results obtained. Those combinations not only covered different farm management types, but different sowing periods. For each combination, there will be more than one method to get the highest accuracy, so the selection frequencies of all discrimination methods were counted, as shown in
Figure 6. It is suggested that QSVM, WNN, and LSVM were the top-three highest accuracies for A-DER and D-HER; QSVM, WNN, and FSVM were the top-three for A-HER and D-DER; FSVM, MNN, and QSVM were the top-three for A-HER and A-DER; WNN, FSVM, and LSVM were the top-three for D-HER and D-DER. Generally, WNN and QSVM were the most selective and, therefore, these two methods could be considered the best among traditional models.
3.3. Spectral-Based Deep Learning
Spectral-based deep learning required approximately 3 min to assess these datasets. Compared to deep learning for slide-images, the elapsed time was shorter (the platform was based on a Lenovo P53, CPU Intel Core i5-9400H 2.50 GHz, RAM 32 GB ). If there are special requirements for time, this method is feasible. For images, it will take days to train. Therefore, spectral-based deep learning greatly improves the efficiency of detection.
According to the detection results using LSTM, most accuracies achieved over 95%. The real accuracies ranged from 90 to 100%. The accuracy increased significantly from the beginning of training to the 500 iteration. Then it slowly rose. As for loss value—it decreased gradually with the number of iterations. The smoothed value was less than 0.1. In
Table 6 below, DN is DER in NGT, DF is DER in FMT, HN is HER in NGT, and HF is HER in FMT. The line is 14 kinds of disease samples and the row is 14 kinds of health samples, under two different farm management types.
Each crop (different from industrial products) is different; thus, the diversity of rice planting should be considered in disease detection. In this study, we considered different the planting methods of rice and attempted to use a variety of detection methods. The main purpose was to make the research results closer to the actual production and to have better popularization.
According to the analysis results, it can be found that the accuracies nearly ranged from 92 to 99%. Generally, better results could be obtained, which is irrelevant to the management mode and planting date.
Data have a great impact on the results, especially in rice fields. Moreover, one must consider the impact of data collection methods and data inconsistency caused by different manufacturers. Therefore, in order to ensure the consistency of data, it is necessary to avoid the influences caused by the change of sunlight through radiation correction devices, and avoid the impact caused by the inconsistency of equipment manufacturers and personal use habits.
3.4. Method Validation
Grading of the rice disease index divided RFS into 5 grades, of which, 0 indicated no diseases, 1 was the least, and 5 was the most serious [
27]. The incidence rate of RFS in the unit area was used as the classification standard of the disease grades. The specific description was as follows:
Grade 0: The prevalence rate of RFS was 0;
Grade 1: The prevalence rate of RFS was under 5%(include 5%);
Grade 2: The prevalence rate of RFS was under 10%(include 10%);
Grade 3: The prevalence rate of RFS was under 20%(include 20%);
Grade 4: The prevalence rate of RFS was under 50%(include 50%);
Grade 5: The prevalence rate of RFS was more than 50%.
According to this standard, the disease index was calculated according to the field disease investigation results, the disease index was as follows:
where,
DI is the disease index;
is the number of diseased panicles at all levels (number/plot);
is the representative value at all levels;
N is the total number of panicles investigated (number/plot);
k is the highest representative value. In order to facilitate the calculation, experimental fields were finally divided into several grades according to the size of the disease index.
The disease grade of RFS, as the control group, was obtained through a field investigation. In this study, the RFS with different diseased grades were distinguished and counted according to the rice disease index. The incidence index of each area was calculated (
Table 6). The incidence grade was divided into four grades 1, 3, 5, and 7, according to the disease index. The conditions corresponding to the disease index were
DI < 1, 1 <
DI < 3, 3 <
DI < 5 and
DI > 5. The classification of grades in this study was mainly based on the distribution gradient of the disease index.
According to the detection method above (top-three highest accuracies and LSTM), we compared the results with the actual field investigation data, and the results were basically consistent.
Figure 7 is a prescription map of RFS detection in the experimental field. We not only carried out disease detection, but we also subdivided the disease grade. Deep green indicates less disease, red indicates serious disease, and light green and light red are between the two.
Due to the large number of rice plants, we cannot guarantee that each disease area can be detected. In the process of near earth remote sensing data acquisition, we should make the resolution appropriate. If the data collected by UAV are too close to the ground and the wind force of UAV is too strong, the rice canopy fluctuates too much and better data cannot be obtained. If they are too far away from the rice plant, it is impossible to get a better resolution.
4. Conclusions
This study considered several combinations of rice plant forms, which covered different planting types and management methods. Of those samples, the most convenient method of the submitted algorithms was based on deep convolutional neural networks. As for traditional methods, the important step involves the features extracted; therefore, different statistical and structural features were extracted, combined with widely used supervised classifiers.
The real accuracy was mostly above 95%. Among these models, support machines and nearest neighbors with different kernel functions were generally better. WNN and QSVM were the most frequency selected methods. Moreover, there was no obvious characteristic showing which arrangement was better. Considering so many factors, the differentiations between healthy and diseased rice plants turned out to be more reliable.
According to these methods, the disease prescription map of RFS could be produced, which provides a theoretical basis to take corresponding control measures in the future.
Although this paper has “reference significance” in the detection of RFS, there were still some limitations.
(1) Near earth remote sensing was used to detect and map RFS in this study; early detection is still a challenge, in most cases, this delayed detection may be early enough to reduce further damage with certain measures for RFS; for others, it may be too late to stop the infection for the current growing season.
(2) Rice planting is affected by many factors; only the two most important factors for the incidence of RFS were considered in this study.
(3) This work was carried out in China, with a typical subtropical climate, and involving a certain popularization. If the same experiment is carried out in other rice producing areas, there may be differences.
(4) In addition, compared with the convenience of satellite data, image acquisition with UAV is a technical work, especially in preventing the damage of very expensive hyperspectral equipment. Unlike satellite data, there are some uncertainties in the consistency of near ground remote sensing data.