Next Article in Journal
Influence of Elytral Color Pattern, Size, and Sex of Harmonia axyridis (Coleoptera, Coccinellidae) on Parasite Prevalence and Intensity of Hesperomyces virescens (Ascomycota, Laboulbeniales)
Next Article in Special Issue
Pheromone-Trap Monitoring System for Pea Leaf Weevil, Sitona lineatus: Effects of Trap Type, Lure Type and Trap Placement within Fields
Previous Article in Journal
Honey Bee Survival and Pathogen Prevalence: From the Perspective of Landscape and Exposure to Pesticides
Previous Article in Special Issue
Towards the Development of a More Accurate Monitoring Procedure for Invertebrate Populations, in the Presence of an Unknown Spatial Pattern of Population Distribution in the Field
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Research on Vegetable Pest Warning System Based on Multidimensional Big Data

Mathematics and Informatics College of South China Agricultural University, Guangzhou 510642, China
*
Author to whom correspondence should be addressed.
Insects 2018, 9(2), 66; https://doi.org/10.3390/insects9020066
Submission received: 20 April 2018 / Revised: 10 June 2018 / Accepted: 11 June 2018 / Published: 13 June 2018
(This article belongs to the Special Issue Insect Monitoring and Trapping in Agricultural Systems)

Abstract

:
Pest early warning technology is part of the prerequisite for the timely and effective control of pest outbreaks. Traditional pest warning system with artificial mathematical statistics, radar, and remote sensing has some deficiency in many aspects, such as higher cost, weakness of accuracy, low efficiency, and so on. In this study, Pest image data was collected and information about four major vegetable pests (Bemisia tabaci (Gennadius), Phyllotreta striolata (Fabricius), Plutella xylostella (Linnaeus), and Frankliniella occidentalis (Pergande) (Thysanoptera, Thripidae)) in southern China was extracted. A multi-sensor network system was constructed to collect small-scale environmental data on vegetable production sites. The key factors affecting the distribution of pests were discovered by multi-dimensional information, such as soil, environment, eco-climate, and meteorology of vegetable fields, and finally, the vegetable pest warning system that is based on multidimensional big data (VPWS-MBD) was implemented. Pest and environmental data from Guangzhou Dongsheng Bio-Park were collected from June 2017 to February 2018. The number of pests is classified as level I (0–56), level II (57–131), level III (132–299), and level IV (above 300) by K-Means algorithm. The Pearson correlation coefficient and the grey relational analysis algorithm were used to calculate the five key influence factors of rainfall, soil temperature, air temperature, leaf surface humidity, and soil moisture. Finally, Back Propagation (BP) Neural Network was used for classification prediction. The result shows: I-level warning accuracy was 96.14%, recall rate was 97.56%; II-level pest warning accuracy was 95.34%, the recall rate was 96.45%; III-level pest warning accuracy of 100%, the recall rate was 96.28%; IV-level pest warning accuracy of 100%, recall rate was 100%. It proves that the early warning system can effectively predict vegetable pests and achieve the early warning of vegetable pest’s requirements, with high availability.

1. Introduction

China’s vegetable harvesting area is about 24.68 million hectares, with a total output of 758 million tons; the plantation area and total output account for 42% and 65% of the world respectively. China’s vegetable cultivation area, total output, per capita vegetable consumption, and export volume rank first in the world. The damage that is caused by pests and diseases of vegetables is generally up to 20~30%, and in severe situations up to 50% [1]. In southern China, the climate is mild in winter, humid in the summer, and the cultivation of vegetables takes a long time. It is practically feasible throughout the year. At the same time, this climate also caused vegetable pests to occur more frequently and to a greater degree. Especially the major pests of southern vegetables, such as Bemisia tabaci (Gennadius), Phyllotreta striolata (Fabricius), Plutella xylostella (Linnaeus), and Frankliniella occidentalis (Pergande) (Thysanoptera, Thripidae) are highly susceptible to reproduction [2]. The large number of pesticides that are used to control vegetable pests has caused serious vegetable quality and safety problems [3]. The occurrence and development of vegetable pests are affected by many factors, such as the layout of crops, the migration of pests, the diapause patterns, the microclimate of the farmland, and meteorological conditions [4]. The occurrence and succession of pests have become more complicated due to the changes in global climatic conditions and changes in farming systems in different regions. The irrational use of pesticides has severe impacts on natural enemies of pests, destroys the ecological balance, and causes the resurgence of important pests and the outbreak of secondary pests. Therefore, it is hard to explain the outbreak mechanism of pests and to carry out pest warning [5].
The traditional monitoring and early warning technology of vegetable pests is mainly based on the analysis of pest infestation and outbreak mechanism [6], and speculates the comprehensive technology for the distribution and damage trends of pests in a period of time in the future [7]. These need to apply relevant biological, ecological knowledge and mathematical statistics, system analysis, and other methods [8,9]. It usually requires experts to make predictions through biological and mathematical statistical analysis methods [10]. At present, some interdisciplinary and cross-disciplinary technologies, such as insect radar [11], remote sensing monitoring [12,13], and multispectral development [14,15,16] have been developed, which have promoted the rapid development of monitoring and early warning of vegetable pests. However, these methods require strong professional knowledge for analysis and early warning and have low efficiency, large scale, small scope, and poor real-time performance, and it is difficult to meet the requirements of modern agriculture. With the continued maturation of the Internet of Things and big data technology, information can be collected on the integration of various aspects and obtained unexpected conclusions through multidimensional big data analysis [17,18]. Big data has only just started in pest control. There are many problems in the collection, transmission, analysis, and early warning of field data. It still requires a lot of research [19]. The growth, development, reproduction, and distribution of vegetable pests have important relationships with climatic factors and soil environment [20]. Pests need to have a certain range of survival temperature to adapt to and within the appropriate temperature range, the growth rate of pests will accelerate. Therefore, environmental factors have an important relationship with pest occurrence, and it is feasible to use environmental factors for the early warning of pests.
In summary, this study uses small-scale multi-sensor networks to collect substantial data on pests, soil, environment, ecological climate, and meteorology, and conducts research on pest outbreak mechanisms and early warning systems based on multi-dimensional big data. After comparing the classical algorithms used in the previous construction models, we combined the methods of the correlation coefficient, grey relational analysis, and Back Propagation (BP) neural network, a vegetable pest warning system based on multidimensional big data (VPWS-MBD) was explored. To study the extent of the occurrence of four major pests in southern China vegetables, the framework of the system is described, the process of system construction is explained, and the original algorithm of each sub-module of the system is described. Finally, we have analyzed and evaluated the system performance, summarized the advantages and disadvantages of the system, and proposed further measures for improvement.

2. The Architecture of the VPWS-MBD

The system contains the collection, transmission, storage, analysis, and distribution of vegetable pest data. As shown in Figure 1, it is divided into a sensory layer, transport layer, and application layer. The sensory layer uses the large-scale data collection of pests, soil, environment, ecological climate, and meteorology at a small scale using a multi-sensor network, which changes the traditional data acquisition model, and greatly increases the amount and accuracy of data acquisition. The transport layer is the data transmission layer. With the help of the 4G network, the collected vegetable pest data is sent to a remote server. At the application layer, users can access the VPWS-MBD through personal computers, laptops, and smart phones, use farmland sensor control functions and intuitive data visualization capabilities, and conduct pest warnings.

3. Detailed Design and Implementation

The design flow chart of VPWS-MBD is shown in Figure 2. It mainly includes data preprocessing module, feature selection and extraction module, normalization processing module, training pest early warning model module, and model prediction module. Next, the design details of the five core modules of VPWS-MBD will be introduced in detail. The algorithms involved are all written in the Python programming language.

3.1. Data Preprocessing

3.1.1. Discard Missing Values Using Lagrange Interpolation Polynomial

Discard missing values using Lagrange Interpolation Polynomial. In the process of collecting relevant data through multi-dimensional sensors, data loss may occur due to sensor damage and unstable signal transmission. Loss of data is not conducive to the analysis and the training of data. It is necessary to use certain methods to interpolate some missing values. In this study, a missing value processing method based on Lagrange Interpolation Polynomial is used [21].
Scan the original data one by one and see if there are missing values, and label missing and abnormal values. Depending on mathematical knowledge, we can find a n − 1 degree polynomial for the known n points on the plane.
y = a 0 + a 1 x + a 2 x 2 + + a n 1 x n 1
Substituting the coordinates of the n points (x1, y1), (x2, y2), ..., (xn, yn) into a polynomial and solving the Lagrange interpolation polynomial is:
L ( x )   = i = 0 n p i j = 0 , j i n x x j x i S j
By substituting the point x corresponding to the missing value that has been labeled into the above formula, the approximate value L(x) of the missing value can be obtained.

3.1.2. Classification of Pest Occurrence Based on K-Means Clustering

The division of the occurrence of pests needs to be divided according to the actual conditions in different places. Different pests are inconsistent in their growth and development in different places. It is very difficult to have a uniform standard. The artificial division of the occurrence of pests can be traced to some extent. There is a rough classification, but the description of the margins between the categories is very vague. Using the K-Means clustering algorithm for discretization [22], the occurrence of pests is divided into multiple levels in the form of maximizing the interval between samples, which avoids subjectivity. The K-Means clustering algorithm flowchart is shown in Figure 3.
The standard measure function is:
E = i = 1 k x c k d ( d ( x , m k ) ) 2
In the above formula, ck is the k-th cluster, mk is the centroid of the cluster ck, d(x, mk) is the distance between x and the centroid mk. In the method that is proposed in this study, the Euclidean distance is:
d ( x , m k ) = x , m k = ( x 1 m k 1 ) 2 + ( x 2 m k 2 ) 2 + + ( x n m k n ) 2

3.2. Feature Selection and Extraction

The quality of data and features determines the upper limit of machine learning. Feature selection and extraction can eliminate irrelevant data and redundant data, and can effectively guarantee the efficiency and effectiveness of machine learning. It is an indispensable step in large-scale machine learning. This study selects and extracts the features by correlation coefficient and grey relational analysis.

3.2.1. Pearson Correlation Coefficient

In this study, the Pearson correlation coefficient is first used in the feature selection module in order to calculate the degree of linearity between the data of each dimension and the degree of occurrence of pests, so as to describe the qualitative relationship between the impact factors of pests and the degree of the occurrence of pests. The Pearson correlation coefficient generally analyzes the relationship between two consecutive variables [23], and its calculation formula is as follows:
r = i = 1 n ( x i x ¯ ) ( y i y ¯ ) i = 1 n ( x i x ¯ ) 2 i = 1 n ( y i y ¯ ) 2
The range of the correlation coefficient r is −1 ≤ r ≤ 1. When |r| > 0.5, it is a significant linear correlation.

3.2.2. Grey Relational Calculation

The use of correlation coefficient analysis alone has certain flaws. The linear relationship between a single feature and the result can only be qualitatively analyzed from the characteristics and results. Thus, the quantitative analysis between the two is lacking to some extent. Therefore, this study uses the grey correlation analysis method to carry out the quantitative analysis of the characteristics. The grey correlation analysis method determines whether the connection is tight based on the degree of similarity of the sequence curve geometry. The closer the curves, the greater the degree of correlation between the corresponding sequences, and vice versa, the smaller the degree of correlation, based on the degree of similarity or dissimilarity between the trends of factors, as a measure of the degree of correlation between factors [24].
The set of reference sequences is X0 = {x0(k), k = 1, 2, …, n}, The sequence of comparison is Xi = {x(k), k = 1, 2, …, n}, (i = 1, 2, …, m). The grey relational degree of X0 and Xi is defined as:
r ( x 0 , x i ) = 1 n k 1 n r ( x 0 ( k ) , x i ( k ) )
In the above formula,
r ( x 0 ( k ) , x i ( k ) ) = min i min k | x 0 ( k ) x i ( k ) | + max i max k | x 0 ( k ) x i ( k ) | | x 0 ( k ) x i ( k ) | + max i max k | x 0 ( k ) x i ( k ) |
In the formula, is the resolution coefficient and ∈ (0,1).The grey relational degree r(x0, xi) of all m sequences is ranked from the largest to the smallest, and the correlation order set is obtained. Finally, we can determine the degree of correlation between sequence Xi and X0.

3.2.3. Key Influence Factor Extraction

After Pearson correlation coefficient and grey relational analysis were calculated for each dimension of vegetable pests’ data, the impact factors of pests and the development trend of pests were described qualitatively and quantitatively. According to the correlation degree, influence factors were extracted, and the specific pest impact factors were selected. The evaluation criteria for key factor is:
E k = ( α i M i n ( α i ) M a x ( α i ) M i n ( α i ) + ρ i M i n ( ρ i ) M a x ( ρ i ) M i n ( ρ i ) , k )
In the formula, α i is the Pearson correlation coefficient sequence and ρ i is the gray correlation factor sequence.

3.3. Normalization

Normalization is the basic step of data processing. Because there are different dimensions between different features, the distance between features may vary. If you do not perform the normalization, then it may affect the results of modeling. This study uses a Z-score normalization [25]. After the data is processed, the average value of all feature data is 0, the standard deviation is 1, and its conversion formula is as follows:
s i * = s i s i ¯ σ i
In the formula, s i ¯ is the average value of the feature data and σ is the standard variance of the feature data.

3.4. Training Based on BP Neural Network Model

BP Neural Networks have strong nonlinear mapping capabilities and self-learning capabilities. Generalization and fault tolerance are superior to decision trees, logistic regression, and support vector machines, and their effects are relatively accurate As a kind of multi-layer feedforward network trained according to the error back-propagation algorithm, it provides a very robust method for building highly linear partitioned model functions [26].
The BP neural network algorithm is to find the minimum of the error function. Through repeated training of multiple groups of samples, the gradient descent method is used to make the weight change along the negative gradient of the error function, and finally converge to the minimum point [27]. The error function is the following formula:
E = 1 2 k = 1 n ( y k d k ) 2
In the formula: n is the learning sample point, yk is the actual output, dk is the ideal output.
The main content of the BP neural network algorithm is to calculate the gradient information of weight between neurons by sample’s learning errors.
δ k = ( y k d k ) f ( n e t j k )
In the formula: δ k is the gradient information of the k sample learning error function for neuron k output and n e t j k represents the input of node j when the k-th sample is input.
The occurrence and growth of vegetable pests are complex and affected by multidimensional and complex factors, making BP neural network a great advantage in the classification and warning of vegetable pests. The network model is usually divided into three layers: input layer, hidden layer, and output layer. The network topology is shown in Figure 4.
As shown in Figure 3, xi is a set of feature vectors, which is obtained from the feature selection and extraction mentioned above, yi is the level of pest occurrence, obtained by the K-means clustering algorithm in the data preprocessing mentioned above. In the BP neural network model, 134 parameters need to be determined, including 65 weights from the input layer to the hidden layer, 52 weights from the hidden layer to the output layer, 13 hidden layer thresholds, and four output layer thresholds.
Before the BP neural network algorithm starts running, the initial learning times are set to 0 and the upper limit of the learning times is set, and the thresholds and weights are randomly set. Generally, the decimal within the closed interval between −1 and 1 is taken. The input layer accepts the feature set data and performs forward propagation to obtain the parameters of the output layer and the hidden layer. Compare the output layer data calculated by forwarding propagation with the data of the real output layer to obtain the error value between them. The error function is used to determine whether the error is less than the upper limit of error. If the error determined is less than the upper limit of error, then the weight and the threshold of each neuron can be updated by reverse propagation. After updating the thresholds and weights, repeat the above steps again and check whether the number of learning reaches the preset upper limit. If the number of times of learning has reached the preset upper limit, learning stops. When the number of learning has reached the preset upper limit value, the model training is completed.

4. Experimental Testing and Analysis

4.1. Experimental Data Acquisition Environment Deployment

The test base of this study is in the vegetable garden of Nanshan Base of Dongsheng Farm Co., Ltd. in Guangzhou. The park is managed by professionals and the cultivation is scientific and reasonable. We choose the organic planting of vegetables, not spraying pesticides, as far as possible to ensure the planting environment close to the natural planting environment and reduce the occurrence of pests by human factors. This study uses a data acquisition system that was manufactured by CAIPOS, Inc. in Graz, Austria, which integrates a soil moisture sensor, a soil temperature, an air temperature and humidity sensor, a leaf wetness sensor, and a rain gauge. Specific manufacturer information of all the sensors refers to Table 1.
Two solar stations were installed in the vegetable garden to provide wireless sensors with AC power. The sensors were installed around two solar panels. The signal collection station was installed above the vegetable garden to avoid obstacles. The signal collection station acquires sensor information through both wired and wireless methods, and it sends the information to the server. Field trials are shown in Figure 5.
In order to automatically collect data on vegetable pests in agricultural land for a long period of time, pest trapping and monitoring equipment for vegetables were designed in this experiment. As shown in Figure 6, Bemisia tabaci (Gennadius), Phyllotreta striolata (Fabricius), Plutella xylostella (Linnaeus), and Frankliniella occidentalis (Pergande) (Thysanoptera, Thripidae) were collected through yellow traps. We waited for several types of major vegetable pests to take photos of these yellow sticky boards and use the number of pests on the day minus the number of pests on the previous day to calculate the number of new pests on the day. The device includes a solar power supply device, a trapping device, and a monitoring device, and it develops a control program. After the program is started, the program automatically captures the image of the pest according to preset parameters and sends it to the remote server via the 4G network. At the same time, the remote command is obtained, and executes it.
From June 2017 to February 2018, a total of 2536 environmental and pest data were collected, of which 136 were processed for missing values. The collected environmental data combined with the image data of the pest can be obtained, as shown in Table 2.

4.2. Evaluation of Pest Levels Based on K-Means Clustering

In this experiment, in order to more objectively obtain the level of vegetable pest warning, K-means clustering was used to cluster the number of pests that were collected. The calculated results are shown in Figure 7.
As can be seen from Figure 6, the degree of occurrence of pests can be clearly divided into four levels. The number of pests is classified as level I in 0–56, the number of pests is classified as level II in 57–131, and the number of pests is classified as level III in 132–299. Grades and pests are divided into level IV above 300.

4.3. The Effective Calibration of Multi-Dimensional Key Factors

The results that were obtained by calculating the impact factors of pests collected by the Pearson correlation coefficient and Grey relational analysis are shown in Table 3.
Using the southern China vegetable pest optimization feature selection algorithm to evaluate each influencing factor, the results are shown in Table 4.
It is important to select the influencing factors with high correlation to train the prediction model. From Table 3, the impact factor of air humidity is only 0.08. If the feature with too low impact factors is used, prediction model training will be inaccurate. As we expected, the impact factors of leaf surface humidity, soil temperature, air temperature, and rainfall were between 1.11–1.94, which was highly related to pest occurrence. The impact factor of soil moisture was 0.72, which was a little low, but it also meets the model training requirements. Therefore, the feature set of the pest prediction model finally selected the leaf surface humidity, soil temperature, air temperature, rainfall, and soil moisture.

4.4. BP Neural Network Training

It was observed that the number of neuron hidden layers is related to the number of input neurons and the number of output neurons. The number of hidden layers is usually set as the sum of the number of input layers and output layers plus 0 to 10 neurons. According to this experience, this study will set the number of neurons in the hidden layer to 13, put the data selected by the features into the BP neural network model, iterative learning 10,000 times, and finally, get the vegetable pest warning model.

4.5. Verification of Early Warning Effect of VPWS-MBD

In this study, 30% of the multidimensional dataset is used for model verification, and the rainfall, soil temperature, air temperature, carbon dioxide concentration, and leaf wetness are selected as the input tuples. The obtained results can be represented by a confusion matrix, as shown in Figure 8.
Each column of the confusion matrix represents a vegetable pest prediction category, the total number of each column indicates the number of data to be predicted to be the category, each row represents the true attribution category of the data, and the total number of data in each row indicates the number of data instances in the category. According to the confusion matrix diagram, the predicted results are compared with the actual results, and the results obtained are shown in Table 5.
It can be seen from Table 4 that the accuracy of early warning models for pests at level I, level II, level III, and level IV is 96%, 95%, 100%, and 100%, respectively, indicating that different pest occurrences are predicted. The accuracy of the early warning is high. At the same time, from the F1 measurement point of view, the early warning model of pests in south China has a good evaluation of the forecasting and forecasting of the degree of occurrence of pests, especially for severe pests, the early warning model will basically not be wrong. The main errors of the analysis and early warning model come from two aspects. The first aspect is that the algorithm has a certain difference between the missing value of the interpolation value and the actual value; in the second aspect, the number of pests is identified by artificial eyes, and there is a certain amount of artificial error.

5. Conclusions

Aiming at the disadvantages of large-scale, low-precision, and time-ineffectiveness in vegetable pest warning, a vegetable pest warning system that is based on multi-dimensional big data was designed. A large-scale multi-sensor network was used to collect large data on pests, soil, environment, ecological climate, and weather, and the images of vegetable pests were automatically collected and classified and counted for vegetable pests. A series of algorithms, such as BP neural network machine learning, were used. The training and learning of the model eventually achieved VPWS-MBD. The experimental results show that VPWS-MBD has high accuracy and stability, which is conducive to the establishment of scientific and effective prevention and control plans for pests and diseases, scientific guidance of agricultural production, and the improvement of the quality and yield of vegetable pests. All of the data used in this study comes from the real environment of the southern China farmland. The actual environmental data ensures that the prediction results are closer to the actual situation on the ground. At the same time, the characteristics of training can be adjusted according to different data, making the system’s architecture more flexible. The BP neural network has a strong fault tolerance capability. Under the condition of sufficient data, it can obtain excellent prediction results. The system can be further expanded and it can be used to predict and warn any particular pest in the future.

Author Contributions

C.Z. and J.C. designed the research, C.Z. and D.X. performed the analysis and drafted the paper. Y.Y. and M.C. contributed to the interpretation of the results and polished the English.

Funding

This work was supported by the National Spark Program (2015GA780002) and Guangdong Province Science and Technology Program (2016B010110005).

Acknowledgments

The authors would like to thank M.C. for his help with correcting grammatical errors.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Lei, Z.R. Biological Control of Diseases and Insect Pests is Valid Method to Ensure Vegetable Safe Producing. Sci. Agric. Sin. 2016, 15, 2932–2934. [Google Scholar]
  2. Zhang, Y.J.; Wu, Q.G.; Wang, S.L.; Chu, D.; Xie, W. Current status and prospects of the important insect pests on the vegetable crops in China. Plant Prot. 2013, 39, 38–45. [Google Scholar]
  3. Zhang, F.; Zhang, M.J.; Luo, C.; Wang, G. Overview of Biological Control Techniques of Vegetable Pests. China Veg. 2011, 1, 23–24. [Google Scholar]
  4. Obopile, M.; Munthali, D.C.; Matilo, B. Farmers’ knowledge, perceptions and management of vegetable pests and diseases in Botswana. Crop Prot. 2008, 27, 1220–1224. [Google Scholar] [CrossRef]
  5. Zhang, Y.H.; Cheng, D.H. Progress in monitoring and forecasting of insect pests in China. Plant Prot. 2013, 39, 55–61. [Google Scholar]
  6. Li, K.B.; Wang, H.X.; Hu, W.X. Route of seasonal migration of the oriental armyworm moth in the eastern part of China as indicated by a three year result of releasing and recapturing of marked moths. J. Plant Prot. 1964, 3, 101–110. [Google Scholar]
  7. Li, K.B. The Survey and Main Progress of Mythimna Separatea in China. Plant Prot. 1993, 19, 2–4. [Google Scholar]
  8. Zhang, Y.; Lin, C.; Cheng, D.; Zhang, Y.; Jiang, Y.; Jiang, J. Radar observation and population analysis on the migration of the clover. Acta Entomol. Sin. 2007, 50, 494–500. [Google Scholar]
  9. Si, S.Y.; Zhou, L.L.; Wang, S.L.; Jiang, X.F.; Xu, Z.F.; Mu, W.; Wang, D.S.; Wang, X.P.; Chen, H.T.; Yang, Y.H.; et al. Progress in research on prevention and control of beet armyworm Spodoptera exigua in China. Chin. J. Appl. Entomol. 2012, 49, 1432–1438. [Google Scholar]
  10. Jiang, X.F.; Chen, J.; Zhang, L.; Sappington, T.W.; Luo, L.Z. Increased long-flight activity triggered in beet armyworm by larval feeding on diet containing Cry1Ac protoxin. PLoS ONE 2013, 8, e63554. [Google Scholar] [CrossRef] [PubMed]
  11. Chapman, J.W.; Drake, V.A.; Reynolds, D.R. Recent insights from radar studies of insect flight. Ann. Rev. Entomol. 2011, 56, 337–356. [Google Scholar] [CrossRef] [PubMed]
  12. Hu, G.; Lim, K.S.; Reynolds, D.R.; Reynolds, A.M.; Chapman, J.W. Wind-Related Orientation Patterns in Diurnal, Crepuscular and Nocturnal High-Altitude Insect Migrants. Front. Behav. Neurosci. 2016, 10, 32. [Google Scholar] [CrossRef] [PubMed]
  13. Qi, H.H.; Zhang, Y.H.; Cheng, D.F.; Han, E.B.; Sun, J.R. Radar observation and trajectory analysis on the autumn return migration of the brown planthopper, Nilaparvata lugens (Homoptera: Delphacidae) in 2009 in China. Acta Entomol. Sin. 2010, 53, 1256–1264. [Google Scholar]
  14. Cao, X.; Luo, Y.; Zhou, Y.; Duan, X.; Cheng, D. Detection of powdery mildew in two winter wheat cultivars using canopy hyperspectral reflectance. Crop Prot. 2013, 45, 124–131. [Google Scholar] [CrossRef]
  15. Lei, F.; Gao, J.; He, Y.; Liu, F. Research Development of Spectral Imaging Technology in Early Detection of Botanical Diseases. Trans. Chin. Soc. Agric. 2013, 44, 169–176. [Google Scholar]
  16. Machado, M.L.; Pinto, F.A.C.; Queiroz, D.M.D.; Paula Júnior, T.J.D.; Vieira, R.F.; Machado, M.L.; Pinto, F.A.C.; Queiroz, D.M.D.; Paula Júnior, T.J.D.; Vieira, R.F. Estimative of white mold severity in common bean crops using hyper and multispectral sensors. Rev. Bras. Eng. Agríc. Ambient 2015, 19, 426–432. [Google Scholar] [CrossRef]
  17. Kramer, A.D.I.; Guillory, J.E.; Hancock, J.T. Experimental evidence of massive-scale emotional contagion through social networks. Proc. Natl. Acad. Sci. USA 2014, 111, 8788. [Google Scholar] [CrossRef] [PubMed]
  18. Network, T.C.G.A.; Weinstein, J.N.; Collisson, E.A.; Mills, G.B.; Shaw, K.R.M.; Ozenberger, B.A.; Ellrott, K.; Shmulevich, I.; Sander, C.; Stuart, J.M. The Cancer Genome Atlas Pan-Cancer analysis project. Nat. Genet. 2013, 45, 1113–1120. [Google Scholar] [Green Version]
  19. Sun, X.; Liu, Z.; Sun, T.; Chen, W. The Application and Development of Big Data in Agricultural Research. China Veg. 2015, 1, 1–5. [Google Scholar]
  20. Theunissen, J. Intercropping in field vegetable crops: Pest management by agrosystem diversification—An overview. Pest Manag. Sci. 2010, 42, 65–68. [Google Scholar] [CrossRef]
  21. Li, B.; Zhang, H.; Lu, H. User mobility prediction based on Lagrange’s interpolation in ultra-dense networks. In Proceedings of the 2016 IEEE 27th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC), Valencia, Spain, 4–8 September 2016; pp. 1–6. [Google Scholar] [CrossRef]
  22. Juan, H.; Liu, X. Dissolved oxygen prediction in water based on K-means clustering and ELM neural network for aquaculture. Trans. Chin. Soc. Agric. Eng. 2016, 32, 174–181. [Google Scholar]
  23. Egan, H.; Kirk, R.S.; Sawyer, R. Pearson’s Chemical Analysis of Foods; Middlesex: London, UK, 1991. [Google Scholar]
  24. Tsai, M.S.; Hsu, F.Y. Application of Grey Correlation Analysis in Evolutionary Programming for Distribution System Feeder Reconfiguration. IEEE Trans. Power Syst. 2010, 25, 1126–1133. [Google Scholar] [CrossRef]
  25. Jain, A.; Nandakumar, K.; Ross, A. Score normalization in multimodal biometric systems. Pattern Recognit. 2013, 38, 2270–2285. [Google Scholar] [CrossRef]
  26. Wang, Y.; Gang, W.; Feng, S. The Dynamic Model Prediction Study of the Forest Disease, Insect Pest and Rat Based on BP Neural Networks. J. Agric. Sci. 2012, 4. [Google Scholar] [CrossRef]
  27. Yu, S.; Zhu, K.; Diao, F. A dynamic all parameters adaptive BP neural networks model and its application on oil reservoir prediction. Appl. Math. Comput. 2008, 195, 66–75. [Google Scholar] [CrossRef]
Figure 1. Structure of vegetable pest warning system that is based on multidimensional big data (VPWS-MBD).
Figure 1. Structure of vegetable pest warning system that is based on multidimensional big data (VPWS-MBD).
Insects 09 00066 g001
Figure 2. VPWS-MBD design flow chart.
Figure 2. VPWS-MBD design flow chart.
Insects 09 00066 g002
Figure 3. Flow chart of the K-Means clustering algorithm.
Figure 3. Flow chart of the K-Means clustering algorithm.
Insects 09 00066 g003
Figure 4. Structure diagram of Back Propagation (BP) neural network.
Figure 4. Structure diagram of Back Propagation (BP) neural network.
Insects 09 00066 g004
Figure 5. Sensor devices of Caipos.
Figure 5. Sensor devices of Caipos.
Insects 09 00066 g005
Figure 6. Pests image acquisition equipment.
Figure 6. Pests image acquisition equipment.
Insects 09 00066 g006
Figure 7. The Division of Pests in Vegetables in South China.
Figure 7. The Division of Pests in Vegetables in South China.
Insects 09 00066 g007
Figure 8. Confusion matrix diagram of warning results.
Figure 8. Confusion matrix diagram of warning results.
Insects 09 00066 g008
Table 1. Caipos Sensors.
Table 1. Caipos Sensors.
ModelMeasurement RangeAccuracy
Soil MoistureC200A0% VWC to saturation2–5% VWC
Soil TemperatureST−0 °C + 70 °C±1 °C%
Air Temperature & Relative HumidityCaipoRHTRel.Hum.: 0.100% Temp.: −5 °C + 60 °CRel.Hum.: ±2.5%Temp.: <±0.4 °C
Leaf WetnessCaipoLWS0.100%3%
Rain GaugeCaipos Rain Gauge0.2mm±1%
Table 2. Pest monitoring data.
Table 2. Pest monitoring data.
DatePrecipitationSoil Temperature [°C]Air Temperature [°C]Soil Moisture [%]Leaf Wetness [%]Air Humidity [%]Pest Level
2017/11/2024.4521.71143.886.52581
2017/11/3024.3220.58279.299.67663
2017/11/4024.3621.94210.1782
2017/11/5024.6423.33143.528.04812
2017/11/6024.824.04193.2111.21790
2017/11/7024.9722.95250.529.29811
2017/11/8023.981820.623.6762
2017/11/90.525.0825.4141.5411.58892
Table 3. Pearson Correlation Coefficient and Grey relational analysis for Southern China Vegetable Pest Data Set Computation Results.
Table 3. Pearson Correlation Coefficient and Grey relational analysis for Southern China Vegetable Pest Data Set Computation Results.
Characteristic FactorsCorrelation CoefficientCorrelation
Rainfall0.2833270.7125
Air humidity0.1618550.621
Air temperature0.4045260.7011
Soil temperature0.4659470.6903
Soil moisture0.3927910.6218
leaf wetness0.4735990.75399
Table 4. Evaluation Results of Affecting Factors on Vegetable Pests in the South.
Table 4. Evaluation Results of Affecting Factors on Vegetable Pests in the South.
Influencing FactorsImpact Factor
Rainfall1.11
Air humidity0.08
Soil temperature1.54
Air temperature1.37
Soil moisture0.72
leaf wetness1.94
Table 5. Analysis of Early Warning Results of VPWS-MBD.
Table 5. Analysis of Early Warning Results of VPWS-MBD.
IIIIIIIV
True positive (TP)36720614320
False Negative (FN)11860
False positive (FP)141100
True Nagative (TN)369536612741
Precision (p)0.960.9511
Recall (r)0.970.960.961
F1 value0.960.950.981

Share and Cite

MDPI and ACS Style

Zhang, C.; Cai, J.; Xiao, D.; Ye, Y.; Chehelamirani, M. Research on Vegetable Pest Warning System Based on Multidimensional Big Data. Insects 2018, 9, 66. https://doi.org/10.3390/insects9020066

AMA Style

Zhang C, Cai J, Xiao D, Ye Y, Chehelamirani M. Research on Vegetable Pest Warning System Based on Multidimensional Big Data. Insects. 2018; 9(2):66. https://doi.org/10.3390/insects9020066

Chicago/Turabian Style

Zhang, Changzhen, Jiahao Cai, Deqin Xiao, Yaowen Ye, and Mohammad Chehelamirani. 2018. "Research on Vegetable Pest Warning System Based on Multidimensional Big Data" Insects 9, no. 2: 66. https://doi.org/10.3390/insects9020066

APA Style

Zhang, C., Cai, J., Xiao, D., Ye, Y., & Chehelamirani, M. (2018). Research on Vegetable Pest Warning System Based on Multidimensional Big Data. Insects, 9(2), 66. https://doi.org/10.3390/insects9020066

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop