applsci-logo

Journal Browser

Journal Browser

Applications of Machine Learning in Agriculture

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Agricultural Science and Technology".

Deadline for manuscript submissions: closed (31 May 2024) | Viewed by 23423

Special Issue Editor


E-Mail Website
Guest Editor
College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou 310058, China
Interests: food safety; food quality; food authenticity; hyperspectral imaging; NIR; machine learning
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

The development of agriculture is changing to the direction of automation, digitalization and intelligence worldwide. Rapid and accurate data processing methods are key for agriculture to enter a new stage. Machine learning is the core of artificial intelligence, and opens up new possibilities for solving, analyzing and understanding intensive agricultural data.

Machine learning based on multiple types of algorithms, including supervised learning, semi-supervised learning, unsupervised learning and reinforcement learning models, can be carried out to mine and learn sensor data. Machine learning can be applied to the whole agricultural industry chain to optimize the industrial structure and improve industrial efficiency. Machine learning in agriculture can provide more solutions for maintaining the world population, coping with climate change and achieving sustainable development.

This Special Issue will publish high-quality and original research papers in the fields of agrometeorological monitoring, farmland element identification, soil attribute measurement, plant phenotyping, pest and disease diagnosis, agricultural product quality detection, crop yield prediction, crop breeding, etc. This Special Issue encourages agricultural scientists to develop more accurate and real-time machine learning methods in the above and broader agricultural fields.

Prof. Dr. Zhengjun Qiu
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • artificial intelligence
  • deep learning
  • neural network
  • big data
  • data mining
  • image processing
  • spectral analysis

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • e-Book format: Special Issues with more than 10 articles can be published as dedicated e-books, ensuring wide and rapid dissemination.

Further information on MDPI's Special Issue polices can be found here.

Published Papers (11 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

17 pages, 6488 KiB  
Article
A Novel Method for Wheat Spike Phenotyping Based on Instance Segmentation and Classification
by Ziang Niu, Ning Liang, Yiyin He, Chengjia Xu, Sashuang Sun, Zhenjiang Zhou and Zhengjun Qiu
Appl. Sci. 2024, 14(14), 6031; https://doi.org/10.3390/app14146031 - 10 Jul 2024
Cited by 1 | Viewed by 865
Abstract
The phenotypic analysis of wheat spikes plays an important role in wheat growth management, plant breeding, and yield estimation. However, the dense and tight arrangement of spikelets and grains on the spikes makes the phenotyping more challenging. This study proposed a rapid and [...] Read more.
The phenotypic analysis of wheat spikes plays an important role in wheat growth management, plant breeding, and yield estimation. However, the dense and tight arrangement of spikelets and grains on the spikes makes the phenotyping more challenging. This study proposed a rapid and accurate image-based method for in-field wheat spike phenotyping consisting of three steps: wheat spikelet segmentation, grain number classification, and total grain number counting. Wheat samples ranging from the early filling period to the mature period were involved in the study, including three varieties: Zhengmai 618, Yannong 19, and Sumai 8. In the first step, the in-field collected images of wheat spikes were optimized by perspective transformation, augmentation, and size reduction. The YOLOv8-seg instance segmentation model was used to segment spikelets from wheat spike images. In the second step, the number of grains in each spikelet was classified by a machine learning model like the Support Vector Machine (SVM) model, utilizing 52 image features extracted for each spikelet, involving shape, color, and texture features as the input. Finally, the total number of grains on each wheat spike was counted by adding the number of grains in the corresponding spikelets. The results showed that the YOLOv8-seg model achieved excellent segmentation performance, with an average precision (AP) @[0.50:0.95] and accuracy (A) of 0.858 and 100%. Meanwhile, the SVM model had good classification performance for the number of grains in spikelets, and the accuracy, precision, recall, and F1 score reached 0.855, 0.860, 0.865, and 0.863, respectively. Mean absolute error (MAE) and mean absolute percentage error (MAPE) were as low as 1.04 and 5% when counting the total number of grains in the frontal view wheat spike images. The proposed method meets the practical application requirements of obtaining trait parameters of wheat spikes and contributes to intelligent and non-destructive spike phenotyping. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

13 pages, 2617 KiB  
Article
Citrus Pest Identification Model Based on Improved ShuffleNet
by Yan-Nan Yu, Chun-Lin Xiong, Ji-Chi Yan, Yong-Bin Mo, Shi-Qing Dou, Zuo-Hua Wu and Rong-Feng Yang
Appl. Sci. 2024, 14(11), 4437; https://doi.org/10.3390/app14114437 - 23 May 2024
Cited by 1 | Viewed by 786
Abstract
To address the current issues of complex structures and low accuracies in citrus pest identification models, a lightweight pest identification model was proposed. First, a parameterized linear rectification function was introduced to avoid neuronal death. Second, the model’s attention to pest characteristics was [...] Read more.
To address the current issues of complex structures and low accuracies in citrus pest identification models, a lightweight pest identification model was proposed. First, a parameterized linear rectification function was introduced to avoid neuronal death. Second, the model’s attention to pest characteristics was improved by incorporating an improved mixed attention mechanism. Subsequently, the network structure of the original model was adjusted to reduce architectural complexity. Finally, by employing transfer learning, an SCHNet model was developed. The experimental results indicated that the proposed model achieved an accuracy rate of 94.48% with a compact size of 3.84 MB. Compared to the original ShuffleNet V2 network, the SCHNet model showed a 3.12% accuracy improvement while reducing the model size by 22.7%. The SCHNet model exhibited an excellent classification performance for citrus pest identification, enabling the accurate identification of citrus pests. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

18 pages, 2531 KiB  
Article
Research on Evaluation Methods of Black Soil Farmland Productivity Based on Field Block Scale
by Zihao Zhu and Yonghua Xie
Appl. Sci. 2024, 14(7), 3130; https://doi.org/10.3390/app14073130 - 8 Apr 2024
Viewed by 917
Abstract
Black soil plays an important role in maintaining a healthy ecosystem, promoting high-yield and efficient agricultural production, and conserving soil resources. In this paper, a typical black soil area of Keshan Farm in Qiqihar City, Heilongjiang Province, China, is used as a case [...] Read more.
Black soil plays an important role in maintaining a healthy ecosystem, promoting high-yield and efficient agricultural production, and conserving soil resources. In this paper, a typical black soil area of Keshan Farm in Qiqihar City, Heilongjiang Province, China, is used as a case study to investigate the black soil farmland productivity evaluation model. Based on the analysis of the composite index (CI) model, productivity index (PI) model and various machine learning models, the soil productivity evaluation method was improved and a prediction model was established. The results showed that the support vector machine regression model based on simulated annealing algorithm (SA-SVR), as well as the Gaussian process regression model (GPR), had obvious advantages in data preprocessing, feature selection, and model optimization compared to the modified composite index model (MCI), the modified productivity index model (MPI), and the coefficients of determination (R2) of their modelling, which were up to 0.70 and 0.71, respectively, and these machine learning prediction models can reflect the effects on maize cultivation and its yield through soil parameters even with small datasets, which can better capture the nonlinear relationship and improve the accuracy and stability of yield prediction, and is an effective method for guiding agricultural production as well as soil productivity evaluation. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

13 pages, 2149 KiB  
Article
Non-Destructive Detection of Golden Passion Fruit Quality Based on Dielectric Characteristics
by Fan Lin, Dengjie Chen, Cheng Liu and Jincheng He
Appl. Sci. 2024, 14(5), 2200; https://doi.org/10.3390/app14052200 - 6 Mar 2024
Cited by 2 | Viewed by 1053
Abstract
This study pioneered a non-destructive testing approach to evaluating the physicochemical properties of golden passion fruit by developing a platform to analyze the fruit’s electrical characteristics. By using dielectric properties, the method accurately predicted the soluble solids content (SSC), Acidity and [...] Read more.
This study pioneered a non-destructive testing approach to evaluating the physicochemical properties of golden passion fruit by developing a platform to analyze the fruit’s electrical characteristics. By using dielectric properties, the method accurately predicted the soluble solids content (SSC), Acidity and pulp percentage (PP) in passion fruit. The investigation entailed measuring the relative dielectric constant (ε′) and dielectric loss factor (ε″) for 192 samples across a spectrum of 34 frequencies from 0.05 to 100 kHz. The analysis revealed that with increasing frequency and fruit maturity, both ε′ and ε″ showed a declining trend. Moreover, there was a discernible correlation between the fruit’s physicochemical indicators and dielectric properties. In refining the dataset, 12 outliers were removed using the Local Outlier Factor (LOF) algorithm. The study employed various advanced feature extraction techniques, including Recursive Feature Elimination with Cross-Validation (RFECV), Permutation Importance based on Random Forest Regression (PI-RF), Permutation Importance based on Linear Regression (PI-LR) and Genetic Algorithm (GA). All the variables and the selected variables after screening were used as inputs to build Extreme Gradient Boosting (XGBoost) and Categorical Boosting (Cat-Boost) models to predict the SSC, Acidity and PP in passion fruit. The results indicate that the PI-RF-XGBoost model demonstrated superior performance in predicting both the SSC (R2 = 0.9240, RMSE = 0.2595) and the PP (R2 = 0.9092, RMSE = 0.0014) of passion fruit. Meanwhile, the GA-CatBoost model exhibited the best performance in predicting Acidity (R2 = 0.9471, RMSE = 0.1237). In addition, for the well-performing algorithms, the selected features are mainly concentrated within the frequency range of 0.05–6 kHz, which is consistent with the frequency range highly correlated with the dielectric properties and quality indicators. It is feasible to predict the quality indicators of fruit by detecting their low-frequency dielectric properties. This research offers significant insights and a valuable reference for non-destructive testing methods in assessing the quality of golden passion fruit. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

24 pages, 6254 KiB  
Article
Long Short-Term Memory Recurrent Neural Network and Extreme Gradient Boosting Algorithms Applied in a Greenhouse’s Internal Temperature Prediction
by Juan M. Esparza-Gómez, Luis F. Luque-Vega, Héctor A. Guerrero-Osuna, Rocío Carrasco-Navarro, Fabián García-Vázquez, Marcela E. Mata-Romero, Carlos Alberto Olvera-Olvera, Miriam A. Carlos-Mancilla and Luis Octavio Solís-Sánchez
Appl. Sci. 2023, 13(22), 12341; https://doi.org/10.3390/app132212341 - 15 Nov 2023
Cited by 8 | Viewed by 1919
Abstract
One of the main challenges agricultural greenhouses face is accurately predicting environmental conditions to ensure optimal crop growth. However, the current prediction methods have limitations in handling large volumes of dynamic and nonlinear temporal data, which makes it difficult to make accurate early [...] Read more.
One of the main challenges agricultural greenhouses face is accurately predicting environmental conditions to ensure optimal crop growth. However, the current prediction methods have limitations in handling large volumes of dynamic and nonlinear temporal data, which makes it difficult to make accurate early predictions. This paper aims to forecast a greenhouse’s internal temperature up to one hour in advance using supervised learning tools like Extreme Gradient Boosting (XGBoost) and Recurrent Neural Networks combined with Long-Short Term Memory (LSTM-RNN). The study uses the many-to-one configuration, with a sequence of three input elements and one output element. Significant improvements in the R2, RMSE, MAE, and MAPE metrics are observed by considering various combinations. In addition, Bayesian optimization is employed to find the best hyperparameters for each algorithm. The research uses a database of internal data such as temperature, humidity, and dew point and external data such as temperature, humidity, and solar radiation, splitting the data into the year’s four seasons and performing eight experiments according to the two algorithms and each season. The LSTM-RNN model produces the best results for the metrics in summer, achieving an R2 = 0.9994, RMSE = 0.2698, MAE = 0.1449, and MAPE = 0.0041, meeting the acceptability criterion of ±2 °C hysteresis. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

16 pages, 1788 KiB  
Article
Machine Learning Classification–Regression Schemes for Desert Locust Presence Prediction in Western Africa
by L. Cornejo-Bueno, J. Pérez-Aracil, C. Casanova-Mateo, J. Sanz-Justo and S. Salcedo-Sanz
Appl. Sci. 2023, 13(14), 8266; https://doi.org/10.3390/app13148266 - 17 Jul 2023
Cited by 4 | Viewed by 1145
Abstract
For decades, humans have been confronted with numerous pest species, with the desert locust being one of the most damaging and having the greatest socio-economic impact. Trying to predict the occurrence of such pests is often complicated by the small number of records [...] Read more.
For decades, humans have been confronted with numerous pest species, with the desert locust being one of the most damaging and having the greatest socio-economic impact. Trying to predict the occurrence of such pests is often complicated by the small number of records and observations in databases. This paper proposes a methodology based on a combination of classification and regression techniques to address not only the problem of locust sightings prediction, but also the number of locust individuals that may be expected. For this purpose, we apply different machine learning (ML) and related techniques, such as linear regression, Support Vector Machines, decision trees, random forests and neural networks. The considered ML algorithms are evaluated in three different scenarios in Western Africa, mainly Mauritania, and for the elaboration of the forecasting process, a number of meteorological variables obtained from the ERA5 reanalysis data are used as input variables for the classification–regression machines. The results obtained show good performance in terms of classification (appearance or not of desert locust), and acceptable regression results in terms of predicting the number of locusts, a harder problem due to the small number of samples available. We observed that the RF algorithm exhibited exceptional performance in the classification task (presence/absence) and achieved noteworthy results in regression (number of sightings), being the most effective machine learning algorithm among those used. It achieved classification results, in terms of F-score, around the value of 0.9 for the proposed Scenario 1. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

16 pages, 3041 KiB  
Article
Image-Based Arabian Camel Breed Classification Using Transfer Learning on CNNs
by Sultan Alfarhood, Atheer Alrayeh, Mejdl Safran, Meshal Alfarhood and Dunren Che
Appl. Sci. 2023, 13(14), 8192; https://doi.org/10.3390/app13148192 - 14 Jul 2023
Cited by 5 | Viewed by 3070
Abstract
Image-based Arabian camel breed classification is an important task for various practical applications, such as breeding management, genetic improvement, conservation, and traceability. However, it is a challenging task due to the lack of standardized criteria and methods, the high similarity among breeds, and [...] Read more.
Image-based Arabian camel breed classification is an important task for various practical applications, such as breeding management, genetic improvement, conservation, and traceability. However, it is a challenging task due to the lack of standardized criteria and methods, the high similarity among breeds, and the limited availability of data and resources. In this paper, we propose an approach to tackle this challenge by using convolutional neural networks (CNNs) and transfer learning to classify images of six different Arabian camel breeds: Waddeh, Majaheem, Homor, Sofor, Shaele, and Shageh. To achieve this, we created, preprocessed, and annotated a novel dataset of 1073 camel images. We then pre-trained CNNs as feature extractors and fine-tuned them on our new dataset. We evaluated several popular CNN architectures with diverse characteristics such as InceptionV3, NASNetLarge, PNASNet-5-Large, MobileNetV3-Large, and EfficientNetV2 (small, medium, and large variants), and we found that NASNetLarge achieves the best test accuracy of 85.80% on our proposed dataset. Finally, we integrated the best-performing CNN architecture, NASNetLarge, into a mobile application for further validation and actual use in a real-world scenarios. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

20 pages, 7327 KiB  
Article
Research on an Improved Non-Destructive Detection Method for the Soluble Solids Content in Bunch-Harvested Grapes Based on Deep Learning and Hyperspectral Imaging
by Junhong Zhao, Qixiao Hu, Bin Li, Yuming Xie, Huazhong Lu and Sai Xu
Appl. Sci. 2023, 13(11), 6776; https://doi.org/10.3390/app13116776 - 2 Jun 2023
Cited by 4 | Viewed by 1751
Abstract
The soluble solids content (SSC) is one of the important evaluation indicators for the internal quality of fresh grapes. However, the current non-destructive detection method based on hyperspectral imaging (HSI) relies on manual operation and is relatively cumbersome, making it difficult to achieve [...] Read more.
The soluble solids content (SSC) is one of the important evaluation indicators for the internal quality of fresh grapes. However, the current non-destructive detection method based on hyperspectral imaging (HSI) relies on manual operation and is relatively cumbersome, making it difficult to achieve automatic detection in batches. Therefore, in this study, we aimed to conduct research on an improved non-destructive detection method for the SSC of bunch-harvested grapes. This study took the Shine-Muscat grape as the research object. Using Mask R-CNN to establish a grape image segmentation model based on deep learning (DL) applied to near-infrared hyperspectral images (400~1000 nm), 35 characteristic wavelengths were selected using Monte Carlo Uninformative Variable Elimination (MCUVE) to establish a prediction model for SSC. Based on the two abovementioned models, the improved non-destructive detection method for the SSC of bunch-harvested grapes was validated. The comprehensive evaluation index F1 of the image segmentation model was 95.34%. The Rm2 and RMSEM of the SSC prediction model were 0.8705 and 0.5696 Brix%, respectively, while the Rp2 and RMSEP were 0.8755 and 0.9177 Brix%, respectively. The non-destructive detection speed of the improved method was 16.6 times that of the existing method. These results prove that the improved non-destructive detection method for the SSC of bunch-harvested grapes based on DL and HSI is feasible and efficient. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

29 pages, 21878 KiB  
Article
Convolutional Neural Network-Based Automated System for Dog Tracking and Emotion Recognition in Video Surveillance
by Huan-Yu Chen, Chuen-Horng Lin, Jyun-Wei Lai and Yung-Kuan Chan
Appl. Sci. 2023, 13(7), 4596; https://doi.org/10.3390/app13074596 - 5 Apr 2023
Cited by 3 | Viewed by 4393
Abstract
This paper proposes a multi–convolutional neural network (CNN)-based system for the detection, tracking, and recognition of the emotions of dogs in surveillance videos. This system detects dogs in each frame of a video, tracks the dogs in the video, and recognizes the dogs’ [...] Read more.
This paper proposes a multi–convolutional neural network (CNN)-based system for the detection, tracking, and recognition of the emotions of dogs in surveillance videos. This system detects dogs in each frame of a video, tracks the dogs in the video, and recognizes the dogs’ emotions. The system uses a YOLOv3 model for dog detection. The dogs are tracked in real time with a deep association metric model (DeepDogTrack), which uses a Kalman filter combined with a CNN for processing. Thereafter, the dogs’ emotional behaviors are categorized into three types—angry (or aggressive), happy (or excited), and neutral (or general) behaviors—on the basis of manual judgments made by veterinary experts and custom dog breeders. The system extracts sub-images from videos of dogs, determines whether the images are sufficient to recognize the dogs’ emotions, and uses the long short-term deep features of dog memory networks model (LDFDMN) to identify the dog’s emotions. The dog detection experiments were conducted using two image datasets to verify the model’s effectiveness, and the detection accuracy rates were 97.59% and 94.62%, respectively. Detection errors occurred when the dog’s facial features were obscured, when the dog was of a special breed, when the dog’s body was covered, or when the dog region was incomplete. The dog-tracking experiments were conducted using three video datasets, each containing one or more dogs. The highest tracking accuracy rate (93.02%) was achieved when only one dog was in the video, and the highest tracking rate achieved for a video containing multiple dogs was 86.45%. Tracking errors occurred when the region covered by a dog’s body increased as the dog entered or left the screen, resulting in tracking loss. The dog emotion recognition experiments were conducted using two video datasets. The emotion recognition accuracy rates were 81.73% and 76.02%, respectively. Recognition errors occurred when the background of the image was removed, resulting in the dog region being unclear and the incorrect emotion being recognized. Of the three emotions, anger was the most prominently represented; therefore, the recognition rates for angry emotions were higher than those for happy or neutral emotions. Emotion recognition errors occurred when the dog’s movements were too subtle or too fast, the image was blurred, the shooting angle was suboptimal, or the video resolution was too low. Nevertheless, the current experiments revealed that the proposed system can correctly recognize the emotions of dogs in videos. The accuracy of the proposed system can be dramatically increased by using more images and videos for training the detection, tracking, and emotional recognition models. The system can then be applied in real-world situations to assist in the early identification of dogs that may exhibit aggressive behavior. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

17 pages, 3265 KiB  
Article
A Novel Generative Adversarial Network Model Based on GC-MS Analysis for the Classification of Taif Rose
by Hala M. Abdelmigid, Mohammed Baz, Mohammed A. AlZain, Jehad F. Al-Amri, Hatim G. Zaini, Maissa M. Morsi, Matokah Abualnaja and Nawal Abdallah Alhuthal
Appl. Sci. 2023, 13(5), 3052; https://doi.org/10.3390/app13053052 - 27 Feb 2023
Cited by 2 | Viewed by 2038
Abstract
Rose oil production is believed to be dependent on only a few genotypes of the famous rose Rosa damascena. The aim of this study was to develop a novel GC-MS fingerprint based on the need to expand the genetic resources of oil-bearing [...] Read more.
Rose oil production is believed to be dependent on only a few genotypes of the famous rose Rosa damascena. The aim of this study was to develop a novel GC-MS fingerprint based on the need to expand the genetic resources of oil-bearing rose for industrial cultivation in the Taif region (Saudi Arabia). Gas chromatography-mass spectrometry (GC-MS) is a widely used analytical technique for determining the volatile composition of distilled rose oil from flower data. Because biosample availability, prohibitive costs, and ethical concerns limit observations in agricultural research, we aimed to enhance the quality of analysis by combining real observations with samples generated in silico. This study proposes a novel artificial intelligence model based on generative adversarial neural networks (GANs) to classify Taif rose cultivars using raw GC-MS data. We employed a variant of the GAN known as conditional stacked GANs (cSGANs) to predict Taif rose’s oil content and other latent characteristics without the need to conduct laboratory tests. A hierarchical stack of conditional GANs is used in this algorithm to generate images. A cluster model was developed based on the dataset provided, to quantify the diversity that should be implemented in the proposed model. The networks were trained using the cross-entropy and minimax loss functions. The accuracy of the proposed model was assessed by measuring losses as a function of the number of epochs. The results prove the ability of the proposed model to perfectly generate new real samples of different classes based on the GC-MS fingerprint. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

17 pages, 7391 KiB  
Article
Data Augmentation Method for Plant Leaf Disease Recognition
by Byeongjun Min, Taehyun Kim, Dongil Shin and Dongkyoo Shin
Appl. Sci. 2023, 13(3), 1465; https://doi.org/10.3390/app13031465 - 22 Jan 2023
Cited by 17 | Viewed by 4025
Abstract
Recently, several plant pathogens have become more active due to temperature increases arising from climate change, which has caused damage to various crops. If climate change continues, it will likely be very difficult to maintain current crop production, and the problem of a [...] Read more.
Recently, several plant pathogens have become more active due to temperature increases arising from climate change, which has caused damage to various crops. If climate change continues, it will likely be very difficult to maintain current crop production, and the problem of a shortage of expert manpower is also deepening. Fortunately, research on various early diagnosis systems based on deep learning is actively underway to solve these problems, but the problem of lack of diversity in some hard-to-collect disease samples remains. This imbalanced data increases the bias of machine learning models, causing overfitting problems. In this paper, we propose a data augmentation method based on an image-to-image translation model to solve the bias problem by supplementing these insufficient diseased leaf images. The proposed augmentation method performs translation between healthy and diseased leaf images and utilizes attention mechanisms to create images that reflect more evident disease textures. Through these improvements, we generated a more plausible diseased leaf image compared to existing methods and conducted an experiment to verify whether this data augmentation method could further improve the performance of a classification model for early diagnosis of plants. In the experiment, the PlantVillage dataset was used, and the extended dataset was built using the generated images and original images, and the performance of the classification models was evaluated through the test set. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Agriculture)
Show Figures

Figure 1

Back to TopTop