Balancing Data through Data Augmentation Improves the Generality of Transfer Learning for Diabetic Retinopathy Classification
Abstract
:1. Introduction
- (1)
- Application of three pre-trained models, VGG16, DenseNet169 and ResNet50, on a publicly available diabetic retinopathy dataset and the data-augmented version of the dataset to solve the class imbalance problem;
- (2)
- Enhance the pre-trained models to improve the performance obtained in (1);
- (3)
- Apply the enhanced models on a blind Mauritian local cohort to predict the different stages of diabetic retinopathy;
- (4)
- Compare the predicted results obtained for the Mauritian dataset using the enhanced models to an actual ophthalmologist’s diagnosis.
2. Materials and Methods
2.1. Proposed Workflow and Components
2.2. Datasets
2.3. Data Pre-Processing
2.4. Transfer Learning Using ResNet50, VGG16 and DenseNet169
2.5. Enhanced CNN Models
3. Results and Discussions
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- GBD 2019 Blindness and Vision Impairment Collaborators; Vision Loss Expert Group of the Global Burden of Disease Study. Causes of blindness and vision impairment in 2020 and trends over 30 years, and prevalence of avoidable blindness in relation to VISION 2020: The Right to Sight: An analysis for the Global Burden of Disease Study. Lancet Glob. Health 2021, 9, E144–E160. [Google Scholar] [CrossRef]
- Saeedi, P.; Petersohn, I.; Salpea, P.; Malanda, B.; Karuranga, S.; Unwin, N.; Colagiuri, S.; Guariguata, L.; Motala, A.A.; Ogurtsova, K.; et al. Global and regional diabetes prevalence estimates for 2019 and projections for 2030 and 2045: Results from the International Diabetes Federation Diabetes Atlas, 9th edition. Diabetes Res. Clin. Pract. 2019, 157, 107843. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Shah, A.R.; Gardner, T.W. Diabetic retinopathy: Research to clinical practice. Clin. Diabetes Endocrinol. 2017, 3, 9. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Lam, C.; Yi, D.; Guo, M.; Lindsey, T. Automated Detection of Diabetic Retinopathy using Deep Learning. AMIA Jt. Summits Transl. Sci. Proc. 2018, 2018, 147–155. [Google Scholar]
- Oltu, B.; Karaca, B.K.; Erdem, H.; Özgür, A. A systematic review of transfer learning based approaches for diabetic retinopathy detection. arXiv 2021, arXiv:2105.13793. [Google Scholar]
- Alyoubi, W.L.; Shalash, W.M.; Abulkhair, M.F. Diabetic retinopathy detection through deep learning techniques: A review. Inform. Med. Unlocked 2020, 20, 100377. [Google Scholar] [CrossRef]
- Kandel, I.; Castelli, M. Transfer Learning with Convolutional Neural Networks for Diabetic Retinopathy Image Classification. A Review. Appl. Sci. 2020, 10, 2021. [Google Scholar] [CrossRef] [Green Version]
- Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Zoph, B.; Vasudevan, V.; Shlens, J.; Le, Q.V. Learning transferable architectures for scalable image recognition. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 8697–8710. [Google Scholar]
- Masood, S.; Luthra, T.; Sundriyal, H.; Ahmed, M. Identification of diabetic retinopathy in eye images using transfer learning. In Proceedings of the 2017 International Conference on Computing, Communication and Automation (ICCCA), Greater Noida, India, 5–6 May 2017; pp. 1183–1187. [Google Scholar]
- Li, X.; Pang, T.; Xiong, B.; Liu, W.; Liang, P.; Wang, T. Convolutional neural networks based transfer learning for diabetic retinopathy fundus image classification. In Proceedings of the 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), Shanghai, China, 14–16 October 2017; pp. 1–11. [Google Scholar]
- Challa, U.K.; Yellamraju, P.; Bhatt, J.S. A Multi-class Deep All-CNN for Detection of Diabetic Retinopathy Using Retinal Fundus Images. In Pattern Recognition and Machine Intelligence: 8th International Conference, PReMI 2019, Tezpur, India, December 17–20, 2019, Proceedings, Part I; Deka, B., Maji, P., Mitra, S., Bhattacharyya, D.K., Bora, P.K., Pal, S.K., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2019; Volume 11941, pp. 191–199. [Google Scholar]
- Kaggle. APTOS 2019 Blindness Detection|Kaggle. Available online: https://www.kaggle.com/c/aptos2019-blindness-detection/ (accessed on 15 February 2022).
- Kassani, S.H.; Kassani, P.H.; Khazaeinezhad, R.; Wesolowski, M.J.; Schneider, K.A.; Deters, R. Diabetic retinopathy classification using a modified xception architecture. In Proceedings of the 2019 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), Ajman, United Arab Emirates, 10–12 December 2019; pp. 1–6. [Google Scholar]
- Khalifa, N.E.M.; Loey, M.; Taha, M.H.N.; Mohamed, H.N.E.T. Deep transfer learning models for medical diabetic retinopathy detection. Acta Inform. Med. 2019, 27, 327–332. [Google Scholar] [CrossRef]
- Hagos, M.T.; Kant, S. Transfer Learning based Detection of Diabetic Retinopathy from Small Dataset. arXiv 2019, arXiv:1905.07203. [Google Scholar] [CrossRef]
- Sikder, N.; Chowdhury, M.S.; Shamim Mohammad Arif, A.; Nahid, A.-A. Early blindness detection based on retinal images using ensemble learning. In Proceedings of the 2019 22nd International Conference on Computer and Information Technology (ICCIT), Dhaka, Bangladesh, 18–20 December 2019; pp. 1–6. [Google Scholar]
- Shaban, M.; Ogur, Z.; Mahmoud, A.; Switala, A.; Shalaby, A.; Abu Khalifeh, H.; Ghazal, M.; Fraiwan, L.; Giridharan, G.; Sandhu, H.; et al. A convolutional neural network for the screening and staging of diabetic retinopathy. PLoS ONE 2020, 15, e0233514. [Google Scholar] [CrossRef] [PubMed]
- Mushtaq, G.; Siddiqui, F. Detection of diabetic retinopathy using deep learning methodology. IOP Conf. Ser. Mater. Sci. Eng. 2021, 1070, 012049. [Google Scholar] [CrossRef]
- Thota, N.B.; Umma Reddy, D. Improving the Accuracy of Diabetic Retinopathy Severity Classification with Transfer Learning. In Proceedings of the 2020 IEEE 63rd International Midwest Symposium on Circuits and Systems (MWSCAS), Springfield, MA, USA, 9–12 August 2020; pp. 1003–1006. [Google Scholar]
- Gangwar, A.K.; Ravi, V. Diabetic retinopathy detection using transfer learning and deep learning. In Evolution in Computational Intelligence: Frontiers in Intelligent Computing: Theory and Applications (FICTA 2020), Volume 1; Bhateja, V., Peng, S.-L., Satapathy, S.C., Zhang, Y.-D., Eds.; Advances in Intelligent Systems and Computing; Springer: Singapore, 2021; Volume 1176, pp. 679–689. [Google Scholar]
- Dai, L.; Wu, L.; Li, H.; Cai, C.; Wu, Q.; Kong, H.; Liu, R.; Wang, X.; Hou, X.; Liu, Y.; et al. A deep learning system for detecting diabetic retinopathy across the disease spectrum. Nat. Commun. 2021, 12, 3242. [Google Scholar] [CrossRef] [PubMed]
- Benson, J.; Maynard, J.; Zamora, G.; Carrillo, H.; Wigdahl, J.; Nemeth, S.; Barriga, S.; Estrada, T.; Soliz, P. Transfer learning for diabetic retinopathy. In Medical Imaging 2018: Image Processing; Angelini, E.D., Landman, B.A., Eds.; SPIE: Bellingham, WA, USA, 2018; p. 70. [Google Scholar]
- Söderberg, S.; Zimmet, P.; Tuomilehto, J.; de Courten, M.; Dowse, G.K.; Chitson, P.; Gareeboo, H.; Alberti, K.G.M.M.; Shaw, J.E. Increasing prevalence of Type 2 diabetes mellitus in all ethnic groups in Mauritius. Diabet. Med. 2005, 22, 61–68. [Google Scholar] [CrossRef]
- Housing and Population Census. Available online: https://web.archive.org/web/20121114114018/http://www.gov.mu/portal/goc/cso/file/2011VolIIPC.pdf (accessed on 15 February 2022).
- Sudre, C.H.; Li, W.; Vercauteren, T.; Ourselin, S.; Jorge Cardoso, M. Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support; Cardoso, M.J., Arbel, T., Carneiro, G., Syeda-Mahmood, T., Tavares, J.M.R.S., Moradi, M., Bradley, A., Greenspan, H., Papa, J.P., Madabhushi, A., et al., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2017; Volume 10553, pp. 240–248. [Google Scholar]
- Lopez-Nava, I.H.; Valentín-Coronado, L.M.; Garcia-Constantino, M.; Favela, J. Gait Activity Classification on Unbalanced Data from Inertial Sensors Using Shallow and Deep Learning. Sensors 2020, 20, 4756. [Google Scholar] [CrossRef]
- Zhou, Y.; Wang, B.; He, X.; Cui, S.; Shao, L. DR-GAN: Conditional Generative Adversarial Network for Fine-Grained Lesion Synthesis on Diabetic Retinopathy Images. IEEE J. Biomed. Health Inform. 2020, 26, 56–66. [Google Scholar] [CrossRef]
- Agustin, T.; Utami, E.; Fatta, H.A. Implementation of data augmentation to improve performance CNN method for detecting diabetic retinopathy. In Proceedings of the 2020 3rd International Conference on Information and Communications Technology (ICOIACT), Yogyakarta, Indonesia, 24–25 November 2020; pp. 83–88. [Google Scholar]
- Tan, C.; Sun, F.; Kong, T.; Zhang, W.; Yang, C.; Liu, C. A survey on deep transfer learning. In Artificial Neural Networks and Machine Learning—ICANN 2018: 27th International Conference on Artificial Neural Networks, Rhodes, Greece, October 4–7, 2018, Proceedings, Part III; Kůrková, V., Manolopoulos, Y., Hammer, B., Iliadis, L., Maglogiannis, I., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2018; Volume 11141, pp. 270–279. [Google Scholar]
- Da Rocha, D.A.; Ferreira, F.M.F.; Peixoto, Z.M.A. Diabetic retinopathy classification using VGG16 neural network. Res. Biomed. Eng. 2022. [Google Scholar] [CrossRef]
- Mule, N.; Thakare, A.; Kadam, A. Comparative analysis of various deep learning algorithms for diabetic retinopathy images. In Health Informatics: A Computational Perspective in Healthcare; Patgiri, R., Biswas, A., Roy, P., Eds.; Studies in Computational Intelligence; Springer: Singapore, 2021; Volume 932, pp. 97–106. [Google Scholar]
- Huang, G.; Liu, Z.; van der Maaten, L.; Weinberger, K.Q. DenseNet Densely connected convolutional networks. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 2261–2269. [Google Scholar]
- Taormina, V.; Cascio, D.; Abbene, L.; Raso, G. Performance of Fine-Tuning Convolutional Neural Networks for HEp-2 Image Classification. Appl. Sci. 2020, 10, 6940. [Google Scholar] [CrossRef]
- Zhang, C.-L.; Luo, J.-H.; Wei, X.-S.; Wu, J. In defense of fully connected layers in visual representation transfer. In Advances in Multimedia Information Processing—PCM 2017; Zeng, B., Huang, Q., El Saddik, A., Li, H., Jiang, S., Fan, X., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2018; Volume 10736, pp. 807–817. [Google Scholar]
- ElBedwehy, M.N.; Behery, G.M.; Elbarougy, R. Face recognition based on relative gradient magnitude strength. Arab. J. Sci. Eng. 2020, 45, 9925–9937. [Google Scholar] [CrossRef]
- Shorten, C.; Khoshgoftaar, T.M. A survey on Image Data Augmentation for Deep Learning. J. Big Data 2019, 6, 60. [Google Scholar] [CrossRef]
- Khan, A.H.; Cao, X.; Li, S.; Katsikis, V.N.; Liao, L. BAS-ADAM: An ADAM based approach to improve the performance of beetle antennae search optimizer. IEEE/CAA J. Autom. Sinica 2020, 7, 461–471. [Google Scholar] [CrossRef]
- Keras Transfer Learning & Fine-Tuning. Available online: https://keras.io/guides/transfer_learning/ (accessed on 2 March 2022).
- Peng, P.; Wang, J. How to fine-tune deep neural networks in few-shot learning? arXiv 2020, arXiv:2012.00204. [Google Scholar]
- Ismail, A. View of Improving Convolutional Neural Network (CNN) Architecture (MiniVGGNet) with Batch Normalization and Learning Rate Decay Factor for Image Classification. Available online: https://publisher.uthm.edu.my/ojs/index.php/ijie/article/view/4558/2976 (accessed on 29 March 2022).
- Usha Ruby, A. Binary cross entropy with deep learning technique for Image classification. Int. J. Adv. Trends Comput. Sci. Eng. 2020, 9, 5393–5397. [Google Scholar] [CrossRef]
- Song, H.; Kim, M.; Park, D.; Lee, J.-G. How does Early Stopping Help Generalization against Label Noise? arXiv 2019, arXiv:1911.08059. [Google Scholar] [CrossRef]
- Jayalakshmi, G.S.; Kumar, V.S. Performance analysis of Convolutional Neural Network (CNN) based Cancerous Skin Lesion Detection System. In Proceedings of the 2019 International Conference on Computational Intelligence in Data Science (ICCIDS), Chennai, India, 21–23 February 2019; pp. 1–6. [Google Scholar]
Training Data | Number of Images in Training/Validation Dataset | Number of Images in Testing Dataset | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Class 0 | Class 1 | Class 2 | Class 3 | Class 4 | Class 0 | Class 1 | Class 2 | Class 3 | Class 4 | |
Original APTOS dataset | 1265 | 272 | 697 | 138 | 191 | 540 | 98 | 302 | 55 | 104 |
Total images—2563 | Total images—1099 | |||||||||
Augmented APTOS dataset | 1265 | 1306 | 697 | 935 | 1264 | 540 | 98 | 302 | 55 | 104 |
Total images—5467 | Total images—1099 | |||||||||
Mauritian dataset | No training performed using Mauritian data | 54 | 62 | 45 | 12 | 33 | ||||
Total images—208 |
Precision | Recall | F1 Score | |
---|---|---|---|
Class 0 | 0.9600 | 0.8889 | 0.9231 |
Class 1 | 0.8600 | 0.6935 | 0.7679 |
Class 2 | 0.7551 | 0.8222 | 0.7872 |
Class 3 | 0.7778 | 0.5000 | 0.6087 |
Class 4 | 0.6000 | 0.9091 | 0.7229 |
Weighted Average | 0.8165 | 0.7933 | 0.7945 |
Authors | Techniques Used | Discussions |
---|---|---|
Dai et al. [23] | Model: deep model based on ResNet Dataset: Shanghai Integrated Diabetes Prevention and Care System (Shanghai Integration Model, SIM) between 2014 and 2017 Number of images: 666,383 images | Pre-trained models (ResNet and R-CNN) were used. ROC was used to evaluate performance. Performance: AUC scores of 0.943, 0.955, 0.960 and 0.972 for mild, moderate, severe and proliferative cases were achieved, showing good performance using transfer learning |
Masood et al. [11] | Model: pre-trained Inception V3 model Dataset: Eye-PACS dataset Number of images: 3908 images (800 from each class except 708 from class 4) | Performance: accuracy—48.2%, limitations: low accuracy |
Li et al. [12] | Model: different pre-trained networks such as AlexNet, VGG-S, VGG16 and VGG19 Dataset: the Messidor and DR1 datasets Number of images: 1014 images (DR1), 1200 images (Messidor) | Performance: best area under the curve (AUC) (VGG-S)—98.34% (Messidor dataset), 97.86% (DR1 dataset) Limitations: number of classes is limited to DR and No DR only |
Challa et al. [13] | Model: developed a deep All-CNN architecture Dataset: Eye-PACS dataset Number of images: 35,126 images | Performance: accuracy—86.64%, loss—0.46, average F1 score—0.6318 Limitation: no detailed information on overfitting |
Khalifa et al. [16] | Model: AlexNet, Res-Net18, SqueezeNet and GoogleNet Dataset: APTOS dataset Number of images: 3662 images | Performance: best accuracy (AlexNet)—97.9% Limitation: high computational power needed (Intel Xeon E5-2620 processor (2 GHz), 96 GB of RAM) since the model needed to train on 14,648 images. Additionally, no detailed information was given for model overfitting during the training phase. The only method used to counter overfitting was data augmentation, which takes place before the model training phase. |
Hagos et al. [17] | Model: pre-trained Inception V3 model Dataset: APTOS dataset Number of images: 2500 images (1250 for NoDR and 1250 for DR) | Performance: accuracy—90.9%, loss—3.94% Limitation: number of classes is limited to DR and No DR only |
Gangwar et al. [22] | Model: deep learning hybrid model with pre-trained Inception-ResNet-v2 as a base model Dataset: Messidor-1 and APTOS dataset Number of images: 1200 images (Messidor-1), 3662 images (APTOS) | Performance: accuracy—72.33% (Messidor-1), 82.18% (APTOS dataset) Limitation: did not check whether model was overfitting |
Benson et al. [24] | Model: pre-trained Inception V3 model Dataset: DR dataset obtained from VisionQuest Biomedical database Number of images: 6805 images | Performance: sensitivity—90%, specificity—90%, AUC—95% Limitation: results for No DR, MildDR, Moderate DR were 47%, 50% and 35% |
Thota et al. [21] | Model: Fine-tuned and pre-trained VGG16 model Dataset: Eye-PACS dataset Number of images: 34,126 images | Performance: accuracy—74%, sensitivity—80%, a specificity—65%, AUC—80%Limitation: low accuracy compared to similar experimentations |
Our proposed Model | Model: Fine-tuned and pre-trained ResNet50, VGG16, DenseNet169 models Dataset: APTOS dataset, Mauritian dataset Number of images: 3662 images (APTOS), 208 images (Mauritius) | Performance: accuracy (ResNet50)—82% (APTOS dataset), 79% (Mauritian dataset) Novelty: performed multiclass classification (5 different classes) for Mauritian dataset |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Mungloo-Dilmohamud, Z.; Heenaye-Mamode Khan, M.; Jhumka, K.; Beedassy, B.N.; Mungloo, N.Z.; Peña-Reyes, C. Balancing Data through Data Augmentation Improves the Generality of Transfer Learning for Diabetic Retinopathy Classification. Appl. Sci. 2022, 12, 5363. https://doi.org/10.3390/app12115363
Mungloo-Dilmohamud Z, Heenaye-Mamode Khan M, Jhumka K, Beedassy BN, Mungloo NZ, Peña-Reyes C. Balancing Data through Data Augmentation Improves the Generality of Transfer Learning for Diabetic Retinopathy Classification. Applied Sciences. 2022; 12(11):5363. https://doi.org/10.3390/app12115363
Chicago/Turabian StyleMungloo-Dilmohamud, Zahra, Maleika Heenaye-Mamode Khan, Khadiime Jhumka, Balkrish N. Beedassy, Noorshad Z. Mungloo, and Carlos Peña-Reyes. 2022. "Balancing Data through Data Augmentation Improves the Generality of Transfer Learning for Diabetic Retinopathy Classification" Applied Sciences 12, no. 11: 5363. https://doi.org/10.3390/app12115363
APA StyleMungloo-Dilmohamud, Z., Heenaye-Mamode Khan, M., Jhumka, K., Beedassy, B. N., Mungloo, N. Z., & Peña-Reyes, C. (2022). Balancing Data through Data Augmentation Improves the Generality of Transfer Learning for Diabetic Retinopathy Classification. Applied Sciences, 12(11), 5363. https://doi.org/10.3390/app12115363