Biomedical Diagnosis of Breast Cancer Using Deep Learning and Multiple Classifiers
Abstract
:1. Introduction
- Examining the current state-of-the-art systems for breast cancer identification and classification to identify their vulnerabilities and how to enhance their identification and classification accuracy;
- Utilizing the Fuzzy C-Means clustering algorithm and multiple classifiers to execute the classification operation precisely and achieve high accuracy;
- Evaluating the performance of the proposed system in terms of various metrics, which are accuracy, precision, recall, specificity, and F-score. This evaluation indicates that the applied method of diagnosis and classification for breast cancer reaches over 98.84% accuracy and surpasses other state-of-the-art approaches.
1.1. Research Problem and Motivations
1.2. Related Work
2. Materials and Methods
2.1. Problem Statement
2.2. Deep Convolutional Neural Network (DCNN)
2.3. Datasets
2.4. The Proposed Methodology
Algorithm 1: Presented system: Breast Cancer Identification and Diagnosis |
Input: an image: mammographic. |
Output: the detection and classification of Breast Cancer: 1) BC and 2) MC or HB. |
1. Read an image from a file. |
2. In the preprocessing phase: Do the following: |
3. Remove any detected noise. |
4. Resize the input into a compatible size with AlexNet. |
5. Utilize the Gabor filter, DWT, and PCA. |
6. Transform the resultant image into a gray image. |
7. End of Preprocessing phase. |
8. For the Deep Learning phase (DCNN): Do the following: |
9. Create a Zero matrix with a size = size of the input image. |
10. For i =1: size of the input |
11. Perform a masking operation using the morphological operation to extract: Area, shape, diameter, and correlation of the potential area of Interest (PoI). |
12. Determine a dynamic threshold for every image. |
13. Invert the image to separate the foreground and the background. |
14. Compute variance, standard deviation, mean, and correlation for every PoI in each input. |
15. Extract the required features. |
16. End |
17. End of DCNN phase. |
18. For the classification phase: Do the following: |
19. Create a Binary image to detect and classify the disease with a size = 1024 × 1024 in every PoI. |
20. Find a mass area and draw a circle around it. |
21. Determine the number of detected areas and their drawn circles. |
22. For i = 1: 1024 |
23. For j = 1: 1024 |
24. Compute the number of white pixels z to compare it with the threshold. |
25. If z > threshold: |
26. Cancer is Detected. |
27. End |
28. Classify detected cancer: BC or MC or display a message saying that there is no cancer. |
29. End |
30. End |
31. End the classification phase. |
32. Find TP, TN, FP, and FN. |
33. Compute accuracy, precision, recall, specificity, and F-score. |
34. End of the algorithm. |
- The processing time is less than 13 s for every input.
- The achieved accuracy is more than 98.8%.
- The system is consistent and trustworthy.
- The system delivers favorable findings.
- The system outperforms other developed algorithms mentioned in the literature review section in all considered performance metrics.
- True Positive (TP): refers to the number of adequately detected and classified breast cancer images in the utilized datasets;
- False Positive (FP): represents the number of detected and classified breast cancers classified improperly;
- True Negative (TN): refers to the number of healthy breasts categorized accurately;
- False Negative (FN): indicates the number of adverse outcomes classified improperly;
- Precision (Prc): the ratio of the correctly identified types over the summation of the classes that are identified incorrectly plus the correctly identified types as demonstrated in the following equation:
- 6.
- Recall (Rcl): the ratio of the correctly identified classes over the summation of the actual images plus the number of antagonistic classes that are incorrectly classified, as depicted in (2) (in addition, this is sometimes called sensitivity as well):
- 7.
- Accuracy (Acr): this parameter indicates how well the proposed method performs, and it is calculated as follows:
- 8.
- Specificity (Spc): the ability of the proposed system to classify any sample that is not associated with any labeling data. It is calculated as follows:
- 9.
- F-Score: the harmonic mean of the recall and precision of the implemented system. Thus, the higher the value of the F-score is, the better the model is implemented, and it is evaluated as follows:
3. Results
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Acknowledgment
Conflicts of Interest
References
- Banumathy, D.; Khalaf, O.I.; Romero, C.A.T.; Raja, P.V.; Sharma, D.K. Breast calcifications and histopathological analysis on tumour detection by CNN. Comput. Syst. Sci. Eng. 2023, 44, 595–612. [Google Scholar] [CrossRef]
- Nagalakshmi, K.; Suriya, S.D. Performance Analysis of Breast Cancer Detection Method Using ANFIS Classification Approach. Comput. Syst. Sci. Eng. 2023, 44, 501–517. [Google Scholar] [CrossRef]
- Ulagamuthalvi, V.; Kulanthaivel, G.; Balasundaram, A.; Sivaraman, A.K. Breast Mammogram Analysis and Classification Using Deep Convolution Neural Network. Comput. Syst. Sci. Eng. 2022, 43, 275–289. [Google Scholar] [CrossRef]
- Eltahir, M.M.E.; Ahmed, T.M. Diagnosing Breast Cancer Accurately Based on Weighting of Heterogeneous Classification Sub-Models. Comput. Syst. Sci. Eng. 2022, 42, 1257–1272. [Google Scholar] [CrossRef]
- Hurtado, J.A.B.; Albarran, I.A.C.; Ayala, M.T.; Manzano, M.A.I.; Hernandez, L.A.M.; Perez-Ramirez, C.A. Diagnostic strategies for breast cancer detection: From image generation to classification strategies using artificial intelligence algorithms. Cancer 2022, 14, 1–24. [Google Scholar]
- Arooj, S.; Rahman, A.U.; Zubair, M.; Khan, M.F.; Alissa, K.; Mosavi, A. Breast Cancer Detection and Classification Empowered with Transfer Learning. Front. Public Health 2022, 10, 924432. [Google Scholar] [CrossRef] [PubMed]
- Rajakumari, R.; Kalaivani, L. Breast Cancer Detection and Classification Using Deep CNN Techniques. Intell. Autom. Soft Comput. 2022, 32, 1089–1107. [Google Scholar] [CrossRef]
- Mohiyuddin, A.; Basharat, A.; Ghani, U.; Peter, V.; Abbas, S.; Bin Naeem, O.; Rizwan, M. Breast Tumor Detection and Classification in Mammogram Images Using Modified YOLOv5 Network. Comput. Math. Methods Med. 2022, 2022, 1–16. [Google Scholar] [CrossRef] [PubMed]
- Thilagaraj, M.; Arunkumar, N.; Govindan, P. Classification of Breast Cancer Images by Implementing Improved DCNN with Artificial Fish School Model. Comput. Intell. Neurosci. 2022, 2022, 1–12. [Google Scholar] [CrossRef] [PubMed]
- Reshma, V.K.; Arya, N.; Ahmad, S.S.; Wattar, I.; Mekala, S.; Joshi, S.; Krah, D. Detection of Breast Cancer Using Histopathological Image Classification Dataset with Deep Learning Techniques. BioMed Res. Int. 2022, 2022, 1–13. [Google Scholar] [CrossRef] [PubMed]
- Sakib, S.; Yasmin, N.; Tanzeem, A.K.; Shorna, F.; Hasib, K.; Alam, S.B. Breast cancer detection and classification: A comparative analysis using machine learning algorithms. In Proceedings of the 3rd International Conference on Communication, Computing and Electronics Systems, Online, 15–16 September 2022; pp. 1–15. [Google Scholar]
- Patil, V.; Burud, S.; Pawar, G.; Rayajaghav, T. Breast cancer detection using MATLAB functions. Adv. Image Process. Pattern Recognit. 2020, 3, 1–6. [Google Scholar]
- Hantoro, Y.N. Comparative study of breast cancer diagnosis using data mining classification. Int. J. Eng. Res. Technol. 2020, 9, 790–795. [Google Scholar]
- Omondiagbe, D.A.; Veeramani, S.; Sidhu, A.S. Machine Learning Classification Techniques for Breast Cancer Diagnosis. IOP Conf. Series Mater. Sci. Eng. 2019, 495, 1–17. [Google Scholar] [CrossRef]
- Fathy, W.E.; Ghoneim, A.S. A deep learning approach for breast cancer mass detection. Int. J. Adv. Comput. Sci. Appl. 2019, 10, 175–182. [Google Scholar] [CrossRef] [Green Version]
- Hamad, Y.A.; Simonov, K.; Naeem, M.B. Breast cancer detection and classification using artificial neural networks. In Proceedings of the 1st Annual International Conference on Information and Sciences, Fallujah, Iraq, 20–21 November 2018; pp. 51–57. [Google Scholar]
- Tariq, N. Breast Cancer Detection using Artificial Neural Networks. J. Mol. Biomarkers Diagn. 2017, 9, 1–6. [Google Scholar] [CrossRef]
- Paramkusham, S.; Rao, K.M.M.; Rao, B.V.V.S.N.P. Early stage detection of breast cancer using novel image processing techniques, MATLAB and Labview Implementation. In Proceedings of the 15th International Conference on Advanced Computing Technologies, Rajampet, India, 21–22 September 2013; pp. 1–5. [Google Scholar]
- Cheng, H.D.; Shan, J.; Ju, W.; Guo, Y.; Zhang, L. Automated breast cancer detection and classification using ultrasound images: A survey. Pattern Recognit. 2010, 43, 299–317. [Google Scholar] [CrossRef] [Green Version]
- Karthik, R.; Menaka, R.; Kathiresan, G.S.; Anirudh, M.; Nagharjun, M. Gaussian Dropout Based Stacked Ensemble CNN for Classification of Breast Tumor in Ultrasound Images. IRBM, 2021; in press. [Google Scholar] [CrossRef]
Fold-1 | Fold-2 | Fold-3 | Fold-4 | Fold-5 | |
---|---|---|---|---|---|
Run-1 | Test | Train | Train | Train | Train |
Run-2 | Train | Test | Train | Train | Train |
Run-3 | Train | Train | Test | Train | Train |
Run-4 | Train | Train | Train | Test | Train |
Run-5 | Train | Train | Train | Train | Test |
Performance Metric | Evaluated Values n = 3400 |
---|---|
TP | 3186 B = 1408, M = 1778 |
TN | 185 |
FN | 19 |
FP | 10 |
Accuracy | 99.147% |
Precision | 99.687% |
Recall | 99.407% |
Specificity | 94.871% |
F-score | 99.547% |
Datasets | Healthy Breast (HB) | Benign Cancer (BC) | Malignant Cancer (MC) |
---|---|---|---|
BHI | 79 | 593 | 679 |
CBIS-DDSM | 64 | 719 | 828 |
BCW | 42 | 96 | 271 |
Predicted Class. | True Class | |||
Malignant | Benign | Healthy | ||
Malignant | 1778 = (98.36%) | 8 = (1.58%) | 4 = (2.02%) | |
Benign | 6 = (0.98%) | 1408 = (97.91%) | 3 = (1.63%) | |
Healthy | 3 = (0.66%) | 5 = (0.51%) | 185 = (96.35%) |
Works | Methodology | Accuracy | Precision | Recall | Specificity | F-Score |
---|---|---|---|---|---|---|
[1], 2023 | CNN | 97.74% | 97.955% | 95.908% | 98.335% | 97.863% |
[2], 2023 | ANFIS | 98.8% | Not mentioned | 97.9% | 98.5% | Not mentioned |
[3], 2022 | DCNN | 98% | 99.075% | 100% | 100% | 99.5% |
[4], 2022 | The weighting of heterogeneous sub-models | 81% | 81.14% | 81.23% | Not mentioned | 81.48% |
[7], 2022 | DCNN | 99% | Not mentioned | Not mentioned | Not mentioned | Not mentioned |
[8], 2022 | Modified yolov5 Network | 96.5% | Not mentioned | Not mentioned | Not mentioned | Not mentioned |
[9], 2022 | DCNN + Artificial Fish School Model | 98.66% | Not mentioned | 99.1% | 98.8% | Not mentioned |
[10], 2022 | Deep Learning Models | 92.44% | 86.89% | Not mentioned | Not mentioned | Not mentioned |
The Proposed System | DCNN + KNN + Bayes + DT | 99.147% | 99.687% | 99.407% | 94.871% | 99.547% |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alsheikhy, A.A.; Said, Y.; Shawly, T.; Alzahrani, A.K.; Lahza, H. Biomedical Diagnosis of Breast Cancer Using Deep Learning and Multiple Classifiers. Diagnostics 2022, 12, 2863. https://doi.org/10.3390/diagnostics12112863
Alsheikhy AA, Said Y, Shawly T, Alzahrani AK, Lahza H. Biomedical Diagnosis of Breast Cancer Using Deep Learning and Multiple Classifiers. Diagnostics. 2022; 12(11):2863. https://doi.org/10.3390/diagnostics12112863
Chicago/Turabian StyleAlsheikhy, Ahmed A., Yahia Said, Tawfeeq Shawly, A. Khuzaim Alzahrani, and Husam Lahza. 2022. "Biomedical Diagnosis of Breast Cancer Using Deep Learning and Multiple Classifiers" Diagnostics 12, no. 11: 2863. https://doi.org/10.3390/diagnostics12112863
APA StyleAlsheikhy, A. A., Said, Y., Shawly, T., Alzahrani, A. K., & Lahza, H. (2022). Biomedical Diagnosis of Breast Cancer Using Deep Learning and Multiple Classifiers. Diagnostics, 12(11), 2863. https://doi.org/10.3390/diagnostics12112863