A CAD System for Lung Cancer Detection Using Hybrid Deep Learning Techniques
Abstract
:1. Introduction
- Non-small cell lung cancer (NSCLC) is the most common variant, which grows and spreads slowly;
- Small cell lung cancer (SCLC) is caused by smoking and spreads faster than NSCLC.
1.1. Research Problem and Motivations
- Enhancing the economy of the country;
- Providing digital services and transforming public services into a digital world;
- Increasing safety and security;
- Providing a better life and improving the quality of life.
1.2. Research Contributions
1.3. Related Work
2. Materials and Methods
2.1. Problem Statement
2.2. Research Objectives
- To explore, study, and analyze current methods in lung cancer to mark and locate their vulnerabilities;
- To search for available lung cancer datasets, download them, and conduct an analysis;
- To implement a feasible model to identify and categorize lung cancer by incorporating the convolutional neural network (VGG-19) and LSTMs;
- To determine the number of identified cancer cells using the developed model;
- To evaluate numerous performance parameters to assess the proposed algorithm with other state-of-the-art approaches using four parameters: accuracy, precision, recall, and the F-score;
- To build a confusion matrix that characterizes how the proposed model categorizes a given test dataset appropriately and accurately. The proposed CAD system generally generates three confusion matrixes as three schemes are utilized.
2.3. Datasets
2.4. The Utilized Deep Learning Techniques (DLTs)
2.4.1. VGG-19
2.4.2. Long Short-Term Memory Networks (LSTMs)
2.5. The Proposed Methodology
- True Positive (TP): indicates the number of adequately identified types in the given dataset;
- False Positive (FP): determines the number of types that are mispredicted;
- True Negative (TN): gives the number of healthy lungs identified correctly;
- False Negative (FN): measures the number of negative samples identified incorrectly;
- Precision (PR): shows the ratio of the identified types over the summation of the classes that are identified incorrectly plus the actual classes that are correctly classified, as demonstrated in the equation below:PR = TP/(TP + FP)
- Recall (RE): computes the ratio of the identified classes over the summation of the actual images plus the number of negative types that are incorrectly classified, as depicted in (2):RE = TP/(TP + FN)
- Accuracy (Acc): this parameter indicates how the proposed approach performs well, and it is evaluated as follows:Acc = (TP + TN)/N
- 8.
- F-score: represents a harmonic mean of two performance metrics of the presented CAD algorithm: recall and precision. Therefore, the higher the value is, the better the model is developed. This metric is evaluated as follows:F-score = 2 × [(PR × RE)/(PR + RE)]
- It is easy to run and operate;
- Procedures are automated to minimize human intervention;
- It is a dependable and practical solution;
- No specific modules are mandatory.
Algorithm: Lung Cancer Detection and Classification |
Input: an image: CT Scan or X-ray. Output: the detection and classification of Lung Cancer: NSCLC and its subtypes.
|
3. Results
3.1. Scenario 1: Adenocarcinoma
3.2. Scenario 2: Large Cell Carcinoma
3.3. Scenario 3: Squamous Cell Carcinoma
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Hosseini, H.; Monsefi, R.; Shadroo, S. Deep Learning Applications for Lung Cancer Diagnosis: A Systematic Review. arXiv 2022, arXiv:2201.00227. [Google Scholar]
- Sousa, J.; Pereira, T.; Silva, F.; Silva, M.C.; Vilares, A.T.; Cunha, A.; Oliveira, H.P. Lung Segmentation in CT Images: A Residual U-Net Approach on A Cross-Cohort Dataset. Appl. Sci. 2022, 12, 1959. [Google Scholar] [CrossRef]
- Nazir, I.; Ul Haq, I.; Khan, M.M.; Qureshi, M.B.; Ullah, H.; Butt, S. Efficient Pre-Processing and Segmentation for Lung Cancer Detection Using Fused CT Images. Electronics 2021, 11, 34. [Google Scholar] [CrossRef]
- Dayma, M.M. Lung Cancer Detection Using MATLAB. IOSR J. Comput. Eng. (IOSR-JCE) 2021, 23, 35–40. [Google Scholar]
- Al-Yasriy, H.F.; Al-Husieny, M.S.; Mohsen, F.Y.; Khalil, E.A.; Hassan, Z.S. Diagnosis of Lung Cancer Based on CT Scans Using CNN. In Proceedings of the 2nd International Scientific Conference of Al-Ayen University (ISCAU-2020): IOP Conf. Series: Material Science and Engineering, Thi-Qar, Iraq, 15–16 July 2020; p. 10. [Google Scholar]
- Ahmed, B.T. Lung Cancer Prediction and Detection Using Image Processing Mechanisms: An Overview. Signal Image Process. Lett. (SIMPLE) 2019, 1, 20–31. [Google Scholar] [CrossRef]
- Hasan, R.; Al Kabir, M. Lung Cancer Detection and Classification Based on Image Processing and Statistical Learning. arXiv 2019, arXiv:1911.1065. [Google Scholar]
- Available online: https://healthcare.utah.edu/huntsmancancerinstitute/news/2019/11/even-non-smokers-can-get-lung-cancer.php (accessed on 5 March 2022).
- Nasser, I.M.; Abu-Naser, S.S. Lung Cancer Detection Using Artificial Neural Network. Int. J. Eng. Inf. Syst. 2019, 3, 17–23. [Google Scholar]
- Available online: https://www.lungevity.org/for-patients-caregivers/lung-cancer-101/types-of-lung-cancer (accessed on 5 March 2022).
- Melisa, B. Image Detection Using the VGG-19 Convolutional Neural Network. Available online: https://medium.com/mlearning-ai/image-detection-using-convolutional-neural-networks-89c9e21fffa3 (accessed on 22 November 2022).
- Khattar, A.; Quadri, S.M.K. Generalization of Convolutional Network Domain Adaptation Network for Classification of Disaster Images on Twitter. Multimed. Tools Appl. 2022, 81, 30437–30464. [Google Scholar] [CrossRef]
- Dolphin, R. LSTM Networks: A Detailed Explanation. Available online: https://towardsdatascience.com/lstm-networks-a-detailed-explanation-8fae6aefc7f9 (accessed on 26 December 2022).
- Yeturu, K. Machine learning algorithms, applications, and practices in data science. Handb. Stat. 2020, 43, 81–206. [Google Scholar]
- Bhatia, S.; Sinha, Y.; Goel, L. Lung Cancer Detection: A Deep Learning Approach. In “Soft Computing for Problem Solving,” Advances in Intelligent Systems and Computing; Bansal, J., Das, K., Nagar, A., Deep, K., Ojha, A., Eds.; Springer Nature: Singapore, 2019; Volume 817, pp. 699–705. [Google Scholar]
- Madan, B.; Panchal, A.; Chavan, D. Lung Cancer Detection Using Deep Learning. In Proceedings of the 2nd International Conference on Advances in Science and Technology (ICAST-2019), Makassar, Indonesian, 5–6 November 2019; p. 3. [Google Scholar]
- Available online: https://www.kaggle.com/datasets/mohamedhanyyy/chest-ctscan-images/ (accessed on 4 March 2022).
- Available online: https://www.kaggle.com/datasets/raddar/nodules-in-chest-xrays-lidcidri (accessed on 11 December 2022).
- Available online: https://luna16.grand-challenge.org/Download/ (accessed on 8 March 2023).
- Makaju, S.; Prasad, P.W.C.; Alsadoon, A.; Singh, A.K.; Elchouemi, A. Lung Cancer Detection Using CT Scan Images. Sci. Procedia Comput. Sci. 2018, 125, 107–114. [Google Scholar] [CrossRef]
- Mahersia, H.; Zaroug, M.; Gabralla, L. Lung Cancer Detection on CT Scan Images: A Review on the Analysis Techniques. Int. J. Adv. Res. Artif. Intell. (IJARAI) 2015, 4, 38–45. [Google Scholar] [CrossRef] [Green Version]
- Tun, K.M.M.; Khaing, A.S. Feature Extraction and Classification of Lung Cancer Nodule Using Image Processing Techniques. Int. J. Eng. Res. Technol. (IJERT) 2014, 3, 2204–2210. [Google Scholar]
- Kanitkar, S.; Thombare, N.D.; Lokhande, S.S. Lung Cancer Detection and Classification: A review. Int. J. Eng. Res. Technol. (IJERT) 2013, 2, 2312–2315. [Google Scholar]
Properties | Chest CT Scans [17] | LIDC-IDRI [18] | LUNA16 [19] |
---|---|---|---|
Number of images | 1000 | 463 | 888 |
Size | 125 MB | 4 GB | 66 GB |
Ground truth | Yes | Yes | Yes |
Type | CT scan | X-rays | CT scan |
Parameter | Used Value |
---|---|
Input size (n) | 224 × 224 |
Activation (ACT) | ReLU |
Kernel size (KS) | 3 ×3 |
Pool filtering size (np) | 2 × 2 |
Stride number (sn) | 4 |
Padding (pad) | Same |
Optimizer | Adam |
Parameter | Value |
---|---|
Number of cells (nc) | 50 |
Number of units in every cell (nu) | 64 |
Activation (ACT) | Tanh |
Optimizer (opt) | Adam |
Performance Metric | Evaluated Value: N = 850 Images |
---|---|
TP | 796 Adenocarcinoma = 245, LCC = 161, SCC = 233 |
TN | 17 |
FN | 29 |
FP | 8 |
Accuracy | 95.647% |
Precision | 99% |
Recall | 96.484% |
F-score | 97.726% |
Performance Metric | Evaluated Value: N = 850 Images |
---|---|
TP | 811 Adenocarcinoma = 273, LCC = 201, SCC = 337 |
TN | 24 |
FN | 9 |
FP | 6 |
Accuracy | 98.235% |
Precision | 99.266% |
Recall | 98.902% |
F-score | 99.084% |
Performance Metric | Evaluated Value: N = 850 Images |
---|---|
TP | 833 Adenocarcinoma = 307, LCC = 289, SCC = 237 |
TN | 11 |
FN | 2 |
FP | 1 |
Accuracy | 99.42%% |
Precision | 99.880% |
Recall | 99.760% |
F-score | 99.820% |
Works | Utilized Technology | Precision | Recall | Accuracy |
---|---|---|---|---|
Sousa et al., 2021 [2] | Fused image technique | N.M. | 89% | 99% |
Nazir et al., 2021 [3] | LP + ASR | 89% | N.M. | 99% |
Al-Yasriy et al., 2020 [5] | CNN | 95.714% | N.M. | 93.548% |
Hasan et al., 2019 [7] | Image processing | N.M. | N.M. | 72.2% |
Nasser and Abu-Naser, 2019 [9] | ANN | N.M. | N.M. | 96.67% |
Madan et al., 2019 [16] | XGBoost + RFA | N.M. | N.M. | 84% |
The proposed algorithm | VGG-19 and LSTMs | 99.42% | 99.880% | 99.760% |
Predicted Class | True Class | ||||
Class A 310 | Class B 289 | Class C 237 | Class D 14 | ||
Class A | 307 = (99.032%) | 0 | 0 | 1 = (7.143%) | |
Class B | 1 = (0.323%) | 286 = (98.962%) | 0 | 2 = (0.692%) | |
Class C | 0 | 0 | 237 = (100%) | 0 | |
Class D | 2 = (14.286%) | 0 | 0 | 12 = (85.714%) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alsheikhy, A.A.; Said, Y.; Shawly, T.; Alzahrani, A.K.; Lahza, H. A CAD System for Lung Cancer Detection Using Hybrid Deep Learning Techniques. Diagnostics 2023, 13, 1174. https://doi.org/10.3390/diagnostics13061174
Alsheikhy AA, Said Y, Shawly T, Alzahrani AK, Lahza H. A CAD System for Lung Cancer Detection Using Hybrid Deep Learning Techniques. Diagnostics. 2023; 13(6):1174. https://doi.org/10.3390/diagnostics13061174
Chicago/Turabian StyleAlsheikhy, Ahmed A., Yahia Said, Tawfeeq Shawly, A. Khuzaim Alzahrani, and Husam Lahza. 2023. "A CAD System for Lung Cancer Detection Using Hybrid Deep Learning Techniques" Diagnostics 13, no. 6: 1174. https://doi.org/10.3390/diagnostics13061174
APA StyleAlsheikhy, A. A., Said, Y., Shawly, T., Alzahrani, A. K., & Lahza, H. (2023). A CAD System for Lung Cancer Detection Using Hybrid Deep Learning Techniques. Diagnostics, 13(6), 1174. https://doi.org/10.3390/diagnostics13061174