An Efficient Machine Learning Approach for Diagnosing Parkinson’s Disease by Utilizing Voice Features
Abstract
:1. Introduction
1.1. Machine Learning-Based Detection of Parkinson’s Disease
1.2. Research Problem and Motivation
1.3. Contribution
- To identify which machine learning algorithms, such as SVM, KNN, naive Bayes, and ANN, offer the most accurate classifications and diagnosis of Parkinson’s disease.
- To develop statistical evaluations for the diagnosis of Parkinson’s disease in order to identify the frequency at which the best training and test results will be acquired, and consequently to assist in upcoming literature-based research.
- The proposed system has used an ANN classifier to attain the maximum classification accuracy when compared to the approaches used in earlier research.
- In order to improve the prediction of PD, a comprehensive methodology was employed to explore the effectiveness and efficiency of various feature selection approaches.
- The proposed model is examined with four machine learning methods, including SVM, naive Bayes, k-NN, and ANN, as well as with earlier and more current studies on PD detection.
1.4. Structure of Proposed Work
2. Related Works
3. Proposed Work
Feature Selection
4. Materials and Methods
4.1. Dataset
4.2. Parkinson’s Disease Diagnosis Based on Voice Analysis and Machine Learning
4.3. Classification of Parkinson’s Disease with ML Classifier
4.4. Building of Machine Learning Techniques with Classifier Evaluation Metrics
5. Experiments and Results
5.1. SVM-Classifier
5.2. Naive Bayes Classifier
5.3. Artificial Neural Network
- Identifying the responsibility and function of ANN in the detection of this disease.
- Making observations on labels and features of datasets.
- Grouping the types of the studied disease centered on their symptoms.
- Examining the accurate outcomes.
5.4. K-Nearest Neighbor
5.5. Summary of Evaluation Results
6. Comparative Study and Discussion
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- DeMaagd, G.; Philip, A. Parkinson’s Disease and Its Management: Part 1: Disease Entity, Risk Factors, Pathophysiology, Clinical Presentation, and Diagnosis. Pharm. Ther. 2015, 40, 504–532. [Google Scholar]
- Rizek, P.; Kumar, N.; Jog, M.S. An update on the diagnosis and treatment of Parkinson disease. CMAJ 2016, 188, 1157–1165. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Available online: https://www.who.int/news-room/fact-sheets/detail/parkinson-disease (accessed on 30 October 2022).
- De Rijk, M.C.; Launer, L.J.; Berger, K.; Breteler, M.M.; Dartigues, J.F.; Baldereschi, M.; Fratiglioni, L.; Lobo, A.; Martinez-Lage, J.; Trenkwalder, C.; et al. Prevalence of Parkinson’s disease in Europe: A collaborative study of population-based cohorts. Neuro-logic Diseases in the Elderly Research Group. Neurology 2000, 54 (Suppl. S5), S21–S23. [Google Scholar] [PubMed]
- Cantürk, İ.; Karabiber, F. A machine learning system for the diagnosis of Parkinson’s disease from speech signals and its application to multiple speech signal types. Arab. J. Sci. Eng. 2016, 41, 5049–5059. [Google Scholar] [CrossRef]
- Singh, N.; Pillay, V.; Choonara, Y.E. Advances in the treatment of Parkinson’s disease. Prog. Neurobiol. 2007, 81, 29–44. [Google Scholar] [CrossRef]
- Rana, A.; Rawat, A.S.; Bijalwan, A.; Bahuguna, H. Application of multi-layer (perceptron) artificial neural network in the diagnosis system: A systematic review. In Proceedings of the 2018 International Conference on Research in Intelligent and Computing in Engineering (RICE), San Salvador, El Salvador, 22–24 August 2018; pp. 1–6. [Google Scholar]
- Lakany, H. Extracting a diagnostic gait signature. Pattern Recognit. 2008, 41, 1627–1637. [Google Scholar] [CrossRef]
- Figueiredo, J.; Santos, C.P.; Moreno, J.C. Automatic recognition of gait patterns in human motor disorders using machine learning: A review. Med. Eng. Phys. 2018, 53, 1–12. [Google Scholar] [CrossRef]
- Hazan, H.; Hilu, D.; Manevitz, L.; Ramig, L.O.; Sapir, S. Early diagnosis of Parkinson’s disease via machine learning on speech data. In Proceedings of the 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel, Eilat, Israel, 14–17 November 2012; pp. 1–4. [Google Scholar] [CrossRef]
- Karan, B.; Sahu, S.S.; Mahto, K. Parkinson disease prediction using intrinsic mode function based features from speech signal. Biocybern. Biomed. Eng. 2019, 40, 249–264. [Google Scholar] [CrossRef]
- Frid, A.; Safra, E.J.; Hazan, H.; Lokey, L.L.; Hilu, D.; Manevitz, L.; Ramig, L.O.; Sapir, S. Computational diagnosis of Parkinson’s Disease directly from natural speech using machine learning techniques. In Proceedings of the 2014 IEEE International Conference on Software Science, Technology and Engineering, Washington, DC, USA, 11–12 June 2014; pp. 50–53. [Google Scholar]
- Rawat, A.S.; Rana, A.; Kumar, A.; Bagwari, A. Application of multi layer artificial neural network in the diagnosis system: A systematic review. IAES Int. J. Artif. Intell. 2018, 7, 138. [Google Scholar] [CrossRef] [Green Version]
- KarimiRouzbahani, H.; Daliri, M.R. Diagnosis of Parkinson’s Disease in Human Using Voice Signals. BCN 2011, 2, 12–20. [Google Scholar]
- Khamparia, A.; Gupta, D.; Nguyen, N.G.; Khanna, A.; Pandey, B.; Tiwari, P. Sound Classification Using Convolutional Neural Network and Tensor Deep Stacking Network. IEEE Access 2019, 7, 7717–7727. [Google Scholar] [CrossRef]
- Bourouhou, A.; Jilbab, A.; Nacir, C.; Hammouch, A. Comparison of classification methods to detect the parkinson disease. In Proceedings of the 2016 International Conference on Electrical and Information Technologies (ICEIT), Tangiers, Morocco, 4–7 May 2016; pp. 421–424. [Google Scholar]
- Sharma, A.; Giri, R.N. Automatic Recognition of Parkinson’s Disease via Artificial Neural Network and Support Vector Machine. Int. J. Innov. Technol. Explor. Eng. (IJITEE) 2014, 4, 7. [Google Scholar]
- Purwins, H.; Li, B.; Virtanen, T.; Schluter, J.; Chang, S.-Y.; Sainath, T.N. Deep Learning for Audio Signal Processing. IEEE J. Sel. Top. Signal Process. 2019, 13, 206–219. [Google Scholar] [CrossRef] [Green Version]
- Zhang, L.; Qu, Y.; Jin, B.; Jing, L.; Gao, Z.; Liang, Z. An Intelligent Mobile-Enabled System for Diagnosing Parkinson Disease: Development and Validation of a Speech Impairment Detection System. JMIR Public Health Surveill. 2020, 8, e18689. [Google Scholar] [CrossRef]
- Kadiri, S.R.; Kethireddy, R.; Alku, P. Parkinson’s Disease Detection from Speech Using Single Frequency Filtering Cepstral Coefficients. In Proceedings of the Interspeech 2020, Shanghai, China, 25–29 October 2020. [Google Scholar] [CrossRef]
- Pramanik, M.; Pradhan, R.; Nandy, P.; Bhoi, A.K.; Barsocchi, P. Machine Learning Methods with Decision Forests for Parkinson’s Detection. Appl. Sci. 2021, 11, 581. [Google Scholar] [CrossRef]
- Gunduz, H. Deep Learning-Based Parkinson’s Disease Classification Using Vocal Feature Sets. IEEE Access 2019, 7, 115540–115551. [Google Scholar] [CrossRef]
- Available online: https://www.dataversity.net/improving-clinical-insights-machine-learning/# (accessed on 27 August 2022).
- Kohlschein, C.; Schmitt, M.; Schuller, B.; Jeschke, S.; Werner, C.J. A machine learning based system for the automatic evaluation of aphasia speech. In Proceedings of the 2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom), Dalian, China, 12–15 October 2017; pp. 1–6. [Google Scholar] [CrossRef]
- Bertini, F.; Allevi, D.; Lutero, G.; Montesi, D.; Calzà, L. Automatic Speech Classifier for Mild Cognitive Impairment and Early Dementia. ACM Trans. Comput. Healthc. 2022, 3, 1–11. [Google Scholar] [CrossRef]
- Matcham, F.; On Behalf of the RADAR-CNS Consortium; Pietro, C.B.D.S.; Bulgari, V.; de Girolamo, G.; Dobson, R.; Eriksson, H.; Folarin, A.A.; Haro, J.M.; Kerz, M.; et al. Remote assessment of disease and relapse in major depressive disorder (RADAR-MDD): A multi-centre prospective cohort study protocol. BMC Psychiatry 2019, 19, 72. [Google Scholar] [CrossRef] [Green Version]
- Sakar, C.O.; Serbes, G.; Gunduz, A.; Tunc, H.C.; Nizam, H.; Sakar, B.E.; Tutuncu, M.; Aydin, T.; Isenkul, M.E.; Apaydin, H. A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable-factor wavelet transform. Appl. Soft Comput. 2019, 74, 255–263. [Google Scholar] [CrossRef]
- Yasar, A.; Saritas, I.; Sahman, M.A.; Cinar, A.C. Classification of Parkinson disease data with artificial neural networks. In Proceedings of the IOP Conference Series: Materials Science and Engineering, Wuhan, China, 10–12 October 2019; Volume 675, p. 012031. [Google Scholar]
- Avuçlu, E.; Elen, A. Evaluation of train and test performance of machine learning algorithms and Parkinson diagnosis with statistical measurements. Med. Biol. Eng. Comput. 2020, 58, 2775–2788. [Google Scholar] [CrossRef]
- Marar, S.; Swain, D.; Hiwarkar, V.; Motwani, N.; Awari, A. Predicting the occurrence of Parkinson’s Disease using various Classification Models. In Proceedings of the 2018 International Conference on Advanced Computation and Telecommunication (ICACAT), Bhopal, India, 28–29 December 2018; pp. 1–5. [Google Scholar]
- Nikookar, E.; Sheibani, R.; Alavi, S.E. An ensemble method for diagnosis of Parkinson’s disease based on voice measurements. J. Med. Signals Sens. 2019, 9, 221–226. [Google Scholar] [CrossRef] [PubMed]
- Tracy, J.M.; Özkanca, Y.; Atkins, D.C.; Ghomi, R.H. Investigating voice as a biomarker: Deep phenotyping methods for early detection of Parkinson’s disease. J. Biomed. Inform. 2019, 104, 103362. [Google Scholar] [CrossRef] [PubMed]
- Cibulka, M.; Brodnanova, M.; Grendar, M.; Grofik, M.; Kurca, E.; Pilchova, I.; Osina, O.; Tatarkova, Z.; Dobrota, D.; Kolisek, M. SNPs rs11240569, rs708727, and rs823156 in SLC41A1 Do Not Discriminate Between Slovak Patients with Idiopathic Parkinson’s Disease and Healthy Controls: Statistics and Machine-Learning Evidence. Int. J. Mol. Sci. 2019, 20, 4688. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Hsu, S.-Y.; Lin, H.-C.; Chen, T.-B.; Du, W.-C.; Hsu, Y.-H.; Wu, Y.-C.; Tu, P.-W.; Huang, Y.-H.; Chen, H.-Y. Feasible Classified Models for Parkinson Disease from 99mTc-TRODAT-1 SPECT Imaging. Sensors 2019, 19, 1740. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Drotár, P.; Mekyska, J.; Rektorová, I.; Masarová, L.; Smékal, Z.; Faundez-Zanuy, M. Evaluation of handwriting kinematics and pressure for differential diagnosis of Parkinson’s disease. Artif. Intell. Med. 2016, 67, 39–46. [Google Scholar] [CrossRef]
- Maass, F.; Michalke, B.; Willkommen, D.; Leha, A.; Schulte, C.; Tönges, L.; Mollenhauer, B.; Trenkwalder, C.; Rückamp, D.; Börger, M.; et al. Elemental fingerprint: Reassessment of a cerebrospinal fluid biomarker for Parkinson’s disease. Neurobiol. Dis. 2019, 134, 104677. [Google Scholar] [CrossRef]
- Mucha, J.; Mekyska, J.; Faundez-Zanuy, M.; Lopez-De-Ipina, K.; Zvoncak, V.; Galaz, Z.; Kiska, T.; Smekal, Z.; Brabenec, L.; Rektorova, I. Advanced Parkinson’s Disease Dysgraphia Analysis Based on Fractional Derivatives of Online Handwriting. In Proceedings of the 2018 10th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT), Moscow, Russia, 5–9 November 2018; pp. 1–6. [Google Scholar] [CrossRef]
- Wenzel, M.; Milletari, F.; Krüger, J.; Lange, C.; Schenk, M.; Apostolova, I.; Klutmann, S.; Ehrenburg, M.; Buchert, R. Automatic classification of dopamine transporter SPECT: Deep convolutional neural networks can be trained to be robust with respect to variable image characteristics. Eur. J. Pediatr. 2019, 46, 2800–2811. [Google Scholar] [CrossRef]
- Segovia, F.; Gorriz, J.M.; Ramirez, J.; Martinez-Murcia, F.J.; Castillo-Barnes, D. Assisted Diagnosis of Parkinsonism Based on the Striatal Morphology. Int. J. Neural Syst. 2019, 29, 1950011. [Google Scholar] [CrossRef] [Green Version]
- Ye, Q.; Xia, Y.; Yao, Z. Classification of Gait Patterns in Patients with Neurodegenerative Disease Using Adaptive Neuro-Fuzzy Inference System. Comput. Math. Methods Med. 2018, 2018, 9831252. [Google Scholar] [CrossRef]
- Klomsae, A.; Auephanwiriyakul, S.; Theera-Umpon, N. (2018). String grammar unsupervised possibilistic fuzzy c-medians for gait pattern classification in patients with neurodegenerative diseases. Comput. Intell. Neurosci. 2018, 2018, 1869565. [Google Scholar] [CrossRef] [Green Version]
- Felix, J.P.; Vieira, F.H.T.; Cardoso, A.A.; Ferreira, M.V.G.; Franco, R.A.P.; Ribeiro, M.A.; Araujo, S.G.; Correa, H.P.; Carneiro, M.L. A Parkinson’s Disease Classification Method: An Approach Using Gait Dynamics and Detrended Fluctuation Analysis. In Proceedings of the 2019 IEEE Canadian Conference of Electrical and Computer Engineering (CCECE), Edmonton, AB, Canada, 5–8 May 2019. [Google Scholar] [CrossRef]
- Andrei, A.-G.; Tautan, A.-M.; Ionescu, B. Parkinson’s Disease Detection from Gait Patterns. In Proceedings of the 2019 E-Health and Bioengineering Conference (EHB), Iasi, Romania, 21–23 November 2019; pp. 1–4. [Google Scholar] [CrossRef]
- Priya, S.J.; Rani, A.J.; Subathra, M.S.P.; Mohammed, M.A.; Damaševičius, R.; Ubendran, N. Local Pattern Transformation Based Feature Extraction for Recognition of Parkinson’s Disease Based on Gait Signals. Diagnostics 2021, 11, 1395. [Google Scholar] [CrossRef]
- Yurdakul, O.C.; Subathra, M.; George, S.T. Detection of Parkinson’s Disease from gait using Neighborhood Representation Local Binary Patterns. Biomed. Signal Process. Control 2020, 62, 102070. [Google Scholar] [CrossRef]
- Li, B.; Yao, Z.; Wang, J.; Wang, S.; Yang, X.; Sun, Y. Improved Deep Learning Technique to Detect Freezing of Gait in Parkinson’s Disease Based on Wearable Sensors. Electronics 2020, 9, 1919. [Google Scholar] [CrossRef]
- Rana, A.; Dumka, A.; Singh, R.; Panda, M.K.; Priyadarshi, N.; Twala, B. Imperative Role of Machine Learning Algorithm for Detection of Parkinson’s Disease: Review, Challenges and Recommendations. Diagnostics 2022, 12, 2003. [Google Scholar] [CrossRef]
- Masoudi-Sobhanzadeh, Y.; MotieGhader, H.; Masoudi-Nejad, A. FeatureSelect: A software for feature selection based on machine learning approaches. BMC Bioinform. 2019, 20, 107. [Google Scholar] [CrossRef]
- Rahmaninia, M.; Moradi, P. OSFSMI: Online stream feature selection method based on mutual information. Appl. Soft Comput. 2018, 68, 733–746. [Google Scholar] [CrossRef]
- Pourbahrami, S. Improving PSO global method for feature selection according to iterations global search and chaotic theory. arXiv 2018, preprint. arXiv:1811.08701. [Google Scholar]
- Yu, L.; Liu, H. Feature selection for high-dimensional data: A fast correlation-based filter solution. In Proceedings of the 20th International Conference on Machine Learning (ICML-03), Washington, DC, USA, 21–24 August 2003; pp. 856–863. [Google Scholar]
- Blum, A.L.; Langley, P. Selection of relevant features and examples in machine learning. Artif. Intell. 1997, 97, 245–271. [Google Scholar] [CrossRef]
- Raileanu, L.E.; Stoffel, K. Theoretical Comparison between the Gini Index and Information Gain Criteria. Ann. Math. Artif. Intell. 2004, 41, 77–93. [Google Scholar] [CrossRef]
- Jolliffe, I.T. Principal Component Analysis, 2nd ed.; Springer: New York, NY, USA, 2002. [Google Scholar]
- Miao, Y.; Lou, X.; Wu, H. The Diagnosis of Parkinson’s Disease Based on Gait, Speech Analysis and Machine Learning Techniques. In Proceedings of the 2021 International Conference on Bioinformatics and Intelligent Computing (BIC 2021). Association for Computing Machinery, New York, NY, USA, 22–24 January 2021; pp. 358–371. [Google Scholar]
- Little, M.; McSharry, P.; Hunter, E.; Spielman, J.; Ramig, L. Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. Nat. Preced. 2008, 1. [Google Scholar] [CrossRef]
- Lichman, M. UCI Machine Learning Repository; University of California, School of Information and Computer Science: Irvine, CA, USA; Available online: http://archive.ics.uci.edu/ml (accessed on 25 September 2022).
- Rewar, S. A systematic review on Parkinson’s disease (PD). Indian J. Res. Pharm. Biotechnol. 2015, 3, 176. [Google Scholar]
- Arora, S.; Venkataraman, V.; Zhan, A.; Donohue, S.; Biglan, K.; Dorsey, E.; Little, M. Detecting and monitoring the symptoms of Parkinson’s disease using smartphones: A pilot study. Park. Relat. Disord. 2015, 21, 650–653. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Miljkovic, D.; Aleksovski, D.; Podpečan, V.; Lavrač, N.; Malle, B.; Holzinger, A. Machine Learning and Data Mining Methods for Managing Parkinson’s Disease. In Machine Learning for Health Informatics; Springer: Cham, Switzerland, 2016; pp. 209–220. [Google Scholar] [CrossRef]
- Challa, K.N.R.; Pagolu, V.S.; Panda, G.; Majhi, B. An improved approach for prediction of Parkinson’s disease using machine learning techniques. In Proceedings of the 2016 International Conference on Signal Processing, Communication, Power and Embedded System (SCOPES), Odisha, India, 3–5 October 2016; pp. 1446–1451. [Google Scholar] [CrossRef] [Green Version]
- Lee, G.S.; Lin, S.H. Changes of rhythm of vocal fundamental frequency in sensorineural hearing loss and in Parkinson’s disease. Chin. J. Physiol. 2009, 52, 446–450. [Google Scholar] [CrossRef] [PubMed]
- Asmae, O.; Abdelhadi, R.; Bouchaib, C.; Sara, S.; Tajeddine, K. Parkinson’s Disease Identification using KNN and ANN Algorithms based on Voice Disorder. In Proceedings of the 2020 1st International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET), Meknes, Morocco, 16–19 April 2020; Institute of Electrical and Electronics Engineers (IEEE): New York, NY, USA, 2020; pp. 1–6. [Google Scholar]
- Wu, X.; Kumar, V.; Ross Quinlan, J.; Ghosh, J.; Yang, Q.; Motoda, H.; Steinberg, D. Top 10 algorithms in data mining. Knowl. Inf. Syst. 2008, 14, 1–37. [Google Scholar] [CrossRef] [Green Version]
- Ray, P.K.; Mohanty, A.; Panigrahi, T. Power quality analysis in solar PV integrated microgrid using independent component analysis and support vector machine. Optik 2019, 180, 691–698. [Google Scholar] [CrossRef]
- Lahmiri, S.; Shmuel, A. Detection of Parkinson’s disease based on voice patterns ranking and optimized support vector machine. Biomed. Signal Process. Control 2018, 49, 427–433. [Google Scholar] [CrossRef]
- Bhatia, A.; Sulekh, R. Predictive Model for Parkinson’s disease through Naïve Bayes Classification. Int. J. Comput. Sci. Commun. 2017, 9, 194–202. [Google Scholar]
- Rana, A.; Bahuguna, H.; Bijalwan, A. Artificial Neural Network based Diagnosis System. Int. J. Comput. Trends Technol. 2017, 3, 189–191. [Google Scholar]
- Alzubaidi, M.S.; Shah, U.; DhiaZubaydi, H.; Dolaat, K.; Abd-Alrazaq, A.A.; Ahmed, A.; Househ, M. The Role of Neural Network for the Detection of Parkinson’s Disease: A Scoping Review. Healthcare 2021, 9, 740. [Google Scholar] [CrossRef]
- Sakar, B.E.; Isenkul, M.E.; Sakar, C.O.; Sertbas, A.; Gurgen, F.; Delil, S.; Apaydin, H.; Kursun, O. Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings. IEEE J. Biomed. Health Inform. 2013, 17, 828–834. [Google Scholar] [CrossRef]
- Vadovský, M.; Paralič, J. Parkinson’s disease patients classification based on the speech signals. In Proceedings of the 2017 IEEE 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI), Herlany, Slovakia, 26–28 January 2017; pp. 000321–000326. [Google Scholar]
- Ouhmida, A.; Raihani, A.; Cherradi, B.; Terrada, O. A Novel Approach for Parkinson’s Disease Detection Based on Voice Classification and Features Selection Techniques. Int. J. Online Biomed. Eng. 2021, 17, 111–130. [Google Scholar] [CrossRef]
- Mabrouk, R.; Chikhaoui, B.; Bentabet, L. Machine Learning Based Classification Using Clinical and DaTSCAN SPECT Imaging Features: A Study on Parkinson’s Disease and SWEDD. IEEE Trans. Radiat. Plasma Med. Sci. 2018, 3, 170–177. [Google Scholar] [CrossRef]
- Benba, A.; Jilbab, A.; Hammouch, A. Using Human Factor Cepstral Coefficient on Multiple Types of Voice Recordings for Detecting Patients with Parkinson’s Disease. Irbm 2017, 38, 346–351. [Google Scholar] [CrossRef]
Reference | Feature | Machine Learning Algorithms Used | Objective | Tools Used | Source of Data | No. of Subjects | Outcomes |
---|---|---|---|---|---|---|---|
Sakar et al., 2019 [27] | Speech | Naïve Bayes, Logistic Regression, SVM (RBF and Linear), KNN, Random Forest, MLP | Classification of PD from HC | JupyterLab with python programming language | Collected from participants | 252, 188 PD + 64 HC | Highest accuracy obtained from SVM (RBF)—86% |
Yasar A. et al., 2019 [28] | Speech | Artificial Neural Network | Classification of PD from HC | MATLAB | Collected from participants | 80, 40 PD + 40 HC | Accuracy of ANN—94.93% |
Avuçlu, E., Elen, A, 2020 [29] | Speech | KNN, Random Forest, Naïve Bayes, SVM | Classification of PD from HC | JupyterLab with python programming language | UCI machine learning repository | 31, 23 PD + 8 HC | Accuracy from Naïve Bayes—70.26% |
Marar et al., 2018 [30] | Speech | Naïve Bayes, ANN, KNN, Random Forest, SVM, Logistic Regression, Decision Tree (DT) | Classification of PD from HC | R programming | Collected from participants | 31, 23 PD + 8 HC | Highest accuracy obtained from ANN—94.87% |
Sheibani R et al., 2019 [31] | Speech | Ensemble Based Method | Classification of PD from HC | JupyterLab with python programming language | UCI machine learning repository | 31, 23 PD + 8 HC | Accuracy obtained from ensemble learning—90.6%, |
John M. Tracy et al., 2020 [32] | Speech | Logistic Regression (L2-Regularized), Random Forest, Gradient Boosted Trees | Classification of PD from HC | Python | mPower database | 2289, 246 PD + 2023 HC | Highest accuracy obtained from gradient boosted trees Recall—79.7%, Precision—90.1%, F1-score—83.6% |
Cibulka et al., 2019 [33] | Handwriting Patterns | Random Forest | Classification of PD from HC | Not mentioned | Collected from participants | 270, 150 PD + 120 HC | Classification error for rs11240569, rs708727, rs823156 is 49.6%, 44.8%, 49.3% respectively. |
Hsu S-Y et al., 2019 [34] | Handwriting Patterns | SVM with RBF Kernel, Logistic Regression | Classification of PD from HC | Weka | PACS | 202, 94 Severe PD + 102 mild PD + 6 HC | Highest accuracy obtained from SVM-RBF 83.2% having sensitivity 82.8%, specificity 100% |
Drotár, P et al., 2016 [35] | Handwriting Patterns | K-NN, Ensemble AdaBoostClassifier, Support Vector Machine | Classification of PD from HC | Python [scikit-learn library] | PaHaW database | 37 PD and 38 HC | Accuracy—81.3% |
Fabian Maass et al., 2020 [36] | Handwriting Patterns | SVM | Classification of PD from HC | Weka | UCI machine learning repository | 157, 82 PD + 68 HC +7 Normal Pressure Hydrocephalus (NPH) | sensitivity-80%, and specificity—83% |
J. Mucha et al., 2018 [37] | Handwriting Patterns | Random Forest Classifier | Classification of PD from HC | Python Programming | PaHaW database | 69, 33 PD + 36 HC | Obtained classification accuracy-90% with sensitivity 89%, and specificity 91% |
Wenzel et al., 2019 [38] | Handwriting Patterns | CNN | Classification of PD from HC | MATLAB | PPMI database | 645, 438 PD + 207 HC | Accuracy-97.2% |
Segovia, F. et al., 2019 [39] | Handwriting Patterns | SVM with 10 Cross Validation | Classification of PD from HC | Python programming | Virgen De La Victoria Hospital, Malaga, Spain | 189, 95 PD + 94 HC | Accuracy-94.25% |
Ye, Q. et al., 2018 [40] | Gait | Least Square (LS)—SVM, Particle Swarm Optimization (PSO) | Classification of PD, ALS, HD from HC | Not mentioned | Neurology Outpatient Clinic at Massachusetts General Hospital, Boston, MA, USA | 64, 15 PD + 16 HC + 13 (Amyotrophic lateralsclerosis disease (ALS)) + 20 (Huntington’s disease (HD)) | Accuracy to diagnose PD from HC- 90.32%, Accuracy to diagnose HD from HC-94.44%, Accuracy to diagnose ALS from HC- 93.10% |
Klomsae, A et al., 2018 [41] | Gait | Fuzzy KNN | Classification of PD, ALS, HD from HC | Not mentioned | Neurology Outpatient Clinic at Massachusetts General Hospital, Boston, MA, USA | 64, 15 PD + 20 HD + 13 ALS + 16 HC | Accuracy to diagnose PD from HC- 96.43%, Accuracy to diagnose HD from HC-97.22%, Accuracy to diagnose ALS from HC-96.88% |
J. P. Félix et al., 2019 [42] | Gait | SVM, KNN, Naïve Bayes, LDA, Decision Tree | Classification of PD from HC | MATLAB R2017a | Neurology Outpatient Clinic at Massachusetts General Hospital, Boston, MA, USA | 31, 15 PD + 16 HC | Highest accuracy obtained from SVM, KNN, and decision tree- 96.8% |
Andrei et al., 2019 [43] | Gait | SVM | Classification of PD from HC | Not mentioned | Laboratory for Gait and Neurodynamics | 166, 93 PD + 73 HC | Accuracy-100% |
Priya SJ et al., 2021 [44] | Gait | ANN | Classification of PD from HC | MATLAB R2018b | Laboratory for Gait and Neurodynamics | 166, 93 PD + 73 HC | Accuracy-96.28% |
Oğul, et al., 2020 [45] | Gait | ANN | Classification of PD from HC | MATLAB | Laboratory for Gait and Neurodynamics | 166, 93 PD + 73 HC | Classification accuracy-98.3% |
Li B et al., 2020 [46] | Gait | Deep CNN | Classification of PD from HC | Not mentioned | Collected from participants | 20, 10 PD + 10 HC | Accuracy-91.9% |
Dataset Characteristic | Multivariate |
No. of Instances | 197 |
Attributes Characteristic | Real |
No. of Attributes | 23 |
Missing Values | N/A |
Made by | Max Little of the University of Oxford |
Associated Tasks | Classification |
Types of Classification | Binary {0 for healthy and 1 for PD patient} |
Name | Results |
---|---|
Accuracy Score of test data | 87.17% |
Accuracy Score of training data | 88.46% |
Execution Time | 0.03111 s |
F1-score | 66.19% |
MCC | 56.59% |
Name | Results |
---|---|
Accuracy Rate of test data | 74.11% |
Accuracy Rate of training data | 76.23% |
Execution Time | 0.0323 s |
F1-score | 86.74% |
MCC | 66.56% |
Title | Results |
---|---|
Accuracy Rate of test data | 96.7% |
Accuracy Rate of training data | 97.4% |
Execution Time | 0.025 s |
F1-Score | 87.01% |
MCC | 70.11% |
Name | Results |
---|---|
Accuracy Rate of test data | 87.17% |
Accuracy Rate of training data | 88.46% |
Execution Time | 0.03111 s |
F1-score | 71% |
MCC | 65.02% |
Performance Measure | ||||||
---|---|---|---|---|---|---|
Accuracy | F1-Score | MCC | Sensitivity | Specificity | ||
Training Dataset | Test Dataset | |||||
SVM | 88.46% | 87.17% | 66.19% | 56.59% | 62.5% | 93.54% |
Naïve Bayes | 76.23% | 74.11% | 86.74% | 66.56% | 84% | 79.76% |
KNN | 88.46% | 87.17% | 71% | 65.02% | 60.0% | 93.54% |
ANN | 97.4% | 96.7% | 87.01% | 70.11% | 92.42% | 91.25% |
Reference | Basis | Machine Learning Classifier | Accuracy | Sensitivity | Specificity |
---|---|---|---|---|---|
Sakar et al. [70] | Speech | SVM and KNN | 68.45% | 60 | 50 |
Vadovsk’y and Parali [71] | Speech | C4.5 + C5.0 + randomforest + CART | 66.5 | NA | NA |
Ouhmida, A. [72] | Speech | SVM, K-NN, Decision Tree | 98.26% (AUC) | NA | NA |
Mabrouk et al., [73] | Speech | Random forest, SVM, MLP, KNN | 78.4% (SVM), 82.2% (KNN) | NA | NA |
Benba et al. [74] | Speech | HFCC-SVM | 87.5% | 90% | 85% |
Proposed Work | Speech | SVM, naïve Bayes, KNN and ANN | 87.17%, 74.11%, 87.17%, and 96.7% | 62.5%, 84%, 60%, and 92.42% | 93.54%, 79.76%, 93.54%, and 91.25% |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Rana, A.; Dumka, A.; Singh, R.; Rashid, M.; Ahmad, N.; Panda, M.K. An Efficient Machine Learning Approach for Diagnosing Parkinson’s Disease by Utilizing Voice Features. Electronics 2022, 11, 3782. https://doi.org/10.3390/electronics11223782
Rana A, Dumka A, Singh R, Rashid M, Ahmad N, Panda MK. An Efficient Machine Learning Approach for Diagnosing Parkinson’s Disease by Utilizing Voice Features. Electronics. 2022; 11(22):3782. https://doi.org/10.3390/electronics11223782
Chicago/Turabian StyleRana, Arti, Ankur Dumka, Rajesh Singh, Mamoon Rashid, Nazir Ahmad, and Manoj Kumar Panda. 2022. "An Efficient Machine Learning Approach for Diagnosing Parkinson’s Disease by Utilizing Voice Features" Electronics 11, no. 22: 3782. https://doi.org/10.3390/electronics11223782
APA StyleRana, A., Dumka, A., Singh, R., Rashid, M., Ahmad, N., & Panda, M. K. (2022). An Efficient Machine Learning Approach for Diagnosing Parkinson’s Disease by Utilizing Voice Features. Electronics, 11(22), 3782. https://doi.org/10.3390/electronics11223782