Next Article in Journal
Clinical-Demographic Profile, Prognostic Factors and Outcomes in Classic Follicular Lymphoma Stratified by Staging and Tumor Burden: Real-World Evidence from a Large Latin American Cohort
Previous Article in Journal
Combined Cytokine Blockade Therapy (CCBT) Using Basiliximab and Infliximab for Treatment of Steroid-Refractory Graft-Versus-Host Disease (SR-GvHD)
Previous Article in Special Issue
Artificial Intelligence and Colposcopy: Automatic Identification of Vaginal Squamous Cell Carcinoma Precursors
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Enhancing Cancerous Gene Selection and Classification for High-Dimensional Microarray Data Using a Novel Hybrid Filter and Differential Evolutionary Feature Selection

1
Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, P.O. Box 344, Rabigh 21911, Saudi Arabia
2
Information Technology Department, Faculty of Computing and Information Technology, King Abdulaziz University, P.O. Box 344, Rabigh 21911, Saudi Arabia
3
Department of Computer Science, Faculty of Computing and Information Technology, King Abdulaziz University, P.O. Box 344, Rabigh 21911, Saudi Arabia
*
Authors to whom correspondence should be addressed.
Cancers 2024, 16(23), 3913; https://doi.org/10.3390/cancers16233913
Submission received: 18 September 2024 / Revised: 6 November 2024 / Accepted: 19 November 2024 / Published: 22 November 2024

Simple Summary

To improve cancer classification performance for high-dimensional microarray datasets, this work proposes combining filter and differential evolutionary (DE) algorithm feature selection techniques. By scoring genes or features of high-dimensional microarray datasets by some common filter methods, we keep only the highest-ranked features and eliminate superfluous and irrelevant ones to decrease the dimensionality of the microarray datasets. Then, the genes or features of the microarray datasets are optimized further by DE, producing noticeably better classification results. This could lead to outstanding improvement in the cancer classification using only less features of the microarray datasets.

Abstract

Background: In recent years, microarray datasets have been used to store information about human genes and methods used to express the genes in order to successfully diagnose cancer disease in the early stages. However, most of the microarray datasets typically contain thousands of redundant, irrelevant, and noisy genes, which raises a great challenge for effectively applying the machine learning algorithms to these high-dimensional microarray datasets. Methods: To address this challenge, this paper introduces a proposed hybrid filter and differential evolution-based feature selection to choose only the most influential genes or features of high-dimensional microarray datasets to improve cancer diagnoses and classification. The proposed approach is a two-phase hybrid feature selection model constructed using selecting the top-ranked features by some popular filter feature selection methods and then further identifying the most optimal features conducted by differential evolution (DE) optimization. Accordingly, some popular machine learning algorithms are trained using the final training microarray datasets with only the best features in order to produce outstanding cancer classification results. Four high-dimensional cancerous microarray datasets were used in this study to evaluate the proposed method, which are Breast, Lung, Central Nervous System (CNS), and Brain cancer datasets. Results: The experimental results demonstrate that the classification accuracy results achieved by the proposed hybrid filter-DE over filter methods increased to 100%, 100%, 93%, and 98% on Brain, CNS, Breast and Lung, respectively. Furthermore, applying the suggested DE-based feature selection contributed to removing around 50% of the features selected by using the filter methods for these four cancerous microarray datasets. The average improvement percentages of accuracy achieved by the proposed methods were up to 42.47%, 57.45%, 16.28% and 43.57% compared to the previous works that are 41.43%, 53.66%, 17.53%, 61.70% on Brain, CNS, Lung and Breast datasets, respectively. Conclusions: Compared to the previous works, the proposed methods accomplished better improvement percentages on Brain and CNS datasets, comparable improvement percentages on Lung dataset, and less improvement percentages on Breast dataset.

1. Introduction

Cancer research is widely acknowledged as a highly promising domain for using machine learning. Extensive endeavours have been undertaken to explore prospective approaches for detecting and treating cancer [1].
Cancer is a condition that exhibits uncontrolled cellular proliferation and results as a growth of a tumor in the form of mass or lump. Lung, colon, breast, central nervous system (CNS), liver, kidney, prostate, and brain cancer are among the various types of cancer that can occur. In this research study we have examined four distinct types of cancer dataset: Lung, Breast, Brain, and Central Nervous System. Lung cancer is a prevalent and mortal cancer worldwide [2]. It can arise in the primary airway, specifically within the lung tissue. The outcome is the unregulated proliferation and growth of specific lung cells. Respiratory disorders, including emphysema, are linked to an increased risk of lung cancer development. Breast cancer is one of the most invasive malignancies, predominantly affecting women. It is considered the most severe cancer following lung cancer due to the elevated mortality rate among women [3,4]. The rapid development of abnormal brain cells that is indicative of a brain tumor [5,6,7] is a significant health concern for adults, as it can result in severe impairment of organ function and even mortality. A malignant brain tumour rapidly grows and extends to adjacent brain regions. The Central Nervous System (CNS), consisting of the brain and spinal cord, is responsible for numerous biological functions. Spinal cord compression and spinal instability often involve the vertebral and spinal epidural spaces as common sites for cancer metastases. Metastases represent the most common type of CNS tumour in adults [8].
Cancer is regarded as one of the primary causes of death. In order to preserve the lives of patients, advanced technologies such as artificial intelligence and machine learning are used to detect cancer at an early stage and accurately predict its type. The cancer diagnosis is performed by employing several medical datasets, which encompass microarray gene expression data, also known as the microarray dataset. Microarray technology offers unique experimental capabilities that have been beneficial to cancer research. Microarray data can be used to evaluate a wide variety of cancer types. High-dimensional data from DNA microarray experiments is known as gene expression data. It is widely used to classify and detect malignant disorders [9]. The most recent development of artificial intelligence, specifically machine learning, has simplified data analysis, including microarray data. The authors [10] demonstrated that machine learning algorithms can be employed for microarray dataset analysis for cancer classification. Utilizing expressions of genes in microarray datasets can serve as an effective tool for diagnosing cancer. However, the number of active genes continues to grow, surpassing hundreds of thousands, while the available datasets remain limited in size, containing only a few subsets of samples. Therefore, one of the challenges in analyzing microarray datasets used for cancer classification is the curse of dimensionality. There is an additional concern regarding the characteristics of the current microarray datasets, which consist of numerous redundant and irrelevant features that have a detrimental impact on cancer classification results and computational expense [11]. The presence of duplicated and irrelevant features in very high-dimensional microarray datasets reduces the ability of the machine learning techniques to achieve accurate cancer classification and prediction [12]. These characteristics diminish the efficiency of the prediction model and complicate the search for meaningful insights. Consequently, it is necessary to employ feature selection methods in order to enhance the accuracy of the machine learning classifiers [13].
In order to enhance the effectiveness of widely used machine learning algorithms, many feature selection techniques have been employed to identify the most important features in malignant microarray datasets [14,15,16,17,18]. Even though filter feature selection approaches offer computational efficiency and the ability to reduce the dimensionality of microarray datasets, their accuracy results are limited since they evaluate features independently of classifiers. On the other hand, wrapper feature selection approaches interact with the classifier throughout the feature evaluation process, resulting in superior outcomes compared to the filter method. Nevertheless, the utilization of wrapper approaches on high-dimensional microarray datasets might be difficult and time-consuming.
In recent years, several evolutionary and bio-inspired algorithms [19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35] have been implemented in literature to obtain the highest level of accuracy in the gene selection challenge. Although feature selection methods based on evolutionary algorithms can overcome the limitations of filter and wrapper methods, they may result in greater computational times for certain machine learning algorithms. Due to the high dimensionality and large number of features in malignant microarray datasets, it is not feasible to initially employ evolutionary algorithms as feature selection approaches. It is essential to reduce the features of microarray cancer datasets using filter feature selection. Then, an evolutionary optimization algorithm can be utilized to optimize the features further to maximize cancer classification performance. This motivated us to suggest a novel hybrid filter-differential evolutionary feature selection method that combines the strengths of both filters and evolutionary techniques to generate effective solutions with improved cancer classification performance for high-dimensional microarray datasets.
The Differential Evolutionary (DE) is one of the superior optimization evolutionary algorithms, which is inspired by the biological evolution of the chromosomes in nature. DE performs well in convergence, although it is straightforward to implement and requires a few parameters to control and low space complexity. These attractive advantages of DE over other competitive optimization algorithms make DE gained widespread recognition for its exceptional efficacy in addressing various optimization challenges. Hence, this study aims to combine the superior performance of the DE optimization algorithm with filter selection methods to improve the classification accuracy of four microarray datasets by highlighting the most important and relevant genes. This is the first attempt at applying the hybrid filter and DE-based gene selection and classification of DNA Microarray data to the belief of our knowledge. In this paper, we propose a novel approach that combines feature selection methods based on differential evolutionary optimization algorithms and filter methods for identifying the most effective subset of features. Six common filtering methods were applied in this study to assign a score to each feature in microarray cancer datasets. These methods were then used to reduce the dimensionality of the datasets by retaining only the highest-ranked features and removing superfluous and irrelevant ones. The DE algorithm was then used to optimize the reduced cancer datasets, resulting in significantly improved results in cancer classification. Our proposed approach improved the classification performance of cancer when applied to microarray datasets with high dimensions. The following is a summary of the most important contributions that our paper represents:
  • Feature reduction using filter algorithms: Even though microarray datasets are used a lot in scientific literature, a recent study [28] found that popular machine learning methods weren’t very good at correctly classifying the high-dimensional microarray datasets such as the Brain, Breast, Lung, and CNS datasets since they have limited samples and thousands of redundant, irrelevant, and noisy genes or features. In this paper, we employed six well-known, fast and effective filtering methods—Information gain (IG), information gain ratio (IGR), correlation (CR), Gini index (GIND), Relief (RELIEF) and Chi-squared (CHSQR) to reduce the dimensionality of microarray datasets by ranking the features and genes and then selecting only the top 5% of ranked features in order to enhance the cancer classification performance.
  • Optimal feature selection using DE: The performance improvement of the machine learning algorithms achieved on high-dimensional microarray datasets considering only the filter methods is inadequate since the filter methods assess the features separately by identifying the correlation between each feature and the class label. Therefore, the DE optimization algorithm is suggested to be effectively used in the second phase to identify the correlation between a set of the best features and the class label. This is done to further optimize the selected features obtained from the filter methods and to make an enhancement in cancer classification performance. DE is able to reduce almost 50% of the irrelevant features in comparison to filter feature selection methods.
  • Maximizing the cancer classification: The proposed hybrid filter-DE feature selection methods have achieved the two main objectives stated in this study, which are reducing the dimensionality and increasing the cancer classification performance on the four microarray datasets used. It is evident from experimental results that proposed hybrid filter-DE feature selection methods accomplished excellent performances with fewer important features: 100% classification accuracy with only 121 features for the Brain dataset, 100% classification accuracy with only 156 features for CNS, 98% classification accuracy with only 296 features for Lung dataset, and 93% classification accuracy with only 615 features for Breast, respectively. The average improvement percentages of accuracy achieved by the proposed methods were up to 42.47%, 57.45%, 16.28% and 43.57% compared to the previous works that are 41.43%, 53.66%, 17.53%, 61.70% on Brain, CNS, Lung and Breast datasets, respectively. Compared to the previous works, the proposed methods accomplished better improvement percentages on Brain and CNS datasets, comparable improvement percentages on Lung dataset, and less improvement percentages on Breast dataset.
The remaining part of this paper consists of the following sections. Section 2 discussed recent works related to cancerous gene selection and classification performance suggested for high-dimensional microarrays. Section 3 describes proposed methodology and elaborates on the details of the phases of the proposed hybrid filter-DE feature selection methods. The experimental results and discussion of the proposed hybrid filter-DE are presented in Section 4. Finally, the paper is concluded, and future work is recommended in Section 5.

2. Related Works

This section presents and investigates the existing hybrid feature selection methods approaches that have recently been applied on cancer microarray datasets to improve cancer classification results.
Karthika et al. [20] employed the mixture model (MM) in addition to the Fast Fourier Transform (FFT) on Microarray Gene Expression data for dimensionality reduction. In order to select an effective feature, they employed optimization techniques called Dragonfly. Nonlinear Regression, DT, RF, and SVM were used as classifiers in this study. The classifiers’ performance is evaluated both with and without feature selection methods. Finally, hyper-parameter tuning techniques such as Adaptive Moment Estimation (Adam) and Random Adaptive Moment Estimation (RanAdam) are used to improve classifiers, resulting in an accuracy of approximately 98% with the SVM classifier. This research did not address computational complexity and model validation techniques. In addition, this study has notable limitations, including population-specific findings, reliance on MAGE data, and the influence of outliers.
Elbashir et al. [21]. suggested a graph attention network (GAT) model to utilize diverse mRNA and miRNA for the prediction of the survival rate of non-small cell lung cancer (NSCLC) using multi-omics data. Chi-square analysis was used to select the most significant features to include in the model. They used the synthetic minority oversampling method (SMOTE) to make the dataset and the concordance index (C-index) more equal, and they tested the model on different sets of omics data. When using combined mRNA and miRNA data, they obtained the highest value of the C-index (0.82) along with the accuracy of 0.75. Chi-Square approaches cannot be regarded as the most ideal feature selection method for highly complicated and correlated biological data; this is a significant limitation of the current research.
Zamri et al. [22] presented a hybrid metaheuristics optimization-based two-stage feature selection model. The SKF-MUT simulated Kalman filter was used in this study to pick microarray features that would make the ANN classification more accurate. The experimental results were validated using eight binary and multiclass benchmark datasets. SKF-MUT effectively selected the correct number of features and achieved 95–100% classification accuracy. The significant limitations of this study include model evaluation relying just on accuracy. Instead, other metrics like precision, recall, F1-score, or AUC-ROC might better assess the model’s performance along with accuracy. Further, the computational cost of feature selection has not been discussed.
Ali et al. [23] presented a hybrid filter-genetic feature selection method to reduce microarray dataset dimensionality. The first part of this work used three filter methods: information gain (IG), information gain ratio (IGR), and Chi-squared (CS) to pick the most relevant microarray dataset features. The second phase used a genetic algorithm to optimize the features selected in the first phase of the proposed approach. The proposed method was validated utilizing breast, lung, CNS, and brain cancer microarray datasets. Experimental results indicated the suggested model improved performance of various common machine learning approaches in terms of Accuracy, Recall, Precision, and F-measure and the reported accuracy ranges from 92 to 100%. The limitations of the existing work can be included as computational cost of the feature selection process and also statistical validation not discussed.
Elemam and Elshrkawey [24] introduced a two-stage hybrid feature selection. They began by using feature evaluation methods that included chi-squared, F-statistics, and mutual information (MI) filters. In the second phase, they employed wrapper-based sequential forward selection with ML models like SVM, DT, RF, and KNN classifiers to find the optimal set of features. The model was then rigorously tested and validated using lung cancer, ovarian cancer, leukemia, and SRBCT datasets. The results were impressive, with an accuracy rate of almost 100 percent and a minimal number of selected features. However, the study’s performance was solely measured through accuracy, and the issue of feature redundancy was not adequately addressed. No statistical tests were conducted for model validation, which are the limitations of the existing work
In a recent study, Abasabadi et al. [25] proposed a novel hybrid feature selection method to address the challenge of high dimensionality in microarray datasets. The methodology combines a filter approach (SLI-γ) with a genetic algorithm (GA). In the initial phase, 99% of irrelevant features were eliminated using SLI-γ. The second phase involved the GA optimization of the remaining relevant features to enhance classification accuracy. The results of this method were not only enhanced performance but also a significant reduction in execution time, which is a remarkable achievement. However, the inherent computational complexity associated with GA-based optimization remains a challenge, especially as the dimensionality of datasets increases.
Almutiri et al. [26] proposed a hybrid feature selection method, GI-SVM-RFE, to improve classification accuracy in high-dimensional microarray datasets. The methodology combines the Gini index and SVM-RFE to select informative genes recursively. The results showed enhanced classification accuracy reported as 90.67 compared to other methods without feature selection or using only the Gini index or SVM-RFE. The model not validated statistically.
Similarly, Xie et al. [27] proposed the Multi-Fitness RankAggreg Genetic Algorithm (MFRAG). The methodology employed a genetic algorithm framework to integrate nine feature selection techniques. It uses an ensemble model to assess fitness and guide the evolutionary process. The results indicated that MFRAG demonstrated exceptional performance, achieving an accuracy between 87 and 100 percent, with increased classification accuracy using fewer selected characteristics. The limitations of this study include the potential for overfitting despite the use of the ensemble method and the absence of statistical discussion of model validation.
Dash et al. [28] proposed a hybrid methodology for feature reduction utilizing harmony search and Pareto optimization. The authors employed the Harmony Search algorithm and Gene Selection (AHSGS) to identify the top 100 gene characteristics while also utilizing Bi-objective Pareto optimization to eliminate insignificant gene features. The model was assessed using four publicly available microarray datasets. In all instances, SVM surpassed other classifiers, attaining nearly 100 percent accuracy, with the exception of the Colon dataset, where ANN reached 82 percent accuracy. The existing work exhibits notable limitations, particularly in the statistical analysis, as the results concerning significance levels are absent. Furthermore, the author failed to address the criteria or methodology employed to ascertain the Harmony Memory Consideration Rate and Pitch Adjusting Rate.
Almutiri et al. [29] suggested a fusion-based feature selection framework aimed at mitigating high dimensionality and enhancing classification performance in gene expression microarray data. The framework utilizes a three-layer approach. The first layer has independent feature selection methods for gene ranking and scoring. The second layer consists of a threshold-based filtering step and a final decision layer employing majority or consensus voting. Experiments were conducted on five microarray datasets using an SVM classifier. The results revealed enhanced classification accuracy, achieving up to 97% on the Prostate dataset, alongside dimensionality reduction in comparison to existing methods. The primary limitations of this study are threshold sensitivity and dependence on voting strategy.
Kilicarslan et al. [30] proposed a hybrid model to significantly improve cancer diagnosis. The methodology combined relief and stacked autoencoders for dimension reduction. Then, SVM and CNN were used to improve classification accuracy. The proposed method achieved the highest classification accuracies (98.6%, 99.86%, and 83.95%, respectively) on three microarray datasets (Ovarian, Leukemia, and CNS), outperforming SVM and other tested approaches. The study highlighted the effectiveness of dimension reduction in enhancing classification accuracy. However, this study has most notable limitations as limited comparison with other Feature Selection Methods and process of hyperparameter optimization for the CNN model.
Baliarsingh et al. [31] proposed a microarray-based hybrid cancer classification model. The methodology utilized ANOVA to select relevant genes. Then, the enhanced Jaya (EJaya) algorithm and the forest optimization algorithm (FOA) were utilized to find the best gene subset, and SVM was used for classification. The proposed method reduced features and exceeded benchmark methods in classification accuracy from 96 to 100%. The significant limitation of this study is the use of a single classifier (SVM), which may not generalize across datasets. In addition Parameter tuning may also affect EJaya and FOA algorithm performance.
In this study, Almugren and Alshamlan [32] evaluated and compared contemporary hybrid approaches combining bio-inspired evolutionary algorithms for gene selection and cancer classification. The methodology, which was conducted with utmost thoroughness, involved reviewing various algorithms, with a focus on genetic algorithms (GA) as wrapper methods for gene selection. The results revealed that GA is the most extensively used and achieved the highest accuracy with a minimal number of selected genes ranging from 93 to 100%. In contrast, the Firefly algorithm has not been used as a wrapper approach. The limitation of existing work is the inadequate investigation of alternative hybrid algorithms.
Sayed et al. [33] proposed this study to investigate the efficacy of a Nested Genetic Algorithm (Nested-GA) for feature selection in high-dimensional colon cancer microarray datasets. The methodology used a t-test to preprocess data and a nested approach with two Genetic Algorithms. The outer Genetic Algorithm (OGA-SVM) is used for gene expression data, and the Inner Genetic Algorithm (IGA-NNW) is utilized for DNA methylation data. The validation was performed using five cross-folds, ensuring a thorough examination of the results. Nested-GA outperformed KNN and RF on the colon cancer dataset with 99.9% classification accuracy. This study’s main limitation is comparing Nested-GA to a limited set of feature selection algorithms (KNN and RF). A more extensive comparison with additional contemporary methods could yield more significant insights into its performance which is one of the limitation of this work.
Similarly, Ghosh et al. [34] introduced a novel two-stage hybrid model that integrates multiple filter methods with a genetic algorithm (GA) for cancer detection in microarray datasets. The methodology involved initially creating an ensemble of filter methods such as ReliefF, chi-square, and symmetrical uncertainty by looking at the union and intersection of their top-n-ranked features. Then, in the next step, GA is used to make the results of the first step even better. The result showed that the model did better than the best current methods, with an accuracy of about 100% and a smaller number of chosen features across five cancer datasets: colon, lung, leukemia, SRBCT, and prostate. The limitation of this study is that the performance evaluation is mainly based on accuracy and feature count.
Hameed et al. [35] introduced a three-phase hybrid method to select and classify high-dimensional microarray data. To achieve this purpose, the author employed Pearson’s Correlation Coefficient (PCC) alongside Binary Particle Swarm Optimization (BPSO) or Genetic Algorithm (GA) and numerous classifiers. In the first phase, the methodology utilizes PCC as a filter for feature selection. Subsequently, the second phase involved the application of either BPSO or GA as wrapper methods. The data was classified using five distinct classifiers. The results showed improved classification accuracy, with BPSO outperforming GA in speed and effectiveness across multiple datasets and classifiers. Although the authors compared BPSO with GA, they did not study a broader range of optimization algorithms or hybrid approaches. This highlights the urgent need for a more comprehensive understanding of the best practices in feature selection and classification.
As can be observed from the existing works discussed above, the filter methods have been utilized individually [18,24] or combined with the genetic algorithm [23,24,26,27,33,34,35] or wrapper feature selection [29,32] in order to improve cancer classification on microarray datasets. In contrast, this study proposed integrating Differential Evolution (DE) with some popular filter methods to maximize cancer classification on microarray datasets. DE has several attractive advantages over other competitive optimization algorithms. DE performs well in convergence, although it is straightforward to implement and requires a few parameters to control and low space complexity. Therefore, the proposed hybrid method successfully identified the most effective features using both filter methods and DE. This resulted in improved performance of classifying cancer on highly dimensional microarray datasets.

3. Materials and Methods

This section introduces a hybrid feature selection strategy that combines filter methods with differential evolutionary algorithms. This approach aims to effectively identify the most relevant features from high-dimensional microarray datasets to get excellent results in cancer classification. The methodology of the proposed hybrid filter and DE-based feature selection consists of five phases, as shown in Figure 1: microarray data collection, feature reduction using filter algorithms, feature selection using DE, training, and testing and evaluation of trained models.

3.1. Description of Cancerous Microarray Data Used

There are many microarray datasets used in literature. In order to assess the effectiveness of the suggested hybrid filter-DE feature selection method, we will focus on four malignant microarray datasets. Brain cancer [36,37], Breast cancer [38], Lung cancer [39] and Central Nervous System (CNS) [39] datasets since these datasets are more common microarray datasets and have high-dimensional features. Furthermore, several latest works [20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35] reported that the high-dimensional features of these four cancerous microarray datasets caused low classification results. Table 1. shows characteristics of the four cancerous microarray datasets used for assessing the proposed hybrid filter-DE feature selection method.

3.2. Feature Reduction Using Filter Algorithms

As stated in Section 3.1, the microarray datasets included in this study are characterized by a high number of dimensions and contain numerous duplicated and irrelevant features. It is impractical to train machine learning algorithms with all these features of microarray datasets. So, we use common filter feature selection methods to rank all features of microarray cancer datasets. Subsequently, to reduce the high dimensional datasets, only the top 5% of the best-ranked features are selected, and the remaining redundant and unnecessary features are removed. In this work, the features of microarray cancer datasets are ranked with various common filter feature selection methods. The features of microarray cancer datasets are ranked by utilizing well-known, fast and effective filtering methods such as correlation, information gain, and information gain ratio, Relief, Chi-squared and Gini Index.

3.2.1. Correlation

The correlation-based feature selection (CR-FS) technique is a widely used filter algorithm [40] that relies on the correlation between features and the target class. CFS chooses only the features that have high correlation to the target class and minimal correlation with each other. CFS uses the following Equation (1) to evaluate the features.
F s = f v c a ¯ f + f f 1 v a a ¯
where the average correlation between the class and the feature is represented by v c a ¯ while the correlation between two features is represented by v a a ¯ , and f denotes the number of features. In this study, the correlations between features and the target class are computed by feature Pearson’s correlation coefficient (PCC).

3.2.2. Information Gain

The information gain (IG) [41] is a well-known filter technique that has been effectively used to choose highly relevant features by employing the entropy concept to assess the importance of features. In IG, the worth of an attribute is determined by computing the quantity of information gained by the feature with respect to the target class. The IG method uses Equation (2) to compute the score IG (S, A) of a feature A:
I G S , A = E n t r o p y S v V a l u e A S v S E n t r o p y S v
where S v represents the subset of data S and feature A has the specific value v.
By using Equation (3), Entropy(S) is calculated based on the probability P ( c j ) of class c j in S.
E n t r o p y ( S ) = j = 1 c P ( c j ) log 2 P ( c j )

3.2.3. Information Gain Ratio

Although IG is usually a good filter feature selection method used for ranking the features, IG biases towards features with a large number of distinct values. The information gain ratio (IGR) [42], which penalizes features with a large number of values, is used to address the drawback of IG. In IGR, the information gain is divided by the split information, which effectively measures the inherent information required to distinguish between the various values of that feature. IGR uses an Equation (4) to calculate split information (S, A).
Split   information S , A = i = 1 m S i S log 2 S i S
The original dataset is denoted as S, each sub-dataset after being split is represented as S i , and m is the number of sub-datasets. The number of samples in S and S i are denoted as S and S i respectively.
The score IGR (S, A) of feature A is calculated by using Equation (5).
I G R S , A = I G S , A S p l i t   i n f o r m a t i o n ( S , A )

3.2.4. Relief

The Relief (RF) feature selection method [43] is another popular filter method based on the nearest neighbor to weight attributes, so it is utilized to deal effectively with dependent features and noisy data. The Relief feature selection method arbitrarily chooses a sample of the data and then finds its k-nearest neighbors from the same class and each of the opposite classes. The Relief method uses Equation (6) to compute the score S i   of ith attribute:
S i = 1 2 b = 1 l d X i b X i M b d X i b X i H b
where l represents the randomly chosen samples from the dataset, d X i b X i M b denotes the distance between the ith attribute value of a randomly chosen sample X i b and the nearest sample X i M b of the same class, while and d ( X i b X i H b ) represents the distance between the ith attribute value of a randomly chosen sample X i b and the nearest sample X i H b of the different class.

3.2.5. Chi-Squared

Chi-squared [44] is a popular and statistical filter method based on calculating the dependence between features and class. Features are ranked based on how strongly they are associated with the target class. The Chi-squared (CHSQR) test computes the aggregate of squared differences between the observed frequencies of each category with the expected frequencies under the assumption of no association. By computing χ2 with regard to the class as shown in Equation (7), each feature’s significance is assessed.
X 2 = i = 1 r j = 1 c O i j E i j 2 E i j
where c represents the class number and r stands for the number of bins utilized to discretize numerical attributes. O i j and E i j stand for the observed frequency and expected frequency, respectively.

3.2.6. Gini Index

Another popular filter method used for feature ranking is the Gini index (GI) [45]. It computes and allocates a weight or scoring to each feature, indicating the feature’s ability to distinguish instances from distinct classes. The following Equation (8) is utilized for calculating the Gini index of S :
G i n i S = 1 j = 1 c P ( c j ) 2
where c denotes number of classes and P c j refers to the probability of samples belonging to class c j in S.

3.3. Differential Evolution Based Feature Selection

Machine learning algorithms can be effectively trained to produce enhanced classification results using reduced cancer datasets that only include the top-ranked features determined by the filter methods. However, the performance of the machine learning algorithms considering only the filter methods is still limited when they are applied on high-dimensional microarray datasets [46]. This is because the majority of filter methods ignore the correlation between groups of features and the class label. They assume that the features are independent and find the relationship between the individual feature and the class label. Furthermore, filter methods use certain criteria to assess the features without the use of a machine learning algorithm. Thus, in order to improve the effectiveness of the cancer classification, Differential Evolution (DE) is used in this paper to further optimize the chosen features that are selected by the filter approaches.
DE is a powerful global optimization approach developed by Storn and Price [47], which belongs to category of evolutionary algorithms that draw inspiration from the natural evolution of chromosomes. In recent years, DE has been applied successfully in many real applications and optimization problems since it has several attractive advantages [46,48] over other competitive optimization algorithms. DE performs well in convergence although it is straightforward to implement and requires a few parameters to control and low space complexity.
Firstly, the initial population of the possible solutions in DE is randomly generated over the feasible region. Accordingly, the fitness function is used to evaluate the candidate solutions of the initial population. Then, DE generates a new solution by combining several solutions with the candidate solution. Like GA, the candidate solutions in DE population iteratively evolve to find better solutions through repeated generations of three main DE operators: mutation, crossover, and selection. However, the mutation, crossover, and selection operations in DE are conducted in different ways compared to GA operations.
Let X i = x i 1 , x i 2 , x i 3 , , x i d , x i D be the source vector of solution i, where i = {1, 2, …, P} and P is population size, d = {1, 2, …, D}, and D is the dimensionality of the search space. The mutation, crossover, and selection in DE are achieved as follows:
  • Mutation: For each solution vector X i , Equation (9) is used to produce a mutant solution V i
V i = X r 1 + F X r 3 X r 2
where F is the mutation factor in the interval [0, 1], X r 1 , X r 2 , X r 3 are three individuals or candidate solutions which are randomly chosen from the population such that r 1 r 2 r 3 i .
  • Crossover: The crossover operation is performed between the parent X i and its corresponding mutant solution V i in order to produce a trial solution U i = u i 1 , u i 2 , u i 3 , , u i d , u i D as shown in Equation (10).
u i d = v i d ,         if   δ CR   or   d = d r a n d x i d ,         otherwise
where CR represents crossover rate which is user-predefined constant within the range of [0, 1], δ is a random number between [0, 1], and d r a n d is randomly selected index [1, D].
  • Selection: The fitness of trial vector U i is computed and then compared with the fitness of source vector X i . If U i i is better than X i , X i will be replaced with U i into the population of the next generation. Otherwise, the population keeps the source vector X i for the next generation.
From generation to generation, the mutation, crossover and selection are applied to update and evolve the population in order to find the optimal solution until a stopping criterion is met.
DE was originally produced for solving optimization problems in continuous numbers search space. Therefore, the binary version of DE(BDE) was developed in order to solve feature selection and other binary optimization problems. The steps of feature selection based on DE can be described as follows:
  • Initialization: In this step, the initial population of possible individuals is generated randomly. Each individual represents a possible solution of feature subset which is binary-coded vector with m bits, where m is the number of available features. If any bit of an individual is 1, the corresponding feature is selected; otherwise, it is not selected. For example, five features (3rd, 5th, 7th, 8th and 9th features) are selected for given a solution X = {0, 0, 1, 0, 1, 0, 1, 1, 1, 0}.
  • Fitness evaluations: In this step, the individuals (feature subset solutions) in the population are evaluated using the fitness function. To calculate the fitness value of a feature subset solution, the misclassification rate (1-classification rate) of a machine learning technique trained with that feature subset solution is calculated for each individual in the population. That means the training dataset with the features indicated by 1 value on feature subset solution is used to train the machine learning technique. Then the misclassification rate is computed as the fitness of that feature subset solution. The misclassification rate is the ratio of the number of incorrectly classified instances to the total number of instances. The fitness function in the proposed DE-based feature selection aims at minimizing the misclassification rate of machine learning techniques trained with the possible feature subset solutions in order to identify the optimal feature subset that can produce the best classification results.
  • Mutation process: For the current individual X i , the mutation process in DE starts by randomly choosing three individuals X r 1 , X r 2 , X r 3 from the population where r 1 r 2 r 3 i . In DE used in feature selection problem, mutant solution V i is generated based on difference vector as shown in the following Equations (11) and (12).
d i f f e r e n c e   v e c t o r i d =           0 ,         if   x r 1 d = x r 2 d x r 1 d ,         otherwise
v i d =           1 ,         if   d i f f e r e n c e   v e c t o r i d = 1 x r 3 d ,         otherwise
  • Crossover process: Once the mutant individual is generated in the mutation process, the trial individual U i = u i 1 , u i 2 , u i 3 , , u i d , u i D is created by performing the crossover process between the source vector X i and its corresponding mutant vector V i . The trial solution U i in DE-based feature selection is computed using the same equation used in the original DE as shown in Equation (13).
u i d = v i d ,         if   δ CR   or   d = d r a n d x i d ,         otherwise
where CR represents crossover rate which is user-predefined constant within the range of [0, 1], δ is a random number between [0, 1], d = {1, 2, …, D}, and D is the dimensionality of the search space, and d r a n d is randomly selected index [1, D].
  • Selection process: In this step, DE compares the fitness values (misclassification rate) produced by the source solution X i and the trial solution U i . If the trial solution U i has better fitness value (lower misclassification rate) than the source solution X i , DE replaces the source solution X i with the trial solution U i in the population for the next generation. Otherwise, the source solution X i is retained in the DE population for the next generation.
In DE evolutionary process, the mutation, crossover, and selection processes are repeatedly conducted until a stopping criterion is satisfied. Eventually, the DE returns the best solution in the DE population that represents the optimum feature subset that can be effectively utilized later in the training and testing phases. Algorithm 1 presents steps of the proposed hybrid filter and differential evolution-based feature selection suggested to enhance cancer classification in Microarray data.
Algorithm 1: Hybrid filter and differential evolution-based feature selection
Input: F: Original feature set, and P: Size of population
Output: SF: The optimal selected features
Begin
1Compute scores of the features using Filter methods: Information gain (IG), information gain ratio (IGR), correlation (CR), Gini index (GIND), Relief (RELIEF) and Chi-squared (CHSQR)
2Do ranking of all features in F based on the scores computed by Filter methods
3Reduce dimension of training data by selecting only the top 5% of ranked features
4Set Crossover rate, maximum number of generations Max_t, the generation counter t = 0
5Generate and initialize P individuals of population (feature subset solutions)
6While t < Max_t and Stopping criterion is not satisfied Do
7 t = t + 1
8 Compute the fitness of individuals using misclassification rate
9 Best individual = Evaluate the fitness of individuals
10For each individual X i  Do
11  Choose three individuals X r 1 , X r 2 , X r 3 randomly
  where r 1 r 2 r 3 i
12  Generate a mutant solution V i using Equations (9) and (10)
13  Generate a trial vector U i using Equation (11)
14  Evaluate fitness values of X i and U i
15  If fitness of U i is better than fitness of X i
16         X i is replaced with U i in the population and then
         U i used for the next generation
17   Else
18      X i is kept for next generation
19   End if
20End For
21End While
22Extract the optimal selected features from the individual with the best fitness value
23Return the optimal features SF
24End Algorithm

3.4. Training of Machine Learning Techniques

As mentioned, the high-dimensional microarray datasets are reduced by applying the proposed hybrid filter-DE feature selection. In the first stage, we used filter methods to rank all features in microarray datasets and then select only the top 5% of features while the remaining features were removed. Then, we used DE in the second stage of the proposed method to choose only highly relevant features of the reduced microarray datasets. By using the final training microarray datasets with the most optimal features, we trained some well-known machine learning algorithms, such as support vector machine (SVM), naïve Bayes classifier (NB), k-Nearest Neighbour (kNN), decision tree (DT), and random forest (RF), that are widely employed in the literature to classify cancer having large dimension microarray datasets. To optimally design the kNN, NB, DT, RF, and SVM, the best features selected by the proposed hybrid filter-DE feature selection are used to as inputs to train these models. In addition, the best settings and parameters used in all classifiers were selected by a trial-and-error basis in order to produce the best results. Furthermore, we train these models using stratified cross-validation that is especially useful for imbalanced datasets, ensuring equal representation in each fold.

3.5. Testing and Evaluation of Machine Learning Techniques

After completing the training phase, the trained classification models based on the proposed hybrid filter-DE feature selection method will be evaluated with the new testing microarray datasets. Instead of using all the features of testing microarray datasets, we reduce testing microarray datasets by using only the optimal features selected by the proposed hybrid filter-DE feature selection method. Then, the reduced testing microarray datasets will be used as input of SVM, NB, kNN, DT, and RF to assess the cancer classification performance of these classifiers. In this paper, we used 10-fold cross-validation to evaluate the proposed method. We use some popular evaluation metrics used in the literature to evaluate the proposed hybrid filter-DE feature selection method, such as classification accuracy, recall, precision, and F-measure. The evaluation metrics used in this paper are defined based on the confusion matrix as follows:
Classification accuracy expressed in Equation (14) measures the total proportion of properly diagnosed samples.
A c c u r a c y = T P + T N T P + F P + F N + T N × 100 %
Equation (15) calculates Recall measure, which is the proportion of positive samples that are properly diagnosed as the positive class.
R e c a l l = T P T P + F N × 100 %
Precision is calculated in Equation (16) as the number of properly diagnosed positive samples divided by the total number of samples classified as positive. It is represented by the following formula:
P r e c i s i o n = T P T P + F P × 100 %
F-measure is a measure that combines Recall and Precision into a harmonic mean as expressed in Equation (17) to give a balanced assessment of the model’s performance.
F m e a s u r e = 2 × P r e c i s i o n × R e c a l l P r e c i s i o n + R e c a l l × 100 %
where TP denotes the number of correctly classified positive samples, FP stands for the number of negative samples incorrectly classified as positive samples, TN stands for the number of correctly classified negative samples, and FN indicates the number of positive samples incorrectly classified as negative samples.
In the multi-class classification, these measures are computed using Equations (18)–(23). Recall for each class i is calculated using Equation (18), which represents the ratio of true positive predictions to the total actual samples of that class. The Overall Recall can be calculated using Equation (19).
R e c a l l i = T P i T P i + F N i × 100 %
O v e r a l l   R e c a l l = ( 1 n i = 1 n R e c a l l i ) × 100 %
where T P i , F P i , T N i and F N i are true positive, false positive, true negative and false negative for class i, respectively, and n is the number of classes.
As shown in Equation (20), Precision for each class i is the ratio of true positive predictions for that class to the total samples classified as that class. The Overall Precision can be calculated using Equation (21).
P r e c i s i o n i = T P i T P i + F P i × 100 %
O v e r a l l   P r e c i s i o n = ( 1 n i = 1 n P r e c i s i o n i ) × 100 %
F-measure for each class i is calculated using Equation (22), which is the harmonic mean of precision and recall for that class. The Overall F-measure can be calculated using Equation (23).
F m e a s u r e i = 2 × P r e c i s i o n i × R e c a l l i P r e c i s i o n i + R e c a l l i × 100 %
O v e r a l l   F m e a s u r e = ( 1 n i = 1 n F m e a s u r e i ) × 100 %
In addition to classification measures, the classification error can be measured for each class using False Positive Rate and False Negative Rate. False Positive Rate (FPR) for each class i is calculated using Equation (24), which represents the ratio of samples incorrectly classified as class i out of all samples that do not actually belong to class i. Equation (25) is used to calculate the Overall FPR. In contrast, the False Negative Rate (FNR) for each class i is calculated using Equation (26), which denotes the proportion of samples that actually belong to class i but they are incorrectly classified as a different class. Equation (27) is used to calculate the overall FNR.
F P R i = F P i F P i + T N i
O v e r a l l   F P R = 1 n i = 1 n F P R i
F N R i = F N i F N i + T P i
O v e r a l l   F N R = 1 n i = 1 n F N R i

4. Experimental Environment and Results Discussion

This section presents the experimental environment and settings used to implement the suggested hybrid filter-DE techniques. In addition, it investigates the effectiveness of machine learning approaches after implementing the recommended hybrid filter-DE feature selection strategy, in comparison to their performance standalone and after applying only filter methods. The comparison includes the performance of machine learning by considering all features, features selected by filter methods, and features selected by the suggested hybrid filter-DE techniques. The most optimal classification results of the filter approaches were accomplished by training machine learning algorithms with top 5% of ranking features on these four datasets.

4.1. Experimental Environment

This study utilized the RapidMiner tool (version 10.1) and the Anaconda package. RapidMiner resources offer a comprehensive collection of machine learning algorithms and a variety of strategies for data validation. Furthermore, the Anaconda package is employed to execute differential evolutionary algorithms in the Python programming language, as well as for the purposes of training, testing, and visualization. As shown in Table 2, we have conducted many experiments and scenarios to get the best crossover rate (CR) and population size (P) that can produce the best classification results. The best parameters of DE used with the proposed hybrid filter-DE feature selection are listed in Table 3.
In addition, several Python libraries, such as Scikit-learn and Matplotlib, have been utilized. The data was visualized using Matplotlib, and machine learning techniques were implemented using the Scikit-learn module. The computing environment utilized a Lenovo Laptop equipped with the Windows 10 operating system, an Intel Core i7 (RTX) CPU, and 32 GB of RAM.

4.2. Results Analysis of Brain Dataset

It is evident from Figure 2 that when the IG filter was applied, the classification accuracies of KNN (78.57%), NB (69.05%), DT (50%), RF (78.57%), and SVM (69%) improved to 81%, 88.1%, 61.9%, 90.5%, and 81%, respectively. When IGR was used, the accuracy results of KNN, NB, DT, RF, and SVM showed enhancement and reached 83.33%, 85.71%, 64.29%, 90.86%, and 66.67%, respectively. Furthermore, when Chi-square (CHSQR) was applied, the accuracy results of KNN, NB, DT, RF, and SVM were enhanced to 80.95%, 83.33%, 69.05%, 88.1%, and 66.67%, respectively. In addition, when applying correlation CR, the accuracy results showed some improvement to 88.1%, 71.43%, 61.1%, 88.1%, and 83.33%, respectively. In the case of the GIND filter, accuracy enhancement was observed as KNN (83.33%), NB (83.33%), DT (64.29%), RF (90.46%), and SVM (80.95%). Additionally, the relief filter method showed enhancement with KNN (88.1%), NB (80.95%), DT (61.09%), RF (92.86%), and SVM (71.90%).
The hybrid IG-DE method improved the results of classification accuracy of KNN (78.57%), NB (69.05%), DT (50%), RF (78.57%), and SVM (69%) to 92%, 100%, 92%, and 92%, respectively, whereas the hybrid IGR-DE method improved them to 92%, 100%, 100%, 92%, and 92%. The suggested hybrid CHSQR-DE approach improved classification accuracy of KNN, NB, DT, RF, and SVM to 92%, 100%, 100%, 92%, and 92%, whereas hybrid CR-DE improved them to 85%, 100%, 85%, 85%, and 85%. Furthermore, the proposed hybrid GIND-DE method improved accuracy results to 92%, 100%, 92%, and 92%, while the hybrid RELIEF-DE method improved accuracy results to 92%, 100%, 100%, 92%, and 92% when compared to all features’ accuracy results.
In terms of accuracy, the IG-DE-NB, IGR-DE, with NB and DT, CHSQR-DE with NB and DT, CR-DE with NB, GIND-DE with NB and DT, and RELIEF-DE with NB and DT classifiers achieved optimal performance of 100% accuracy using the top 5% of ranked features on this dataset.
Table 4 shows the comparison of the number of selected genes and other measures of KNN, NB, DT, RF, and SVM by applying all features, filter feature selection, and hybrid-DE filter feature selection methods. It is obvious from Table 4 that the machine learning algorithms, with the implementation of the proposed hybrid filter-DE methods, accomplished much better performance in terms of accuracy, precision, recall, and F-measure compared to their performance with all features and their performances considering only filter methods. Outstandingly, the proposed hybrid methods IG-DE, IGR-DE, CHSQR-DE, CR-DE, GIND-DE, and RELIEF-DE with NB achieved optimal 100% performance. Furthermore, it is obvious from Table 4 that CHSQR-DE with DT has an edge over all other hybrid methods in terms of the features selected. The smallest number of selected features are highlighted in grey color. CHSQR-DE with DT performed excellently by selecting only 121 features out of 5597 features. It is evident from Table 4 that the filter feature selection strategy reduced the number of relevant features in the BRAIN datasets from 5597 to 280. Furthermore, the suggested hybrid IG-DE, IGR-DE, CHSQR-DE, CR-DE, GIND-DE, and RELIEF-DE techniques in the BRAIN dataset on average favored only 142, 126, 134, 132, 144, 138 features out of the 280 features.

4.3. Results Analysis of CNS Dataset

Figure 3 demonstrates that the classification accuracies of KNN (61.67%), NB (61.67%), DT (58.33%), and RF (53.33%) were enhanced by applying IG to 75%, 66.67%, 80%, and 80% while they were enhanced by applying IGR to 68.33%, 78.33%, 68.33%, and 80%, respectively. Further, in the case of CHSQR, the observed enhancement reached 75%, 70%, 61.67%, and 83.33%. In the case of CR (correlation), the change was noted as 76.67%, 55.5, and 77.33%, respectively. Further, when GIND was applied, the enhanced results reached 75%, 80%, 68.33%, and 83.33%. The relief filter also showed enhancement, reaching 71.67%, 73.33%, 65%, and 75%. It was also observed that there was no enhancement in all cases with SVM.
The hybrid IG-DE technique improved the classification accuracies of KNN, NB, DT, RF, and SVM to 94%, 100%, 94%, 89%, and 94% respectively. On the other hand, the hybrid CHSQR-DE method improved the accuracies to 83%, 100%, 89%, 83%, and 89%. The hybrid CR-DE technique improved the classification accuracies of KNN, NB, DT, RF, and SVM to 945, 100%, 94%, 83%, and 94%, respectively. On the other hand, the hybrid GIND-DE method improved the accuracies to 94%, 100%, 94%, 89%, and 89%. In addition, their performances were improved by implementing the suggested hybrid RELIEF-DE technique to achieve success rates of 83%, 100%, 94%, 89%, and 83% respectively.
The least enhancements observed with the proposed IGR-DE were 72%, 94%, 94%, 83%, and 78% in comparison to KNN (61.67), NB (61.67), DT (58.33), RF (53.33), and SVM (65). In terms of accuracy, the IG-DE with NB, CR-DE with DT, GIND-DE with NB, CHSQR-DE with NB, and RELIEF-DE with NB classifiers achieved optimal performance of 100% accuracy using the top 5% of ranked features on this dataset. To further analyze the efficacy of the suggested technique, the results of the proposed approaches were compared against feature selection methods as well as all features as presented in Table 5. Table 5 shows the comparison of and the number of selected genes and other measures of KNN, NB, DT, RF, and SVM by applying all features, filter feature selection, and hybrid-DE filter feature selection methods.
Table 5 clearly shows that the machine learning algorithms performed significantly better performance in terms of accuracy, precision, recall, and F-measure after applying the proposed hybrid filter-DE methods compared to their performance with all features and their performance with only filter methods. Outstandingly, the proposed IG-DE with NB, CR-DE with DT, GIND-DE with NB, CHSQR-DE with NB, and RELIEF-DE with DT achieved optimal 100% performance. Furthermore, it is obvious from Table 5 that IG-DE with NB has an edge over all other hybrid methods in terms of the features selected. IG-DE with NB performed excellently with selecting only 156 features out of a total of 6129 features. The smallest number of selected features are highlighted in grey color. In addition, Table 5 shows that the filter feature selection strategy reduced the number of relevant features in the CNS datasets from 6129 to 306. Further, the suggested hybrid IG-DE, IGR-DE, CHSQR-DE, CR-DE, GIND-DE, and RELIEF-DE techniques choose only 163, 171, 177, 180, 174 and 178 relevant features on average from the 306 filtered features.

4.4. Results Analysis of Lung Dataset

Figure 4 shows that applying GIND improved the classification accuracies of KNN (92.61%), NB (90.15%), DT (84.43%), and RF (83.74%) by 93.6%, 95.07%, 91.13%, and 93.60, respectively. Also, after applying the IG filter method, accuracy results were enhanced to 92.61%, 95%, 86.7%, and 93.6%, respectively.
Further, when CHSQR was applied, the accuracy results showed some improvements at 92.12%, 92.12%, 85.22, and 92.61%, respectively. In the case of the IGR filter, enhancement was recorded with NB (93.6%), DT (86.21%), and RF (91.13%), while KNN and Relief showed a little decrement. In addition, with the CR filter method, the enhancement was recorded only with NB (94.09%) and RF (86.7%), while we observed a little decrease with KNN and DT. The relief filter method showed enhancement with NB (91.63%), DT (89.16%), and RF (92.12%) but it did not perform well with KNN, and a small amount of decrement was observed in the accuracy. No method of the six filter methods performed well with SVM, and no enhancement was recorded in accuracy.
The proposed hybrid IG-DE method enhanced further the classification accuracies of KNN, NB, DT, RF, and SVM to 97%, 97%, 97%, 93%, and 98%, respectively, while they were enhanced by applying the hybrid IGR-DE method to 95%, 92%, 98%, 95%, and 97%, respectively. The CHSQR-DE method enhanced further the classification accuracies of DT, RF, and SVM to 97%, 93%, and 97%, respectively, while for KNN and NB, a very small amount of decrement was observed. Additionally, by applying the hybrid CR-DE method, KNN, NB, DT, RF, and SVM results were enhanced to 95%, 93%, 93%, 93%, and 97%, respectively. Furthermore, by applying the hybrid GIND-DE method, the accuracy results of KNN, NB, DT, RF, and SVM were enhanced to 93%, 93%, 98%, 95%, and 95%, respectively, while they were enhanced by applying the hybrid RELIEF-DE method to 93%, 92%, 97%, 93%, and 93%, respectively. In terms of accuracy, the IG-DE with SVM, IGR-DE with DT, and GIND-DE with DT classifiers achieved optimal performance of 98% accuracy using the top 5% of ranked features on this Lung dataset.
Table 6 compares the performance measures, and the number of genes identified by the proposed methodology on the Lung dataset to five classifiers (KNN, NB, DT, RF, and SVM) with all features, filter feature selection, and hybrid-DE filter feature selection approaches. Table 6 clearly shows that the machine learning algorithms performed significantly better in terms of accuracy, precision, recall, and F-measure after applying the proposed hybrid filter-DE methods in comparison to when using all features or only filter methods. From Table 6, it’s clear that the proposed hybrid methods IG-DE with SVM (98%, 100%, and 98%), IGR-DE with DT (98%, 100%, and 98%), and Gini Index GIND-DE with DT (98%, 100%, and 98%) accomplished about the same performance in terms of accuracy, precision, and F-measure. In terms of selected features, as shown in Table 6, the hybrid GIND-DE with DT and IGR-DE with DT showed excellent performance using only 296 features out of a total of 12,600 features. In conclusion, the hybrid Gini Index and IGR with DE optimization and the DT classifier outperformed other hybrid methods.
In addition, Table 6 reveals that the filter feature selection strategy reduced the number of relevant features in the LUNG datasets from 12,600 features to 630 features. Furthermore, the suggested hybrid IG-DE, IGR-DE, CHSQR-DE, CR-DE, GIND-DE, and RELIEF-DE techniques reduced the 630 features to just 300, 305, 318, 308 and 294 relevant features on average. The smallest number of the selected features are highlighted in grey colour.
As described in Table 1, the Lung dataset has five classes: normal tissue (17 samples), adenocarcinoma (139 samples), pulmonary carcinoid (20 samples), squamous carcinoma (21 samples) and small cell cancer (6 samples). That means Lung consists of 17 samples of normal tissue, while remaining classes represent types of lung cancer. Therefore, in addition to classification measures, since there are different numbers of samples for classes in lung dataset, the classification errors in terms of False Positive Rate (FPR) and False Negative Rate (FNR) were calculated for each class in Table 7.
The FPR and FNR were calculated for SVM, NB, KNN, DT, and RF after applying the proposed hybrid filter-DE feature section methods: IG-DE, IGR-DE, CHSQR-DE, CR-DE, GIND-DE and RELIEF-DE. The lower FPR and FNR indicate less misclassification and better performance. As can be observed from Table 7, in most cases, TPRs and TNRs of SVM, NB, KNN, DT, and RF after applying the proposed hybrid filter-DE feature section methods were low, especially with the four types of lung cancer (adenocarcinoma, pulmonary carcinoid, squamous carcinoma and small cell cancer). Particularly, NB achieves low FPR across most feature selection methods, especially with IG-DE and IGR-DE. However, it struggles with higher FNR in Adenocarcinoma using CR-DE, impacting sensitivity for this class. DT maintains low FPR, with IGR-DE showing excellent results, though FNR for Adenocarcinoma remains moderately high across methods. IGR-DE provides the best overall balance for DT performance. RF achieves low FPR with IG-DE but shows slightly higher FPR for Normal tissue with methods like CHSQR-DE. FNR is highly variable, with challenges in Adenocarcinoma and Pulmonary Carcinoid depending on the feature selection method. KNN has consistently low FPR, particularly with IG-DE, though CHSQR-DE raises FPR for Normal tissue. FNR is more variable, with Pulmonary Carcinoid being challenging in certain feature selection methods. SVM shows high FPR in Normal tissue using IG-DE but maintains low FPR with other methods like IGR-DE. FNR is generally low, though Pulmonary Carcinoid can be challenging for GIND-DE and RELIEF-DE. We can conclude that NB and DT with IG-DE or IGR-DE feature selection offer the best balance of low FPR and FNR across most cancer types, ensuring minimal misclassifications. So, NB and DT with IG-DE or IGR-DE can be considered as effective options for this lung cancer dataset.

4.5. Results Analysis of Breast Dataset

Figure 5 shows that adding IG improved the classification accuracy of KNN (56.67%), NB (48.45%), DT (53.73%), RF (63.92%), and SVM (52.58%) to 71.33%, 55.67%, 67.01%, 86.6%, and 74.23% respectively. Furthermore, using the IGR filter method increased the accuracy results to 64.95%, 54.67%, 61.86%, 87.63%, and 69.07%, respectively. Additionally, in the case of CHSQR, the observed enhancement in comparison to all feature performances reached 72.16%, 72.33%, 68.04%, 81.44%, and 73.20%, respectively. In the case of CR (correlation), enhancement reached 76.29%, 77.32%, 67.01%, 76.29%, and 74.23%, respectively. In the case of GIND (Gini Index), enhancement in comparison to all feature performance reached 73.2%, 56.64%, 70.1%, 70.10%, 82.47%, and 78.35%, respectively. In the case of GIND (Gini Index), enhancement in comparison to all feature performance reached 73.2%, 56.64%, 70.1%, 70.10%, 82.47%, and 78.35%, respectively.
The relief filter also showed enhancement in the accuracy results and reached 74.23%, 77.32%, 63.92%, 80.41%, and 76.29%. By applying the filter method, the highest accuracy achieved was 87.63% with IGR-RF, and the lowest accuracy achieved was 54.67% with IGR-NB. The proposed hybrid IG-DE method improved the accuracy of KNN, NB, DT, RF, and SVM even more, to 80%, 53%, 87%, 70%, and 73%, respectively. The hybrid IGR-DE method improved them even more, to 70%, 53%, 87%, 70%, and 67%, respectively. The CHSQR-DE method enhanced further the classification accuracies of KNN, NB, DT, RF, and SVM to 77%, 70%, 93%, 80%, and 73%, respectively. By applying the hybrid CR-DE method, KNN, NB, DT, RF, and SVM results were enhanced to 83%, 70%, 87%, 70%, and 70%, respectively. Using the hybrid GIND-DE method, the accuracy of KNN, NB, DT, RF, and SVM improved to 80%, 60%, 87%, 77%, and 73%, respectively. Using the hybrid RELIEF-DE method, the accuracy also improved to 80%, 77%, 90%, 77%, and 73%, respectively. The highest accuracy (93%) enhancement was observed with CHSQR-DE with the DT classifier, while the lowest enhancement (53%) was observed with IG-DE and IGR-DE using the NB classifier. In terms of accuracy, the CHSQR-DE with the DT classifier achieved an optimal performance of 93% using the top 5% of ranked features on this Breast dataset.
Table 8 shows the performance measures, and the number of genes identified by the proposed methodology on the Breast dataset using all features, filters, and the proposed hybrid filter-DE methods. Table 8 clearly reveals that the machine learning algorithms performed significantly better in terms of accuracy, precision, recall, and F-measure after applying the proposed hybrid filter-DE methods than when using all features or only filter methods. Further, Table 8 shows that the suggested hybrid CHSQR-DE with DT outperformed other hybrid approaches in terms of accuracy, precision, and F-measure 93%, 94%, 93% respectively.
Furthermore, the relief RELIEF-DE with the DT classifier obtained accuracy, precision, and F-measures of 90%, 90%, and 90%, respectively. In terms of selected features, as shown in Table 8. CHSQR-DE with DT showed the best performance using the least number of features (615) out of a total of 24,481 features. In conclusion, Chi-Square with DE optimization and a DT classifier outperformed all other filter and hybrid methods. In addition, Table 8 indicates that the filter feature selection strategy reduced the number of relevant features in the BREAST datasets from 24,481 features to 1224 features. Further, the application of suggested hybrid CHSQR-DE-DT, CR-DT-KNN, CHSQR-DE-RF, and RELIEF-DE-NB techniques decreased the 1224 features to an average of 615, 583, 619 and 596 significant features, respectively. The smallest number of selected features are highlighted in grey color.

4.6. Analysis of Computational Time Complexity

To study computational complexity, big O notation was used in this study to analyze the computational time complexity of the proposed method. The computational times for Information gain (IG), information gain ratio (IGR), correlation (CR), Gini index (GIND), Relief (RELIEF) and Chi-squared (CHSQR) are O ( F × N × log N ) , O ( F × N × log N ) , O ( F × N ) , O ( F × N × log N ) , O ( l × F × N ) , and O ( F × N ) , respectively. Here, F represents the number of features, N denotes the total number of samples in the dataset, l represents number of samples chosen to evaluate the features in Relief (RELIEF).
The time complexity of one generation in DE involves the time complexity of the mutation process O ( P × D ) , the crossover process O ( P × D ) , the fitness evaluation and selection process O ( P × T ) . The combined complexity per generation is O P × D + O P × D + O P × T = O ( P × ( D + T ) ) . Thus, the total time complexity of DE for all generations is O ( G × P × ( D + T ) ) , where G is the total number of generations, P is the population size, D is the dimensionality of the problem (number of features reduced by filter methods), and T is the time complexity of evaluating the fitness function.
The overall time complexity of proposed hybrid approach involves calculating the computational time of filter algorithms used to rank the features and then reduce the dimensionality of microarray datasets, and the computational time of the DE optimization algorithm used in the second phase to identify the best features. Therefore, the total computational time complexity of the proposed hybrid filter-DE feature selection approaches can be listed in the following Table 9.

4.7. Comparison of Proposed Hybrid Filter-DE with Previous Works

This section presents a comparison between the proposed hybrid filter-DE and previous research works that utilized hybrid feature selection techniques on microarray datasets.
This study aimed to reduce the dimensionality of four microarray datasets: Lung, CNS, Brain, and Breast. Hameed et al. [35] proposed using PCC-GA and PCC-BPSO approaches in microarray datasets to integrate Pearson’s Correlation Coefficient (PCC) with GA or BPSO. Almutiri et al. [29] enhanced cancer classification using fusion-based feature selection on four microarray datasets. In one recent work, the authors [17] used the hybrid feature selection approach to evaluate these four microarray datasets. They combined the Gini index (GI) and support vector machines (SVMs) with recursive feature elimination (RFE).
Recently, Ali et al. [23] employed hybrid GA-based feature selection on the same four microarray datasets used in this study, integrating genetic algorithm with filter methods. Table 10 shows the accuracy comparison of the proposed hybrid filter-DE against previous research works. The proposed hybrid filter-DE feature selection approaches were compared to related studies using the same datasets. Table 10 reveals that the proposed approaches outperformed most of the previous works on most of the datasets used in this study. For Brain dataset, it is evident from Table 10 that the suggested hybrid filter-DE feature selection approaches with NB performed well with 100% accuracy. Furthermore, most of the proposed hybrid filter-DE methods with DT achieved 100% classification accuracy. Similarly, the classification accuracy results accomplished by previous work [23] with RF were competitive with our proposed methods. For CNS dataset, the classification accuracy results of most of the proposed hybrid filter-DE methods were better than accuracy results achieved by other works. Particularly, Table 10 reveals that the proposed IG-DE, CHSQR-DE, CR-DE, GIND-DE, and RELIEF-DE with NB for CNS dataset performed well with 100% accuracy. For Lung dataset, the suggested IGR-DE and GIND-DE with DT performed well with 98% accuracy. The IG-GA [23] and PCC-BPSO [35] with NB also performed comparable performance about 98% classification accuracy. For Breast dataset, it is evident from Table 10 that the proposed CHSQR-DE with the DT and IGR-GA [23] with RF outperformed all other approaches with 93% accuracy.
To show the improvement achieved, the improvement percentage of accuracy achieved by the proposed hybrid filter-DE methods is calculated using the following Formula (28):
I P A c c = A c c S _ F A c c A l l _ F A c c A l l _ F × 100
where I P A c c is the improvement percentage of accuracy, A c c S _ F is the accuracy achieved with features selected by the proposed hybrid filter-DE methods, and A c c A l l _ F the accuracy achieved with all features.
Table 11 shows the comparison of the improvement percentage of accuracy achieved by the proposed hybrid filter-DE methods and the previous studies. For Brain dataset, the average improvement percentage of accuracy achieved by the proposed hybrid filter-DE methods was up to 42.47%, while the previous works achieved the average improvement percentage of accuracy up to 41.43%. For CNS, the average improvement percentage of accuracy achieved by the proposed hybrid filter-DE methods was up to 57.45% while the average improvement percentage of previous works was up to 53.66%. For LUNG, the average improvement percentage of accuracy achieved by the proposed hybrid filter-DE methods was up to 16.28%, while the average improvement percentage for previous works was up to 17.53%. For BREAST, the average improvement percentage of accuracy achieved by the proposed hybrid filter-DE methods was up to 43.57%, while the average improvement percentage for previous works is up to 61.70%. We conclude the proposed hybrid filter-DE methods accomplished better improvement percentage of accuracy than the previous works in Brain and CNS datasets. For Lung dataset, the improvement percentage of accuracy achieved by the proposed hybrid filter-DE methods was competitive to the improvement percentage of accuracy achieved by the previous work. For BREAST dataset, the improvement percentage of accuracy achieved by the proposed hybrid filter-DE methods was less than the improvement percentage of accuracy achieved by the previous works.

4.8. Statistical Significance Testing

We have performed the Wilcoxon signed-rank tests, assuming the null hypothesis (H0) and alternative hypothesis (H1). The null hypothesis states that there is no difference between the accuracies of the pairs, while the alternative hypothesis states that there is a significant difference between the accuracies of the pairs. The Wilcoxon signed-rank test yields two values: the test statistic and the p-value. The p-value represents the likelihood of obtaining a result that is at least as extreme as the observed outcome under the assumption that the null hypothesis holds. The statistic uses ranks of differences between paired data. More minor statistics indicate more substantial evidence of paired sample differences. If p ≤ 0.05, there is significant evidence to reject the null hypothesis, and it shows significant differences between samples. If p > 0.05, we do not reject the null hypothesis, indicating no significant difference in performance and the two methods perform competitively.

4.8.1. Statistical Significance Testing of Proposed Methods Compared with Each Other

From Table 12, we observed that the p-values of IG-DE vs. IGR-DE (p = 0.07), IG-DE vs. CR-DE (p = 0.08), CR-DE vs. GIND-DE (p = 0.16) and CR-DE vs. RELIEF-DE (p = 0.12) found very close to the significant level (0.05). This marginal difference validates that IG-DE and CR-DE performed in almost significant ways in competition.
The comparisons among IG-DE vs. CHSQR-DE, IG-DE vs. GIND-DE, IG-DE vs. RELIEF-DE, IGR-DE vs. CR-DE, CHSQR-DE vs. CR-DE, CHSQR-DE vs. GIND-DE, CHSQR-DE vs. RELIEF-DE, GIND-DE vs. RELIEF-DE were having high p-values (mostly > 0.05). For example, the comparison between IG-DE and CHSQR-DE yielded a p-value of 0.90, while the comparison between GIND-DE and RELIEF-DE produced a p-value of 0.73. No statistically significant differences exist in performances for IG-DE vs. GIND-DE, IG-DE vs. RELIEF-DE, IGR-DE vs. CR-DE, CHSQR-DE vs. CR-DE, CHSQR-DE vs. GIND-DE and CHSQR-DE vs. RELIEF-DE pairs. The high p-values validate that the null hypothesis (no difference in the method performance) cannot be rejected. This concludes that the performances of IG-DE vs. CHSQR-DE, IG-DE vs. GIND-DE, IG-DE vs. RELIEF-DE, IGR-DE vs. CR-DE, CHSQR-DE vs. CR-DE, CHSQR-DE vs. GIND-DE, CHSQR-DE vs. RELIEF-DE, GIND-DE vs. RELIEF-DE were not distinguishably different from each other.
The results revealed a promising future for feature selection methodologies, with IGR-DE emerging as a frontrunner, outperforming GIND-DE, RELIEF-DE, and CHSQR-DE. Other methods, such as IG-DE, CHSQR-DE, CR-DE, and GIND-DE, RELIEF-DE, demonstrated similar performance, suggesting their potential interchangeability without significant differences in accuracy.

4.8.2. Statistical Significance Testing of Proposed Methods Against Other Existing Works

As shown in Table 13, the Wilcoxon signed-rank test showed that IGR-DE outperformed PCC-GA [35] (p = 0.01) and GI-SVM-RFE [26] (p = 0.006). IGR-DE outperforms GI-SVM-RFE [26] due to its lower p-values, specifically 0.006. The lower p-value (0.01) indicates that IGR-DE has better performance than PCC-GA [35]. The lower p-values (0.006 and 0.01) support the rejection of the null hypothesis, which states that there is no significant difference between the compared methods. These findings support our proposed IGR-DE approach. The performance of IGR-DE differs significantly from GI-SVM-RFE [26] and PCC-GA [35]. IGR-DE outperformed GI-SVM-RFE [26] and PCC-GA [35] due to this considerable difference.
Despite the slight difference in significance level, IGR-DE demonstrated a performance that was on par with other methods such as IG-GA [23] (p = 0.13), Fusion [29] (p = 0.10), and PCC-BPSO [35] (p = 0.11). The p-value is larger than 0.05 indicates that the proposed methods are competitive with previous research works. IGR-DE was found to be competitive with IGR-GA [23] (p = 1.0) and CS-GA [23] (p = 0.40), although not significantly different.
The Wilcoxon signed-rank test revealed that IG-DE performed comparably to PCC-GA [35] (0.09), PCC-BPSO [35] (0.11), and GI-SVM-RFE [26] (0.11) due to a minor variation in significant p-value. Even though IG-DE showed no significant difference from IG-GA [23] (p = 0.75), Fusion [29] (p = 0.68), IGR-GA [23] (p = 0.59), and CS-GA [23] (p = 0.98), the p-value is greater than 0.05 that implies the proposed approaches perform competitively to the existing works.
The Wilcoxon signed-rank test showed that CHSQR-DE outperformed PCC-GA [35] (p = 0.02) and PCC-BPSO [35] (p = 0.03). The lower p-value of CHSQR-DE at 0.02 indicates that the performance of CHSQR-DE is significantly superior to that of PCC-GA [35]. Further, the lower p-value of CHSQR-DE at 0.03 shows that CHSQR-DE outperforms PCC-BPSO [35]. Lower p-values (0.02 and 0.03) support null hypothesis rejection. These results validate our proposed CHSQR-DE methodology. The performance of CHSQR-DE differs significantly from PCC-GA [35] and PCC-BPSO [35]. CHSQR-DE outperformed PCC-GA [35] and PCC-BPSO [35] due to this considerable difference. The proposed method CHSQR-DE showed no significant difference compared to other existing methods such as IG-GA [23] (p = 0.54), Fusion [29] (p = 0.56), IGR-GA [23] (p = 0.45), CS-GA [23] (p = 0.34), and GI-SVM-RFE [26] (0.21), but it was still competitive because the p-value was higher than 0.05.
The analysis using the Wilcoxon signed-rank test revealed that CR-DE demonstrated a statistically significant improvement over PCC-GA [35] (p = 0.009) and PCC-BPSO [35] (p = 0.02). The lower p-value of CR-DE, at 0.009, means the proposed CR-DE outperforms the PCC-GA [35], while for CR-DE at 0.02, it indicates superior performance than PCC-BPSO [35]. Lower p-values (0.009 and 0.02) support null hypothesis rejection. These results support our proposed CR-DE approach. The performance of CR-DE differs significantly from PCC-GA [35] and PCC-BPSO [35]. The proposed CR-DE outperformed PCC-GA [35] and PCC-BPSO [35] due to this considerable difference. The proposed method CR-DE showed no significant difference compared to other previous research works like IG-GA [23] (p = 0.23), Fusion [29] (p = 0.84), IGR-GA [23] (p = 0.27), CS-GA [23] (p = 0.29), and GI-SVM-RFE [26] (p = 0.46), but it was still competitive because the p-value was higher than 0.05.
The results of the Wilcoxon signed-rank test showed that GIND-DE outperformed GI-SVM-RFE [26] with a p-value of 0.05. The p-value is a measure of the strength of the evidence against the null hypothesis. In this case, a p-value of 0.05 suggests that there is a 5% chance that the observed difference in performance between GIND-DE and GI-SVM-RFE [26] is due to random variation. GIND-DE outperformed GI-SVM-RFE [26] due to its lower p-values, precisely 0.05. Null hypothesis rejection is strengthened by the lower p-value of 0.05. This confirms the novel characteristics of our GIND-DE technique. The performance of GIND-DE differs significantly from GI-SVM-RFE [26]. The proposed GIND-DE approach showed it is about to perform with a significant difference from PCC-GA [35] (p = 0.07) and PCC-BPSO [35] (p = 0.09). The significant p-value is ≤0.05, and the proposed GIND-DE achieved a p-value of 0.07 and 0.09. This marginal difference demonstrates that GIND-DE performed in an almost significant way and very competitively compared to PCC-GA [35] and PCC-BPSO [35]. The proposed method GIND-DE showed no significant difference compared to the other existing methods like IG-GA [23] (p = 0.84), Fusion [29] (p = 0.56), IGR-GA [23] (p = 0.59), and CS-GA [23] (p = 0.75), but it was still competitive to them.
The Wilcoxon signed-rank test showed that RELIEF-DE outperformed PCC-GA [35] and PCC-BPSO [35] (p = 0.05). RELIEF-DE outperformed PCC-GA [35] and PCC-BPSO [35] because of its lower p-values, particularly 0.05. Null hypothesis rejection is strengthened by the lower p-value of 0.05. This result validates our novel RELIEF-DE methodology. The performance of RELIEF-DE differed significantly from PCC-GA [35] and PCC-BPSO [35]. RELIEF-DE outperformed PCC-GA [35] and PCC-BPSO [35] due to this considerable difference. The proposed approach RELIEF-DE achievement was found to be very close to differ significantly from GI-SVM-RFE [26] (p = 0.06). The observed p-value (0.06) is marginally different from 0.05. Consequently, RELIEF-DE was almost about to perform differently with GI-SVM-RFE [26]. The proposed method RELIEF-DE showed no significant difference compared to other previous research works such as IG-GA [23] (p = 0.64), Fusion [29] (p = 0.56), IGR-GA [23] (p = 0.49), and CS-GA [23] (p = 0.59). However, it remains competitive, as the p-value of higher than 0.05 indicates that the performance of the proposed method is on par with the existing works.

4.9. Discussion

Due to the dimensionality curse on four cancerous microarray datasets, the common machine learning algorithms trained with all features failed to produce outstanding classification results, as shown in Figure 2, Figure 3, Figure 4 and Figure 5 and Table 4, Table 5, Table 6, Table 7 and Table 8. To reduce the high dimension of microarray datasets, filter methods contributed to removing redundant and irrelevant features in order to improve classification results. For the majority of the microarray datasets utilized in this work, Figure 2, Figure 3, Figure 4 and Figure 5 and Table 4, Table 5, Table 6, Table 7 and Table 8 demonstrate that the IG, IGR, and CHSQR filter methods improved the performance of SVM, NB, kNN, DT, and RF. However, the filter approaches often assess features without consulting a classifier and disregard the relationship between the features sets and the class label. Therefore, SVM, NB, KNN, DT, and RF with filter techniques only produced modest classification improvements on cancerous microarray datasets.
Figure 2, Figure 3, Figure 4 and Figure 5 and Table 4, Table 5, Table 6, Table 7 and Table 8 demonstrate that when the proposed hybrid filter-DE feature selection techniques were applied, SVM, NB, kNN, DT, and RF performed noticeably better cancer classification results compared to their performances without applying feature selection or applying only filter methods. Figure 6 illustrates that the classification accuracy results achieved by the proposed hybrid filter-DE over filter methods increased to 100%, 100%, 93% and 98% on four microarray datasets: Brain, CNS, Breast and Lung, respectively. This was expected since the proposed hybrid filter-DE feature selection method was able to find set of optimal features by considering the correlation between the sets of features and the class labels. In addition to enhancing the classification measures, Figure 7 and Table 4, Table 5, Table 6, Table 7 and Table 8 show that applying the suggested DE-based feature selection contributed to removing around 50% of the irrelevant features reduced using filter methods for four cancerous microarray datasets. That means that a smaller number of significant features selected by DE can be utilized for cancer classification on microarray datasets.
Compared to other existing hybrid feature selection works [23,24,26] the results in Table 10 demonstrate that, for the majority of the microarray datasets, the proposed hybrid filter and DE-based feature selection outperformed or was competitive to other previous works that suggested combining filter methods with genetic algorithm, particle swarm optimization, or other fusion methods. It can be observed from Table 10 that NB after applying the filter method and DE-based feature selection exceeded the competitor methods and accomplished exceptional performance on Brain and CNS datasets. For Lung and Breast datasets, DT with applying the proposed hybrid GIND-DE and CHSQR-DE outperformed other existing hybrid feature selection works while performances of other machine learning classifiers were competitive to their performances with other existing hybrid feature selection works, as shown in Table 10.

5. Conclusions and Future Work

This study suggests hybridization of filter and differential evolutionary (DE) algorithm-based feature selection methods to deal with challenges associated to high-dimensional microarray datasets. The suggested hybrid filter-DE feature selection approach initially identifies the top-ranked five percent (5%) significant features by means of IG, IGR, CHSQR, GIND, CR, and RELIEF in order to get rid of any irrelevant and redundant features from high-dimensional microarray datasets. The filter feature selection approaches alone do not perform well in cancer classification because they evaluate the features independently of the machine learning algorithm. So, in the next phase, the suggested approach further performed an optimization on the reduced dataset by means of DE to enhance cancer classification. The experimental results showed that the proposed approaches using differential evolutionary (DE) algorithms effectively eliminated approximately 50% of irrelevant features from the initially refined datasets using filter methods. This process ensured that only essential features remained for optimizing cancer classification performance. In addition, the hybrid filter-DE feature selection approaches demonstrated superior performance compared to stand-alone classifiers and filter algorithm-only classifiers. Furthermore, the suggested hybrid DE method consistently did better than other hybrid feature selection methods on most of the high-dimensional microarray datasets that were used in this study. Moving forward, selecting the optimal parameters using optimization methods may enhance the results. Furthermore, using resampling or class-weight learning techniques to tackle data imbalances in the microarray datasets could enhance the suggested hybrid filter-DE feature selection approach. In addition, future research endeavors will focus on enhancing cancer classification for high-dimensional microarray datasets by applying several filter feature selection approaches in combination with various evolutionary and swarm algorithms.

Author Contributions

Conceptualization, A.H. and W.A.; methodology, A.H. and W.A.; software, A.H. and W.A.; validation, A.H., W.A. and F.B.; formal analysis, A.H., W.A., F.B. and E.A.; investigation, A.H., W.A. and F.B.; resources, A.H.; data curation, A.H. and W.A.; writing, A.H., W.A., F.B. and A.A.; writing—review and editing, A.H., W.A., F.B., A.A. and E.A.; visualization, A.H.; supervision, A.H. and W.A.; project administration, A.H.; funding acquisition, A.H. All authors have read and agreed to the published version of the manuscript.

Funding

King Abdulaziz University-Institutional Funding Program for Research and Development Ministry of Education.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Available upon request.

Acknowledgments

This research work was funded by the Institutional Fund Projects under grant no. (IFPIP:645-830-1443). The authors gratefully acknowledge the technical and financial support provided by the Ministry of Education and King Abdulaziz University, DSR, Jeddah, Saudi Arabia.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Kourou, K.; Exarchos, T.P.; Exarchos, K.P.; Karamouzis, M.V.; Fotiadis, D.I. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 2014, 13, 8–17. [Google Scholar] [CrossRef] [PubMed]
  2. Yu, K.H.; Lee, T.L.M.; Yen, M.H.; Kou, S.C.; Rosen, B.; Chiang, J.H.; Kohane, I.S. Reproducible machine learning methods for lung 451 cancer detection using computed tomography images: Algorithm development and validation. J. Med. Internet Res. 2020, 22, 16709. [Google Scholar] [CrossRef] [PubMed]
  3. Felman, A. What to Know about Breast Cancer. Medical News Today. Available online: https://www.medicalnewstoday.com/articles/37136 (accessed on 20 December 2023).
  4. Malebary, S.J.; Hashmi, A. Automated Breast Mass Classification System Using Deep Learning and Ensemble Learning in Digital Mammogram. IEEE Access 2021, 9, 55312–55328. [Google Scholar] [CrossRef]
  5. Zahoor, M.M.; Qureshi, S.A.; Bibi, S.; Khan, S.H.; Khan, A.; Ghafoor, U.; Bhutta, M.R. A New Deep Hybrid Boosted and Ensemble Learning-Based Brain Tumor Analysis Using MRI. Sensors 2022, 22, 2726. [Google Scholar] [CrossRef]
  6. Hashmi, A.; Barukab, O. Dementia Classification Using Deep Reinforcement Learning for Early Diagnosis. Appl. Sci. 2023, 13, 1464. [Google Scholar] [CrossRef]
  7. Hashmi, A.; Osman, A.H. Brain Tumor Classification Using Conditional Segmentation with Residual Network and Attention Approach by Extreme Gradient Boost. Appl. Sci. 2022, 12, 10791. [Google Scholar] [CrossRef]
  8. Ostrom, Q.T.; Cioffi, G.; Gittleman, H.; Patil, N.; Waite, K.; Kruchko, C.; Barnholtz-Sloan, J.S. CBTRUS Statistical Report: Primary Brain and Other Central Nervous System Tumors Diagnosed in the United States in 2012–2016. Neuro Oncol. 2019, 21, v1–v100. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  9. Musheer, R.A.; Verma, C.K.; Srivastava, N. Novel machine learning approach for classification of high-dimensional microarray data. Soft Comput. 2019, 23, 13409–13421. [Google Scholar] [CrossRef]
  10. Singh, R.K.; Sivabalakrishnan, M. Feature Selection of Gene Expression Data for Cancer Classification: A Review. Procedia Comput. Sci. 2015, 50, 52–57. [Google Scholar] [CrossRef]
  11. Wang, L. Feature selection in bioinformatics. In Independent Component Analyses, Compressive Sampling, Wavelets, Neural Net, Biosystems, and Nanoengineering X; SPIE: Baltimore, MD, USA, 2012; Volume 8401, Available online: https://hdl.handle.net/10356/84511 (accessed on 20 December 2023).
  12. Song, Q.; Ni, J.; Wang, G. A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data. IEEE Trans. Knowl. Data Eng. 2013, 25, 1–14. [Google Scholar] [CrossRef]
  13. Saeys, Y.; Inza, I.; Larrañaga, P. A review of feature selection techniques in bioinformatics. Bioinformatics 2007, 23, 2507–2517. [Google Scholar] [CrossRef] [PubMed]
  14. Wang, A.; Liu, H.; Yang, J.; Chen, G. Ensemble feature selection for stable biomarker identification and cancer classification from microarray expression data. Comput. Biol. Med. 2022, 142, 105208. [Google Scholar] [CrossRef] [PubMed]
  15. De Souza, J.T.; De Francisco, A.C.; De Macedo, D.C. Dimensionality Reduction in Gene Expression Data Sets. IEEE Access 2019, 7, 61136–61144. [Google Scholar] [CrossRef]
  16. Bhui, N. Ensemble of Deep Learning Approach for the Feature Selection from High-Dimensional Microarray Data; Springer: Berlin/Heidelberg, Germany, 2022; pp. 591–600. [Google Scholar] [CrossRef]
  17. Alhenawi, E.; Al-Sayyed, R.; Hudaib, A.; Mirjalili, S. Feature selection methods on gene expression microarray data for cancer classification: A systematic review. Comput. Biol. Med. 2022, 140, 105051. [Google Scholar] [CrossRef]
  18. Abdulla, M.; Khasawneh, M.T. G-Forest: An ensemble method for cost-sensitive feature selection in gene expression microarrays. Artif. Intell. Med. 2020, 108, 101941. [Google Scholar] [CrossRef]
  19. Foster, K.R.; Koprowski, R.; Skufca, J.D. Machine learning, medical diagnosis, and biomedical engineering research—Commentary. Biomed. Eng. OnLine 2014, 13, 94. [Google Scholar] [CrossRef]
  20. MS, K.; Rajaguru, H.; Nair, A.R. Enhancement of Classifier Performance with Adam and RanAdam Hyper-Parameter Tuning for Lung Cancer Detection from Microarray Data—In Pursuit of Precision. Bioengineering 2024, 11, 314. [Google Scholar] [CrossRef]
  21. Elbashir, M.K.; Almotilag, A.; Mahmood, M.A.; Mohammed, M. Enhancing Non-Small Cell Lung Cancer Survival Prediction through Multi-Omics Integration Using Graph Attention Network. Diagnostics 2024, 14, 2178. [Google Scholar] [CrossRef]
  22. Zamri, N.A.; Aziz, N.A.A.; Bhuvaneswari, T.; Aziz, N.H.A.; Ghazali, A.K. Feature Selection of Microarray Data Using Simulated Kalman Filter with Mutation. Processes 2023, 11, 2409. [Google Scholar] [CrossRef]
  23. Ali, W.; Saeed, F. Hybrid Filter and Genetic Algorithm-Based Feature Selection for Improving Cancer Classification in High-Dimensional Microarray Data. Processes 2023, 11, 562. [Google Scholar] [CrossRef]
  24. Elemam, T.; Elshrkawey, M. A Highly Discriminative Hybrid Feature Selection Algorithm for Cancer Diagnosis. Sci. World J. 2022, 2022, 1056490. [Google Scholar] [CrossRef] [PubMed]
  25. Abasabadi, S.; Nematzadeh, H.; Motameni, H.; Akbari, E. Hybrid feature selection based on SLI and genetic algorithm for microarray datasets. J. Supercomput. 2022, 78, 19725–19753. [Google Scholar] [CrossRef] [PubMed]
  26. Saeed, F.; Almutiri, T. A Hybrid Feature Selection Method Combining Gini Index and Support Vector Machine with Recursive Feature Elimination for Gene Expression Classification. Int. J. Data Min. Model. Manag. 2022, 14, 41–62. [Google Scholar] [CrossRef]
  27. Xie, W.; Fang, Y.; Yu, K.; Min, X.; Li, W. MFRAG: Multi-Fitness RankAggreg Genetic Algorithm for biomarker selection from microarray data. Chemom. Intell. Lab. Syst. 2022, 226, 104573. [Google Scholar] [CrossRef]
  28. Dash, R. An Adaptive Harmony Search Approach for Gene Selection and Classification of High Dimensional Medical Data. J. King Saud Univ.-Comput. Inf. Sci. 2021, 33, 195–207. [Google Scholar] [CrossRef]
  29. Almutiri, T.; Saeed, F.; Alassaf, M.; Hezzam, E.A. A Fusion-Based Feature Selection Framework for Microarray Data Classification. In Proceedings of the International Conference of Reliable Information and Communication Technology, Online, 22–23 December 2021; pp. 565–576. [Google Scholar] [CrossRef]
  30. Kilicarslan, S.; Adem, K.; Celik, M. Diagnosis and classification of cancer using hybrid model based on ReliefF and convolutional neural network. Med. Hypotheses 2020, 137, 109577. [Google Scholar] [CrossRef] [PubMed]
  31. Baliarsingh, S.K.; Vipsita, S.; Dash, B. A new optimal gene selection approach for cancer classification using enhanced Jaya-based forest optimization algorithm. Neural Comput. Appl. 2020, 32, 8599–8616. [Google Scholar] [CrossRef]
  32. Almugren, N.; Alshamlan, H. A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification. IEEE Access 2019, 7, 78533–78548. [Google Scholar] [CrossRef]
  33. Sayed, S.; Nassef, M.; Badr, A.; Farag, I. A Nested Genetic Algorithm for feature selection in high-dimensional cancer Microarray datasets. Expert Syst. Appl. 2019, 121, 233–243. [Google Scholar] [CrossRef]
  34. Ghosh, M.; Adhikary, S.; Ghosh, K.K.; Sardar, A.; Begum, S.; Sarkar, R. Genetic algorithm based cancerous gene identification from microarray data using ensemble of filter methods. Med. Biol. Eng. Comput. 2019, 57, 159–176. [Google Scholar] [CrossRef]
  35. Hameed, S.S.; Muhammad, F.F.; Hassan, R.; Saeed, F. Gene Selection and Classification in Microarray Datasets using a Hybrid Approach of PCC-BPSO/GA with Multi Classifiers. J. Comput. Sci. 2018, 14, 868–880. [Google Scholar] [CrossRef]
  36. White Head Institute Center for Genomic Research Cancer Genomics. Available online: https://wi.mit.edu/our-research/cancer (accessed on 16 January 2024).
  37. Pomeroy, S.L.; Tamayo, P.; Gaasenbeek, M.; Sturla, L.M.; Angelo, M.; McLaughlin, M.E.; Kim, J.Y.H.; Goumnerova, L.C.; Black, P.M.; Lau, C.; et al. Prediction of central nervous system embryonal tumour outcome based on gene expression. Nature 2002, 415, 436–442. [Google Scholar] [CrossRef] [PubMed]
  38. Van’t Veer, L.J.; Dai, H.; Van De Vijver, M.J.; He, Y.D.; Hart, A.A.; Mao, M.; Peterse, H.L.; Van Der Kooy, K.; Marton, M.J.; Witteveen, A.T. Gene expression profiling predicts clinical outcome of breast cancer. Nature 2002, 415, 530–536. [Google Scholar] [CrossRef] [PubMed]
  39. Li, J.; Liu, H. Kent Ridge Biomedical Data Set Repository. Available online: https://leo.ugr.es/elvira/DBCRepository/ (accessed on 15 December 2023).
  40. Hall, M.A.; Smith, L.A. Feature Subset Selection: A Correlation based Filter Approach. In Proceedings of the International Conference on Neural Information Processing and Intelligent Information Systems, Dunedin, New Zealand, 24–28 November 1997. [Google Scholar]
  41. Lai, C.-M.; Yeh, W.-C.; Chang, C.-Y. Gene selection using information gain and improved simplified swarm optimization. Neurocomputing 2016, 218, 331–338. [Google Scholar] [CrossRef]
  42. Han, J.; Kamber, M. Data Mining Concepts and Techniques, 3rd ed.; Morgan Kaufmann Publishers: Waltham, MA, USA; Elsevier: Amsterdam, The Netherlands, 2001. [Google Scholar]
  43. Demšar, J. Algorithms for subsetting attribute values with Relief. Mach. Learn. 2010, 78, 421–428. [Google Scholar] [CrossRef]
  44. Jin, X.; Xu, A.; Bie, R.; Guo, P. Machine Learning Techniques and Chi-Square Feature Selection for Cancer Classification Using SAGE Gene Expression Profiles. In Proceedings of the Data Mining for Biomedical Applications: PAKDD 2006 Workshop (BioDM 2006), Singapore, 9 April 2006; pp. 106–115. [Google Scholar] [CrossRef]
  45. Hayes, A. Gini Index Explained and Gini Coefficients Around the World. Available online: https://www.investopedia.com/terms/g/gini-index.asp (accessed on 10 February 2024).
  46. Hancer, E.; Xue, B.; Zhang, M. Differential evolution for filter feature selection based on information theory and feature ranking. Knowl.-Based Syst. 2018, 140, 103–119. [Google Scholar] [CrossRef]
  47. Storn, R.; Price, K. Differential Evolution—A Simple and Efficient Heuristic for global Optimization over Continuous Spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
  48. Zorarpacı, E.; Özel, S.A. A hybrid approach of differential evolution and artificial bee colony for feature selection. Expert Syst. Appl. 2016, 62, 91–103. [Google Scholar] [CrossRef]
Figure 1. The methodology of improving cancer classification in Microarray data using the proposed hybrid filter and differential evolution-based feature selection.
Figure 1. The methodology of improving cancer classification in Microarray data using the proposed hybrid filter and differential evolution-based feature selection.
Cancers 16 03913 g001
Figure 2. Comparison of machine learning accuracy on the Brain dataset using all features, filter features, and hybrid filter-DE methods.
Figure 2. Comparison of machine learning accuracy on the Brain dataset using all features, filter features, and hybrid filter-DE methods.
Cancers 16 03913 g002
Figure 3. Comparison of machine learning accuracy on the CNS dataset using all features, filter features, and hybrid filter-DE methods.
Figure 3. Comparison of machine learning accuracy on the CNS dataset using all features, filter features, and hybrid filter-DE methods.
Cancers 16 03913 g003
Figure 4. Comparison of machine learning accuracy on the Lung dataset using all features, filter features, and hybrid filter-DE features.
Figure 4. Comparison of machine learning accuracy on the Lung dataset using all features, filter features, and hybrid filter-DE features.
Cancers 16 03913 g004
Figure 5. Comparison of accuracy results on the Breast dataset using all features, filter approaches, and the suggested hybrid filter-DE method.
Figure 5. Comparison of accuracy results on the Breast dataset using all features, filter approaches, and the suggested hybrid filter-DE method.
Cancers 16 03913 g005
Figure 6. Comparison of the best accuracy results achieved by filter methods and the proposed hybrid filter-DE feature selection method.
Figure 6. Comparison of the best accuracy results achieved by filter methods and the proposed hybrid filter-DE feature selection method.
Cancers 16 03913 g006
Figure 7. Comparison of the best results of features selected by filter methods and the proposed hybrid filter-DE feature selection method.
Figure 7. Comparison of the best results of features selected by filter methods and the proposed hybrid filter-DE feature selection method.
Cancers 16 03913 g007
Table 1. Characteristics of the four cancerous microarray datasets used for assessing the proposed hybrid filter-DE feature selection method.
Table 1. Characteristics of the four cancerous microarray datasets used for assessing the proposed hybrid filter-DE feature selection method.
DatasetNo. of SamplesNo. of FeaturesNo. of ClassesDetails of Classes
Breast [38]9724,4812Tumour (46 samples) and Normal (51 samples)
Lung [39]20312,6005The dataset consists of 17 samples of normal tissue, 139 samples of adenocarcinoma, 20 samples of pulmonary carcinoid, and 21 samples of squamous carcinoma and there are 6 samples of small cell cancer.
Brain [36,37]4255975Medulloblastomas (10 samples), Non-embryonal brain tumours (10 samples), Normal human cerebella (4 samples), PNET (8 samples), and RERCNS (10 samples)
CNS [39]6071292Tumour (39) and Normal (21)
Table 2. Some scenarios used to get the best crossover rate (CR) and population size (P).
Table 2. Some scenarios used to get the best crossover rate (CR) and population size (P).
DE ParametersAccuracy
BREASTDE-CR-KNNP = 50, CR = 0.9 83
P = 100, CR = 0.573
P = 150, CR = 0.673
DE_CHSQR_DTP = 50, CR = 0.993
P = 100, CR = 0.567
P = 150, CR = 0.663
P = 200, CR = 0.763
BRAINDE-IG-NBP = 100, CR = 0.9100
P = 150, CR = 0.685
P = 100, CR = 0.585
P = 200, CR = 0.792
DE-IGR-NBP = 50, CR = 0.9100
P = 200, CR = 0.777
P = 150, CR = 0.677
P = 100, CR = 0.569
DE-IG-DTP = 50, CR = 0.992
P = 100, CR = 0.554
P = 150, CR = 0.654
P = 200, CR = 0.762
CNSDE-IG-SVMP = −50, CR = 0.994
P = 100, CR = 0.593
P = 150, CR = 0.694
P = 200, CR = 0.794
DE-GIND-NBP = 50, CR = 0.9100
P = 100, CR = 0.589
P = 150, CR = 0.683
P = 200, CR = 0.794
DE-RELIEF-DTP = 50, CR=0.994
P = 100, CR = 0.661
P = 150, CR = 0.778
P = 200, CR = 0.567
LUNGDE-IGR-RFP = 50, CR = 0.995
P = 100, CR = 0.5.90
P = 150, CR = 0.792
P = 200, CR = 0.793
DE-GIND-DTP = 50, CR = 0.998
P = 100, CR = 0.577
P = 150, CR = 0.682
P = 200, CR = 0.782
Table 3. Parameter details of DE used in the hybrid filter-DE feature selection approach across the experiment on all datasets.
Table 3. Parameter details of DE used in the hybrid filter-DE feature selection approach across the experiment on all datasets.
DE ParametersValue
Crossover rate (CR)0.9
Population size(P)50
No of generations50 to 600
Step size for number of generations50
Table 4. Performance comparison of classifiers utilizing all features, filter methods, and the suggested hybrid filter-DE approaches on the Brain dataset. Background color: show the lowest number of features.
Table 4. Performance comparison of classifiers utilizing all features, filter methods, and the suggested hybrid filter-DE approaches on the Brain dataset. Background color: show the lowest number of features.
BrainALL-
FT
IGIG-DEIGRIGR-DECHSQRCHSQR-DECRCR-DEGINDGIND-DERELIEFRELIEF-DE
SVMAccuracy6980.959266.679266.679283.338580.959271.992
Recall58719256925693738571935292
Precision48.2866.189447.479445.189567.098866.189547.6994
F-measure52.768.519251.3892.9950.019369.9286.4768.519349.7592.99
No. of Features5597280129280142280137280131280140280149
NBAccuracy69.0588.110085.7110083.3310071.4310083.3310080.95100
Recall60.0580.51007810076.051006310073.0510072100
Precision58.6990.9110088.7210088.7710061.4310068.9510070100
F-measure59.5885.3910083.0210082.1810062.2110071.1510070.99100
No. of Features5597280153280153280137280134280146280149
KNNAccuracy78.5780.959283.339280.959288.18583.339288.192
Recall75.0580.593839380.593868783938893
Precision86.3684.339583.339587.059590.719285.859591.2195
F-measure80.5782.379385.589383.659388.298584.49389.5893
No. of Features5597280131280142280138280127280146280149
DTAccuracy5061.99264.2910069.0510061.098564.2910061.9100
Recall48.5659361.0510069100608767.510065100
Precision53.0661.739563.8310069.8910060.168565.7310064.95100
F-measure50.6863.329362.6410069.4410060.088266.610064.97100
No. of Features5597280149280145280121280134280139280122
RFAccuracy78.5790.489292.869288.19288.18590.469292.8692
Recall76.5889293928893838788939392
Precision80.2192.739494.369491.339589.529091.999594.3694
F-measure78.3190.39293.689289.639386.148489.959393.6892
No. of Features5597280149280149280138280134280150280122
Table 5. Performance comparison of classifiers utilizing all features, filters, and the suggested hybrid filter-DE approaches on the CNS dataset. Background color: show the lowest number of features.
Table 5. Performance comparison of classifiers utilizing all features, filters, and the suggested hybrid filter-DE approaches on the CNS dataset. Background color: show the lowest number of features.
CNSALL-
FT
IGIG-DEIGRIGR-DECHSQRCHSQR-DECRCR-DEGINDGIND-DERELIEFRELIEF-DE
SVMAccuracy65659465786589659465896589
Recall100100941007810083100941008910089
Precision65659565776593659565906590
F-measure78.7978.899410077.578.798678.799478.798878.7989.5
No. of Features6129306156306170306176306176306175306178
NBAccuracy61.677510078.33947010076.671008010073.33100
Recall66.6776.9210084.629271.7910079.4910082.0510076.92100
Precision72.2283.3310082.5968010083.7810086.4910081.08100
F-measure69.338010083.549375.6810081.5810084.2110078.95100
No. of Features6129306156306165306168306180306181306178
KNNAccuracy61.67759468.3372758376.6794759471.6783
Recall79.4982.059284.627287.188379.499279.499487.1883
Precision67.39809671.747177.278383.789681.589573.9183
F-measure72.9481.019377.656981.938381.589380.52948083
No. of Features6129306156306176306168306176306175306178
DTAccuracy58.3366.679468.339461.6789559468.33946594
Recall66.6776.929679.499667.399261.549679.499669.2396
Precision68.4273.179373.819379.498866.679373.81937593
F-measure67.53759476.549472.9488649476.549469.2394
No. of Features6129306179306174306181306177306181306185
RFAccuracy53.338089808383.338377.338383.33897589
Recall71.7989.748897.447582.227987.187989.748387.1883
Precision62.2281.48877.559094.878275.568285.379377.2793
F-measure66.6785.378886.367888.18080.958087.58687.1886
No. of Features6129306171306174306193306193306158306175
Table 6. Performance comparison of classifiers utilizing all features, filters, and the suggested hybrid filter-DE approaches on the Lung dataset. Background color: show the lowest number of features.
Table 6. Performance comparison of classifiers utilizing all features, filters, and the suggested hybrid filter-DE approaches on the Lung dataset. Background color: show the lowest number of features.
LungALL-
FT
IGIG-DEIGRIGR-DECHSQRCHSQR-DECRCR-DEGINDGIND-DERELIEFRELIEF-DE
SVMAccuracy68.4768.479868.479768.479768.479768.479568.4793
Recall20209720932093209220772073
Precision13.6913.6910013.699913.699913.699913.697913.6978
F-measure16.2516.259816.259516.259516.259516.257716.2575
No. of Features12,600630301630304630321630309630290630295
NBAccuracy90.15959793.69292.129094.099395.079391.6392
Recall79.0788.59793.649293.219090.379388.59388.5392
Precision88.2194.369788.99384.929289.079390.499483.4593
F-measure83.3991.349791.219288.879189.729389.489485.9192
No. of Features12,600630301630304630321630309630290630295
KNNAccuracy92.6192.619789.669592.129291.139593.69389.1693
Recall80.7387.919673.989280.368286.459484.798668.8573
Precision95.2289.019693.19595.099186.969393.389393.3878
F-measure87.3888.59682.459387.118586.79588.888779.2675
No. of Features12,600630301630304630321630309630290630295
DTAccuracy84.4386.79786.219885.229779.89391.139889.1697
Recall69.0773.699379.29672.49369.079584.699686.4593
Precision84.1580.639980.1310085.929975.929083.4910086.4399
F-measure75.87779579.669878.589572.339284.099886.4495
No. of Features12,600630296630296630317630311630296630304
RFAccuracy83.7493.69391.139592.619386.79393.69592.1293
Recall59.0197.158878.469283.478268.778882.999275.688
Precision93.5485.69494.459496.829496.759497.079595.9694
F-measure72.4391.019185.729389.658680.49189.489384.5791
No. of Features12,600630303630320630313630303630308630289
Table 7. The comparison of FPR and FNR for the suggested hybrid filter-DE approaches on the Lung dataset.
Table 7. The comparison of FPR and FNR for the suggested hybrid filter-DE approaches on the Lung dataset.
Classes in Lung Dataset
Normal
Tissue
Adeno-
Carcinoma
Pulmonary
Carcinoid
Squamous
Carcinoma
Small
Cell Cancer
Average
SVMIG-DEFPR0.530.000.000.0000.11
FNR0.000.000.000.1700.03
IGR-DEFPR0.110.000.000.0000.02
FNR0.000.400.000.0000.08
CHSQR-DEFPR0.110.000.000.0000.02
FNR0.000.200.000.1700.07
CR-DEFPR0.110.000.000.0000.02
FNR0.000.400.000.0000.08
GIND-DEFPR0.160.000.000.0000.03
FNR0.000.001.000.1700.23
RELIEF-DEFPR0.210.000.000.0000.04
FNR0.000.201.000.1700.27
NBIG-DEFPR0.050.020.000.0000.01
FNR0.020.200.000.0000.04
IGR-DEFPR0.050.020.000.0500.02
FNR0.100.200.000.0000.06
CHSQR-DEFPR0.050.020.020.0500.03
FNR0.120.200.000.0000.06
CR-DEFPR0.160.020.000.0000.04
FNR0.020.400.000.1700.12
GIND-DEFPR0.050.020.000.0400.02
FNR0.070.200.000.0000.05
RELIEF-DEFPR0.050.020.000.0500.02
FNR0.100.200.000.0000.06
KNNIG-DEFPR0.050.020.000.0000.01
FNR0.020.000.000.1700.04
IGR-DEFPR0.110.020.000.0000.02
FNR0.020.200.000.1700.08
CHSQR-DEFPR0.160.020.000.0200.04
FNR0.050.200.500.1700.18
CR-DEFPR0.050.020.000.0200.02
FNR0.480.200.000.0000.14
GIND-DEFPR0.110.040.000.0000.03
FNR0.050.000.500.1700.14
RELIEF-DEFPR0.210.000.000.0000.04
FNR0.000.201.000.1700.27
DTIG-DEFPR0.110.000.000.0000.02
FNR0.000.200.000.1700.07
IGR-DEFPR0.050.000.000.0000.01
FNR0.000.200.000.0000.04
CHSQR-DE FPR0.110.000.000.0000.02
FNR0.000.200.000.1700.07
CR-DEFPR0.050.040.000.0200.02
FNR0.070.200.000.0000.05
GIND-DEFPR0.050.000.000.0000.01
FNR0.000.200.000.0000.04
RELIEF-DEFPR0.110.000.000.0000.02
FNR0.000.200.000.1700.07
RFIG-DEFPR0.020.020.000.0000.01
FNR0.020.400.000.1700.12
IGR-DEFPR0.110.020.000.0000.02
FNR0.020.400.000.0000.08
CHSQR-DEFPR0.160.020.000.0000.03
FNR0.020.400.500.0000.18
CR-DEFPR0.160.020.000.0000.03
FNR0.020.400.000.1700.12
GIND-DEFPR0.110.020.000.0000.02
FNR0.020.200.000.1700.08
RELIEF-DEFPR0.160.020.000.0000.04
FNR0.020.400.000.1700.12
Table 8. Classifier performance comparison on the Breast dataset using all features, filters, and the proposed hybrid filter-DE methods. Background color: show the lowest number of features.
Table 8. Classifier performance comparison on the Breast dataset using all features, filters, and the proposed hybrid filter-DE methods. Background color: show the lowest number of features.
BreastALL-
FT
IGIG-DEIGRIGR-DECHSQRCHSQR-DECRCR-DEGINDGIND-DERELIEFRELIEF-DE
SVMAccuracy52.5874.237369.077373.27374.237378.357376.2973
Recall10080.397376.477370.597370.597380.397372.5573
Precision52.5873.217468.427476.67578.267378.857380.4374
F-measure68.9276.647372.2273.573.477374.237380.397376.2973
No. of Features24,481122459112246061224617122461012246391224581
NBAccuracy48.4555.675354.645372.337077.327056.646077.3277
Recall76.4796.085090.25074.517078.437096.085778.4377
Precision50.6555.442754.122773.087078.437053.857978.4377
F-measure60.9469.53567.653573.797078.437096.084978.4377
No. of Features24,481122462312245721224572122461012245761224596
KNNAccuracy55.6771.138064.957072.167776.298373.28074.2380
Recall78.4382.358090.26884.317676.478388.248082.3580
Precision55.5668.858061.338269.3577788369.238072.4180
F-measure65.04758073.026576.117677.238377.598077.0680
No. of Features24,481122462212246121224584122458312245811224622
DTAccuracy57.7367.018761.868768.049367.018770.18763.9290
Recall66.6772.558668.638768.639378.438776.478768.6390
Precision58.6262.278862.587709465.578769.648764.8190
F-measure62.3969.818665.428769.319371.438772.98766.6790
No. of Features24,481122459912246231224615122460312246051224595
RFAccuracy63.9286.67087.637081.448076.297082.477780.4177
Recall70.5988.247090.27084.318078.437086.277778.4377
Precision64.2986.547086.897281.138076.927081.487783.3377
F-measure67.2987.386988.466982.698077.677086.277680.8177
No. of Features24,481122459512246191224619122460412246201224596
Table 9. The computational time complexity of the proposed methods.
Table 9. The computational time complexity of the proposed methods.
The Proposed Hybrid MethodsTime Complexity
IG-DE
IGR-DE
GIND-DE
O F × N × log N + O ( G × P × ( D + T ) )
CR-DE
CHSQR-DE
O F × N + O ( G × P × ( D + T ) )
RELIEF-DE O l × F × N + O ( G × P × ( D + T ) )
Table 10. The accuracy comparison of the proposed hybrid filter-DE methods with the previous studies.
Table 10. The accuracy comparison of the proposed hybrid filter-DE methods with the previous studies.
Proposed Hybrid Filter-DEIG-GA
[23]
IGR-GA
[23]
CS-GA
[23]
GI-SVM-RFE
[26]
Fusion [29]PCC-GA
[35]
PCC-BPSO
[35]
IG-DEIGR-DECHSQR-DECR-DEGIND-DERELIEF-DE
BRAINKNN92929285929292.8697.6297.6287.59595.2497.62
NB10010010010010010092.8695.2495.2488N/A90.4892.86
DT921001008510010085.7188.185.7171.5N/AN/AN/A
RF9292928592921001001009088.6795.2485.71
SVM92929285929285.7197.6297.62N/AN/A97.6297.62
CNSKNN94728394948393.3383.3388.3381.67N/A96.6793.55
NB100941001001001009088.3383.3385N/A9091.94
DT94948994949493.3393.3388.3375N/AN/AN/A
RF89838383898991.679088.3383.3376.488591.94
SVM94788994898986.676583.33N/A7598.3391.94
LUNGKNN97959295939397.0496.0695.5792.62N/A97.5496.06
NB97929093939298.5297.5497.0491.17N/A97.0498.03
DT97989793989796.5596.0696.5588.71N/AN/AN/A
RF93959393959396.0695.5796.0693.64N/A96.0696.06
SVM98979797959394.0994.5895.07N/AN/A97.5497.04
BREASTKNN80707783808089.6986.684.5487.67N/A86.687.63
NB53537070607757.3762.8979.3890.67N/A85.5788.66
DT87879387879086.690.7284.5472.22N/AN/AN/A
RF70708070777789.6993.8185.5788.6784.6584.5485.57
SVM73737373737384.5482.4782.47N/A75.1188.6690.72
Table 11. The comparison of the improvement percentage of accuracy achieved by the proposed hybrid filter-DE methods and the previous studies.
Table 11. The comparison of the improvement percentage of accuracy achieved by the proposed hybrid filter-DE methods and the previous studies.
Proposed Hybrid Filter-DEIG-GA [23]IGR-GA [23]CS-GA [23]GI-SVM-RFE [26]FUSION [29]PCC-GA [35]PCC-BPSO [35]
IG-DEIGR-DECHSQR-DECR-DEGIND-DERELIEF-DE
BRAINKNN17.0917.0917.098.1817.0917.0918.1924.2524.2511.3720.9121.2224.25
NB44.8244.8244.8244.8244.8244.8234.4837.9337.9327.44 31.0434.48
DT84.00100.00100.0070.00100.00100.0071.4276.2071.4243.00
RF17.0917.0917.098.1817.0917.0927.2827.2827.2814.5512.8521.229.09
SVM33.3333.3333.3323.1933.3333.3324.2241.4841.48 41.4841.48
Average39.2742.4742.4730.8842.4742.4735.1241.4340.4724.0916.8828.7427.32
CNSKNN52.4216.7534.5952.4252.4234.5951.3435.1235.1232.43 56.7551.69
NB62.1552.4262.1562.1562.1562.1545.9443.2335.1237.83 45.9449.08
DT61.1561.1552.5861.1561.1561.1560.0060.0042.8628.58
RF66.8955.6355.6355.6366.8966.8971.8968.7656.2556.2543.4159.3872.40
SVM44.6220.0036.9244.6236.9236.9233.340.0028.20 15.3851.2841.45
Average57.4541.1948.3855.2055.9152.3452.5041.4239.5138.7729.4053.3453.66
LUNGKNN4.742.580.662.580.420.424.783.733.200.01 5.323.73
NB7.602.050.173.163.162.059.288.207.641.13 7.648.74
DT14.8916.0714.8910.1516.0714.8914.3613.7714.365.07
RF11.0613.4511.0611.0613.4511.0614.7114.1314.7111.82 14.7114.71
SVM43.1341.6741.6741.6738.7535.8337.4238.1338.85 42.4641.73
Average16.2815.1613.3613.7214.3712.8516.1115.5915.754.51 17.5317.23
BREASTKNN43.7025.7438.3249.0943.7043.7061.1155.5651.8657.48 55.5657.41
NB9.399.3944.4844.4823.8458.9318.4129.8063.8487.14 76.6282.99
DT50.7050.7061.0950.7050.7055.9050.0157.1546.4425.10
RF9.519.5125.169.5120.4620.4640.3246.7633.8738.7232.4332.2633.87
SVM38.8438.8438.8438.8438.8438.8460.7856.8556.85 42.8568.6272.54
Average30.4326.8441.5838.5235.5143.5746.1349.2250.5752.1137.6458.2661.70
Table 12. Comparison of the p-value of the proposed methods with each other.
Table 12. Comparison of the p-value of the proposed methods with each other.
Proposed Methodsp-Value
IG-DE vs. IGR-DE0.07
IG-DE vs. CHSQR-DE0.90
IG-DE vs. CR-DE0.08
IG-DE vs. GIND-DE0.59
IG-DE vs. RELIEF-DE0.95
IGR-DE vs. CHSQR-DE0.05
IGR-DE vs. CR-DE0.58
IGR-DE vs. GIND-DE0.02
IGR-DE vs. RELIEF-DE0.02
CHSQR-DE vs. CR-DE0.29
CHSQR-DE vs. GIND-DE0.47
CHSQR-DE vs. RELIEF-DE0.31
CR-DE vs. GIND-DE0.16
CR-DE vs. RELIEF-DE0.12
GIND-DE vs. RELIEF-DE0.73
Table 13. Comparison of the p-value of the proposed methods against previous works.
Table 13. Comparison of the p-value of the proposed methods against previous works.
Proposed MethodPrevious Worksp-Value
IG-DEIG-GA [23]0.75
IGR-GA [23]0.59
Fusion [29]0.68
CS-GA [23]0.98
GI-SVM-RFE [26]0.11
PCC-GA [35]0.09
PCC-BPSO [35]0.11
IGR-DEIG-GA [23]0.13
IGR-GA [23]1.0
Fusion [29]0.10
CS-GA [23]0.40
GI-SVM-RFE [26]0.006
PCC-GA [35]0.01
PCC-BPSO [35]0.11
CHSQR-DEIG-GA [23]0.54
IGR-GA [23]0.45
Fusion [29]0.56
CS-GA [23]0.34
GI-SVM-RFE [26]0.21
PCC-GA [35]0.02
PCC-BPSO [35]0.03
CR-DEIG-GA [23]0.23
IGR-GA [23]0.27
Fusion [29]0.84
CS-GA [23]0.29
GI-SVM-RFE [26]0.46
PCC-GA [35]0.009
PCC-BPSO [35]0.02
GIND-DEIG-GA [23]0.84
IGR-GA [23]0.59
Fusion [29]0.56
CS-GA [23]0.75
GI-SVM-RFE [26]0.05
PCC-GA [35]0.07
PCC-BPSO [35]0.09
RELIEF-DEIG-GA [23]0.64
IGR-GA [23]0.49
Fusion [29]0.56
CS-GA [23]0.59
GI-SVM-RFE [26]0.06
PCC-GA [35]0.05
PCC-BPSO [35]0.05
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hashmi, A.; Ali, W.; Abulfaraj, A.; Binzagr, F.; Alkayal, E. Enhancing Cancerous Gene Selection and Classification for High-Dimensional Microarray Data Using a Novel Hybrid Filter and Differential Evolutionary Feature Selection. Cancers 2024, 16, 3913. https://doi.org/10.3390/cancers16233913

AMA Style

Hashmi A, Ali W, Abulfaraj A, Binzagr F, Alkayal E. Enhancing Cancerous Gene Selection and Classification for High-Dimensional Microarray Data Using a Novel Hybrid Filter and Differential Evolutionary Feature Selection. Cancers. 2024; 16(23):3913. https://doi.org/10.3390/cancers16233913

Chicago/Turabian Style

Hashmi, Arshad, Waleed Ali, Anas Abulfaraj, Faisal Binzagr, and Entisar Alkayal. 2024. "Enhancing Cancerous Gene Selection and Classification for High-Dimensional Microarray Data Using a Novel Hybrid Filter and Differential Evolutionary Feature Selection" Cancers 16, no. 23: 3913. https://doi.org/10.3390/cancers16233913

APA Style

Hashmi, A., Ali, W., Abulfaraj, A., Binzagr, F., & Alkayal, E. (2024). Enhancing Cancerous Gene Selection and Classification for High-Dimensional Microarray Data Using a Novel Hybrid Filter and Differential Evolutionary Feature Selection. Cancers, 16(23), 3913. https://doi.org/10.3390/cancers16233913

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop