1. Introduction
The liver is an essential organ with a weight of nearly 1.5 kg and making up 2% of the mass of the whole body. The liver performs significant life-sustaining functions. The physiological roles of the liver are divided into those of hepatic sinusoids and hepatocytes. The liver is called the chemical plant of the body [
1]. According to a report by the WHO, 62.6% of deaths are due to liver diseases, of which 54.3% are due to cirrhosis; more than two-thirds of the world’s problems with the disease acute hepatitis occur in this particular area [
2]. According to a WHO report [
3], in the region of Asia, cirrhosis is the prominent cause of death due to liver diseases. In 2016, nearly 399,000 people died due to different liver diseases, e.g., cirrhosis, hepatocellular carcinoma, and hepatitis C. The WHO has made a strategy to declare liver disease a public health problem.
The last several decades have shown that abnormalities such as liver abscesses are sometimes severe and are subject to substantial epidemiological changes and risk factors [
4]. The most common liver abscesses are caused by blood infections, abdominal infections, infections due to injury, and bacterial or parasitic infections. To avoid complications in untreated patients, it is essential to recognize the abscesses’ severity, diagnosis, and treatment. The significant sources of pyogenic liver abscesses are biliary tract diseases [
5]. Hypoxia, ischemia, drug exposure, and infection cause damage to the liver. However, damaged cells cannot recover after a specific limit, leading to permanent damage. Sometimes, this generates scar tissue or non-functioning cells in the liver, which is called fibrosis [
6]. Until the 1970s, liver fibrosis and cirrhosis were considered irreparable. Cirrhosis was first discussed in the literature in 1819 by Laennec as a primary cancer that could be seen in the liver by its colors [
7]. Liver cirrhosis is the next stage of fibrosis caused by liver diseases such as liver injury, swelling, abnormal growth of non-functioning cells, and angiogenesis [
8]. Secondary liver cancer or a metastatic hepatic tumor is more noticeable than a primary tumor. About 70% to 80% of metastatic liver cancer cases remain limited to the liver [
9]. Liver metastases typically hypo-attenuate in un-enhanced CT. If there is concomitant hepatic steatosis, the lesions may be isolated or slightly hyper-attenuated.
Necrosis, also known as cell death or the death of bodily tissues, is when viable cells become nonviable, resulting in a suspension of the cell contents. This occurs when too little blood is supplied to the tissues. It is an irreversible process caused by injury, radiation, or chemical effects. Necrosis is found in chronic liver disease, and with the persistence of its underlying cause, it is followed by a progressive disease—fibrosis [
3]. Thus, the extent of necrosis is a part of the information collected from a biopsy. The extent of necrosis ranges from individual cells to massive hepatic cases. The pathologist’s role is to evaluate the pattern of tumors in the context of some morphological changes in order to suggest one or more possible underlying causes [
10]. In other cases, most of the liver’s vascular disorders are uncommon, except for portal vein thrombosis (PVT). Most patients can easily establish PVT diagnoses through a process called noninvasive imaging [
11]. After liver cirrhosis, the second leading cause of portal hypertension is PVT. Liver cirrhosis, hepatobiliary malignancy, inflammatory diseases, and thrombophilic disorders, including myeloproliferative diseases, are intensely concomitant with PVT development [
7].
In combination, homogeneous and heterogeneous patterns or frequency levels make a texture. Texture can be visualized, but it is more concrete to explain its statistical parameters. Liver texture demonstrates a similar nature. The micro-changes in intensity levels in a liver’s texture on the inside and along boundaries with other organs can produce severe problems in discriminating liver abnormalities [
8].
We searched Web of Science, the Google Scholar database, PubMed, and EMBASE to identify relevant articles published to date on liver disease diagnosis. We used keywords such as liver disease diagnosis using machine learning, texture analysis, computed tomography, computer-aided diagnosis, etc. We noticed early research work on the topic with limited datasets and methodologies. Most of these datasets are also not publicly available. For this reason, we collected our own dataset consisting of around 3000 CT images, which is the main contribution of this paper. Some relevant articles that we found on the topic are listed in [
12,
13,
14,
15,
16,
17,
18].
We noticed that the primary task in liver diagnosis and treatment is to visualize the texture of the patient. However, it is impossible to read all of the patient’s information in an image with a naked eye. Briefly, there can be many errors in the information collected for diagnoses from patients’ images due to homogeneity and heterogeneity in texture patterns. Different statistical methods and mathematical techniques can be adopted to attain accuracy in diagnostic imaging, and there are also various ways to extract texture features. The Haralick texture features include the Sum Average, Angular Second Moment, Sum Variance, Difference Variance, Inverse Difference Moment, Difference Entropy, and Information Measures of Correlation, Contrast, and Correlation. A Gray-Level Co-Occurrence Matrix (GLCM) was constructed in the proposed work. A GLCM contains second-order statistics for retrieving pixels’ information from a gray-level distribution within the ROIs, as introduced by Haralick et al. [
19].
The research work presented in this paper targets the assessment of the effectiveness of texture analysis through machine learning for differentiation among five different classes. We propose two methods for liver disease recognition using deep convolutional networks and conventional machine learning methods. Only two or three texture models of liver images have been considered to date. Hence, in our present article, we consider the quantitative evaluation of seven different kinds of texture models for liver CT images. We extracted 37 features from the models and then provided specially mined features for different feature selections. We also used the feature-ranking method to investigate the best settings.
2. Related Work
Liver disease classification using ML is not a new research area. Many good papers on the topic have been published by researchers. We searched Web of Science, the Google Scholar database, PubMed, and EMBASE to identify relevant articles published to date on liver disease diagnosis. We used keywords such as liver disease diagnosis using machine learning, texture analysis, computed tomography, computer-aided diagnosis, etc. We noticed early research work on the topic with limited datasets and methodologies. Most of these datasets are also not publicly available. For this reason, we collected our own dataset consisting of around 3000 CT images, which is the main contribution of this paper. Some relevant articles that we found on the topic are listed in [
12,
13,
14,
15,
16,
17,
18].
ML experts have explored neural networks, SVM, decision trees (DT), and other ML models to investigate this problem. This part of the paper discusses some of these methods that explore liver disease classification. The authors of [
20] implemented methods such as J48, NB, random tree (RT), and K-star. Another paper [
21] used algorithms such as logistic regression, RT, and SVM for classification. A back-propagation network (BPN) combined with a multilayer feed-forward deep neural network (MLFFDNN) was utilized by [
22]. XGBoost was used to estimate liver disease data, and the authors used L1 and L2 [
23] during their work. An imbalance in ILPD was handled through a specific method called the minority oversampling algorithm. The performance was assessed for balanced and unbalanced databases with SVM and KNN [
24].
Another method that explored particle swarm optimization (PSO) combined with SVM for feature selection was investigated in [
25]. Comparatively good results were reported with the use of an SVM classifier [
25]. Heuristic and nature-inspired meta-heuristic optimization algorithms (MHOAs) improved the performance of the method. The classification accuracy was furhter improved with methods such as grid search and the Nelder–Mead method [
26].
The Grasshopper [
27] and Firefly algorithms [
28] improved the accuracy in combination with SVM. However, these algorithms were still found to have some limitations. Another method that combined the Crow Search Optimization Algorithm (CSA) with SVM was reported in [
29]. According to the authors of the paper, the method proposed in [
29] maintained a balance between exploitation and exploration. The method proposed in [
29] had two parameters. The method was fast and straightforward to implement.
3. Liver CT Image Database
The significant contribution of the proposed paper is the collection of a database of liver images with normal and infected data. Very few tiny datasets have been reported in the literature regarding liver disease classification through machine learning. For this research, clinically verified CT imaging data were collected from the Bahawal Victoria Hospital, Bahawalpur, Pakistan. The CT machine used X-rays to acquire images with both spiral and sequential methods. Axial reconstruction minimized the averaging of the lesions as far as partial volume was concerned. Similarly, it also allowed hepatic enhancement scanning in three phases: the portal, arterial, and delayed phases. Moreover, issues regarding registration were minimized due to a process called single-hold breath.
A total of 3000 samples of 71 patients were selected and studied after discussion with the expert radiologists involved in the management. We kept the size of each image at 512 × 512. The scanner on which we obtained images was a 128-slice scanner with a 12-bit depth and a slice thickness varying in the range of 0.6–1 mm. We acquired images in the arterial, delayed, and portal venous phases. We studied five categories of liver images, namely, infected liver data, liver metastasis, tumor necrosis, vascular disorder, and normal liver images. A complete discussion was set up with the doctors’ and radiologists’ teams in each step. The dataset will be publicly available for research purposes after the publication of our paper.
The inclusion criteria were the collection of data from infected patients with hepatitis B and C, a metastatic tumor (secondary tumor), tumor necrosis, or vascular disorder because of the availability of said data types. Patients on ventilators and renal function tests or who were children were excluded from the current study. Due to the low socioeconomic status of the area, biopsy to the confirm clinical data was impossible for the patients. The gold standard for the final diagnosis was serum alpha-fetoprotein and triphasic multidetector computed tomography of the liver, in which non-ionic intravenous ultravist contrast was used to enhance the diseased pattern.
Table 1 shows the demographic data of the patients selected for the study.
Figure 1 and
Figure 2 show some images of our collected database.
Figure 1 shows images with some abnormalities at some stages.
Figure 2 compares normal case images with infected data.