Next Article in Journal
Applying MRI Intensity Normalization on Non-Bone Tissues to Facilitate Pseudo-CT Synthesis from MRI
Next Article in Special Issue
A [68Ga]Ga-DOTANOC PET/CT Radiomic Model for Non-Invasive Prediction of Tumour Grade in Pancreatic Neuroendocrine Tumours
Previous Article in Journal
The Role of Left-Atrial Mechanics Assessed by Two-Dimensional Speckle-Tracking Echocardiography to Differentiate Hypertrophic Cardiomyopathy from Hypertensive Left-Ventricular Hypertrophy
Previous Article in Special Issue
The Heterogeneity of Skewness in T2W-Based Radiomics Predicts the Response to Neoadjuvant Chemoradiotherapy in Locally Advanced Rectal Cancer
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Radiomics and Artificial Intelligence Analysis with Textural Metrics Extracted by Contrast-Enhanced Mammography in the Breast Lesions Classification

1
Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy
2
Department of Electrical Engineering and Information Technologies, Università degli Studi di Napoli Federico II, 80125 Naples, Italy
3
Senology Surgical Division, Istituto Nazionale Tumori–IRCCS-Fondazione G. Pascale, 80131 Naples, Italy
4
Pathology Division, Istituto Nazionale Tumori–IRCCS-Fondazione G. Pascale, 80131 Naples, Italy
*
Author to whom correspondence should be addressed.
Diagnostics 2021, 11(5), 815; https://doi.org/10.3390/diagnostics11050815
Submission received: 20 March 2021 / Revised: 26 April 2021 / Accepted: 27 April 2021 / Published: 30 April 2021
(This article belongs to the Special Issue Radiomics in Oncology)

Abstract

:
The aim of the study was to estimate the diagnostic accuracy of textural features extracted by dual-energy contrast-enhanced mammography (CEM) images, by carrying out univariate and multivariate statistical analyses including artificial intelligence approaches. In total, 80 patients with known breast lesion were enrolled in this prospective study according to regulations issued by the local Institutional Review Board. All patients underwent dual-energy CEM examination in both craniocaudally (CC) and double acquisition of mediolateral oblique (MLO) projections (early and late). The reference standard was pathology from a surgical specimen for malignant lesions and pathology from a surgical specimen or fine needle aspiration cytology, core or Tru-Cut needle biopsy, and vacuum assisted breast biopsy for benign lesions. In total, 104 samples of 80 patients were analyzed. Furthermore, 48 textural parameters were extracted by manually segmenting regions of interest. Univariate and multivariate approaches were performed: non-parametric Wilcoxon–Mann–Whitney test; receiver operating characteristic (ROC), linear classifier (LDA), decision tree (DT), k-nearest neighbors (KNN), artificial neural network (NNET), and support vector machine (SVM) were utilized. A balancing approach and feature selection methods were used. The univariate analysis showed low accuracy and area under the curve (AUC) for all considered features. Instead, in the multivariate textural analysis, the best performance considering the CC view (accuracy (ACC) = 0.75; AUC = 0.82) was reached with a DT trained with leave-one-out cross-variation (LOOCV) and balanced data (with adaptive synthetic (ADASYN) function) and a subset of three robust textural features (MAD, VARIANCE, and LRLGE). The best performance (ACC = 0.77; AUC = 0.83) considering the early-MLO view was reached with a NNET trained with LOOCV and balanced data (with ADASYN function) and a subset of ten robust features (MEAN, MAD, RANGE, IQR, VARIANCE, CORRELATION, RLV, COARSNESS, BUSYNESS, and STRENGTH). The best performance (ACC = 0.73; AUC = 0.82) considering the late-MLO view was reached with a NNET trained with LOOCV and balanced data (with ADASYN function) and a subset of eleven robust features (MODE, MEDIAN, RANGE, RLN, LRLGE, RLV, LZLGE, GLV_GLSZM, ZSV, COARSNESS, and BUSYNESS). Multivariate analyses using pattern recognition approaches, considering 144 textural features extracted from all three mammographic projections (CC, early MLO, and late MLO), optimized by adaptive synthetic sampling and feature selection operations obtained the best results (ACC = 0.87; AUC = 0.90) and showed the best performance in the discrimination of benign and malignant lesions.

1. Introduction

Breast cancer is the most common female disease in the word, diagnosed primarily in over-fifties women [1,2,3].
Mammography (MX), introduced in the 1960s, plays a pivotal role in cancer screening, detection, and follow-up, despite the availability of various other breast imaging modalities, such as ultrasound (US) and breast Magnetic Resonance Imaging (MRI), due to its properties and qualities [4,5].
Contrast-enhanced mammography (CEM) is a diagnostic technique that combines the advantages of standard full-field digital mammography (FFDM) with the intravenous administration of an iodinated contrast medium, highlighting neovascularity associated with actively growing malignancy. The use of contrast agents in cancer detection is based on the phenomenon that neoplasms induce angiogenesis for further tumor growth. Contrast medium can pass through the walls of new rapidly formed vessels into the (tumor) interstitium, causing enhancement.
In addition, dual-energy CEM is a quick and well-tolerated examination that eliminates breast density as a limiting factor when interpreting two-dimensional (2D) mammograms by utilizing a dual-energy acquisition system, which then generates a subtracted image to outline areas of enhancement.
Previous studies have evaluated the sensitivity of CEM compared to conventional digital MX, US, and MRI [6,7,8]. CEM sensitivity has been reported in the range 90–100% [9], much higher in comparison to MX and US alone [8]. CEM allows to identify additional occult cancers at mammography, to assess more accurately the disease extent, and to guide surgical and treatment planning [8,9,10,11].
However, several studies have reported that CEM sensitivity increases compared to conventional MX at the expense of specificity that ranges in the interval of 58–70% [12]. This issue could be resolved using quantitative feature extracted by CEM images.
Recent significant advancements within the field of medical image analysis relies on the application of Artificial Intelligence (AI) methods for the processing of large quantities of data from different imaging modalities [13,14].
Radiomics is the process of extracting quantitative properties, named features, from images. This feature extraction activity is typically realized by means of pattern recognition algorithms and provides, as a result, a set of numbers, each one representing a quantitative description of a specific either geometrical or physical property of the image portion under consideration. In oncological applications, examples of features are tumor size, shape, intensity, and texture, collectively providing a comprehensive tumor characterization, called the radiomics signature of the tumor [15,16,17].
The texture features analysis characterize intensity histogram and the relationships between different intensity levels within lesion; moreover, it describe spatial and spectral frequency patterns [18,19,20].
Several authors proposed radiomics analysis for breast cancer detection and classification, based on lesion heterogeneity information (textural features) and opportune classifiers [21,22,23,24,25,26,27,28,29,30,31,32].
This work has the objective to estimate the diagnostic accuracy of textural features extracted by dual-energy CEM images by carrying out univariate and multivariate statistical analyses, using artificial intelligence approaches in the classification of benign and malignant breast lesions.

2. Methods

2.1. Patient Selection

From October 2017 to April 2018, 80 consecutive patients with known unifocal and or multifocal/multicentric breast lesions (mean age ± standard deviation of 50.7 ± 11.6 years (range 26–78)) were enrolled in this prospective study according to regulations issued by the local Institutional Review Board. All women gave their written informed consent for research purposes. Inclusion criteria: patient with known breast lesions, histologically proven, and patient underwent dual-energy CEM examination in both craniocaudal (CC) and mediolateral oblique (MLO) projections.
Exclusion criteria were breast implants, pregnancy or possible pregnancy, a contraindication to the intravenous use of iodinated or gadolinium-chelated contrast agent (e.g., allergy to either agent or severely impaired renal function), inability to keep upright immobility during the examination, neoadjuvant chemotherapy treatment, hormone treatment (the most common forms of hormone therapy for breast cancer work by blocking estrogen hormones from attaching to receptors on cancer cells or by decreasing the body’s production of hormones) or radiation therapy at the time of imaging.
Overall, 104 suspected breast lesions of 80 enrolled patients were analyzed. A total of 19 patients were excluded: 6 for contraindication to the intravenous use of iodinated or gadolinium-chelated contrast agent; 4 for neoadjuvant chemotherapy treatment; 4 for breast implants; 2 for inability to keep upright immobility during the examination; and 3 for hormone treatment or radiation therapy.

2.2. Imaging Protocol

All CEM examinations were performed with a dual-energy mammography system (Hologic’s Selenia® Dimensions® Unit, Bedford, MA, USA). Approximately two minutes after an intravenous injection of 1.5 mL/kg body weight of iodinated contrast medium (Visipaque 320; GE Healthcare, Inc, Princeton, NJ, USA) at a rate of 2–3 mL/s, each woman was properly positioned in a CC view.
Two images were obtained per each breast through a low-energy exposure (26–31 kVp, according to breast thickness and density) and a high-energy exposure (45–49 kVp), straddling the k-edge of iodine. The low-energy image served as the equivalent of a two-dimensional FFDM examination, which was previously demonstrated [33,34,35]. The high-energy image is not diagnostic and is used for post-processing purposes. The total X-ray dose delivered to the patient for a pair of low- and high-energy images was estimated to be between 0.7 and 3.6 mGy, depending on breast thickness (30 to 80 mm) and tissue composition (0 to 100% glandular tissue); it is approximately 1.2-times the dose delivered in a standard single-view digital mammography [33,34,35].
The two previously acquired images are then digitally subtracted of each other to produce a resultant recombined image that highlights contrast enhancement uptake area and gives functional information [10], as it can be seen in Figure 1. Four and eight minutes after contrast agent administration, each breast were compressed in the MLO view: early-MLO and late-MLO view images were respectively acquired (Figure 1).

2.3. Histopathological Analysis

The reference standard was histopathologic examination. Overall, 104 samples of 80 patients were analyzed histopathologically: 65 surgical specimens for malignant lesions and 39 samples (surgical specimen or fine needle aspiration cytology (FNAC), core or Tru-Cut needle biopsy, and vacuum assisted breast biopsy (VABB)) for benign lesions. Histopathological examination was performed before CEM examination. This procedure did not change the shape of the lesion and therefore of the segmentation of the lesions.
Tumor stages were classified according to the system implemented by the American Joint Committee on Cancer staging. Ductal carcinoma in situ and invasive cancer tumors were counted as malignant lesions including, also, the papillary breast cancer that is a very rare type of invasive ductal breast cancer that accounts for fewer than 1% of all breast cancers.
All other results, including fibroadenoma, ductal hyperplasia, dysplasia, cysts, fibrosis and adenosis, were considered non-malignant lesions (Table 1).

2.4. Image Processing

Two expert radiologists, with 22 and 15 years of experience respectively, performed segmentation by drawing manually the regions of interest (ROIs) first separately and then together and in accordance with each other.
The margins of the breast lesions were defined on dual-energy subtracted (DES) images, where contrast uptake was emphasized, both in CC and in early and late MLO view. An example is shown in Figure 2.
DES images of each woman were processed in MATLAB (The MathWorks, Inc., Natik, MA, USA [36]) and for each ROI radiomics textural features were extracted. They provide information in the spatial distribution of intensity levels in a neighborhood.
The Texture Toolbox of MATLAB® realized by Vallières et al. [37], which includes 48 parameters, calculated according to the Image Biomarker Standardization Initiative [38], was considered. The package implements wavelet band-pass filtering, isotropic resampling, discretization length corrections and different quantization tools. The toolbox can be downloaded at https://it.mathworks.com/matlabcentral/fileexchange/51948-radiomics (accessed on 15 April 2019).
The textural features (Table 2) include both first-order features and second-order ones; an extra detailed description for each one has been provided in the Appendix A.

2.5. Statistical Analysis

The statistical analysis included univariate and multivariate approaches and were performed with the RStudio software [39].

2.5.1. Univariate Analysis

The intra-class correlation coefficient (ICC) was calculated to assess the robustness of manual segmentation for the radiomics features obtained considering the two volumes of interest segmented separately by two expert radiologists.
For two-groups comparisons, we used the non-parametric Wilcoxon–Mann–Whitney test for continuous variables. Receiver operating characteristic (ROC) analysis was performed.
To individuate the optimal cut-off value for each feature the Youden index was calculated. Area under ROC curve (AUC), sensitivity (SENS), specificity (SPEC), positive predictive value (PPV), negative predictive value (NPV), and accuracy (ACC) were obtained considering the optimal cut-off values identified maximizing the Youden index [40].
A p-value < 0.05 was considered as significant. However, false discovery rate (FDR) adjustment according to Benjamini and Hochberg for multiple testing was considered.

2.5.2. Multivariate Analysis

Multivariate analysis was carried out following the pattern recognition model and using linear classifier (linear discriminant analysis—LDA), decision tree (TREE), k-nearest neighbours (KNN), artificial neural network (NNET), and support vector machine (SVM) to assess the diagnostic accuracy using all extracted metrics of textural parameters. In recent years, several authors have deepened the subject in their work [18,41,42,43,44]. Moreover, a brief description of each classifier has already been discussed in a previous article [14].
Configuration settings for each classifier are provided in Table 3.
The classification analysis has been cross-validated using 10-fold cross validation (10-fold CV) and the Leave-one-out cross validation (LOOCV) approaches and median values of AUC, accuracy, sensitivity, specificity, PPV e NPV were obtained.
Each classifier received the same set or subset of robust features, identified by a feature selection with the least absolute shrinkage and selection operator (LASSO) method [45]. In the LASSO method, 10-fold cross-validation was used to select the optimal regularization parameter alpha, as the average of mean square error of each patient was the smallest. With the optimal alpha, features with a non-zero coefficient in LASSO were reserved. Note that the shrinkage requires the selection of a tuning parameter (lambda) that determines the amount of penalization. Feature selection was carried out considering both the λ value with the minimum Mean Squared Error (minMSE) and the largest λ value within one standard error of it (1SE) [46].
In addition, the presence of less-represented classes has led the use of two techniques to synthesize data and, therefore, to help balance the classes (malignant and benign) giving the possibility to better train the considered classifiers: the self-adaptive synthetic over-sampling (SASYNO) approach and the adaptive synthetic sampling (ADASYN) approach. This allowed to boost the performance both overall and in terms of confusion matrix [47,48,49,50,51].
A brief informal description of LASSO method parameters and of SASYNO and ADASYN functions has already been discussed previously [52].
The best model was chosen considering the highest area under the ROC curve and highest accuracy.

3. Results

The ICC for radiomics textural features was excellent (median value 0.92, range 0.87–0.98), for all extracted features by DCE-MRI.
Table 4 reports the diagnostic accuracy of textural parameters for CC and early and late MLO view in terms of AUC and p-value.
Figure 3, Figure 4 and Figure 5 show ROC curves trend of significant textural features: MAD, IQR, VARIANCE, COARSNESS, BUSYNESS, and STRENGHT for CC projection, VARIANCE, BUSYNESS and STRENGHT for early-MLO projection and GLV_RLRLM, GLV_GLSZM, COARSNESS, BUSYNESS, and STRENGHT for late-MLO projection.
Considering these results, the univariate analysis showed low accuracy and area under the curve for all considered features in the differentiation of benign and malignant lesions.
Figure 6, Figure 7 and Figure 8 show the boxplots, related to above-mentioned parameters, to separate benign from malignant lesions.
As regard multivariate analysis, only the most useful results for the purposes of this work will be reported. Table 5, Table 6, Table 7 and Table 8 report the performance achieved by best classifiers to discriminate benign from malignant lesions.
The best performance considering the CC view (ACC = 0.75; SENS = 0.75; SPEC = 0.74; PPV = 0.75; NPV = 0.74; AUC = 0.82) was reached with a DT trained with LOOCV and balanced data (with ADASYN function) and a subset of three features (by LASSO and λminMSE). The subset of three robust textural features includes MAD, VARIANCE and LRLGE.
The best performance considering the early-MLO (ACC = 0.77; SENS = 0.80; SPEC = 0.73; PPV = 0.75; NPV = 0.78; AUC = 0.83) was reached with a NNET trained with LOOCV and balanced data (with ADASYN function) and a subset of ten features (by LASSO and λminMSE). The subset of ten robust textural features includes MEAN, MAD, RANGE, IQR, VARIANCE, CORRELATION, RLV, COARSNESS, BUSYNESS, and STRENGTH.
The best performance considering the late-MLO (ACC = 0.73; SENS = 0.74; SPEC = 0.71; PPV = 0.71; NPV = 0.75; AUC = 0.82) was reached with an NNET trained with LOOCV and balanced data (with ADASYN function) and a subset of eleven features (by LASSO). The subset of eleven robust textural features includes MODE, MEDIAN, RANGE, RLN, LRLGE, RLV, LZLGE, GLV_GLSZM, ZSV, COARSNESS, and BUSYNESS.
However, the best result (ACC = 0.87; SENS = 0.86; SPEC = 0.87; PPV = 0.88; NPV = 0.86; AUC = 0.90), were obtained considering 144 textural features extracted from all three mammographic projections (CC, early MLO, and late MLO) at the same time with an SVM trained with LOOCV and with balanced data (with ADASYN function).
Figure 9 shows the ROC curve of the best classifiers.

4. Discussion and Conclusions

Numerous studies have demonstrated increased sensitivity and specificity of CEM compared MX, FFDM, Ultrasound, or Digital Breast Tomosynthesis (DBT). In addition, CEM rivals the sensitivity of more costly and time-consuming examinations, such as MRI and molecular breast imaging [9,11,14,53,54,55,56,57,58,59].
Studies of CEM focused on imaging analysis with an auxiliary system for reporting, and on radiomics features for benignant and malignant differentiation [60,61].
The radiomics analysis of tumor features extracted from CEM images combined with qualitative and quantitative information on morphology and functionality represents an important tool for breast tumor characterization [60,61,62,63]. A summary data was provided in the Table 9 in order to report the main findings using radiomic and artificial intelligence analysis on CEM.
Marino et al. [62] investigated the potential of CEM and radiomics analysis for the noninvasive differentiation of breast cancer invasiveness, hormone receptor status, and tumor grade. Radiomics features were derived from first-order histogram (HIS), co-occurrence matrix (COM), run length matrix (RLM), absolute gradient, autoregressive model, the discrete Haar wavelet transform (WAV), and lesion geometry. They reported that radiomics analysis with CEM has potential for noninvasive differentiation of tumors with different degrees of invasiveness, hormone receptor status, and tumor grade.
La Forgia et al. [63] evaluated radiomics features for predicting histological outcome and two cancer molecular subtypes, namely, Human Epidermal growth factor Receptor 2 (HER2)-positive and triple-negative suggesting an interesting role for radiomics in CEM to predict histological outcomes and particular tumors’ molecular subtype.
In this study, we aimed to evaluate radiomics analysis with texture features extracted by dual-energy CEM images in the classification of malignant and benign breast. We performed the evaluation considering both a univariate analysis and a multivariate analysis using pattern recognition approaches.
The univariate analysis on features extracted from CC view shows statistically positive results for MAD (AUC = 0.72), IQR (AUC = 0.71) and VARIANCE (AUC = 0.72) among first order gray-level statistics, in addition to COARSNESS, BUSYNESS and STRENGTH as regards Neighborhood Gray-Tone Difference Matrix. Considering these results, the univariate analysis showed low accuracy and area under the curve for all considered features in the differentiation of benign and malignant lesions.
As regards multivariate analysis, using the unbalanced dataset showed no significant results; in particular, specificity assumed low values. After a balancing operation (with ADASYN function) and a features selection operation (with LASSO approach) higher values of accuracy, specificity and AUC were measured. A TREE trained with balanced data and a subset of three robust textural predictors (MAD, VARIANCE and LRLGE) achieved the best performance with an accuracy of 0.75 and an AUC of 0.82.
As regards the early MLO view univariate analysis, higher AUC values occurred in correspondence with VARIANCE, BUSYNESS and STRENGTH (AUC = 0.70 in all three cases). With the multivariate analysis, a balancing operation (both with the ADASYN function and the SASYNO function) and a features selection operation allowed to reach good improvements. The best performance (accuracy of 0.77 and AUC of 0.83) was obtained with a NNET trained with balanced data (with ADASYN function) and a subset of robust textural features including MEAN, MAD, RANGE, IQR, VARIANCE, CORRELATION, RLV, COARSNESS, BUSYNESS, and STRENGTH.
With the univariate analysis on features extracted from late MLO view, significant results were measures in correspondence of COARSNESS (AUC = 0.74), BUSYNESS (AUC = 0.71), and STRENGTH (AUC = 0.70), in addition to GLV as regards both the Gray-Level Run-Length Matrix and the Gray-Level Size Zone Matrix (AUC values of 0.70 in either case).
As for the multivariate analysis, the best performance (accuracy of 0.73 and AUC of 0.82) using late MLO features was reached with an NNET trained with balanced data (with ADASYN function) and a subset of robust textural features including MODE, MEDIAN, RANGE, RLN, LRLGE, RLV, LZLGE, GLV_GLSZM, ZSV, COARSNESS and BUSYNESS.
Presently, the determination of the breast malignancies is performed with biopsy. Unfortunately, this examination is invasive and then, there is great interest in alternative, non-invasive and cheaper methods to derive the same information directly from the radiologic images. Another important observation of automated systems for radiomics analysis is that these are not affected by factors such as breast density or background parenchymal enhancement (BPE), which may be limiting breast lesions differentiation. Consequently, these systems also preserve their reliability in more complex breast investigations. Therefore, also that textural analyses is again little used in clinical routine practice; in this context, the radiomics analysis of tumor textural features extracted from CEM images represent an important robust tool for breast tumor characterization.
The small cohort of studied patients represents an initial finding to validate by increasing the sample size of the study in the future. The segmentation of the ROIs slice by slice was manual, and this can be time-consuming, can be operator variable, can be difficult in the segmentation of multicentric lesions or in cases of background parenchymal enhancement. In this study is not reported a comparison of accuracy in the breast lesions detection of CEM respect to FFDM. The future endpoint is to include in the analysis an automatic segmentation of the lesions. The radiomics analysis did not consider tumor histological differences, while the integration of texture metrics combined with histopathology results may provide other important prognostic information for the classification of malignant breast lesions both in early and late phase. This could improve the performance in the classification problem and allow classifying breast lesions according grading and histotype. We did not evaluate the possible impact on texture values considering the different impact of the “void sign” after VAAB vs. post FNAC. This could be evaluated in a future study.
In conclusion, multivariate analyses using pattern recognition approaches optimized by adaptive synthetic sampling and feature selection operations obtained the best results to separate benign and malignant lesions. Overall, textural features extracted from CC view images, showed the best performance. However, the best results (ACC = 0.87; AUC = 0.90) were reached by considering all textural features extracted from dual-energy CEM images, using an SVM trained with balanced data. These findings represented a preliminary experience that should be validate with a larger sample size including patients by different centers.

Author Contributions

Formal analysis, R.F., A.P. (Adele Piccirillo), M.S., V.G., M.R.R., and M.L.B.; Investigation, R.F., T.P., P.V., R.D.G., E.E., M.D.B., and A.P. (Antonella Petrillo); Methodology, R.F., V.G., M.R.R., T.P., M.L.B., P.V., R.D.G., E.E., M.D.B., and A.P. (Antonella Petrillo); Writing—original draft, R.F. and A.P. (Adele Piccirillo). All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This prospective study was performed according to regulations issued by the local Institutional Review Board.

Informed Consent Statement

All women gave their written informed consent for research purposes.

Data Availability Statement

All data are reported in the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Definition of Textural Features

First-Order Gray-Level Statistics
First order gray-level statistics describe the distribution of gray-values within the volume. Let X denote the 3-D image matrix with N voxels, P the first order histogram, P(i) the fraction of voxels with intensity level i and Nl the number of discrete intensity levels.
  • Mean, the mean gray-level of X .
    m e a n = 1 N i = 1 N X ( i )
  • Mode, the most frequent element(s) of array X .
  • Median, the sample median of X , or the 50th percentile of X .
  • Standard deviation (STD)
    S T D = ( 1 N 1 i = 1 N ( X ( i ) X ¯ ) 2 ) 1 / 2
  • Mean Absolute Deviation (MAD), the mean of the absolute deviation of all voxel intensities around the mean intensity value.
    M A D = 1 N i = 1 N | X ( i ) X ¯ |
  • Range, the range of intensity values of X .
    r a n g e = max ( X ) min ( X )
    where max ( X ) is the maximum intensity value of X and min ( X ) is the minimum intensity value of X .
  • Interquartile range (IQR), the interquartile range is defined as the 75th minus the 25th percentile of X .
  • Kurtosis
    k u r t o s i s = 1 N i = 1 N ( X ( i ) X ¯ ) 4 ( 1 N i = 1 N ( X ( i ) X ¯ ) 2 ) 2
    where X ¯ is the mean of X .
  • Variance, Variance is the square of the standard deviation.
    v a r i a n c e = 1 N 1 i = 1 N ( X ( i ) X ¯ ) 2
    where X ¯ is the mean of X .
  • Skewness
    s k e w n e s s = 1 N i = 1 N ( X ( i ) X ¯ ) 3 ( 1 N i = 1 N ( X ( i ) X ¯ ) 2 ) 3
    where X ¯ is the mean of X .
Gray-Level Co-Occurrence Matrix (GLCM)
A normalized GLCM is defined as P ( i , j ; δ , α ) , a metric with size N g × N g describing the second-order joint probability function of an image, where the ( i , j ) th element represents the number of times the combination of intensity levels i and j occur in two pixels in the image, that are separated by a distance of δ pixels in direction α , and N g is the maximum discrete intensity level in the image. Let:
-
P ( i , j ) be the normalized (i.e., P ( i , j ) = 1 ) co-occurrence matrix, generalized for any δ and α,
-
p x ( i ) = j = 1 N g P ( i , j ) ,
-
p y ( j ) = i = 1 N g P ( i , j ) ,
-
μ x be the mean of p x , where μ x = i = 1 N g j = 1 N g i P ( i , j ) ,
-
μ y be the mean of p y , where μ y = i = 1 N g j = 1 N g j P ( i , j ) ,
-
σ x be the standard deviation of p x , where σ x = i = 1 N g j = 1 N g P ( i , j ) ( i μ x ) 2 ,
-
σ y be the standard deviation of p y , where σ y = i = 1 N g j = 1 N g P ( i , j ) ( j μ y ) 2 .
  • Energy
    e n e r g y = i = 1 N g j = 1 N g [ P ( i , j ) ] 2
  • Contrast
    c o n t r a s t = i = 1 N g j = 1 N g | i j | 2 P ( i , j )
  • Entropy
    e n t r o p y = i = 1 N g j = 1 N g P ( i , j ) l o g 2 [ P ( i , j ) ]
  • Homogeneity
    h o m o g e n e i t y = i = 1 N g j = 1 N g P ( i , j ) 1 + | i j |  
  • Correlation
    c o r r e l a t i o n = i = 1 N g j = 1 N g i j P ( i , j ) μ x μ y σ x σ y
  • Sum Average
    s u m   a v e r a g e = 1 N g × N g i = 1 N g j = 1 N g [ i P ( i , j ) + j P ( i , j ) ]
  • Dissimilarity
    d i s s i m i l a r i t y = i = 1 N g j = 1 N g | i j | P ( i , j )  
  • Autocorrelation
    a u t o c o r r e l a t i o n = i = 1 N g j = 1 N g i j P ( i , j )  
Gray-Level Run-Length Matrix (GLRLM)
Run length metrics quantify gray level runs in an image. A gray level run is defined as the length in number of pixels, of consecutive pixels that have the same gray level value. In a gray level run length matrix p ( i , j | θ ) , the ( i , j ) th element describes the number of times j a gray level i appears consecutively in the direction specified by θ. Let:
-
p ( i , j ) be the ( i , j ) th entry in the given run-length matrix p, generalized for any direction θ,
-
N g the number of discrete intensity values in the image,
-
N r the maximum run length,
-
N s the total numbers of runs, where N s = i = 1 N g j = 1 N r p ( i , j )
-
p r the sum distribution of the number of runs with run length j , where p r ( j ) = i = 1 N g p ( i , j ) ,
-
p g the sum distribution of the number of runs with run length i , where p g ( i ) = j = 1 N r p ( i , j ) ,
-
N p the number of voxels in the image, where N p = j = 1 N r j p r ,
-
μ r the mean run length, where μ r = i = 1 N g j = 1 N r j p n ( i , j ) ,
-
μ g the mean gray level, where μ g = i = 1 N g j = 1 N r i p n ( i , j ) .
  • Short Run Emphasis (SRE)
    S R E = j = 1 N r p r j 2
  • Long Run Emphasis (LRE)
    L R E = j = 1 N r j 2 p r
  • Gray-Level Nonuniformity (GLN)
    G L N = i = 1 N g p g 2
  • Run-Length Nonuniformity (RLN)
    R L N = j = 1 N r p r 2
  • Run Percentage (RP)
    R P = N s N p
  • Low Gray-Level Run Emphasis (LGRE)
    L G R E = i = 1 N g p g i 2
  • High Gray-Level Run Emphasis (HGRE)
    H G R E = i = 1 N g i 2 p g
  • Short Run Low Gray-Level Emphasis (SRLGE)
    S R L G E = i = 1 N g j = 1 N r p ( i , j ) i 2 j 2
  • Short Run High Gray-Level Emphasis (SRHGE)
    S R H G E = i = 1 N g j = 1 N r p ( i , j ) i 2 j 2
  • Long Run Low Gray-Level Emphasis (LRLGE)
    L R L G E = i = 1 N g j = 1 N r p ( i , j ) j 2 i 2
  • Long Run High Gray-Level Emphasis (LRHGE)
    L R H G E = i = 1 N g j = 1 N r p ( i , j ) i 2 j 2
  • Gray-Level Variance (GLV)
    G L V = 1 N g × N r   i = 1 N g j = 1 N r ( i p ( i , j ) μ g ) 2
  • Run-Length Variance (RLV)
    R L V = 1 N g × N r   i = 1 N g j = 1 N r ( j p ( i , j ) μ r ) 2
Gray-Level Size Zone Matrix (GLSZM)
A gray level size-zone matrix describes the amount of homogeneous connected areas within the volume, of a certain size and intensity. The ( i , j ) th entry of the GLSZM p ( i , j ) is the number of connected areas of gray-level (i.e., intensity value) i and size j. GLSZM features therefore describe homogeneous areas within the tumor volume, describing tumor heterogeneity at a regional scale [5]. Let:
-
p ( i , j ) be the ( i , j ) th entry in the given GLSZM p ,
-
N g the number of discrete intensity values in the image,
-
N z the size of the largest, homogeneous region in the volume of interest,
-
N s the total number of homogeneous regions (zones), where N s = i = 1 N g j = 1 N z p ( i , j )
-
p z the sum distribution of the number of zones with size j , where p z ( j ) = i = 1 N g p ( i , j ) ,
-
p g the sum distribution of the number of zones with gray level i , where p g ( i ) = j = 1 N z p ( i , j ) ,
-
N p the number of voxels in the image, where N p = j = 1 N z j p r ,
-
μ r the mean zone size, where μ r = i = 1 N g j = 1 N z j p ( i , j | θ ) ,
-
μ g the mean gray level, where μ g = i = 1 N g j = 1 N z i p ( i , j | θ ) .
  • Small Zone Emphasis (SZE)
    S Z E = j = 1 N z p z j 2
  • Large Zone Emphasis (LZE)
    L Z E = j = 1 N z j 2 p z
  • Gray-Level Non-uniformity (GLN)
    G L N = i = 1 N g p g 2
  • Zone-Size Non-uniformity (ZSN)
    Z S N = i = 1 N g p z 2
  • Zone Percentage (ZP)
    Z P = N s N p
  • Low Gray-Level Zone Emphasis (LGZE)
    L G Z E = i = 1 N g p g i 2
  • High Gray-Level Zone Emphasis (HGZE)
    H G Z E = i = 1 N g i 2 p g
  • Small Zone Low Gray-Level Emphasis (SZLGE)
    S Z L G E = i = 1 N g j = 1 N z p ( i , j ) i 2 j 2
  • Small Zone High Gray-Level Emphasis (SZHGE)
    S Z H G E = i = 1 N g j = 1 N z p ( i , j ) i 2 j 2
  • Large Zone Low Gray-Level Emphasis (LZLGE)
    L Z L G E = i = 1 N g j = 1 N z p ( i , j ) j 2 i 2
  • Large Zone High Gray-Level Emphasis (LZHGE)
    L Z H G E = i = 1 N g j = 1 N z p ( i , j ) j 2 i 2
  • Gray-Level Variance (GLV)
    G L V = 1 N g × N z   i = 1 N g j = 1 N z ( i p ( i , j ) μ g ) 2
  • Zone-Size Variance (ZSV)
    Z S V = 1 N g × N z   i = 1 N g j = 1 N z ( j p ( i , j ) μ z ) 2
Neighborhood Gray Tone Difference Matrix (NGTDM)
The ith entry of the NGTDM s ( i | d ) is the sum of gray level differences of voxels with intensity i and the average intensity A i of their neighboring voxels within a distance d. Let:
-
n i be the number of voxels with gray level i ,
-
N = n i , the total number of voxels,
-
s ( i ) = { n i | i A i |   f o r   n i > 0 0   o t h e r w i s e , generalized for any distance d ,
-
N g be the maximum discrete intensity level in the image,
-
p ( i ) = n i N , the probability of gray level i ,
-
N p , the total number of gray levels present in the image.
  • Coarseness
    c o a r s e n e s s = [ ε + n = 1 N g p ( i ) s ( i ) ] 1
    where ε is a small number to prevent coarseness becoming infinite.
  • Contrast
    c o n t r a s t = ( 1 N p ( 1 N p ) i = 1 N g j = 1 N g p ( i ) p ( j ) ( i j ) 2 ) ( 1 N i = 1 N g s ( i ) )
  • Busyness
    b u s y n e s s = i = 1 N g p ( i ) s ( i ) i = 1 N g j = 1 N g | i p ( i ) j p ( j ) | , p ( i ) 0 ,   p ( j ) 0  
  • Complexity
    c o m p l e x i t y = i = 1 N g j = 1 N g | i j | p ( i ) s ( i ) + p ( j ) s ( j ) N ( p ( i ) + p ( j ) ) , p ( i ) 0 ,   p ( j ) 0
  • Strength
    s t r e n g t h = i = 1 N g j = 1 N g [ p ( i ) + p ( j ) ] ( i j ) 2 ε + n = 1 N g s ( i ) , p ( i ) 0 ,   p ( j ) 0  
    where ε is a small number to prevent strength becoming infinite.

References

  1. American Cancer Society. Cancer Facts & Figures 2021; American Cancer Society Inc.: Atlanta, GA, USA, 2021; Available online: www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-statistics/annual-cancer-facts-and-figures/2021/cancer-facts-and-figures-2021.pdf (accessed on 15 February 2021).
  2. American Cancer Society. How Common Is Breast Cancer? American Cancer Society Inc.: Atlanta, GA, USA, 2021; Available online: www.cancer.org/cancer/breast-cancer/about/how-common-is-breast-cancer.html (accessed on 15 February 2021).
  3. AIOM. I Numeri del Cancro in Italia; AIOM: Milano, Italy, 2020. [Google Scholar]
  4. Patel, B.K.; Lobbes, M.; Lewin, J. Contrast Enhanced Spectral Mammography: A Review. Semin. Ultrasound CT MRI 2018, 39, 70–79. [Google Scholar] [CrossRef] [PubMed]
  5. Heywang-Köbrunner, S.; Viehweg, P.; Heinig, A.; Küchler, C. Contrast-enhanced MRI of the breast: Accuracy, value, controversies, solutions. Eur. J. Radiol. 1997, 24, 94–108. [Google Scholar] [CrossRef]
  6. Dromain, C.; Balleyguier, C.; Muller, S.; Mathieu, M.C.; Rochard, F.; Opolon, P.; Sigal, R. Evaluation of tumor angiogenesis of breast carcinoma using contrast-enhanced digital mammography. AJR Am. J. Roentgenol 2006, 187, 528–537. [Google Scholar] [CrossRef] [PubMed]
  7. Dromain, C.; Balleyguier, C.; Adler, G.; Garbay, J.R.; Delalogeet, S. Contrast-enhanced digital mammography. Eur. J. Radiol. 2009, 69, 34–42. [Google Scholar] [CrossRef] [PubMed]
  8. Li, L.; Roth, R.; Germaine, P.; Ren, S.; Lee, M.; Hunter, K.; Tinney, E.; Liao, L. Contrast-enhanced spectral mammography (CESM) versus breast magnetic resonance imaging (MRI): A retrospective comparison in 66 breast lesions. Diagn. Interv. Imaging 2017, 98, 113–123. [Google Scholar] [CrossRef] [PubMed]
  9. Fallenberg, E.M.; Dromain, C.; Diekmann, F.; Engelken, F.; Krohn, M.; Singh, J.M.; Ingold-Heppner, B.; Winzer, K.J.; Bick, U.; Renz, D.M. Contrast-enhanced spectral mammography versus MRI: Initial results in the detection of breast cancer and assessment of tumour size. Eur. Radiol. 2014, 24, 256–264. [Google Scholar] [CrossRef]
  10. Lewin, J.M.; Isaacs, P.K.; Vance, V.; Larke, F.J. Dual-energy contrast-enhanced digital subtraction mammography: Feasibility. Radiology 2003, 229, 261–268. [Google Scholar] [CrossRef]
  11. Jochelson, M.S.; Dershaw, D.D.; Sung, J.S.; Heerdt, A.S.; Thornton, C.; Moskowitz, C.S.; Ferrara, J.; Morris, E.A. Bilateral contrast-enhanced dual-energy digital mammography: Feasibility and comparison with conventional digital mammography and MR imaging in women with known breast carcinoma. Radiology 2013, 266, 743–751. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Tagliafico, A.S.; Bignotti, B.; Rossi, F.; Signori, A.; Sormani, M.P.; Valdora, F.; Calabrese, M.; Houssami, N. Diagnostic performance of contrast-enhanced spectral mammography: Systematic review and meta-analysis. Breast 2016, 28, 13–19. [Google Scholar] [CrossRef] [PubMed]
  13. Liney, G.P.; Sreenivas, M.; Gibbs, P.; Garcia-Alvarez, R.; Turnbull, L.W. Breast lesion analysis of shape technique: Semiautomated vs. manual morphological description. J. Magn. Reason. Imaging 2006, 23, 493–498. [Google Scholar] [CrossRef] [PubMed]
  14. Fusco, R.; Sansone, M.; Filice, S.; Carone, G.; Amato, D.M.; Sansone, C.; Petrillo, A. Pattern Recognition Approaches for Breast Cancer DCE-MRI Classification: A Systematic Review. J. Med. Biol. Eng. 2016, 36, 449–459. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Nie, K.; Chen, J.H.; Yu, H.J.; Chu, Y.; Nalcioglu, O.; Su, M.Y. Quantitative analysis of lesion morphology and texture features for diagnostic prediction in breast MRI. Acad. Radiol. 2008, 15, 1513–1525. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Crivelli, P.; Ledda, R.E.; Parascandolo, N.; Fara, A.; Soro, D.; Conti, M. A New Challenge for Radiologists: Radiomics in Breast Cancer. BioMed Res. Int. 2018, 2018, 6120703. [Google Scholar] [CrossRef] [Green Version]
  17. Zheng, Y.; Englander, S.; Baloch, S.; Zacharaki, E.I.; Fan, Y.; Schnall, M.D.; Shen, D. STEP: Spatiotemporal enhancement pattern for MR-based breast tumor diagnosis. Med. Phys. 2009, 36, 3192–3204. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Zhang, Y.; Zhu, Y.; Zhang, K.; Liu, Y.; Cui, J.; Tao, J.; Wang, Y.; Wang, S. Invasive ductal breast cancer: Preoperative predict Ki-67 index based on radiomics of ADC maps. Radiol. Med. 2020, 125, 109–116. [Google Scholar] [CrossRef] [PubMed]
  19. Do Nascimento, M.Z.; Martins, A.S.; Neves, L.A.; Ramos, R.P.; Flores, E.L.; Carrijo, G.A. Classification of masses in mammographic image using wavelet domain features and polynomial classifier. Expert Syst. Appl. 2013, 40, 6213–6221. [Google Scholar] [CrossRef]
  20. Lambin, P.; Rios-Velazquez, E.; Leijenaar, R.; Carvalho, S.; van Stiphout, R.G.; Granton, P.; Zegers, C.M.; Gillies, R.; Boellard, R.; Dekker, A.; et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur. J. Cancer 2012, 48, 441–446. [Google Scholar] [CrossRef] [Green Version]
  21. Sinha, S.; Lucas-Quesada, F.A.; DeBruhl, N.D.; Sayre, J.; Farria, D.; Gorczyca, D.P.; Bassett, L.W. Multifeature analysis of Gd-enhanced MR images of breast lesions. J. Magn. Reson. Imaging 1997, 7, 1016–1026. [Google Scholar] [CrossRef]
  22. Vomweg, T.W.; Buscema, P.M.; Kauczor, H.U.; Teifke, A.; Intraligi, M.; Terzi, S.; Heussel, C.P.; Achenbach, T.; Rieker, O.; Mayer, D.; et al. Improved artificial neural networks in prediction of malignancy of lesions in contrast-enhanced MR-mammography. Med. Phys. 2003, 30, 2350–2359. [Google Scholar] [CrossRef]
  23. Sathya, D.J.; Geetha, K. Mass classification in breast DCE-MR images using an artificial neural network trained via a bee colony optimization algorithm. Science 2013, 39, 294. [Google Scholar] [CrossRef] [Green Version]
  24. Sathya, J.; Geetha, K. Experimental Investigation of Classification Algorithms for Predicting Lesion Type on Breast DCE-MR Images. Int. J. Comput. Appl. 2013, 82, 1–8. [Google Scholar] [CrossRef]
  25. Fusco, R.; Sansone, M.; Petrillo, A.; Sansone, C. A Multiple Classifier System for Classification of Breast Lesions Using Dynamic and Morphological Features in DCE-MRI. Comput. Vis. 2012, 7626, 684–692. [Google Scholar] [CrossRef] [Green Version]
  26. Degenhard, A.; Tanner, C.; Hayes, C.; Hawkes, D.J.O.; Leach, M. The UK MRI Breast Screening Study Comparison between radiological and artificial neural network diagnosis in clinical screening. Physiol. Meas. 2002, 23, 727–739. [Google Scholar] [CrossRef] [PubMed]
  27. Haralick, R.M.; Shanmugam, K.; Dinstein, I. Textural Features for Image Classification. IEEE Trans. Syst. Man. Cybern. 1973, 610–621. [Google Scholar] [CrossRef] [Green Version]
  28. Fusco, R.; Sansone, M.; Sansone, C.; Petrillo, A. Segmentation and classification of breast lesions using dynamic and textural features in Dynamic Contrast Enhanced-Magnetic Resonance Imaging. In Proceedings of the 25th IEEE International Symposium on Computer-Based Medical Systems (CBMS), Rome, Italy, 20–22 June 2012; pp. 1–4. [Google Scholar]
  29. Abdolmaleki, P.; Buadu, L.D.; Naderimansh, H. Feature extraction and classification of breast cancer on dynamic magnetic resonance imaging using artificial neural network. Cancer Lett. 2001, 171, 183–191. [Google Scholar] [CrossRef]
  30. Agner, S.C.; Soman, S.; Libfeld, E.; McDonald, M.; Thomas, K.; Englander, S.; Rosen, M.A.; Chin, D.; Nosher, J.; Madabhushi, A. Textural Kinetics: A Novel Dynamic Contrast-Enhanced (DCE)-MRI Feature for Breast Lesion Classification. J. Digit. Imaging 2010, 24, 446–463. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  31. Levman, J.; Leung, T.; Causer, P.; Plewes, D.; Martel, A.L. Classification of dynamic contrast-enhanced magnetic resonance breast lesions by support vector machines. IEEE Trans. Med. Imaging 2008, 27, 688–696. [Google Scholar] [CrossRef] [Green Version]
  32. Levman, J.; Martel, A.L. Computer-aided diagnosis of breast cancer from magnetic resonance imaging examinations by custom radial basis function vector machine. In Proceedings of the 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology, Buenos Aires, Argentina, 30 August–4 September 2010; Volume 2010, pp. 5577–5580. [Google Scholar]
  33. Francescone, M.A.; Jochelson, M.S.; Dershaw, D.D.; Sung, J.S.; Hughes, M.C.; Zheng, J.; Moskowitz, C.; Morris, E.A. Low energy mammogram obtained in contrast-enhanced digital mammography (CEM) is comparable to routine full-field digital mammography (FFDM). Eur. J. Radiol. 2014, 83, 1350–1355. [Google Scholar] [CrossRef]
  34. Fallenberg, E.M.; Dromain, C.; Diekmann, F.; Renz, D.M.; Amer, H.; Ingold-Heppner, B.; Neumann, A.U.; Winzer, K.J.; Bick, U.; Hamm, B.; et al. Contrast-enhanced spectral mammography: Does mammography provide additional clinical benefits or can some radiation exposure be avoided? Breast Cancer Res. Treat. 2014, 146, 371–381. [Google Scholar] [CrossRef]
  35. Lalji, U.C.; Jeukens, C.R.; Houben, I.; Nelemans, P.J.; van Engen, R.E.; van Wylick, E.; Beets-Tan, R.G.; Wildberger, J.E.; Paulis, L.E.; Lobbes, M.B. Evaluation of low-energy contrast-enhanced spectral mammography images by comparing them to full-field digital mammography using EUREF image quality criteria. Eur. Radiol. 2015, 25, 2813–2820. [Google Scholar] [CrossRef]
  36. The MathWorks Inc. Available online: https://www.mathworks.com/ (accessed on 15 February 2021).
  37. Vallières, M.; Freeman, C.R.; Skamene, S.; El Naqa, I. A radiomics model from joint FDG-PET and MRI texture features for the prediction of lung metastases in soft-tissue sarcomas of the extremities. Phys. Med. Biol. 2015, 60, 5471–5496. [Google Scholar] [CrossRef] [PubMed]
  38. Zwanenburg, A.; Vallières, M.; Abdalah, M.A.; Aerts, H.J.W.L.; Andrearczyk, V.; Apte, A.; Ashrafinia, S.; Bakas, S.; Beukinga, R.J.; Boellaard, R.; et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Im-age-based Phenotyping. Radiology 2020, 295, 328–338. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  39. R-Tools Technology Inc. Available online: https://www.r-tt.com/ (accessed on 15 October 2020).
  40. Youden, W.J. Index for rating diagnostic tests. Cancer 1950, 3, 32–35. [Google Scholar] [CrossRef]
  41. Fusco, R.; Sansone, M.; Filice, S.; Granata, V.; Catalano, O.; Amato, D.M.; Di Bonito, M.; D’Aiuto, M.; Capasso, I.; Rinaldo, M.; et al. Integration of DCE-MRI and DW-MRI Quantitative Parameters for Breast Lesion Classification. BioMed Res. Int. 2015, 1–12. [Google Scholar] [CrossRef] [PubMed]
  42. Bishop, C.M. Pattern Recognition and Machine Learning; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
  43. Cover, T.; Hart, P. Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 1967, 13, 21–27. [Google Scholar] [CrossRef]
  44. Tibshirani, R. The lasso Method for Variable Selection in the Cox Model. Statist. Med. 1997, 16, 385–395. [Google Scholar] [CrossRef] [Green Version]
  45. Tibshirani, R. Regression Shrinkage and Selection Via the Lasso. J. R. Stat. Soc. Ser. B Statist. Methodol. 1996, 58, 267–288. [Google Scholar] [CrossRef]
  46. Bruce, P.; Bruce, A. Practical Statistics for Data Scientists; O’Reilly Media, Inc.: Newton, MA, USA, 2017. [Google Scholar]
  47. James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning; Springer: New York, NY, USA, 2013; Volume 112, p. 18. [Google Scholar]
  48. Wang, Q.; Luo, Z.; Huang, J.; Feng, Y.; Liu, Z. A Novel Ensemble Method for Imbalanced Data Learning: Bagging of Extrapolation-SMOTE SVM. Comput. Intell. Neurosci. 2017, 2017, 1827016. [Google Scholar] [CrossRef]
  49. Gu, X.; Angelov, P.P.; Soares, E.A. A self-adaptive synthetic over-sampling technique for imbalanced classification. Int. J. Intell. Syst. 2020, 35, 923–943. [Google Scholar] [CrossRef]
  50. Chen, Z.; Lin, T.; Xia, X.; Xu, H.; Ding, S. A synthetic neighborhood generation-based ensemble learning for the imbalanced data classification. Appl. Intell. 2017, 48, 2441–2457. [Google Scholar] [CrossRef]
  51. He, H.; Bai, Y.; Garcia, E.A.; Li, S. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In Proceedings of the 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), Hong Kong, China, 1–6 June 2008; pp. 1322–1328. [Google Scholar]
  52. Fusco, R.; Piccirillo, A.; Sansone, M.; Granata, V.; Vallone, P.; Barretta, M.L.; Petrosino, T.; Siani, C.; Di Giacomo, R.; Di Bonito, M.; et al. Radiomic and Artificial Intelligence Analysis with Textural Metrics, Morphological and Dynamic Perfusion Features Extracted by Dynamic Contrast-Enhanced Magnetic Resonance Imaging in the Classification of Breast Lesions. Appl. Sci. 2021, 11, 1880. [Google Scholar] [CrossRef]
  53. Hruska, C.B.; Phillips, S.W.; Whaley, D.H.; Rhodes, D.J.; O’Connor, M.K. Molecular breast imaging: Use of a dual-head dedicated gamma camera to detect small breast tumors. AJR 2008, 191, 1805–1815. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Hruska, C.B.; Boughey, J.C.; Phillips, S.W.; Rhodes, D.J.; Wahner-Roedler, D.L.; Whaley, D.H.; Degnim, A.C.; O’Connor, M.K. Scientific impact recognition award: Molecular breast imaging: A review of the Mayo Clinic experience. Am. J. Surg. 2008, 196, 470–476. [Google Scholar] [CrossRef] [Green Version]
  55. Hruska, C.B.; O’Connor, M.K. Quantification of lesion size, depth, and uptake using a dual-head molecular breast imaging system. Med. Phys. 2008, 35, 1365–1376. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  56. Dromain, C.; Thibault, F.; Diekmann, F.; Fallenberg, E.M.A.; Jong, R.; Koomen, M.; Hendrick, R.E.; Tardivon, A.; Toledano, A. Dual-energy contrast-enhanced digital mammography: Initial clinical results of a multireader, multicase study. Breast Cancer Res. 2012, 14, 94. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Siegmann, K.C.; Krämer, B.; Claussen, C. Current status and new developments in breast MRI. Breast Care 2011, 6, 87–92. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  58. Bernardi, D.; Belli, P.; Benelli, E.; Brancato, B.; Bucchi, L.; Calabrese, M.; Carbonaro, L.A.; Caumo, F.; Cavallo-Marincola, B.; Clauser, P.; et al. Digital breast tomosynthesis (DBT): Recommendations from the Italian College of Breast Radiologists (ICBR) by the Italian Society of Medical Radiology (SIRM) and the Italian Group for Mammography Screening (GISMa). Radiol. Med. 2017, 122, 723–730. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  59. Bucchi, L.; Belli, P.; Benelli, E.; Bernardi, D.; Brancato, B.; Calabrese, M.; Carbonaro, L.A.; Caumo, F.; Cavallo-Marincola, B.; Clauser, P.; et al. Recommendations for breast imaging follow-up of women with a previous history of breast cancer: Position paper from the Italian Group for Mammography Screening (GISMa) and the Italian College of Breast Radiologists (ICBR) by SIRM. Radiol. Med. 2016, 121, 891–896. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  60. Losurdo, L.; Fanizzi, A.; Basile, T.M.A.; Bellotti, R.; Bottigli, U.; Dentamaro, R.; Didonna, V.; Lorusso, V.; Massafra, R.; Tamborra, P.; et al. Radiomics Analysis on Contrast-Enhanced Spectral Mammography Images for Breast Cancer Diagnosis: A Pilot Study. Entropy 2019, 21, 1110. [Google Scholar] [CrossRef] [Green Version]
  61. Fanizzi, A.; Losurdo, L.; Basile, T.M.A.; Bellotti, R.; Bottigli, U.; Delogu, P.; Diacono, D.; Didonna, V.; Fausto, A.; Lombardi, L.; et al. Fully automated support system for diagnosis of breast cancer in contrast-enhanced spectral mammography images. J. Clin. Med. 2019, 8, 891. [Google Scholar] [CrossRef] [Green Version]
  62. Marino, M.A.; Pinker, K.; Leithner, D.; Sung, J.; Avendano, D.; Morris, E.A.; Jochelson, M. Contrast-Enhanced Mammography and Radiomics Analysis for Noninvasive Breast Cancer Characterization: Initial Results. Mol. Imaging Biol. 2020, 22, 780–787. [Google Scholar] [CrossRef] [PubMed]
  63. La Forgia, D.; Fanizzi, A.; Campobasso, F.; Bellotti, R.; Didonna, V.; Lorusso, V.; Moschetta, M.; Massafra, R.; Tamborra, P.; Tangaro, S.; et al. Radiomic Analysis in Contrast-Enhanced Spectral Mammography for Predicting Breast Cancer Histological Outcome. Diagnostics (Basel) 2020, 10, 708. [Google Scholar] [CrossRef] [PubMed]
Figure 1. CEM subtracted images of a multifocal IDC in a 53-years-old woman with dense breast: (A) CC view, (B) early-MLO view, (C) late-MLO view.
Figure 1. CEM subtracted images of a multifocal IDC in a 53-years-old woman with dense breast: (A) CC view, (B) early-MLO view, (C) late-MLO view.
Diagnostics 11 00815 g001
Figure 2. Manual segmentation (red line) of an ILC in a 58-years-old woman with dense breast in (A) CC view, (B) early MLO view and (C) late MLO view.
Figure 2. Manual segmentation (red line) of an ILC in a 58-years-old woman with dense breast in (A) CC view, (B) early MLO view and (C) late MLO view.
Diagnostics 11 00815 g002
Figure 3. Receiver Operating Characteristic (ROC) curves of significant textural features with AUC values for CC view.
Figure 3. Receiver Operating Characteristic (ROC) curves of significant textural features with AUC values for CC view.
Diagnostics 11 00815 g003
Figure 4. Receiver Operating Characteristic (ROC) curves of significant textural features with AUC values for early-MLO view.
Figure 4. Receiver Operating Characteristic (ROC) curves of significant textural features with AUC values for early-MLO view.
Diagnostics 11 00815 g004
Figure 5. Receiver Operating Characteristic (ROC) curves of significant textural features with AUC values for late-MLO view.
Figure 5. Receiver Operating Characteristic (ROC) curves of significant textural features with AUC values for late-MLO view.
Diagnostics 11 00815 g005
Figure 6. Boxplot of significant textural features for CC view.
Figure 6. Boxplot of significant textural features for CC view.
Diagnostics 11 00815 g006
Figure 7. Boxplot of significant textural features for early-MLO view.
Figure 7. Boxplot of significant textural features for early-MLO view.
Diagnostics 11 00815 g007
Figure 8. Boxplot of significant textural features for late-MLO view.
Figure 8. Boxplot of significant textural features for late-MLO view.
Diagnostics 11 00815 g008
Figure 9. ROC curve of the best classifier in (A) a TREE using a subset of three robust textural features extracted by CC view; in (B) a NNET using a subset of ten robust textural features extracted by early-MLO view; in (C) an NNET using a subset of eleven robust textural features extracted by late-MLO view; in (D) a SVM using the all-balanced textural features set extracted from all three mammographic projections (CC, early MLO, and late MLO).
Figure 9. ROC curve of the best classifier in (A) a TREE using a subset of three robust textural features extracted by CC view; in (B) a NNET using a subset of ten robust textural features extracted by early-MLO view; in (C) an NNET using a subset of eleven robust textural features extracted by late-MLO view; in (D) a SVM using the all-balanced textural features set extracted from all three mammographic projections (CC, early MLO, and late MLO).
Diagnostics 11 00815 g009
Table 1. Number and corresponding percentage of the total benign or malignant breast lesions.
Table 1. Number and corresponding percentage of the total benign or malignant breast lesions.
Benign (39 Lesions)NumberPercentage Value (%)
Fibrosis820.51
Ductal hyperplasia1025.64
Fibroadenoma1333.33
Dysplasia410.26
Adenosis410.26
Malignant (65 Lesions)NumberPercentage Value (%)
Infiltrating lobular carcinoma812.31
Infiltrating ductal carcinoma3652.17
Ductal carcinoma in situ1218.46
Intraductal papilloma with DCIS component23.08
Tubular carcinoma34.61
Papillary carcinoma 23.08
Mucinous Carcinoma23.08
Table 2. Extracted textural features.
Table 2. Extracted textural features.
Feature NameAcronym
1st Order Gray-Level Statistics
Mean-
Mode-
Median-
Standard DeviationSTD
Median Absolute DeviationMAD
Range-
Kurtosis-
Interquartile RangeIQR
Variance-
Skewness-
Gray-Level Co-Occurrence Matrix (GLCM)
Energy-
Contrast-
Entropy-
Homogeneity-
Correlation-
Sum Average-
Dissimilarity-
Autocorrelation-
Gray-Level Size Zone Matrix (GLSZM)
Small Zone EmphasisSZE
Large Zone EmphasisLZE
Gray-Level Non-uniformityGLN_ GLSZM
Zone-Size Non-uniformityZSN
Zone Percentage (ZP)ZP
Low Gray-Level Zone EmphasisLGZE
High Gray-Level Zone EmphasisHGZE
Small Zone Low Gray-Level EmphasisSZLGE
Small Zone High Gray-Level EmphasisSZHGE
Large Zone Low Gray-Level EmphasisLZLGE
Large Zone High Gray-Level EmphasisLZHGE
Gray-Level VarianceGLV_GLSZM
Zone-Size VarianceZSV
Neighborhood Gray-Tone Difference Matrix (NGTDM)
Coarseness-
Busyness-
Complexity-
Strength-
Table 3. Configuration settings for each classifier.
Table 3. Configuration settings for each classifier.
ClassifierConfiguration Settings
LDACovariance structure: full; optimizer options: hyperparameter options disabled
Decision treeFine Tree; maximum number of splits: 100; split criterion: Gini’s diversity index; surrogate decision splits: off; optimizer options: hyperparameter options disabled
Artificial neural networkHidden = 3; linear output to TRUE; optimizer options: hyperparameter options disabled
K-nearest neighborsFine KNN; number of neighbors: 100; distance metric: Euclidean; distance weight: equal; standardize data: true; optimizer options: hyperparameter options disabled
Support vector machineLinear SVM; kernel function: linear; kernel scale: automatic; box constraint level: 1; multiclass method: one-vs-one; standardize data: true; optimizer options; hyperparameter options disabled
Table 4. List of most significant textural features with the corresponding area under curve (AUC) and p-value.
Table 4. List of most significant textural features with the corresponding area under curve (AUC) and p-value.
Mammography ProjectionTextural ParametersAUC Valuesp-Value
CC viewMAD0.720.000
IQR0.710.000
VARIANCE0.720.000
COARSNESS0.730.000
BUSYNESS0.730.000
STRENGTH0.740.000
early MLO viewVARIANCE0.700.001
BUSYNESS0.700.001
STRENGTH0.700.000
late MLO viewGLV_GLRLM0.700.001
GLV_GLSZM0.700.001
COARSNESS0.740.000
BUSYNESS0.710.000
STRENGTH0.700.001
Table 5. Performance of the best classifier using textural features extracted from CC view.
Table 5. Performance of the best classifier using textural features extracted from CC view.
ClassifierCross-ValidationACCSENSSPECPPVNPVAUC
Performance for classifiers trained with balanced data (with ADASYN function) and a set of 3 textural features (by LASSO and λ1SE).
TREELOOCV0.750.750.740.750.740.82
SVMLOOCV0.760.770.740.760.750.81
Performance for classifiers trained with balanced data (with ADASYN function) and subset of 34 textural features (AUC ≥ 0.60).
LDALOOCV0.790.720.850.840.750.80
Table 6. Performance of the best classifier using textural features extracted from early-MLO view.
Table 6. Performance of the best classifier using textural features extracted from early-MLO view.
ClassifierCross-ValidationACCSENSSPECPPVNPVAUC
Performance for classifiers trained with balanced data (with SASYNO function) and a set of 23 robust textural features (by LASSO and λminMSE).
SVM10-fold CV0.790.800.770.780.790.80
Performance for classifiers trained with balanced data (with SASYNO function) and a subset of 15 robust textural features (AUC ≥ 0.60).
KNN10-fold CV0.780.770.770.770.770.81
Performance for classifiers trained with balanced data (with ADASYN function) and a set of 10 robust textural features (by LASSO and λminMSE).
KNN10-fold CV0.780.750.800.790.760.81
NNETLOOCV0.770.80.730.750.780.83
Table 7. Performance of the best classifier using textural features extracted from late-MLO view.
Table 7. Performance of the best classifier using textural features extracted from late-MLO view.
ClassifierCross-ValidationACCSENSSPECPPVNPVAUC
Performance for classifiers trained with balanced data (with ADASYN function) and all 48 textural features.
LDA10-fold CV0.780.710.840.810.760.79
Performance for classifiers trained with balanced data (with ADASYN function) and a subset of 11 robust textural features (by LASSO).
NNET10-fold CV0.740.770.700.700.770.78
LOOCV0.730.740.710.710.750.82
Table 8. Performance of the best classifier using textural features extracted from all three mammographic projection (CC, early MLO and late MLO).
Table 8. Performance of the best classifier using textural features extracted from all three mammographic projection (CC, early MLO and late MLO).
ClassifierCross-ValidationACCSENSSPECPPVNPVAUC
Performance for classifiers trained with balanced data (with ADASYN function) and all 144 features.
SVMLOOCV0.870.860.870.880.860.90
Performance for classifiers trained with balanced data (with SASYNO function) and a subset of 18 robust textural features (by LASSO and λminMSE).
LDA10-fold CV0.850.830.860.860.840.89
LOOCV0.850.850.860.860.850.89
Performance for classifiers trained with balanced data (with ADASYN function) and set of 13 robust textural features (by LASSO and λ1SE).
LDA10-fold CV0.840.830.850.840.830.89
LOOCV0.840.850.830.830.840.89
Table 9. Summary table of main findings using radiomic and artificial intelligence analysis on CEM.
Table 9. Summary table of main findings using radiomic and artificial intelligence analysis on CEM.
ReferenceExtracted FeaturesUsed ClassifierResults
[60]Statistical measures on original ROIs, gradiented images, Haar decompositions of the same original ROIs, and on gray-level co-occurrence matrices of each sub-ROI obtained by Haar transform.SVM classifierThe features extracted from each sub-ROI decomposed by two levels of Haar transform were predictive only when they were all used without any selection, reaching the best mean accuracy of about 80%.
[61]Statistical FeaturesRandom Forest classifierThe present method resulted highly performing in the prediction of benign/malignant ROIs with median values of sensitivity and specificity of 87.5% and 91.7%, respectively. The classification model outperformed the human reader, by increasing the specificity over 8%.
[62]Radiomic features were derived from first-order HIS, COM, RLM, absolute gradient, autoregressive model, the discrete Haar WAV, and lesion geometry. Fisher, probability of error and average correlation (POE + ACC), and mutual information (MI) coefficients informed feature selection.Linear discriminant analysis followed by k-nearest neighbour classification.Radiomics analysis achieved the highest accuracies of 87.4% for differentiating invasive from non-invasive cancers based on COM + HIS/MI, 78.4% for differentiating HR positive from HR negative cancers based on COM + HIS/Fisher, 97.2% for differentiating HER2-positive/HR-negative from HER2-negative/HR-positive cancers based on RLM + WAV/MI, 100% for differentiating triple-negative from triple-positive breast cancers mainly based on COM + WAV + HIS/POE + ACC, and 82.1% for differentiating triple-negative from HR-positive cancers mainly based on WAV + HIS/Fisher. Accuracies for differentiating grade 1 vs. grades 2 and 3 cancers were 90% for invasive cancers (based on COM/MI) and 100% for non-invasive cancers (almost entirely based on COM/MI).
[63]Fourteen statistical featuresMultivariate Linear Discriminant AnalysisThe highest performances were obtained for discriminating HER2+/HER2− (90.87%), ER+/ER− (83.79%) and Ki67+/Ki67− (84.80%).
Our Study48 textural parameters were extracted by manually segmenting regions of interest.Linear classifier, decision tree, k-nearest neighbours, artificial neural network and support vector machine were utilized.Multivariate analyses considering 144 textural features extracted from all three mammographic projections (CC, early MLO and late MLO) optimized by adaptive synthetic sampling and feature selection operations obtained the best results (ACC = 0.87; AUC = 0.90) and showed the best performance in the discrimination of benign and malignant lesions.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fusco, R.; Piccirillo, A.; Sansone, M.; Granata, V.; Rubulotta, M.R.; Petrosino, T.; Barretta, M.L.; Vallone, P.; Di Giacomo, R.; Esposito, E.; et al. Radiomics and Artificial Intelligence Analysis with Textural Metrics Extracted by Contrast-Enhanced Mammography in the Breast Lesions Classification. Diagnostics 2021, 11, 815. https://doi.org/10.3390/diagnostics11050815

AMA Style

Fusco R, Piccirillo A, Sansone M, Granata V, Rubulotta MR, Petrosino T, Barretta ML, Vallone P, Di Giacomo R, Esposito E, et al. Radiomics and Artificial Intelligence Analysis with Textural Metrics Extracted by Contrast-Enhanced Mammography in the Breast Lesions Classification. Diagnostics. 2021; 11(5):815. https://doi.org/10.3390/diagnostics11050815

Chicago/Turabian Style

Fusco, Roberta, Adele Piccirillo, Mario Sansone, Vincenza Granata, Maria Rosaria Rubulotta, Teresa Petrosino, Maria Luisa Barretta, Paolo Vallone, Raimondo Di Giacomo, Emanuela Esposito, and et al. 2021. "Radiomics and Artificial Intelligence Analysis with Textural Metrics Extracted by Contrast-Enhanced Mammography in the Breast Lesions Classification" Diagnostics 11, no. 5: 815. https://doi.org/10.3390/diagnostics11050815

APA Style

Fusco, R., Piccirillo, A., Sansone, M., Granata, V., Rubulotta, M. R., Petrosino, T., Barretta, M. L., Vallone, P., Di Giacomo, R., Esposito, E., Di Bonito, M., & Petrillo, A. (2021). Radiomics and Artificial Intelligence Analysis with Textural Metrics Extracted by Contrast-Enhanced Mammography in the Breast Lesions Classification. Diagnostics, 11(5), 815. https://doi.org/10.3390/diagnostics11050815

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop