1. Introduction
Carotid atherosclerosis and the rupture of atheromatic plaques represent a complex occurrence influenced by numerous factors, involving both biological components and biomechanical forces acting on the arterial wall. Specific to each patient, factors like arterial shape, peak systolic blood velocity, pressure, plaque makeup, and structure are crucial in determining the mechanical strains experienced by the arterial wall. High wall shear stress (WSS) and increased plaque structural stress (PSS) due to unstable plaque components are recognized as factors contributing to plaque rupture. Moreover, various measures of plaque accumulation, such as volume, cross-sectional area, or thickness of plaque elements, as well as the presence, size, or volume of specific plaque tissues, have been linked to past, present, and recurrent symptoms or events, including stroke.
In the current literature, there are several studies that explore the relationship between plaque characteristics and the risk of plaque rupture, as well as the biomechanical stresses associated with this risk. Takaya et al. [
1] conducted a prospective study on asymptomatic patients with carotid stenosis, finding significant associations between baseline MRI-detected plaque features and subsequent cerebrovascular events. Similarly, Zhao et al. [
2] identified associations between carotid plaque characteristics and acute cerebral infarct sizes.
Gupta et al. [
3] conducted a meta-analysis confirming that certain plaque components, including intraplaque hemorrhage (IPH), lipid-rich necrotic core (LRNC), and thin/ruptured fibrous cap (FC), are predictors of future stroke or transient ischemic attack (TIA). Esposito-Bauer et al. [
4] and Selwaness et al. [
5] also found associations between plaque composition and cerebrovascular events. Furthermore, studies by Sun J. et al. [
6], Sun B. et al. [
7], Xia et al. [
8], and Cui et al. [
9] explored various aspects of plaque characteristics and their associations with cardiovascular outcomes. Notably, IPH, LRNC, and FC rupture were strongly associated with adverse outcomes in these studies.
On the biomechanical side, Cheng et al. [
10] and Li et al. [
11] investigated stress distributions within plaques, finding higher circumferential stress in ruptured plaques. Groen et al. [
12] and Kock et al. [
13] used imaging data to simulate longitudinal fibrous cap stresses, identifying potential indicators of plaque vulnerability. Moreover, Tang et al. [
14] and Teng et al. [
15] employed finite element analysis (FEA) and computational fluid dynamics (CFD) to assess plaque wall stress and flow-induced stresses, revealing associations with plaque rupture. Other studies, including those by Ohayon et al. [
16], Huang et al. [
17], and Wang et al. [
18], further explored stress distributions within plaques using FSI simulations based on imaging data. Furthermore, studies by Tuenter et al. [
19], Costopoulos et al. [
20], and Doradla et al. [
21] investigated associations between wall shear stress (WSS), plaque composition, and plaque vulnerability, providing insights into the biomechanical mechanisms underlying plaque rupture.
Curcio et al. [
22] introduced a patient-specific computational approach to assess the vulnerability of carotid artery plaques. By employing geometric modeling and structural simulation techniques, they identified regions of elevated stress within the artery wall, particularly around plaques, indicating potential rupture areas. In plaques with lipid composition and heterogeneity, maximum stresses were concentrated within the fibrous cap. Jansen et al. [
23] developed a tissue-engineered model of atherosclerotic plaque caps, incorporating microcalcifications to evaluate their influence on mechanical properties. Their model confirmed the adverse role of microcalcifications in increasing the risk of cap rupture and offers insights into tissue mechanics involving soft tissue calcification. Additionally, Zhang et al. [
24] devised a 3D carotid plaque radiomics model using high-resolution magnetic resonance imaging (HRMRI), which outperformed traditional models in identifying vulnerable plaques, demonstrating superior performance in both training and test cohorts. These studies collectively advance the understanding and assessment of carotid artery plaque vulnerability through computational, tissue-engineering, and radiomics approaches, offering valuable insights for enhanced clinical management. Overall, these studies contribute to our understanding of the complex interplay between plaque characteristics, biomechanical stresses, and the risk of plaque rupture, offering valuable insights for risk assessment and management in patients with atherosclerotic disease.
Significant advancements in using machine learning (ML) models for predicting cerebrovascular events such as stroke, transient ischemic attacks (TIA), Amaurosis Fugax, etc. related to carotid artery disease have been witnessed in reviewing current literature. Studies have employed diverse ML techniques, integrating both clinical data and imaging inputs to enhance predictive accuracy. For instance, Wu et al. [
25] and Weng et al. [
26] utilized clinical and imaging data, demonstrating improved model performance in predicting major adverse cardiovascular and cerebrovascular events. Moreover, research incorporating radiomics [
27], underscores the potential of advanced imaging features in refining ML predictions.
Additionally, the integration of explainable ML models [
28], provides significant insights into early screening and risk assessment for carotid atherosclerosis. Other notable contributions include hybrid modeling approaches combining clinical and imaging data, enhancing the prediction of stroke risk [
29]. Despite the limited focus on simulation-based inputs specifically for carotid artery disease, the existing literature underscores the transformative potential of ML in predicting cerebrovascular events, paving the way for more personalized and precise healthcare interventions.
In this context, we propose a machine learning (ML) model for predicting forthcoming cerebrovascular events which combines as input hemodynamic parameters obtained through finite element analysis with non-imaging characteristics deriving from eCRF recordings and demographic data, a combination that constitutes its uniqueness and novelty compared to other proposed models deriving from current literature. This model provides a detailed understanding of the intricate connection between biomechanical stresses and plaque characteristics, making it an essential tool for assessing the probability of an impending cerebrovascular event.
2. Materials and Methods
2.1. Dataset Description
For this study, we utilized one hundred and thirty-four (134) asymptomatic cases with >50% carotid stenosis. The data were collected and anonymized from five (5) different clinical centers and were obtained under a data protection agreement fulfilling all the ethical and legal requirements for data sharing posed by the General Data Protection Regulation in a third-level care setting. These cases were examined using a 1.5-T whole-body system (Signa HDx, GE Healthcare, Waukesha, WI, USA) equipped with a bilateral four-channel phased-array carotid coil (Machnet BV, Eelde, The Netherlands). The subjects provided written informed consent to participate in the TAXINOMISIS study (
www.clinicaltrials.gov; accessed on 15 May 2024; ID: NCT03495830) protocol, which received approval from the local ethics committee. The median age of the subjects was 69.4 years, and the vast majority suffered from hypertension (83%) and hypercholesterolemia (77.4%).
Table 1 depicts the demographics data for the 134 subjects that were included in the present study.
Notably, 115 cases presented with no cerebrovascular events during the 3-year period, whereas 19 cases exhibited at least one cerebrovascular-related event such as stroke, transient ischemic attacks (TIA), myocardial infarctions (MI), or intracranial hemorrhages. As expected, the dataset was severely imbalanced due to the fact that the TAXINOMISIS study consisted of asymptomatic cases, which rarely trigger any events in a period of 3 years.
2.2. Methodology
2.2.1. 3D Reconstruction
The process of reconstructing a 3D model of the carotid artery relies on a sequence of magnetic resonance images (MRI), involving ToF, T1w, T2w, and PD series. To briefly outline, the ToF sequence is employed to reconstruct the arterial lumen, while the fusion of T1w, T2w, and PD series facilitates the reconstruction of the arterial wall model and the model depicting plaque components. These three models are subsequently aligned in a later phase to generate the final arterial model. This reconstruction method is grounded in an innovative approach [
30], which comprises three distinct steps:
Region of interest segmentation: The first step involves segmenting the regions of interest, including the lumen, outer wall, and plaque components. To accomplish this, three deep learning models were developed. Specifically, 485 tuples of ToF, T1w, T2w, and PD images from 42 different patients were annotated by two experts, resulting in a training dataset. This dataset was then utilized to train three UNET models, each tailored to segmenting a specific region of interest.
3D level set: Following segmentation, a morphological operator is applied to the 3D volume composed of stacked 2D segmented frames. This operation generates the 3D surface model representing the segmented regions.
3D meshing: Subsequently, the marching cubes algorithm is employed on the 3D surface model to produce the final reconstructed arterial model.
By employing this comprehensive approach, the reconstruction process yields detailed and accurate representations of the carotid artery, enabling further analysis and study of arterial pathologies.
2.2.2. 3D Blood-Flow Simulations
The 3D reconstructed luminal carotid geometries underwent transient blood-flow simulations, utilizing patient-specific boundary conditions derived from the respective carotid UltraSound (US) screening. These conditions were comprised of flow velocity profiles obtained for a minimum of three consecutive cardiac cycles for each artery. In our simulations, blood flow was modeled using the Navier—Stokes and continuity equations, represented as Equations (1) and (2), respectively.
Here,
denotes the blood velocity vector and
stands for the stress tensor, defined by Equation (3) where
δij′ represents the Kronecker delta,
μ is the blood dynamic viscosity,
p indicates the blood pressure, and
εij signifies the strain tensor as calculated by Equation (4).
The blood was assumed to follow Newtonian behavior with a density of 1050 kg/m3 and dynamic viscosity of 0.0035 Pa·s. All simulations were conducted using ANSYS® v16.2 with a maximum element size set to 0.16 mm, consisting solely of tetrahedra. The mesh size was determined through a comprehensive sensitivity analysis, with a convergence criterion of 10−4 and an iteration limit of 150 for each time step.
In our effort to replicate patient-specific blood-flow dynamics accurately, we employ an approach that hinges on utilizing the derived mass flow rate profiles from Carotid Ultrasonography (US) images. These tailored profiles are applied to both the inlet, representing the proximal boundary of the Common Carotid Artery (CCA), and the outlet of the External Carotid Artery (ECA). This methodological choice is substantiated by the patient-specific US data, which provide detailed flow velocity profiles for at least two of the three arterial branches: CCA, Internal Carotid Artery (ICA), and/or ECA.
Each US image serves as a visual repository, providing insights into intricate velocity measurements and corresponding waveforms for multiple cardiac cycles across various arteries. The patient-specific flow velocity diagram is created using only the peak systolic velocity (PSV) and the end diastolic velocity (EDV) values using a dedicated in-house developed MATLAB script.
Additionally, patient-specific cardiac cycle durations are computed based on pulse rate information, incorporating temporal dynamics into the simulations. This comprehensive methodology not only leverages the richness of the US data but also precisely integrates this information, ensuring the utmost accuracy in capturing the nuances of patient-specific blood-flow dynamics. The velocity diagram is then transformed into a mass flow rate diagram, using Equation (5).
The symbol
represents the mass flow rate,
denotes the blood density,
signifies the blood velocity, and
represents the cross-sectional area at the velocity measurement location. Regarding the internal carotid artery (ICA), a zero-pressure boundary condition is applied, with a detailed explanation provided in the results section for the rationale behind this choice. Additionally, a no-slip boundary condition is enforced at the arterial wall boundary, ensuring zero velocity at the wall, while a no-penetration condition is imposed, preventing fluid from passing through the boundary.
Figure 1 depicts the boundary conditions for an indicative case of the utilized dataset. Briefly, the entire modeling process consists of the following steps: (a) 3D reconstruction of the lumen, the arterial wall and the plaque constituents deriving from the MRI data and 2D cross-section reconstruction for the site of maximum plaque burden; (b) flow velocity profiles extraction for the CCA and ECA deriving from the US data and transformation to the respective mass flow rate profiles for both, respectively; (c) 3D CFD blood-flow simulation using the 3D model of the lumen, accompanied by the mass flow rate profiles for the CCA and ECA and an a zero-pressure value for the ICA, serving as boundary conditions for the FEM simulation; (d) 2D structural analysis, using the 2D model for the site of maximum plaque burden and the maximum pressure at this specific site, as calculated by (c); (e) calculation of the following hemodynamic parameters: Peak TAESS, P
ECA/P
CCA, P
ICA/P
CCA, vessel average TAESS, vessel average OSI, normalized area of low TAESS, Normalized area of high OSI, plaque structural stress (PSS); (f) input of the aforementioned parameters in the proposed ML model; (g) event risk score calculation. The brief description of the entire pipeline is depicted in
Figure 2.
2.2.3. Structural Analysis
After having performed the 3D blood-flow simulation in order to calculate the hemodynamic parameters to be fed into the machine learning model, as well as the pressure distribution throughout the entire cardiac cycle, the next step is to perform a 2D structural analysis of the cross-section which exhibits the highest plaque burden value that has already been calculated by the 3D reconstruction module. The 2D pixel coordinates of the lumen, the arterial wall, and the plaque components are transformed to cartesian coordinates and the final 2D geometry is then discretized into quadrilateral elements with a maximum face size of 0.05 mm. furthermore, an inflation process is performed at the interface between the plaque component and the arterial wall, as well as at the borders of the arterial wall and the lumen (
Figure 3). Regarding the utilized boundary conditions for the 2D-analysis, the outer perimeter of the external wall is assumed to act as a frictionless support, aiming to prevent any potential displacement of the adventitia. Simultaneously, the plaque components are bonded with the arterial wall throughout the simulation. The luminal boundary emerges as a crucial interface, serving as the platform onto which the previously computed maximum site-specific pressure, derived from the aforementioned 3D blood-flow simulation, is applied as a loading condition. The pressure values obtained at the cross-section, which aligns with the segmented slice, are crucial for guiding the further stages of analysis. This thorough approach ensures the model’s integrity and allows for a detailed understanding of the real-world complexities involved in simulating the biomechanical behavior of arterial structures.
Following this, the structural analysis progresses by assigning specific material properties to the arterial wall and the plaque being examined within the selected cross-section. For both the arterial wall and the fibrous plaque, hyperelastic material properties are assigned. Regarding the arterial wall, we employed the Mooney–Rivlin approach [
31], defined by the parameters C
10 = 0.07 MPa, C
20 = 3.2 MPa, C
21 = 0.0716 MPa, while setting the remaining parameters as C
ij = 0 systematically. To address incompressibility, the parameter d of the arterial wall was set at d = 1.0 × 10
−5 Pa
−1. The material properties attributed to the plaque components were determined through the analysis of the stretch ratio to Cauchy stress diagrams which were created experimentally [
32]. These diagrams formed the basis for calculating and extracting the essential 5-parameter Mooney–Rivlin constants. The resulting values, crucial for accurately characterizing the mechanical behavior of the plaque components, are shown in
Table 2.
Figure 4 depicts the pipeline for the calculation of PSS values, beginning with the calculation of the maximum pressure during the cardiac cycle at the site of the cross-section of interest, followed by the 2D structural simulation using the aforementioned calculated pressure as a loading boundary condition.
Figure 2.
Overall pipeline of the proposed approach. The red circle in green box within the US image indicated the cardiac cycle from which the diagram next to it derives. The literature values indicated in the blue box with
1 are derived from [
32].
Figure 2.
Overall pipeline of the proposed approach. The red circle in green box within the US image indicated the cardiac cycle from which the diagram next to it derives. The literature values indicated in the blue box with
1 are derived from [
32].
Figure 3.
2D mesh for the arterial model with the plaque component indicated in yellow. An inflation process was followed in the two luminal borders, as well as at the borders of the fibrous plaque.
Figure 3.
2D mesh for the arterial model with the plaque component indicated in yellow. An inflation process was followed in the two luminal borders, as well as at the borders of the fibrous plaque.
Figure 4.
PSS and deformation calculations for a representative case (right carotid). (A) The initial 3D blood-flow simulation is used to depict the exact location at which the structural analysis was performed. The location exhibited the highest plaque burden value in the entire arterial length. (B) PSS calculated at 57.4 kPa, at the ICA, between the fibrous plaque and the arterial lumen. (C) Maximum deformation calculated at the inner border of the ECA (0.14 mm).
Figure 4.
PSS and deformation calculations for a representative case (right carotid). (A) The initial 3D blood-flow simulation is used to depict the exact location at which the structural analysis was performed. The location exhibited the highest plaque burden value in the entire arterial length. (B) PSS calculated at 57.4 kPa, at the ICA, between the fibrous plaque and the arterial lumen. (C) Maximum deformation calculated at the inner border of the ECA (0.14 mm).
2.2.4. Hemodynamic Features Used as Input for the ML Model
A total of 134 cases were analyzed to compute the necessary hemodynamic parameters for the event prediction model. This analysis included fluid-related hemodynamic parameters, as well as structural-derived parameters. Specific emphasis is given to deriving the highest plaque structural stress (PSS) value from each case via 2D structural analysis [
33,
34]. This value, acting as a biomechanical marker of vulnerability, is crucial for our predictive modeling, especially for the plaque rupture model. Apart from the PSS values, the 3D simulation produces metrics such as the Time-Averaged Wall Shear Stress (
TAWSS) (Equation (6)), the average Oscillatory Shear Index (
OSI) values (Equation (7)) and distributions throughout the artery, the pressure drop between the inlet (CCA boundary), and the two outlets (ICA and ECA boundaries), respectively:
where
T is a full cardiac cycle and
τw is the instantaneous WSS. The OSI can be considered as the fraction of the angle and the magnitude change between the instantaneous wall shear stress and the time-averaged wall shear stress. The values that can be assigned to it range from 0–0.5. While 0.5 is a value that represents an unstable, oscillatory flow with a WSS of 0 Pa, 0 denotes a wholly unidirectional WSS. Areas with high OSI values are typically more likely to form plaque and exhibit endothelial dysfunction [
35,
36]. Atherogenesis has also been shown to be more likely to manifest at very low TAWSS values (0.4 Pa), whereas high TAWSS values (>40 Pa) may lead to endothelial dysfunction and subsequent damage, raising the likelihood of thrombosis [
37,
38].
2.2.5. Problem Definition
The development of the final model for predicting cerebrovascular events is framed as an advanced multivariate binary classification challenge, focusing on data from the electronic Case Report Form (eCRF) repository of our dataset. This classification delineates between class 0 (cases without events) and class 1 (cases with cerebrovascular events).
Figure 5 depicts the ML model development pipeline which was followed. Given the imbalanced nature of the dataset (115 cases with no events and 19 cases with cerebrovascular events), with a significantly lower number of positive instances compared to negative instances, we applied several preprocessing steps to ensure robust model training and evaluation.
Table 3 shows the features used by the machine learning (ML) event prediction model which combines imaging information, both 2D and 3D simulation-based features, as well as eCRF-based data.
Categorical features in the dataset were encoded using one-hot encoding to transform them into a numerical format suitable for machine learning algorithms. This step ensures that the model can process categorical data efficiently.
The dataset was split into training and testing subsets using a 70–30 split. This division was performed using stratified sampling to maintain the same proportion of positive and negative instances in both the training and testing sets, preserving the original class distribution. The training dataset included 80 class 0 instances and 13 class 1 instances, whereas the test dataset included the remaining 35 class 0 instances and the remaining six class 1 instances, respectively.
Two imbalance-handling approaches were implemented, (a) SMOTE Oversampling for Class 1 and (b) Downsampling for Class 0, respectively.
The Synthetic Minority Oversampling Technique (SMOTE) [
39] was employed to address class imbalance by generating synthetic samples for the minority class. This technique interpolates between existing minority class samples to create new instances, thereby balancing the class distribution without reducing the number of majority class instances. SMOTE was applied to the training set, generating synthetic samples to balance the classes. After applying SMOTE (Equation (8)), the values in the specified binary columns were rounded and clipped to ensure they remained binary (0 or 1):
- b.
Downsampling Class 0 (Majority class)
Random downsampling without replacement was applied based on the “RandomUnderSampler” approach [
40] to address class imbalance by reducing the number of majority class instances to equally match the number of minority class instances. This approach prevents the model from being biased towards the majority class by preserving the same number of instances in both classes. The “RandomUnderSampler” was applied to the training dataset to yield equally balanced populations as in:
The Gradient Boosting Tree (GBT) classifier was chosen to solve a binary classificaiton problem (“0”: asbence of events, “1”: presence of events) based on its effectiveness in handling complex, non-linear relationships and robustness to overfitting. We performed 5-fold stratified cross-validation on the training set to evaluate the model’s performance in both approaches (SMOTE approach and downsampling approach). The Gradient Boosting Tree (GBT) classifier is a powerful ensemble learning algorithm which has been designed to optimize predictive performance by combining multiple weak learners, typically decision trees, into a strong learner. It involves the construction of a sequence of models, where each model focuses on correcting the errors of its predecessor. The models are finally combined to produce the predictive model. The learning process is briefly described below.
A base model
is initially defined as follows:
where
L is the loss function,
are the true values, and
c is a constant. An iterative boosting is then applied for
m = 1, 2…,
M stages and the pseudo-residuals, say
, are estimated as in:
where
is the gradient of the loss function with respect to the predictions, and
is the model from the
m−1 stage. A weak learner is then applied to the pseudo-residuals, yielding:
where
is the weak learn at stage m and
is the squared error between the pseudo-residuals and the weak learner’s predictions. The model at stage
m, say
, is then incrementally updated through the following rule:
where
is the learning rate which reflects the contribution of each weak learner. The final model (i.e., the model after
M stages,
) is defined as the sum of the base model with the contributions from all the weak learners:
We employed everything in Python 3.11 using the “GradientBoostingClassifier” from the scikit-learn package, where the number of estimators (i.e., number of boosting stages) was set to 100, the learning rate to 0.1, the loss was set to the logarithmic loss, and the max. depth of the individual regression estimators to 3. The quality of each split was measured using the mean squared error with improvement score by Friedman.
3. Results
3.1. Internal Validation of Both Approaches on the Training Dataset
The utilized model performance metrics for the comparison of the two imbalance handling approaches are the balanced accuracy, the negative predictive value, the positive predictive value, the area under the receiver operating curve (ROC AUC), and the sensitivity and specificity. The values of the adopted performance metrics and their mean value and the 5-fold standard deviation are given in
Table 4 for the SMOTE approach and in
Table 5 for the downsampling approach.
The results of the 5-fold cross-validation scheme revealed a slight superiority of the SMOTE approach over the downsampling approach, something that was somehow expected due the nature of the SMOTE process, which increases the minority class instances to match the instances of the majority class regarding the training dataset. The downsampling approach, on the other hand, produced relatively satisfactory internal validation results, even though the minority class instances within the training data were very few. Given the relatively small number of positive instances in our dataset, using five folds ensures that each fold contains a sufficient number of samples from both classes, maintaining the class distribution and providing more stable and reliable performance estimates compared to higher fold values like 10-fold, which might result in folds with very few positive instances. This approach also mitigates the risk of overfitting and ensures that the performance metrics we obtain are more reflective of the model’s ability to generalize to unseen data, making it a robust choice for validating our predictive models.
SHapley Additive exPlanations (SHAP) analysis was performed to explain the predictions made by the proposed model by calculating the influence of each feature to the overall event prediction [
41]. The most influential features of the proposed ML model using the SMOTE approach are presented in
Figure 6. The mean calculated SHAP values for all the utilized features were calculated to reflect the global feature importance.
These features represent the most significant predictors of the event outcome in the dataset, with the PSS (plaque structural stress) being the most influential, followed by the presence of hypercholesterolemia and the areas of high OSI.
3.2. Testing of Both Approaches on the Test Dataset
The performance of the SMOTE approach on the test dataset demonstrated its effectiveness in addressing class imbalance and improving the model’s ability to predict cerebrovascular events. The SMOTE-enhanced model achieved a classification outcome that indicated balanced sensitivity and specificity, ensuring that both positive and negative instances were accurately identified. Specifically, the confusion matrix for the SMOTE approach revealed 32 true negatives, three false positives, two false negative, and four true positives. This translated to a high specificity rate, indicating a strong ability to correctly identify non-events, while maintaining a reasonable sensitivity rate for detecting actual events. The ROC AUC score further supported the robustness of this model, showcasing its inherent ability in distinguishing between the positive and negative classes.
In contrast, the “RandomUnderSampler” approach, also exhibited promising results on the test dataset. The confusion matrix for this method showed 28 true negatives, seven false positives, zero false negatives, and six true positives.
Figure 7 depicts the normalized confusion matrices for the two approaches. The ROC AUC score was also competitive, reflecting its balanced performance across both classes. Overall, both approaches demonstrated their respective strengths, with the SMOTE approach presenting slightly superior results in all metrics.
The performance of the GBT classifier for both class imbalance approaches is depicted in
Table 6.
4. Discussion
In this work, we developed a robust predictive model for cerebrovascular events by integrating finite element-derived simulation results with imaging and non-imaging clinical data. Our methodology involved comprehensive data preprocessing, including the encoding of categorical variables. To address the significant class imbalance in our dataset (115 class 0 vs. 19 class 1 instances), we employed two distinct techniques: “RandomUnderSampler” and Synthetic Minority Oversampling Technique (SMOTE). The “RandomUnderSampler” method undersamples (or downsamples) the majority class to create a more balanced training set, while SMOTE generates synthetic samples for the minority class, enhancing the model’s ability to learn from both classes. We then trained a GBT classifier, leveraging its strength in handling complex, non-linear relationships within the data.
The performance of our predictive model was initially evaluated on the transformed train datasets using a 5-fold cross-validation strategy to ensure reliable and generalizable results. This strategy maintained the class distribution within each fold, providing robust performance estimates. The results indicated that both the “RandomUnderSampler” and SMOTE approaches effectively improved the model’s sensitivity and specificity. As indicated in the generated results, the SMOTE approach was the selection of preference, since it produced a more balanced performance, compared to the “RandomUnderSampler” approach. Both methods achieved balanced accuracy and high ROC AUC scores, confirming the model’s capability to distinguish between positive and negative instances. These results underscore the effectiveness of our approach in creating a reliable predictive model for cerebrovascular events. The final GBT model (SMOTE-enabled) exhibited a balanced accuracy of 88% along with an AUC of 0.91. The performance of our approach compared to other ML-based models from literature is presented in
Table 7.
The integration of finite element-derived simulation results as input features represents a significant innovation in our predictive modeling framework. By combining these sophisticated simulation outputs with imaging and non-imaging data, we have enriched the feature space, enabling the model to capture more nuanced patterns associated with cerebrovascular events. This multidisciplinary approach leverages the strengths of advanced computational techniques and clinical insights, providing a more comprehensive understanding of the factors contributing to cerebrovascular risks. The finite element simulations offer detailed biomechanical insights that are not typically available from clinical data alone, thus enhancing the model’s predictive power.
Our method’s innovative aspect lies in this unique combination of data types, which has not been extensively explored in previous studies. The inclusion of simulation-derived features allows for a more detailed characterization of the vascular environment, potentially leading to earlier and more accurate predictions of cerebrovascular events. This approach not only demonstrates the feasibility of integrating diverse data sources but also sets a precedent for future studies aiming to leverage advanced simulations in clinical predictive models. By incorporating these elements, our work provides a novel framework that could be adapted and expanded for other clinical applications, ultimately contributing to more personalized and precise healthcare interventions.
However, the main limitation of our work lies in the rather small dataset that was available. The dataset was relatively imbalanced, a fact that can obviously be attributed to the fact that cerebrovascular events are less than common in asymptomatic patients with carotid artery disease. This was partially tackled by using a 5-fold cross-validation technique, as well as by trying an oversampling (SMOTE) technique for the minority class and calculating the results of the newly acquired enhanced dataset, as well as by applying a downsampling approach for the majority class to match the size of the minority class and recalculating the newly acquired dataset. Obviously, in the presence of a higher number of recorded events which would lead to a more balanced dataset, a different validation strategy would have been followed, having an obviously larger test subset. Furthermore, the proposed method relies on the accuracy of the 3D reconstruction of the carotid vasculature, as well as on the accuracy of the plaque characterization process performed. Both aspects are highly dependent on the quality of the acquired MRI images, a fact that can easily affect the calculated hemodynamic parameters which, in turn, affect the risk of cerebrovascular events that is calculated through the ML model.
The clinical impact of our work is of great importance, since it covers a sensitive subset of carotid artery disease patients who are asymptomatic but also have carotid artery stenoses of >50%. Our model addresses a critical gap in current clinical practice by providing a risk stratification beyond the conventional stenosis threshold of 80% used for surgical interventions. Specifically, it can identify patients with stenoses less than 80% who are at high risk of cerebrovascular events, thereby enabling earlier intervention and potentially preventing strokes that might otherwise be missed. Conversely, the model can also discern patients with stenoses greater than 80% who are at low risk, thus preventing unnecessary surgical procedures and their associated risks. By refining the decision-making process for intervention, our model has the potential to improve patient outcomes, reduce healthcare costs, and contribute to more personalized and effective management of carotid artery disease.
Our future work will focus on acquiring significantly larger datasets to enhance the robustness and generalizability of our predictive model for cerebrovascular events. These expanded datasets will aim to include a diverse range of cases, particularly those involving lipid-rich or calcified plaques, which are critical factors in the development and progression of cerebrovascular disease. By incorporating these additional cases, we can further refine our model to capture the full spectrum of clinical scenarios, thereby improving its predictive accuracy and reliability. Moreover, larger datasets will allow for more sophisticated machine learning techniques, such as deep learning, to be explored and potentially implemented. These techniques can leverage the increased data volume to identify even more subtle and complex patterns, ultimately leading to more precise and personalized predictions. Additionally, future studies will investigate the integration of other emerging data sources, such as genetic and molecular profiles, to further enhance the predictive power and clinical applicability of our model.
5. Conclusions
Our main goal was to develop a predictive ML model for identifying the likelihood of future cerebrovascular incidents, with a focus on asymptomatic cases. To accomplish this, we utilized a dataset that included standard non-imaging information from electronic Case Report Forms (eCRFs), imaging data from ultrasound (US) and magnetic resonance imaging (MRI), hemodynamic and structural parameters obtained from 2D and 3D finite element method (FEM) simulations. Despite the challenge of working with an imbalanced dataset, a natural outcome due to the rarity of events among asymptomatic patients, our methodology produced encouraging outcomes. The peculiarities of asymptomatic cases, mainly the reduced frequency of varied plaque compositions like lipid-rich plaques or hemorrhages, presented a constraint in data diversity. However, our model showcased notable accuracy and specificity in forecasting cerebrovascular events, affirming its value for clinical application.
Our workflow, which includes 3D reconstruction, blood-flow simulation, 2D structural analysis, and risk assessment, was proven to be efficient. This streamlined approach requires limited computational efforts and time, completing in just 15–20 min on average, although this may vary with the quality of imaging and the artery length examined. This efficiency, combined with the model’s accuracy, underscores our method’s practicality and effectiveness.
Author Contributions
Conceptualization, P.K.S.; methodology, P.K.S., V.T., V.P. (Vasileios Pezoulas) and D.P.; validation, P.K.S., I.K., F.S. and D.P.; formal analysis, D.I.F.; resources, I.K., G.C., G.G. and F.S.; data curation, V.K.; writing—original draft preparation, P.K.S.; writing—review and editing, P.K.S., E.F., V.P. (Vassiliki Potsika) and D.I.F.; supervision, D.I.F.; project administration, D.I.F. and V.P. (Vassiliki Potsika). All authors have read and agreed to the published version of the manuscript.
Funding
This work has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No. 755320, as part of the TAXINOMISIS project (a multidisciplinary approach for the stratification of patients with carotid artery disease).
Institutional Review Board Statement
This study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of the University Medical Centre Utrecht, The Netherlands: ethics approval acquired with Reference No. 18/855; Klinikum rechts der Isar der Technischen Universitat Munchen, Germany: ethics approval acquired with Reference No. 186/18 S; Faculty of Medicine, University of Belgrade, Serbia: ethics approval acquired with Reference No. 29/12-26; National and Kapodistrian University of Athens, Greece: ethics approval acquired with Reference No. 60/23-4-2018; IRCCS Azienda Ospedaliera Universitaria San Martino, Genova, Italy: ethics approval acquired with Reference No. 242/2018; Fundacio privada Clinic per a la recerca Biomedica, Hospital Clinic de Barcelona, Spain: ethics approval acquired with Reference No. HCB/2018/0645.
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement
The data used in this study can be available upon request from the corresponding author.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Takaya, N.; Yuan, C.; Chu, B.; Saam, T.; Underhill, H.; Cai, J.; Tran, N.; Polissar, N.L.; Isaac, C.; Ferguson, M.S.; et al. Association between carotid plaque characteristics and subsequent ischemic cerebrovascular events: A prospective assessment with MRI–initial results. Stroke 2006, 37, 818–823. [Google Scholar] [CrossRef] [PubMed]
- Zhao, H.; Zhao, X.; Liu, X.; Cao, Y.; Hippe, D.S.; Sun, J.; Li, F.; Xu, J.; Yuan, C. Association of carotid atherosclerotic plaque features with acute ischemic stroke: A magnetic resonance imaging study. Eur. J. Radiol. 2013, 82, e465–e470. [Google Scholar] [CrossRef] [PubMed]
- Gupta, A.; Baradaran, H.; Schweitzer, A.D.; Kamel, H.; Pandya, A.; Delgado, D.; Dunning, A.; Mushlin, A.I.; Sanelli, P.C. Carotid plaque MRI and stroke risk: A systematic review and meta-analysis. Stroke 2013, 44, 3071–3077. [Google Scholar] [CrossRef] [PubMed]
- Esposito-Bauer, L.; Saam, T.; Ghodrati, I.; Pelisek, J.; Heider, P.; Bauer, M.; Wolf, P.; Bockelbrink, A.; Feurer, R.; Sepp, D.; et al. MRI plaque imaging detects carotid plaques with a high risk for future cerebrovascular events in asymptomatic patients. PLoS ONE 2013, 8, e67927. [Google Scholar] [CrossRef] [PubMed]
- Selwaness, M.; van den Bouwhuijsen, Q.; van Onkelen, R.S.; Hofman, A.; Franco, O.H.; van der Lugt, A.; Wentzel, J.J.; Vernooij, M. Atherosclerotic plaque in the left carotid artery is more vulnerable than in the right. Stroke 2014, 45, 3226–3230. [Google Scholar] [CrossRef] [PubMed]
- Sun, J.; Zhao, X.Q.; Balu, N.; Neradilek, M.B.; Isquith, D.A.; Yamada, K.; Cantón, G.; Crouse, J.R., 3rd; Anderson, T.; Huston, J.; et al. Carotid Plaque Lipid Content and Fibrous Cap Status Predict Systemic CV Outcomes: The MRI Substudy in AIM-HIGH. JACC Cardiovasc. Imaging 2017, 10, 241–249. [Google Scholar] [CrossRef]
- Sun, B.; Li, X.; Liu, X.; Ge, X.; Lu, Q.; Zhao, X.; Pu, J.; Xu, J.; Zhao, H. Association between carotid plaque characteristics and acute cerebral infarction determined by MRI in patients with type 2 diabetes mellitus. Cardiovasc. Diabetol. 2017, 16, 111. [Google Scholar] [CrossRef]
- Xia, J.; Yin, A.; Li, Z.; Liu, X.; Peng, X.; Xie, N. Quantitative Analysis of Lipid-Rich Necrotic Core in Carotid Atherosclerotic Plaques by In Vivo Magnetic Resonance Imaging and Clinical Outcomes. Med. Sci. Monit. 2017, 23, 2745–2750. [Google Scholar] [CrossRef]
- Cui, Y.; Qiao, H.; Ma, L.; Lu, M.; Yang, J.; Yao, G.; Cai, J.; Zhao, X. Association of Age and Size of Carotid Artery Intraplaque Hemorrhage and Minor Fibrous Cap Disruption: A High Resolution Magnetic Resonance Imaging Study. J. Atheroscler. Thromb. 2018, 25, 1222–1230. [Google Scholar] [CrossRef]
- Cheng, G.C. Distribution of circumferential stress in ruptured and stable atherosclerotic lesions. A structural analysis with histopathological correlation. Circulation 1993, 87, 1179–1187. [Google Scholar] [CrossRef]
- Liu, J.; Loree, H.M.; Kamm, R.D.; Fishbein, M.C.; Lee, R.T. Semi-automatic carotid intraplaque hemorrhage detection and quantification on Magnetization-Prepared Rapid Acquisition Gradient-Echo (MP-RAGE) with optimized threshold selection. J. Cardiovasc. Magn. Reson. 2016, 18, 41. [Google Scholar] [CrossRef] [PubMed]
- Groen, H.C.; Gijsen, F.J.; van der Lugt, A.; Ferguson, M.S.; Hatsukami, T.S.; van der Steen, A.F.; Yuan, C.; Wentzel, J.J. Plaque rupture in the carotid artery is localized at the high shear stress region: A case report. Stroke 2007, 38, 2379–2381. [Google Scholar] [CrossRef] [PubMed]
- Kock, S.A. Mechanical stresses in carotid plaques using MRI-based fluid-structure interaction models. J. Biomech. 2008, 41, 1651–1658. [Google Scholar] [CrossRef] [PubMed]
- Tang, D.; Nygaard, J.V.; Eldrup, N.; Fründ, E.T.; Klaerke, A.; Paaske, W.P.; Falk, E.; Yong Kim, W. Sites of rupture in human atherosclerotic carotid plaques are associated with high structural stresses: An in vivo MRI-based 3D fluid-structure interaction study. Stroke 2009, 40, 3258–3263. [Google Scholar] [CrossRef]
- Teng, Z.; Canton, G.; Yuan, C.; Ferguson, M.; Yang, C.; Huang, X.; Zheng, J.; Woodard, P.K.; Tang, D. 3D critical plaque wall stress is a better predictor of carotid plaque rupture sites than flow shear stress: An in vivo MRI-based 3D FSI study. J. Biomech. Eng. 2010, 132, 031007. [Google Scholar] [CrossRef]
- Ohayon, J.; Finet, G.; Gharib, A.M.; Herzka, D.A.; Tracqui, P.; Heroux, J.; Rioufol, G.; Kotys, M.S.; Elagha, A.; Pettigrew, R.I. Necrotic core thickness and positive arterial remodeling index: Emergent biomechanical factors for evaluating the risk of plaque rupture. Am. J. Physiol. Heart Circ. Physiol. 2008, 295, H717–H727. [Google Scholar] [CrossRef]
- Huang, Y.; Teng, Z.; Sadat, U.; Graves, M.J.; Bennett, M.R.; Gillard, J.H. The influence of computational strategy on prediction of mechanical stress in carotid atherosclerotic plaques: Comparison of 2D structure-only, 3D structure-only, one-way and fully coupled fluid-structure interaction analyses. J. Biomech. 2014, 47, 1465–1471. [Google Scholar] [CrossRef]
- Wang, J.; Paritala, P.K.; Mendieta, J.B.; Komori, Y.; Raffel, O.C.; Gu, Y.; Li, Z. Optical coherence tomography-based patient-specific coronary artery reconstruction and fluid-structure interaction simulation. Biomech. Model. Mechanobiol. 2020, 19, 7–20. [Google Scholar] [CrossRef]
- Tuenter, A.; Selwaness, M.; Arias Lorza, A.; Schuurbiers, J.C.H.; Speelman, L.; Cibis, M.; van der Lugt, A.; de Bruijne, M.; van der Steen, A.F.W.; Franco, O.H.; et al. High shear stress relates to intraplaque haemorrhage in asymptomatic carotid plaques. Atherosclerosis 2016, 251, 348–354. [Google Scholar] [CrossRef]
- Costopoulos, C.; Huang, Y.; Brown, A.J.; Calvert, P.A.; Hoole, S.P.; West, N.E.J.; Gillard, J.H.; Teng, Z.; Bennett, M.R. Plaque Rupture in Coronary Atherosclerosis Is Associated with Increased Plaque Structural Stress. JACC Cardiovasc. Imaging 2017, 10, 1472–1483. [Google Scholar] [CrossRef]
- Doradla, P.; Otsuka, K.; Nadkarni, A.; Villiger, M.; Karanasos, A.; Zandvoort, L.J.C.V.; Dijkstra, J.; Zijlstra, F.; Soest, G.V.; Daemen, J.; et al. Biomechanical Stress Profiling of Coronary Atherosclerosis: Identifying a Multifactorial Metric to Evaluate Plaque Rupture Risk. JACC Cardiovasc. Imaging 2020, 13, 804–816. [Google Scholar] [CrossRef]
- Curcio, N.; Rosato, A.; Mazzaccaro, D.; Nano, G.; Conti, M.; Matrone, G. 3D patient-specific modeling and structural finite element analysis of atherosclerotic carotid artery based on computed tomography angiography. Sci. Rep. 2023, 13, 19911. [Google Scholar]
- Jansen, I.; Crielaard, H.; Wissing, T.; Bouten, C.; Gijsen, F.; Akyildiz, A.C.; Farrell, E.; van der Heiden, K. A tissue-engineered model of the atherosclerotic plaque cap: Toward understanding the role of microcalcifications in plaque rupture. APL Bioeng. 2023, 7, 036120. [Google Scholar] [CrossRef] [PubMed]
- Zhang, X.; Hua, Z.; Chen, R.; Jiao, Z.; Shan, J.; Li, C.; Li, Z. Identifying vulnerable plaques: A 3D carotid plaque radiomics model based on HRMRI. Front. Neurol. 2023, 14, 1050899. [Google Scholar] [CrossRef] [PubMed]
- Wu, X.; Hu, J.; Zhang, J. Machine learning-based model for predicting major adverse cardiovascular and cerebrovascular events in patients aged 65 years and older undergoing noncardiac surgery. BMC Geriatr. 2023, 23, 819. [Google Scholar] [CrossRef]
- Weng, S.; Chen, J.; Ding, C.; Hu, D.; Liu, W.; Yang, Y.; Peng, D. Utilizing machine learning algorithms for the prediction of carotid artery plaques in a Chinese population. Front. Physiol. 2023, 14, 1295371. [Google Scholar] [CrossRef]
- Xia, H.; Yuan, L.; Zhao, W.; Zhang, C.; Zhao, L.; Hou, J.; Luan, Y.; Bi, Y.; Feng, Y. Predicting transient ischemic attack risk in patients with mild carotid stenosis using machine learning and CT radiomics. Front. Neurol. 2023, 14, 1105616. [Google Scholar] [CrossRef]
- Yun, K.; He, T.; Zhen, S.; Quan, M.; Yang, X.; Man, D.; Zhang, S.; Wang, W.; Han, X. Development and validation of explainable machine-learning models for carotid atherosclerosis early screening. J. Transl. Med. 2023, 21, 353. [Google Scholar] [CrossRef]
- Bin, C.; Li, Q.; Tang, J.; Dai, C.; Jiang, T.; Xie, X.; Qiu, M.; Chen, L.; Yang, S. Machine learning models for predicting the risk factor of carotid plaque in cardiovascular disease. Front. Cardiovasc. Med. 2023, 10, 1178782. [Google Scholar] [CrossRef] [PubMed]
- Tsakanikas, V.D.; Siogkas, P.K.; Mantzaris, M.D.; Potsika, V.T.; Kigka, V.I.; Exarchos, T.P.; Koncar, I.B.; Jovanovic, M.; Vujcic, A.; Ducic, S.; et al. A deep learning oriented method for automated 3D reconstruction of carotid arterial trees from MR imaging. In Proceedings of the 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada, 20–24 July 2020; pp. 2408–2411. [Google Scholar]
- Torii, R.; Wood, N.B.; Hadjiloizou, N.; Dowsey, A.W.; Wright, A.R.; Hughes, A.D.; Davies, J.; Francis, D.P.; Mayet, J.; Yang, G.Z.; et al. Fluid–structure interaction analysis of a patient-specific right coronary artery with physiological velocity and pressure waveforms. Commun. Numer. Methods Eng. 2009, 25, 565–580. [Google Scholar] [CrossRef]
- Teng, Z.; Zhang, Y.; Huang, Y.; Feng, J.; Yuan, J.; Lu, Q.; Sutcliffe, M.P.F.; Brown, A.J.; Jing, Z.; Gillard, J.H. Material properties of components in human carotid atherosclerotic plaques: A uniaxial extension study. Acta Biomater. 2014, 10, 5055–5063. [Google Scholar] [CrossRef] [PubMed]
- Gu, S.Z.; Ahmed, M.E.; Huang, Y.; Hakim, D.; Maynard, C.; Cefalo, N.V.; Coskun, A.U.; Costopoulos, C.; Maehara, A.; Stone, G.W.; et al. Comprehensive biomechanical and anatomical atherosclerotic plaque metrics predict major adverse cardiovascular events: A new tool for clinical decision making. Atherosclerosis 2024, 390, 117449. [Google Scholar] [CrossRef] [PubMed]
- Costopoulos, C.; Maehara, A.; Huang, Y.; Brown, A.J.; Gillard, J.H.; Teng, Z.; Stone, G.W.; Bennett, M.R. Heterogeneity of Plaque Structural Stress Is Increased in Plaques Leading to MACE: Insights From the PROSPECT Study. JACC Cardiovasc. Imaging 2020, 13, 1206–1218. [Google Scholar] [CrossRef]
- Ku, D.N.; Giddens, D.P.; Zarins, C.K.; Glagov, S. Pulsatile flow and atherosclerosis in the human carotid bifurcation. Positive correlation between plaque location and low oscillating shear stress. Arteriosclerosis 1985, 5, 293–302. [Google Scholar] [CrossRef]
- Davies, P.F. Hemodynamic shear stress and the endothelium in cardiovascular pathophysiology. Nat. Clin. Pract. Cardiovasc. Med. 2009, 6, 16–26. [Google Scholar] [CrossRef] [PubMed]
- Malek, A.M.; Alper, S.L.; Izumo, S. Hemodynamic shear stress and its role in atherosclerosis. JAMA 1999, 282, 2035–2042. [Google Scholar] [CrossRef]
- Fry, D.L. Certain histological and chemical responses of the vascular interface to acutely induced mechanical stress in the aorta of the dog. Circ. Res. 1969, 24, 93–108. [Google Scholar] [CrossRef]
- Chawla, N.V.; Bowyer, N.V.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar]
- Rawat, S.; Mishra, A. Review of Methods for Handling Class-Imbalanced in Classification Problems. arXiv 2022, arXiv:2211.05456. [Google Scholar]
- Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; Curran Associates Inc.: Long Beach, CA, USA, 2017; pp. 4768–4777. [Google Scholar]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).