1. Introduction
Diabetes mellitus (DM) is a significant global health issue, affecting 422 million people worldwide, with most residing in low- and middle-income nations. DM is directly responsible for 1.6 million deaths annually. One of its major complications is diabetic retinopathy (DR), the most common cause of vision loss in people of working age [
1]. DR has two stages: Non-Proliferative (NPDR) and Proliferative (PDR), both linked to hyperglycemia-induced retinal microvascular injury. Several metabolic pathways, including the polyol pathway, the accumulation of advanced glycation end products (AGEs), the protein kinase C (PKC) pathway, and the hexosamine pathway, are involved in this injury [
2].
Cardiovascular risk factors such as hypertension play a crucial role in the development and progression of DR [
3]. Hypertension increases retinal blood flow, exacerbating endothelial dysfunction and accelerating capillary damage in the retinal microvasculature. Elevated systolic and diastolic blood pressures are strongly correlated with the incidence and progression of DR, particularly PDR and diabetic macular edema (DME), independent of glycemic control [
4,
5].
Dyslipidemia, characterized by elevated levels of low-density lipoprotein cholesterol (LDL-C) and triglycerides, contributes to DR through mechanisms such as oxidative stress and endothelial dysfunction [
6]. High levels of oxidized LDL (oxLDL) are cytotoxic to retinal endothelial cells, interacting with receptors like LOX-1 and triggering apoptosis. This process compromises the blood–retinal barrier, leading to increased vascular permeability, plasma leakage, and retinal edema [
6,
7]. Obesity, linked to DR through insulin resistance, systemic inflammation, and endothelial dysfunction, exacerbates the hyperglycemic environment and accelerates microvascular complications. Dysregulated adipokines and impaired nitric oxide (NO) production contribute to vascular permeability and neovascularization, worsening both NPDR and PDR [
8,
9]. Smoking is also an independent risk factor for DR, primarily due to its contribution to vascular inflammation, oxidative stress, and impaired endothelial function [
10]. Nicotine reduces the bioavailability of NO, increasing vascular resistance and blood pressure in the retinal microvasculature, leading to capillary damage and retinal ischemia [
11]. Smoking also increases the production of reactive oxygen species (ROS), amplifying oxidative stress and triggering inflammatory pathways [
10]. It accelerates retinal microvascular damage and reduces the response to treatments like laser photocoagulation and anti-VEGF therapy [
12].
The independent role of these cardiovascular risk factors in DR is further supported by large-scale epidemiological studies, which have consistently shown that tight control of blood pressure and lipid levels, alongside glycemic control, significantly reduces the risk of DR progression. For instance, the UK Prospective Diabetes Study (UKPDS) [
13] and the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial [
14] have provided evidence that multifactorial risk reduction strategies are essential in preventing vision loss due to DR.
Predictive models are crucial in healthcare, especially for managing chronic conditions like DR, where early detection can reduce the risk of severe complications. By integrating clinical variables such as BMI, blood pressure, and lipid levels, these models can identify individuals at higher risk of developing DR before symptoms appear [
15,
16]. This predictive capability allows healthcare providers to take proactive measures, such as adjusting treatment plans or recommending lifestyle changes. These models are also vital for efficient allocation of healthcare resources. They help prioritize high-risk patients for regular screening and early interventions, preventing progression to more severe stages that could lead to vision loss. Various ocular diseases, whether genetic [
17], metabolic [
2], or vascular [
18], contribute to significant visual dysfunction. Predictive models offer insights that go beyond traditional clinical assessments by leveraging advanced algorithms and patient data. As the prevalence of diabetes rises globally [
19], predictive models play an increasingly important role in identifying at-risk populations in regard to DR. This strategy is key in preventing blindness and reducing the burden of diabetic complications on healthcare systems [
15].
The aim of this study is to evaluate the role of cardiovascular risk factors as independent predictors of DR in patients with T2DM. Furthermore, we seek to develop a predictive model that can effectively classify the presence and severity of DR based on these risk factors, providing insights into early detection and intervention strategies to prevent disease progression and vision loss.
3. Results
The study population comprised 377 type II diabetic patients (
Table 2) with a mean age of 55.15 years and a median age of 55 years. The most frequent age mode was 66 years, with ages spanning from 18 to 86 years. In terms of gender, 47.2% of the patients were male (N = 178) and 52.8% were female (N = 199), indicating a slight predominance of females in the sample. The average age for males was 53.99 years, while, for females, it was 56.16 years. Both genders had the same most frequent age of 66 years, and the age range for both males and females was 18 to 86 years. The analysis revealed that there is no statistically significant difference in the ages between male and female patients, as indicated by an independent t-test (t = −0.24,
p = 0.81).
In this study, the 377 patients were classified into five categories depending on their retinal fundus changes. The majority of patients, 220 individuals (58.4%), showed No DR. A significant portion, 60 patients (15.9%), were classified as having Mild DR, characterized by early signs of damage to the retina. Moderate DR was observed in 40 patients (10.6%), indicating more advanced damage but without severe visual impairment. Severe DR was present in 30 patients (8%), marking a higher level of retinal damage and an increased risk of vision loss. Lastly, Proliferative DR, the most advanced form of the condition, involving neovascularization, was identified in 27 patients (7.1%).
Table 3 includes measurements such as age, BMI, HbA1c, blood pressure, cholesterol levels, and the duration of diabetes. Notably, patients with DR tend to have higher values for BMI, HbA1c, fasting glucose, and duration of diabetes, which are all known risk factors for the development and progression of DR.
An analysis was conducted to examine the relationship between age and the presence of DR. The dataset was stratified into three age groups: <40 years, 40–59 years, and 60+ years. A Chi-Square test was performed to assess whether age significantly correlated with the presence of DR. The resulting p-value was 0.062, indicating that the correlation between age group and DR presence is not statistically significant at the conventional threshold of 0.05. However, the p-value suggests a trend toward higher DR prevalence in older age groups, though this relationship is not strong enough to reach statistical significance in this sample. This is supported by the Logistic Regression, which showed a non-significant result for this variable as well. A Chi-Square test for independence was also conducted to examine the relationship between gender and DR stages. The p-value > 0.05 indicated that the differences observed are not statistically significant. Smoking status does not appear to be a significant factor in the progression of DR, with an analysis that resulted in a p-value > 0.05.
In the analysis comparing lipid profiles between diabetic patients with and without DR, no statistically significant differences were observed. For LDL levels, the t-statistic was 0.12 with a
p-value of 0.90, indicating no meaningful difference between the two groups. Similarly, while HDL levels tended to be lower in the DR group, the t-statistic of −1.49 and the
p-value of 0.14 suggest that this difference was not statistically significant. Finally, for triglycerides, the t-statistic of 0.99 and
p-value of 0.32 confirmed no significant difference in triglyceride levels between the groups. These findings suggest that variations in lipid levels may not be critical factors in differentiating patients with or without DR in this dataset (
Figure 1).
Similarly, both systolic and diastolic blood pressure displayed moderate correlations with DR severity (
Figure 2), with coefficients of 0.25 and 0.29, respectively. In the multivariate regression, these factors showed positive associations with more advanced stages of DR. Although the results were not statistically significant across all stages, the data suggest that elevated blood pressure is likely to increase the likelihood of more severe DR stages, such as Severe and Proliferative DR. These findings indicate that blood pressure is a moderate independent risk factor for DR progression, particularly in its association with advanced stages of the disease.
The analysis revealed that BMI has a moderate positive correlation with DR severity, with a correlation coefficient of 0.30. In the regression model, BMI demonstrated a positive coefficient in predicting higher DR stages. Although these coefficients were not statistically significant at all stages, the positive direction of the relationship suggests that a higher BMI may contribute to the progression of DR. This consistent association across stages supports the notion that BMI acts as a moderate independent risk factor for the development and severity of DR.
A Random Forest model was applied to predict the binary classification of No DR versus the presence of any DR stage. The model achieved an overall accuracy of 59.8%, correctly classifying over half of the patients. It performed particularly well in identifying patients with DR, showing a precision of 63% and a recall of 62% for this group, demonstrating its moderate ability to differentiate between patients who have DR and those who do not. The AUC (Area Under the Curve) for the Random Forest model in predicting DR versus No DR was 0.62. We used a 5-fold cross-validation to evaluate this model’s performance. The dataset was split into five parts, with the model trained on four parts and tested on the fifth. The Random Forest model achieved the following average results: an accuracy of 60.1% ± 3.2%, a precision of 61.3% ± 2.7%, a recall of 59.4% ± 3.5%, and an ROC-AUC of 0.62 ± 0.03
Among the most important features were BMI, which emerged as the strongest predictor in the model. Both DBP and SBP followed closely, indicating that blood pressure plays a critical role in the development of DR. HDL cholesterol also showed significant predictive value in the model despite not being statistically significant in basic tests. This suggests that HDL may contribute to the risk of DR when considered in combination with other factors. Additionally, total cholesterol, triglycerides, and LDL cholesterol played a meaningful role in the model’s predictions, though their importance was comparatively lower than that of BMI and blood pressure.
In addition to Random Forest, we applied XGBoost to evaluate its performance in predicting DR using cardiovascular risk factors. The model was evaluated using 5-fold cross-validation, with performance metrics being calculated across each fold. The average performance metrics for XGBoost showed an accuracy of 64.3% ± 2.7%, a precision of 63.5% ± 3.1%, a recall of 61.9% ± 2.9%, and an ROC-AUC of 0.68 ± 0.04. Compared to the Random Forest model, which achieved an ROC-AUC of 0.62, XGBoost demonstrated improved performance with an ROC-AUC of 0.68, indicating better discriminatory ability between patients. In addition, we also applied SVMs and Logistic Regression to evaluate their performance. For the SVM model, we observed an accuracy of 61.8% ± 3.0%, a precision of 60.7% ± 3.2%, a recall of 58.9% ± 3.4%, and an ROC-AUC of 0.65 ± 0.03. The SVM model showed moderate discriminatory ability but did not outperform XGBoost. However, it still performed better than Random Forest in terms of ROC-AUC. The Logistic Regression model achieved an accuracy of 59.7% ± 2.9%, precision of 58.5% ± 3.1%, recall of 57.1% ± 3.5%, and ROC-AUC of 0.60 ± 0.04. While Logistic Regression is a simpler model, its lower ROC-AUC compared to the other methods indicates that it may not capture the non-linear relationships present in the data as effectively. These findings suggest that XGBoost’s mechanism and SVM’s handling of complex patterns offer more accurate predictions in our case, making them better suited for predicting DR.
After tuning, the XGBoost model showed a notable improvement, with an ROC-AUC of 0.72 ± 0.03 compared to its pre-tuning score of 0.68. Similarly, the Random Forest model improved to an ROC-AUC of 0.65 ± 0.04. The SVM model also showed slight gains, with an ROC-AUC of 0.67 ± 0.03 after tuning the regularization parameter and kernel type.
The confusion matrix (
Figure 3) indicates that the model correctly classified 167 “No DR” cases and 122 “DR” cases while it incorrectly predicted 53 cases as “DR” when they were “No DR” (false positives) and 35 cases as “No DR” when they were actually “DR” (false negatives).
To address the need of enhancing the model’s interpretability, especially for clinical application, we conducted a feature importance analysis (
Figure 4) which revealed that SBP, DBP, HDL cholesterol, and BMI were the most influential predictors in the model.
Moreover, to further improve the transparency of the model, we applied SHAP (Shapley Additive exPlanations) values. The global feature importance analysis revealed that SBP had the highest average SHAP value of 0.35, indicating it was the strongest positive predictor of DR. HDL cholesterol, on the other hand, had an average SHAP value of −0.27, showing its protective effect, as higher HDL levels reduced the predicted risk of DR. BMI also played an important role, with an average SHAP value of 0.19, further reinforcing its contribution to DR risk.
Focusing on individual features, we observed that patients with SBP levels above 140 mmHg had SHAP values ranging from 0.30 to 0.50, indicating a significant increase in DR risk. For example, a patient with SBP of 150 mmHg had a SHAP value of 0.45, highlighting its strong influence on DR predictions. In contrast, patients with HDL levels below 40 mg/dL had consistently negative SHAP values of between −0.20 and −0.35, with a patient with a HDL level of 35 mg/dL showing a SHAP value of −0.32. Furthermore, the SHAP analysis showed a significant interaction between SBP and BMI. For patients with both high BP and obesity (BMI > 30), the combined SHAP values reached up to 0.55, indicating a compounded risk of DR.
These enhancements not only improved the model’s clinical utility but also ensured that clinicians can more easily understand the contribution of each risk factor, making it possible to develop more personalized prevention and intervention strategies for patients at risk of DR.
The calibration plots showed that XGBoost was well calibrated, with predicted probabilities closely matching observed outcomes. Random Forest exhibited slight overconfidence at higher probabilities. The Hosmer–Lemeshow test results indicated no significant lack of fit for either model, with XGBoost yielding a p-value of 0.28 and Random Forest a p-value of 0.32.
A Random Forest model was also developed to predict the stages of DR based on the same cardiovascular risk factors (
Figure 5). The model classified patients into four categories: No DR, Mild, Moderate, and Severe. The model demonstrated promising results, achieving an accuracy of 43.1% in predicting the DR stages. Most notably, the model showed strong performance in correctly identifying patients without DR, with a precision of 51% and a recall of 77% for the No DR class, indicating that it effectively differentiated between those without DR and those with more advanced stages. The weighted average precision and recall were also favorable, suggesting that the model was able to handle a range of DR stages.
A XGBoost model was also developed to predict the stages of DR based on the same cardiovascular risk factors. This model demonstrated notable improvements, achieving an overall accuracy of 54.8% in predicting the stages of DR. The model performed particularly well in distinguishing patients without DR, with a precision of 63% and a recall of 85% for the No DR class, significantly improving the ability to correctly identify those without DR compared to the Random Forest model. This indicates that the XGBoost model was more adept at differentiating between patients with no signs of DR and those with more advanced stages. The weighted average precision and recall for all classes were 58% and 65%, respectively, suggesting that the model managed to handle various stages of DR with a better balance across the categories. Notably, the model also showed improved performance in predicting Mild and Moderate stages of DR, with a precision of 56% for the Mild category and 60% for Moderate cases, reflecting its potential for more accurate staging.
The XGBoost model showed a strong performance in classifying patients across the different stages of DR (
Figure 5), though some misclassifications occurred, primarily between neighboring stages. For the No DR category, the model correctly classified 176 patients, while 44 patients were misclassified as having Mild DR. In the Mild DR category, the model correctly classified 29 patients, but 13 were misclassified as No DR and 17 were classified as having Moderate DR. For Moderate DR, the model correctly identified 30 patients, while seven were misclassified as Mild DR and two were misclassified as having Severe DR. In the Severe DR category, 23 patients were correctly classified, with six being misclassified as having Moderate DR. Lastly, for Proliferative DR, the model correctly classified 21 patients, but five were misclassified as having Severe DR. Misclassifications mostly occurred between adjacent DR stages, indicating that the model was effective in distinguishing between distinct stages, although there was some overlap between similar severity levels.
A Random Forest model was trained to predict the presence of PDR versus all other stages of DR. The model achieved an overall accuracy of 90.4%, correctly classifying a high proportion of cases. However, it primarily predicted non-PDR cases and struggled to identify actual cases of PDR. This outcome is likely due to the low prevalence of PDR in the dataset. These results suggest that, while the model effectively identifies non-PDR, its ability to detect PDR is limited by the low prevalence of this condition in the dataset.
4. Discussion
Risk factors for diabetic retinopathy encompass a wide range of physiological and lifestyle factors, including chronic hyperglycemia, hypertension, and dyslipidemia. Other significant risk factors include a longer duration of diabetes, poor glycemic control (high HbA1c levels), and obesity. Additionally, factors such as smoking, genetic predisposition, and coexisting cardiovascular conditions increase the likelihood of DR development [
21]. Moreover, age and pregnancy can exacerbate DR risk, particularly in patients with existing diabetes [
21,
22].
Numerous research has looked into the risk factors for DR in various patient samples or groups. Previous research has demonstrated that the multiple factors influencing the onset of diabetes and DR, including the length of diabetes, blood glucose level, HbA1c, and high blood pressure, are what contribute to the multifaceted nature of DR [
19,
23]. Tsao et al. found that the administration of insulin and the length of diabetes were characteristics that could be used to identify individuals at high risk of DR by comparing many machine learning approaches [
23]. Glucose levels is clearly not the sole factor that determines the incidence and course of DR, even though it is a crucial one [
24]. An earlier investigation revealed a connection between glucose variability and the onset and course of diabetes; however, this association became less significant when hemoglobin A1c was taken into account [
25].
Lately, a significant amount of research work has gone into creating machine learning algorithm-based predictive models for DR [
21]. However, these models are not well suited for usage in settings with limited medical resources. Li et al. conducted a baseline analysis of sixty factors in their study, and the results indicated that insulin treatment also scored highly and that HbA1c was the most significant risk factor for DR; the risk increased with a HbA1c score of over 8% and insulin treatment being required. Age was also shown to be a protective factor for the onset and development of DR [
15]. Research on the correlation between PDR and age has yielded conflicting results. The results are not always consistent, but some data indicate a possible increased risk of severe forms of the disease among younger people [
26]. For instance, a review found that the risk of developing severe fibrovascular proliferation was considerably higher in those under 45. In a comparable direction, Klein et al. discovered that patients under 30 were more likely to develop PDR [
27]. On the other hand, a prospective study found that the frequency of DR increased with age, while there was no statistically significant difference between the age groups [
28]. Additionally, a national population-based cohort study revealed that people with DM who had had the disease for less than five years had a lower prevalence of DR and that regular screening had no effect on the detection rates of young patients (less than 45 years of age).
The rising incidence of obesity and insulin resistance has led to a large increase in the prevalence of diabetes mellitus and the diseases it is associated with. Despite being a significant risk factor for DR in people with type 2 diabetes, blood pressure and blood lipid levels may not always correlate. Concurrently, a significant body of research demonstrated that, in both non-diabetic and diabetic patients, central obesity, insulin resistance, and dyslipidemia are linked to retinal lesions [
9]. Li et al. recruited 2305 patients and examined the relationship between obesity and any type of DR, diabetic macular edema (DME), and vision-threatening DR (VTDR). Their investigation yielded the following conclusions: only 29.6% of diabetic patients had normal weight, whereas 67.8% of diabetic patients were overweight or obese; second, the waist to height ratio was shown to be a substantial risk factor for VTDR, whereas obesity (BMI > 25.0 kg/m
2) was shown to be a significant protective factor [
29]. A prior meta-analysis assessing the relationship between central obesity and DR in the Chinese population found that, in contrast to the waist–hip ratio, abdominal obesity, as measured by waist circumference, is linked to an increased risk of DR. They did not, however, compare the outcomes across other ethnic groups. Furthermore, their investigation did not investigate if there is a similar correlation between abdominal obesity and DR in regard to varying severity [
29,
30].
Although there has been a considerable amount of clinical research performed on the relationship between DR and abdominal obesity, the findings have been inconsistent. Prior research found contradictory findings about the relationship between obesity and DR. The most common risk factor for DR was found to be obesity (BMI > 30 kg/m
2) in cross-sectional research involving 501 persons with T1DM [
31]. In Asian T2DM patients, a greater BMI seems to have an inhibitory impact on DR risk [
32]. Prior cross-sectional investigations have likewise revealed an insignificant correlation between DR and BMI [
33]. Several studies have shown that a higher BMI is associated with a higher risk of developing DR, particularly more severe forms. Some studies suggest that, for each unit increase in BMI, the risk of DR progression increases, although the exact mechanisms may vary among individuals.
Blood pressure plays a significant role in the progression of DR, with hypertension being a key modifiable risk factor [
34]. Elevated blood pressure can exacerbate retinal blood vessel damage by increasing pressure on the fragile capillaries, leading to microaneurysms, hemorrhages, and worsening ischemia, all of which contribute to the progression of DR. The association between elevated blood pressure and diabetes mellitus was validated by research on epidemiology. Extensive, prospective studies such as UKPDS (The UK Prospective Diabetes Study) and ABCD (Appropriate Blood Pressure Control in Diabetes), including T2DM patients, indicate that strict blood pressure management can halt the onset and advancement of DR [
35,
36]. The study findings of Bulum et al. [
4] imply that SBP and DBP serve as distinct risk factors for DR in T2DM patients. Even after accounting for the length of diabetes, gender, and HbA1c, there is a substantial correlation between the existence of DR and SBP in patients with type 2 diabetes regardless of whether their blood pressure is normal or elevated [
37]. Regardless of the existence of cardiovascular and renal disease, SBP is also linked to both early and advanced DR [
38]. In patients with T2DM, blood pressure variability, in addition to blood pressure degree, is a risk factor for DR [
39]. While most research revealed a connection between DR and SBP, several studies found a relationship between DR and solely DBP [
40].
Lipid levels and dyslipidemia have shown mixed results in their correlation with diabetes and the progression of DR. While some studies suggest that elevated LDL cholesterol and triglycerides, as well as low HDL cholesterol, may contribute to retinal damage, others have found the association to be less clear, indicating that the role of dyslipidemia in DR progression remains uncertain and may depend on additional risk factors [
15,
16,
41,
42]. In a comparable direction, there is an ongoing debate on the significance of lipoprotein(a) in the etiology of DR. The link between apolipoproteins and DR was the subject of a recent investigation; however, clinical practice has not yet made extensive use of these novel laboratory markers [
43].
In our study, cardiovascular risk factors were evaluated for their association with DR. The most significant predictors of DR were found to be BMI, SBP and DBP. No significant relation was obtained between age, gender, and smoking status.
4.1. Comparison of Different Models
One of the key strengths of the XGBoost model is its ability to handle complex, non-linear relationships in the data. DR is influenced by a variety of interrelated cardiovascular and metabolic factors, and XGBoost’s boosting approach allows for these interactions to be effectively captured and modeled. This flexibility makes it superior to linear models, such as Logistic Regression, which may oversimplify these relationships. XGBoost also offers excellent performance in handling imbalanced datasets. DR data, especially when subclassified, tends to be imbalanced, with a larger proportion of patients falling into the No DR or early-stage categories. XGBoost effectively mitigates this challenge through class weighting and boosting iterations, resulting in more accurate predictions for under-represented classes, such as Moderate and Severe DR, compared to models like Random Forest or SVMs, which may struggle with such imbalances. Another major advantage is XGBoost’s scalability and efficiency. Given that the dataset for this study contained hundreds of patients, the model was able to efficiently handle large volumes of data without significant computational strain thanks to its ability to parallelize the boosting process. This advantage is particularly important when dealing with clinical datasets that continue to grow in size. However, Random Forest offers higher interpretability and requires less fine-tuning, making it a more user-friendly model in cases where interpretability and ease of use are prioritized over predictive performance. Furthermore, tuning options for hyperparameters in XGBoost, such as learning rate, max depth, and regularization parameters, allowed for optimal model performance. The hyperparameter tuning process, including the adjustment of learning rate and class weights, enabled the model to generalize well across stages of DR without overfitting. In comparison, traditional models offer less flexibility in parameter optimization, which can limit their effectiveness.
Despite its strengths, the XGBoost model has some limitations. Its low interpretability can make it difficult for clinicians to understand the direct impact of individual risk factors on predictions, which is crucial in clinical settings. Tools like SHAP can enhance interpretability but add complexity compared to more transparent models like Logistic Regression. Additionally, XGBoost requires extensive tuning to achieve optimal performance, making it more challenging to implement than simpler models like Random Forest. Finally, while XGBoost excels in performance, it is computationally expensive and may require more resources and time, especially with large datasets, limiting its practicality in some clinical environments.
4.2. Clinical Application
One of the primary advantages of the XGBoost model is its ability to accurately predict the early onset of DR, particularly in patients with no visible signs of the disease. This predictive capability enables clinicians to intervene earlier, potentially delaying or preventing the progression of DR. For instance, patients identified as being at high risk of DR can be referred for more frequent retinal screening and aggressive management of modifiable risk factors, such as hypertension and dyslipidemia. Early intervention can help reduce the risk of progression to advanced stages of DR, where treatment options become more limited and complications more severe.
Furthermore, the model’s ability to distinguish between different stages of DR—No DR, Mild, Moderate, Severe, and Proliferative DR—provides valuable insights for stratifying patients according to their disease severity. This stratification allows clinicians to tailor treatment plans to individual patient needs. For example, patients predicted to have moderate or severe DR may benefit from more intensive therapeutic approaches, such as the early initiation of anti-VEGF therapy or laser treatment, to prevent further retinal damage and preserve vision.
Additionally, the XGBoost model incorporates cardiovascular risk factors that are routinely measured in clinical practice, making it highly adaptable for use in real-world healthcare settings. The simplicity of using widely available clinical data ensures that the model can be easily integrated into existing patient management workflows without the need for specialized or complex diagnostic procedures. This enhances its applicability in primary care settings, where early intervention is critical for preventing disease progression in diabetic patients. Another key aspect of the model’s utility is its ability to aid in decision-making regarding patient follow-up. By predicting the likelihood of progression to more advanced stages of DR, the model can inform decisions on how often patients should be monitored.
Finally, the model’s potential extends beyond prediction to guiding treatment strategies. With accurate identification of patients at various stages of DR, clinicians can recommend interventions based on individual risk profiles. For instance, controlling blood pressure and optimizing lipid profiles in patients identified as being at risk of DR progression can prevent further retinal damage and improve long-term outcomes.