The Identification of Diabetes Mellitus Subtypes Applying Cluster Analysis Techniques: A Systematic Review
Abstract
:1. Introduction
2. Methods
2.1. Search Strategy and Selection Criteria
2.2. Methods of the Review
2.3. Data Extraction
3. Results
3.1. Sample Characteristics
3.2. Cluster Analysis
3.2.1. Data Standardization
3.2.2. Variables Selected for Cluster Analysis
3.2.3. Methods of Clustering and Dimensionality Reduction
3.2.4. Cluster Validation on an Independent Sample
3.2.5. Main Results
4. Discussion
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
Appendix A
Study (Author, Year) | Country | Study Design | Source of the Data | Population Size and Characteristics | Diagnostic Criteria of Diabetes | Variables for Cluster Analysis | Number of Clusters and Characteristics | |
---|---|---|---|---|---|---|---|---|
1. | Ahlqvist et al. (2018) [7] | Denmark | Observational retrospective study | Steno Diabetes Center Copenhagen database | N = 2290. The majority were Caucasians. There were more male smokers and ex-smokers. Males had a higher level of HbA1c, BP, weight and TG but lower BMI and cholesterol levels at baseline. | Health records with patients who had clinically diagnosed type 2 diabetes. | 1. HbA1c 2. Age at diagnosis 3. Diabetes duration 4. BMI 5. HOMA2-IR 6. HOMA2-β 7. GAD65 autoantibody titre. | Cluster 1 (SAID, n = 577): characterized by early-onset disease, relatively low BMI, poor metabolic control, insulin deficiency, and presence of GADA; Cluster 2 (SIDD, n = 1575): GADA negative but otherwise similar to cluster 1: low age at onset, relatively low BMI, low insulin secretion (low HOMA2-B index), and poor metabolic control. Cluster 3 (SIRD, n = 1373): characterized by insulin resistance (high HOMA2-IR index) and high BMI. Cluster 4 (MOD, n = 1942): characterized by obesity but not by insulin resistance. Cluster 5 (MARD, n = 3513): similar to cluster 4, only modest metabolic derangements. |
2. | Tanabe et al. (2020) [8] | Japan | Observational retrospective study | Fukushima chronic kidney disease(CKD)cohort (January 2003–March 2017) and Fukushima Diabetes, Endocrinology and Metabolism(DEM)cohort (January 2003–November 2019) | 1255 of 1520 (917 patients from CKD cohort and 603 from DEM cohort) T2DM patients included in cluster analysis | ICD-10 codes E10–14 or FPG ≥ 126 mg/dL, RPG ≥ 200 mg/dL, in a patient with classic symptoms of hyperglycemia or hyperglycemic crisis A1c ≥ 6.5% | 1.GADA levels; 2.Age at diagnosis; 3.BMI; 4.HbA1c; 5. HOMA2-B; 6.HOMA2-IR | cluster 1 (SAID, 68 (5.4%)): was positive for islet-related autoantibodies and was young at onset, had an increased risk of diabetic retinopathy, after adjusting for modifiable risk factors; cluster 2 (SIDD, 238 (19.0%)): had a severe insulin deficiency and the highest A1c; cluster 3 (SIRD, 90 (7.2%)): was the highest in BMI, HOMA 2-IR, and HOMA2-B and had an increased risk of DKD; cluster 4 (MOD, 363 (28.9%)):had a higher BMI and was slightly younger than the MARD subgroup; cluster 5 (MARD, 496 (39.5%). |
3. | Zaharia et al. (2019) [9] | Germany | Observational retrospective study | T1DM and T2DM diabetes patients from prospective German Diabetes Study (01/2009 and 1/2015) | 1105 patients with known disease duration of less than 12 months, aged 18–69 years | American Diabetes Association criteria | 1. Age; 2. BMI; 3.Glycaemia, 4. HOMA-IR; 5. HOMA-B; 6. GADA levels. | cluster 1 SAID (N = 247): GADA positive, were more likely to be of a younger age, had relatively low BMI, poor glycemic control and overt insulin deficiency. 158 (67.0%) received insulin on diagnosis cluster 2 SIDD (N = 28): showed similarities with patients with SAID, but GADA negative; had the highest prevalence of confirmed diabetic sensorimotor polyneuropathy and cardiac autonomic neuropathy; 12 (44.0%) were treated with insulin on diagnosis; cluster 3 SIRD (N = 121): had high BMI and whole-body adipose-tissue insulin resistance, had the highest sensitivity for C-reactive protein, high hepatocellular lipid content and fatty liver index, low eGFR levels; cluster 4 MOD (N = 323): had obesity and substantial adipose tissue insulin resistance, high sensitivity for C-reactive protein, but they had moderate whole-body insulin resistance; cluster 5 MARD (N = 386): older than those in other clusters and showed only minor metabolic abnormalities. |
4. | Amato et al. (2016) [10] | Italy | Cross-sectional study | Outpatient clinic at Unit of Endocrinology, Diabetology and Metabolism, University of Palermo | N = 96. Caucasian patients with type 2 diabetes within 6 months of onset, age range 51–75 years. | Health records about known type 2 diabetes for <6 months and in stable treatment for the last 3 months with metformin | (1) glucagon-like peptide-1 (GLP-1) (2) glucose- dependent insulinotropic polypeptide (GIP) (3) ghrelin | Cluster 1 (n = 63): significantly lower levels of GLP-1, GIP and ghrelin compared to cluster 2 (n = 33), and higher levels of HbA1c and fasting plasma glucose. Regarding the clinical and anamnestic characteristics of the patients, there were not any significant differences between the two clusters, except for a greater prevalence of patients practicing physical activity in Cluster 2. |
5. | Arif et al. (2014) [11] | UK | Cross-sectional study | Several university and regional hospitals in UK took part in the research | N = 33. Children with newly diagnosed type 1 diabetes (5–16 years), unaffected siblings of patients with type 1 diabetes (6–16 years). | Test of blood autoimmune response phenotypes by combinatorial, multiparameter analysis of autoantibodies and autoreactive T-cell responses | For Autoimmune Inflammatory Phenotypes in Children With Newly Diagnosed Type 1 Diabetes group: 1. interferon-g 2. interleukin 10 (Il-10) 3. antigen-specific autoantibodies (Aabs) 4. proinsulin 5. insulin 6. Islet antigen antibodies (IA-2Ab) 7. GAD65 antibody 8. zinc transporter 8 antibody | Cluster 1 (n = 15): a combination of islet AAbs and IFN-g responses to all antigens. Have a significantly higher frequency of IL-10 response to GAD, insulin, proinsulin. There are also differences in the frequency of islet AAbs between clusters. AAbs against IA-2 and ZnT8 are significantly less frequent in the IL10–dominated cluster-1. Two children had no islet AAbs present at diagnosis, five had only a single AAb, and eight had two or more AAbs. Cluster 2 (n = 18): The frequency of multiple AAbs was significantly higher, all 18 children had two or more IL-10 responses to all antigens. |
6. | Pes et al. (2016) [12] | Italy | Cross-sectional study | Diabetic Unit, Department of Internal Medicine, University of Sassari, November 2005–December 2010 | N = 238. Patients with a Latent autoimmune diabetes in adults. Patients were of Sardinian origin for at least 2 generations, with 35 and older age. | International Diabetes Federation worldwide consensus | 1. Gender 2. Body mass index 3. Total cholesterol 4. Triglycerides 5. Sistolic blood pressure 6. Diastolic blood pressure 7. anti-glutamic acid decarboxylase (GAD) autoantibody 8.Anti-Islet Antigen-2 9. Anti-thyroid peroxidase 10. Cumulative genetic score 11. Insulin-free period | PC 1 (explained 18.0% of total variance): the dominant variables were: BMI, triglycerides, systolic and diastolic blood pressure and duration of insulin-free time period, showed a mild beta-cells failure. PC 2 (explained 15.0% of total variance): genetic variables such as Class II HLA, CTLA-4 as well as anti-GAD65, anti-IA-2 and anti-TPO antibody titers, and the insulin-free time period predominated, showed a faster beta-cells failure. PC 3 (explained 12.0% of total variance): gender and triglycerides predominated, showed a slower beta-cells failure. PC 4 (explained 12.0% of total variance): cholesterol predominated, showed a slower beta-cells failure. |
7. | Hammer et al. (2003) [13] | Australia | Cross-sectional study | Survey | 396 T2DM patients from 8555 surveyed. Two groups of the population of western Sydney. With diabetes (mean age 59.5 years), and without (44.6 years). | Self-reported | Self-reported symptoms: Upper GI/Dysmotility 1. Bloating 2. Food staying in stomach 3. Pain 4. Heartburn 5. Early satiety 6. Dysphagia Diarrhea 7. Urgency 8. Loose/watery stools 9. Less than 3 bowel movement/day 10. Fecal incontinence Constipation 11. Hard/lumpy stools 12. Blockage in the anus 13. Less than 3 bowel movement/week 14. Constipation/diarrhea Nausea/Vomiting 15. Vomiting 16. Nausea | The cluster analysis of the four latent symptom factors produced a five-cluster solution: Health group (5205) and four diseased clusters (396). The disease clusters were each defined by higher-than-average scores on a single symptom and were labeled according to that symptom. 1. Health. 2. Upper GI/Dysmotility (44.8% of the total variance): Poor glycemic control increased threefold compared to Health cluster. 3. Diarrhea (10.4% of the total variance)a: Poor glycemic control increased sevenfold compared to the Health cluster. 4. Constipation (7.8% of the total variance): Poor glycemic control increased fivefold compared to Health cluster. 5. Nausea/Vomiting (6·3% of the total variance): Poor glycemic control increased sixfold compared to Health cluster. |
8. | Kahkoska et al. (2020) [14] | USA, Denmark, Germany | Observational retrospective study | Cardiovascular Outcome Trials’ data | N = 20,274. Participants recently enrolled from three randomized, double blind, controlled, parallel-group multinational CVOTs in adults with long-standing T1 and T2 diabetes. The mean age was 64 years or older and the mean duration of diabetes was 12 years or longer. | DEVOTE: if patients got treatment against diabetes LEADER, SUSTAIN: glycated hemoglobin level of 7.0% or more. | 1.HbA1c 2.BMI at baseline, 3.Age at T2DM diagnosis | Cluster A (n = 3767): SIDD. Worse degree of glycemic control. Cluster B (n = 4810):SIRD. Greater baseline BMI. Cluster C (n = 4131): MOD. Greater baseline BMI and the lowest age of T2DM diagnosis. Cluster D (n = 7431):MARD. The highest age of T2DM diagnosis. |
9. | Karpati et al. (2018) [15] | Israel | Retrospective cohort study | Clalit Health Services healthcare data warehouse | N = 85,783. participants had 3–7 years duration of type 2 diabetes. 60,423 from total number had valid HbA1c measures. The mean age of the study cohort was 63.6 years, 52.6% of the patients were female. | HbA1c tests, glucose tests, diagnoses, and diabetes medications were analyzed. | 1. change in HbA1c values from t1 to t4 2. mean of the absolute first differences in HbA1c values 3. the ratio of the maximum absolute second difference to mean absolute first difference of HbA1c values | 1. Stable cluster (n = 45,679) had 20.2% no treatment compared to the 8.0% in both the descending and ascending clusters; 2. Descending cluster (n = 6084) had the highest proportion of patients treated with insulin (and a possible additional non-insulin medication) (16.7% vs. 11.9% for the ascending cluster and 4.4% for the stable cluster, p < 0.001), had high proportion of micro- and macrovascular complications (28.0% and 16.4%) compared to the stable cluster; 3. Ascending cluster (n = 8660) had the highest proportion of patients being treated only with non-insulin hypoglycemic medication (79.5% in the ascending cluster vs. 75.3% for the descending and stable clusters, p < 0.001), showed frequent hypoglycemic events and high mortality (15.3%), had high proportion of micro- and macrovascular complications (28.8% and 15.4%) compared to the stable cluster; 4. Undefined cluster (n = 25,360) showed relatively low levels of micro and macrovascular complications, but had higher mortality rates (14.8%). |
10. | Safai et al. (2018) [16] | Sweden | Observational retrospective study | Data from five cohorts: All New Diabetics in Scania (ANDIS), the Scania Diabetes Registry (SDR), All New Diabetics in Uppsala (ANDIU), Diabetes Registry Vaasa (DIREVA), and Malmö Diet and Cancer CardioVascular Arm (MDC-CVA). | N = 14,755. The results of 8980 patients from the ANDIS cohort were used for clustering. Patients from 5 databases with all types of diabetes. | Based on National Diabetes Registry | 1. BMI 2. age at onset of diabetes 3. HOMA2-B 4. HbA1c 5. HOMA2-I 6. Presence or absence of GADA was included as a binary variable. | 1. Autoimmune β-cell failure cluster (n = 65), characterized by patients with a positive GAD65 autoantibody titer. They also had the lowest TG level. 2. Insulin resistance with short disease duration cluster (n = 490), characterized by patients being diagnosed with type 2 diabetes relatively recently and having the highest HOMA2- β. 3. Non-autoimmune βcell failure cluster (n = 510), patients in sub-group 3 were the youngest at diabetes diagnosis but otherwise resembled sub-group 1 apart from the lack of positive GAD65 autoantibody titer. Increased risk for retinopathy. 4. Insulin resistance with long disease duration cluster (n = 727). Cluster 4 and 2 were very alike with a high age at diagnosis, similar BMI, better glycemic regulation, a relatively preserved β-cell function and at the same time a relatively high HOMA2-IR. The most important variable separating these two subgroups was the duration of diabetes. 5. Presence of metabolic syndrome cluster (n = 498), characterized by having the highest BMI compared to the other groups. It also consisted of those with the highest fasting glucose, HbA1c, C-peptide, HOMA2-IR and TG level. |
11. | Zou et al. (2019) [17] | US and China | Cross-sectional population-based study | Data were taken from the 2007–2008 China National Diabetes and Metabolic Disorders Study (CNDMDS) and the 1988–94 National Health and Nutrition Examination Survey (NHANES III) | 2316 participants from CNDMDS and 685 from NHANES III, (overall 3001) | WHO criteria | 1. Age at diagnosis; 2.BMI; 3. HbA1c (or alternatively mean plasma glucose); 4.HOMA2-B 5.HOMA2-IR | cluster 1 (MARD, 1045 (45.1%) of 2316 CNDMDS participants and 311 (45.4%) of 685 NHANES III) modest metabolic derangements in blood glucose, BMI, insulin resistance and β-cell function in both populations. cluster 2 (MOD, 759 (32.7%) of 2316 CNDMDS participants and 222 (32.4%) of 685 NHANES III): highest BMI, yet average blood glucose, β-cell function, and insulin resistance in both populations; cluster 3 (SIDD, 312 (13∙5%) of 2316 CNDMDS participants and 98 (14∙3%) of 685 NHANES III): had the lowest insulin secretion and highest blood glucose concentration; cluster 4 (SIRD, 200 (8∙6%) of 2316 CNDMDS participants and 54 (7∙9%) of 685 NHANES III): had the highest insulin resistance and best beta cell function. |
12. | Dennis et al. (2019) [18] | UK | Observational retrospective study | ADOPT trial, April, 2000, and June, 2002, followed up until June, 2006; For validation: RECORD cardiovascular outcomes trial, 2011 and 2003 followed up a minimum 5 years and a median 6 years | ADOPT trial (n = 4351, newly diagnosed T2DM patients aged 30–75 years); RECORD trail (n = 4447, 40–75 aged participants with established T2DM). | ADOPT trial: fasting plasma glucose 7–13 mmol/L, and no evidence of renal impairment; RECORD trial: HbA1c 7.0–9.0% (53–75 mmol/mol), BMI greater than 25.0 kg/m2 and no evidence of renal impairment | 1. GADA levels; 2. Age at diagnosis; 3. BMI; 4. HbA1c; 5. HOMA-2b; 6. HOMA-IR. | Cluster 1 (SAID): 4.0%; Cluster 2 (SIDD): 20.0%; Cluster 3 (SIRD): 20.0%, had high BMI, HOMA-B and HOMA-IR, were at an older age; Cluster 4 (MODD): 22.0%, had the highest BMI Cluster 5 (MARD): 34.0%. In ADOPT trial clusters 1 (SAID), 2 (SIDD), and 4 (MOD) had higher rate of HbA1c progression, while only cluster 4 (MOD) in RECORD trial. After adjustment to baseline UACR, time to albuminuria was shorter for cluster 3 (SIRD) vs. cluster 2 (SIDD) in ADOPT, but not RECORD. |
13. | Li et al. (2015) [19] | USA | Cross-sectional study | Data from electronic medical records (EMRs) and genotype data (eMERGE) | From 11,210 genotyped outpatient cohort 2551 T2DM patients were included in the cluster analysis | ICD-9-CM diagnosis codes, laboratory tests (LONIC), prescribed medications (RxNorm) | Variables with at least 50% of patients who had the values, resulting in 73 variables to perform the analysis were selected | Patients in subtype 1 (762) were the youngest (59.76 ± 0.45 years) and were notable for features classically associated with T2DM, such as the highest BMI (33.07 ± 0.29 kg/m2) and highest serum glucose concentrations at point-of-care testing (POCT) (193.69 ± 11.45 mM). Although these patients had better kidney function compared to those in the other two subtypes. They were characterized by T2DM complications as diabetic nephropathy and diabetic retinopathy and ACE gene. Patients in subtype 2 (617) had the lowest weight (85.17 ± 1.14 kg) compared with those in the other subtypes. Subtype 2 was enriched for cancer malignancy and cardiovascular diseases. Patients in subtype 3 (1096) had the highest SBP (135.7 ± 0.7 mmHg), serum chloride levels (102.03 ± 0.11 mEq/liter), and troponin I levels (0.36 ± 0.09 mg/liter) and were more often prescribed ARB/ACEI (62.96%) for the treatment of hypertension and statins (56.0%) for cholesterol reduction. They were associated most strongly with cardiovascular diseases, neurological diseases, allergies, and HIV infections and FHIT gene. |
14. | Ahlqvist et al. (2017) [20] | Sweden | Observational retrospective study | Swedish ANDIS (All New Diabetics in Scania) cohort; For replication: The Scania Diabetes Registry (SDR); ANDIU (All new diabetics in Uppsala); DIREVA (Diabetes Registry Vaasa) MDC-CVA (Malmö Diet and Cancer) | ANDIS (N = 8980, aged 0–96 years, within a median of 40 days after diagnosis. ); SDR (N = 1466); ANDIU (N = 844); DIREVA (N = 3485); MDC-CVA (N = 3300) | Based on the National Diabetes Registry | 1.GAD-antibodies 2.BMI 3.HbA1c 4.HOMA2-B 5.HOMA2-IR 6. Age at onset | Cluster 1 (SAID, 6.4%); was characterized by early onset, relatively low BMI, poor metabolic control, insulin deficiency, and presence of GADA, frequent ketoacidosis (30.5%); Cluster 2 (SIDD, 17.5%): was GADA negative but otherwise similar to SAID, frequent ketoacidosis (25.1%) and early signs of diabetic retinopathy; Cluster 3 (SIRD, 15.3%): was characterized by insulin resistance (high HOMA2-IR) and high BMI, had the highest prevalence of non-alcoholic fatty liver disease and high risk for CKDs; Cluster 4 (MODD, 21.6%): was characterized by obesity but not by insulin resistance; Cluster 5 (MARD, 39.1%): were older, but showed, as cluster 4, only modest metabolic derangements. |
Appendix B
Study (Author, Year) | Data Standardization | Methods of Clustering and Dimensionality Reduction | Methods for the Determination of the Number of Clusters (Direct or Statistical) | Clusters Validation on Independent Sample | |
---|---|---|---|---|---|
1. | Ahlqvist et al. (2018) [7] | Yes | Hierarchical clustering, k-means for GADA-negative individuals | Silhouette width method | Yes |
2. | Tanabe et al. (2020) [8] | No | Hierarchical clustering, k-means for GADA-negative individuals | Silhouette width method | No |
3. | Zaharia et al. (2019) [9] | No | Hierarchical clustering | Silhouette width method | No |
4. | Amato et al. (2016) [10] | Yes | Hierarchical clustering | Fixed number of clusters | No |
5. | Arif et al. (2014) [11] | No | Agglomerative hierarchical clustering | Number of clusters were determined based on hierarchical clustering with Ward′s method | No |
6. | Pes et al. (2016) [12] | No | PCA | Number of clusters were determined based on PCA (absolute factor loadings ≥ 0.4) | No |
7. | Hammer et al. (2003) [13] | No | K-means, PCA | Number of clusters were determined based on PCA | No |
8. | Kahkoska et al. (2020) [14] | Yes | Hierarchical clustering | Silhouette width method | No |
9. | Karpati et al. (2018) [15] | No | K-means | “NbClust” algorithm that selected optimal method for determination of numbers of clusters | Yes |
10. | Safai et al. (2018) [16] | Yes | k-means (Hartigan and Wong algorithm in R) and PCA for confirmation | Based on within the cluster sums of squares against the number of clusters | No |
11. | Zou et al. (2019) [17] | No | K-means | Silhouette width method | No |
12. | Dennis et al. (2019) [18] | Yes | K-means | Silhouette width method | Yes |
13. | Li et al. (2015) [19] | Yes | Topology-based approach | Based on patient-patient network using cosine distance metric | Yes |
14. | Ahlqvist et al. (2017) [20] | Yes | K-means | Silhouette width method | Yes |
References
- Guariguata, L.; Whiting, D.R.; Hambleton, I.; Beagley, J.; Linnenkamp, U.; Shaw, J.E. Global estimates of diabetes prevalence for 2013 and projections for 2035. Diabetes Res. Clin. Pract. 2014, 103, 137–149. [Google Scholar] [CrossRef] [PubMed]
- Bommer, C.; Heesemann, E.; Sagalova, V.; Manne-Goehler, J.; Atun, R.; Bärnighausen, T.; Vollmer, S. The global economic burden of diabetes in adults aged 20–79 years: A cost-of-illness study. Lancet Diabetes Endocrinol. 2017, 5, 423–430. [Google Scholar] [CrossRef]
- Nwaneri, C.; Cooper, H.; Bowen-Jones, D. Mortality in type 2 diabetes mellitus: Magnitude of the evidence from a systematic review and meta-analysis. Br. J. Diabetes Vasc. Dis. 2013, 13, 192–207. [Google Scholar] [CrossRef]
- Group IDFDA. Update of mortality attributable to diabetes for the IDF Diabetes Atlas: Estimates for the year 2013. Diabetes Res. Clin. Pract. 2015, 109, 461–465. [Google Scholar] [CrossRef] [PubMed]
- Rawshani, A.; Rawshani, A.; Franzén, S.; Sattar, N.; Eliasson, B.; Svensson, A.-M.; Gudbjörnsdottir, S. Risk factors, mortality, and cardiovascular outcomes in patients with type 2 diabetes. N. Engl. J. Med. 2018, 379, 633–644. [Google Scholar] [CrossRef]
- Hardin, D.S.; Rohwer, R.D.; Curtis, B.H.; Zagar, A.; Chen, L.; Boye, K.S.; Lipkovich, I.A. Understanding heterogeneity in response to antidiabetes treatment: A post hoc analysis using SIDES, a subgroup identification algorithm. J. Diabetes Sci. Technol. 2013, 7, 420–430. [Google Scholar] [CrossRef] [Green Version]
- Ahlqvist, E.; Storm, P.; Käräjämäki, A.; Martinell, M.; Dorkhan, M.; Carlsson, A.; Wessman, Y. Novel subgroups of adult-onset diabetes and their association with outcomes: A data-driven cluster analysis of six variables. Lancet Diabetes Endocrinol. 2018, 6, 361–369. [Google Scholar] [CrossRef] [Green Version]
- Tanabe, H.; Saito, H.; Kudo, A.; Machii, N.; Hirai, H.; Maimaituxun, G.; Kazama, J. Factors Associated with Risk of Diabetic Complications in Novel Cluster-Based Diabetes Subgroups: A Japanese Retrospective Cohort Study. J. Clin. Med. 2020, 9, 2083. [Google Scholar] [CrossRef]
- Zaharia, O.P.; Strassburger, K.; Strom, A.; Bönhof, G.J.; Karusheva, Y.; Antoniou, S.; Hwang, J.H. Risk of diabetes-associated diseases in subgroups of patients with recent-onset diabetes: A 5-year follow-up study. Lancet Diabetes Endocrinol. 2019, 7, 684–694. [Google Scholar] [CrossRef]
- Amato, M.C.; Pizzolanti, G.; Torregrossa, V.; Panto, F.; Giordano, C. Phenotyping of type 2 diabetes mellitus at onset on the basis of fasting incretin tone: Results of a two-step cluster analysis. J. Diabetes Investig. 2016, 7, 219–225. [Google Scholar] [CrossRef]
- Arif, S.; Leete, P.; Nguyen, V.; Marks, K.; Nor, N.M.; Estorninho, M.; Dunger, D.B. Blood and islet phenotypes indicate immunological heterogeneity in type 1 diabetes. Diabetes 2014, 63, 3835–3845. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Pes, G.M.; Delitala, A.P.; Errigo, A.; Delitala, G.; Dore, M.P. Clustering of immunological, metabolic and genetic features in latent autoimmune diabetes in adults: Evidence from principal component analysis. Intern. Emerg. Med. 2016, 11, 561–567. [Google Scholar] [CrossRef] [PubMed]
- Hammer, J.; Howell, S.; Bytzer, P.; Horowitz, M.; Talley, N.J. Symptom clustering in subjects with and without diabetes mellitus: A population-based study of 15,000 Australian adults. Am. J. Gastroenterol. 2003, 98, 391–398. [Google Scholar] [PubMed]
- Kahkoska, A.R.; Geybels, M.S.; Klein, K.R.; Kreiner, F.F.; Marx, N.; Nauck, M.A.; Buse, J.B. Validation of distinct type 2 diabetes clusters and their association with diabetes complications in the DEVOTE, LEADER and SUSTAIN-6 cardiovascular outcomes trials. Diabetes Obes. Metab. 2020, 22, 1537–1547. [Google Scholar] [CrossRef]
- Karpati, T.; Leventer-Roberts, M.; Feldman, B.; Cohen-Stavi, C.; Raz, I.; Balicer, R. Patient clusters based on HbA1c trajectories: A step toward individualized medicine in type 2 diabetes. PLoS ONE 2018, 13, e0207096. [Google Scholar] [CrossRef]
- Safai, N.; Ali, A.; Rossing, P.; Ridderstråle, M. Stratification of type 2 diabetes based on routine clinical markers. Diabetes Res. Clin. Pract. 2018, 141, 275–283. [Google Scholar] [CrossRef]
- Zou, X.; Zhou, X.; Zhu, Z.; Ji, L. Novel subgroups of patients with adult-onset diabetes in Chinese and US populations. Lancet Diabetes Endocrinol. 2019, 7, 9–11. [Google Scholar] [CrossRef] [Green Version]
- Dennis, J.M.; Shields, B.M.; Henley, W.E.; Jones, A.G.; Hattersley, A.T. Disease progression and treatment response in data-driven subgroups of type 2 diabetes compared with models based on simple clinical features: An analysis using clinical trial data. Lancet Diabetes Endocrinol. 2019, 7, 442–451. [Google Scholar] [CrossRef] [Green Version]
- Li, L.; Cheng, W.-Y.; Glicksberg, B.S.; Gottesman, O.; Tamler, R.; Chen, R.; Dudley, J.T. Identification of type 2 diabetes subgroups through topological analysis of patient similarity. Sci. Transl. Med. 2015, 7, 311ra174. [Google Scholar] [CrossRef] [Green Version]
- Ahlqvist, E.; Storm, P.; Käräjämäki, A.; Martinell, M.; Dorkhan, M.; Carlsson, A.; Wessman, Y. Clustering of adult-onset diabetes into novel subgroups guides therapy and improves prediction of outcome. BioRxiv 2017, 186387. [Google Scholar] [CrossRef] [Green Version]
- Kodama, K.; Tojjar, D.; Yamada, S.; Toda, K.; Patel, C.J.; Butte, A.J. Ethnic differences in the relationship between insulin sensitivity and insulin response: A systematic review and meta-analysis. Diabetes Care 2013, 36, 1789–1796. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Arnold, L.W.; Wang, Z. The HbA1c and all-cause mortality relationship in patients with type 2 diabetes is J-shaped: A meta-analysis of observational studies. Rev. Diabet. Stud. RDS 2014, 11, 138. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Voorham, J.; Denig, P. Groningen initiative to analyse type 2 diabetes treatment. Computerized extraction of information on the quality of diabetes care from free text in electronic patient records of general practitioners. J. Am. Med. Inform. Assoc. 2007, 14, 349–354. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Udler, M.S.; Kim, J.; Von Grotthuss, M.; Bonàs-Guarch, S.; Cole, J.B.; Chiou, J.; Glaser, B. Type 2 diabetes genetic loci informed by multi-trait associations point to disease mechanisms and subtypes: A soft clustering analysis. PLoS Med. 2018, 15, e1002654. [Google Scholar] [CrossRef] [Green Version]
- Gordon, M.M.; Moser, A.M.; Rubin, E. Unsupervised analysis of classical biomedical markers: Robustness and medical relevance of patient clustering using bioinformatics tools. PLoS ONE 2012, 7, e29578. [Google Scholar]
- Barai, A.; Dey, L. Outlier detection and removal algorithm in k-means and hierarchical clustering. World J. Comput. Appl. Technol. 2017, 5, 24–29. [Google Scholar] [CrossRef]
- Fong, S.; Deb, S.; Yang, X.-S.; Zhuang, Y. Towards enhancement of performance of K-means clustering using nature-inspired optimization algorithms. Sci. World J. 2014, 2014, 564829. [Google Scholar] [CrossRef] [Green Version]
- Xu, R.; Wunsch, D. Survey of clustering algorithms. IEEE Trans. Neural Netw. 2005, 16, 645–678. [Google Scholar] [CrossRef] [Green Version]
- Van Rooden, S.M.; Heiser, W.J.; Kok, J.N.; Verbaan, D.; Van Hilten, J.J.; Marinus, J. The identification of Parkinson’s disease subtypes using cluster analysis: A systematic review. Mov. Disord. 2010, 25, 969–978. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Sarría-Santamera, A.; Orazumbekova, B.; Maulenkul, T.; Gaipov, A.; Atageldiyeva, K. The Identification of Diabetes Mellitus Subtypes Applying Cluster Analysis Techniques: A Systematic Review. Int. J. Environ. Res. Public Health 2020, 17, 9523. https://doi.org/10.3390/ijerph17249523
Sarría-Santamera A, Orazumbekova B, Maulenkul T, Gaipov A, Atageldiyeva K. The Identification of Diabetes Mellitus Subtypes Applying Cluster Analysis Techniques: A Systematic Review. International Journal of Environmental Research and Public Health. 2020; 17(24):9523. https://doi.org/10.3390/ijerph17249523
Chicago/Turabian StyleSarría-Santamera, Antonio, Binur Orazumbekova, Tilektes Maulenkul, Abduzhappar Gaipov, and Kuralay Atageldiyeva. 2020. "The Identification of Diabetes Mellitus Subtypes Applying Cluster Analysis Techniques: A Systematic Review" International Journal of Environmental Research and Public Health 17, no. 24: 9523. https://doi.org/10.3390/ijerph17249523
APA StyleSarría-Santamera, A., Orazumbekova, B., Maulenkul, T., Gaipov, A., & Atageldiyeva, K. (2020). The Identification of Diabetes Mellitus Subtypes Applying Cluster Analysis Techniques: A Systematic Review. International Journal of Environmental Research and Public Health, 17(24), 9523. https://doi.org/10.3390/ijerph17249523