Next Article in Journal
Targeting Breast Cancer: The Familiar, the Emerging, and the Uncharted Territories
Previous Article in Journal
The Avidity of Autoreactive Alpha-Synuclein Antibodies in Leucine-Rich Repeat Kinase 2 Mutation Carriers Is Not Altered Compared to Healthy Controls or Patients with Parkinson’s Disease
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Fidelity of Rheumatoid Arthritis Multivariate Diagnostic Biomarkers Using Discriminant Analysis and Binary Logistic Regression

1
Department of Biomedical Sciences, University of Missouri-Kansas City School of Medicine, Kansas City, MO 64108, USA
2
Central Research Laboratory, Center for Science and Medical Studies for Girls, King Saud University, Riyadh 11495, Saudi Arabia
3
Department of Zoology, College of Science, Center for Science and Medical Studies for Girls, King Saud University, Riyadh 11495, Saudi Arabia
4
Department of Statistics and Operations Research, College of Sciences, King Saud University, Riyadh 11495, Saudi Arabia
5
Rheumatology Unit, Department of Medicine, College of Medicine, King Saud University, Riyadh 11495, Saudi Arabia
6
College of Medicine, King Saud Bin Abdulaziz University for Health Sciences, Riyadh 11495, Saudi Arabia
7
Biochemistry Department, College of Science, Center for Science and Medical Studies for Girls, King Saud University, Riyadh 11495, Saudi Arabia
*
Author to whom correspondence should be addressed.
Biomolecules 2023, 13(9), 1305; https://doi.org/10.3390/biom13091305
Submission received: 16 July 2023 / Revised: 22 August 2023 / Accepted: 23 August 2023 / Published: 25 August 2023

Abstract

:
Rheumatoid arthritis (RA) is an autoimmune inflammatory disease that causes multi-articular synovitis. The illness is characterized by worsening inflammatory synovitis, which causes joint swelling and pain. Synovitis erodes articular cartilage and marginal bone, resulting in joint deterioration. This bone injury is expected to be permanent. Cytokines play a prominent role in the etiology of RA and could be useful as early diagnostic biomarkers. This research was carried out at Riyadh’s King Khalid University Hospital (KKUH). Patients were enrolled from the Rheumatology unit. Seventy-eight RA patients were recruited (67 (85.9%) females and 11 (14.1%) males). Patients were selected for participation by convenience sampling. Demographic data were collected, and disease activity measurements at 28 joints were recorded using the disease activity score (DAS-28). Age- and sex-matched controls from the general population were included in the study. A panel of 27 cytokines, chemokines, and growth factors was determined in patient and control sera. Binary logistic regression (BLR) and discriminant analysis (DA) were used to analyze the data. We show that multiple cytokine biomarker profiles successfully distinguished RA patients from healthy controls. IL-17, IL-4, and RANTES were among the most predictive variables and were the only biomarkers incorporated into both BLR and DA predictive models for pooled participants (men and women). In the women-only models, the significant cytokines incorporated in the model were IL-4, IL-17, MIP-1b, and RANTES for the BLR model and IL-4, IL-1Ra, GM-CSF, IL-17, and eotaxin for the DA model. The BLR and DA men-only models contained one cytokine each, eotaxin for BLR and platelet-derived growth factor-bb (PDGF-BB) for DA. We show that BLR has a higher fidelity in identifying RA patients than DA. We also found that the use of gender-specific models marginally improves detection fidelity, indicating a possible benefit in clinical diagnosis. More research is needed to determine whether this conclusion will hold true in various and larger patient populations.

1. Introduction

Rheumatoid arthritis (RA) is a chronic autoimmune inflammatory disease characterized by multi-articular synovitis. The condition is distinguished by increasing inflammatory synovitis characterized by joint swelling and discomfort. Synovitis leads to joint degeneration by causing irreversible articular cartilage and marginal bone erosion [1].
Despite the plethora of therapeutic options and treatment algorithms currently available for RA, the accurate prediction of optimum treatment selection, course of disease, and mortality risk for individual patients remains elusive. Currently, RA clinical parameters and acute phase reactants (APRs) represent the core dataset that often guides therapeutic decisions for such a complex disorder as RA [2,3].
Circulating and/or synovial cytokines are regarded as excellent biomarkers for monitoring disease onset, development, and progression, either directly using cytokine multiplex immunoassays or indirectly by gene profiling approaches [4,5], because of the crucial role that cytokine networks play in maintaining inflammatory responses in the rheumatoid joint. However, only a small number of studies have employed circulating cytokine profiles to predict RA severity and outcome or to offer insights into its immune pathogenesis [4,5].
Interleukin-4 (IL-4) has recently been shown to play a function in the pathogenesis of rheumatism. IL-4 regulates the activation, differentiation, proliferation, and survival of several T-cell types. It was also shown to possess immunomodulatory properties in B cells, mast cells, macrophages, and a variety of other cell types [6]. IL-17A primarily acts on synoviocytes and osteoblasts to mediate cartilage and bone degradation in RA. In vitro experiments reveal that IL-17A causes synoviocyte activation (e.g., the production of IL-6 and IL-8), migration, and invasion, all of which increase cartilage degradation [7].
A negative correlation was suggested between genetically determined serum levels of macrophage inflammatory protein-1 (MIP-1b) and a lower risk of RA [8]. Interestingly, eotaxin 1 (CCL11) interacts with CCR3 to cause the migration of various leukocyte types. Fibroblast-like synoviocytes (FLSs), significant CCR3-expressing cells, are pathogenic effectors in RA. Wakabayashi et al. [9] reported that CCL11 levels in serum and synovial fluids (SFs) from RA patients were significantly higher than those in serum from healthy controls and SFs from osteoarthritis patients.
Wang et al. [10] emphasized the importance of platelet-derived growth factor (PDGF-BB) as a biomarker in RA. Fibroblast-like synoviocytes (FLSs) in the synovium of patients with RA can promote cartilage and bone destruction by producing matrix metalloproteinases and receptor activators of NF-κB ligands, thereby representing an important therapeutic target for RA. Furthermore, PDGF-BB plays an important role in RA-FLS proliferation and migration [10]. These results suggest that PDGF-BB might contribute to RA pathogenesis.
RA synovial fibroblasts (RASFs) release chemokines in response to interleukin-1 (IL-1) and tumor necrosis factor-alpha (TNF-α), which attract inflammatory cells to the site of inflammation [11]. Among these chemokines is the regulated upon activation, normal T expressed and presumably secreted (RANTES)/CC ligand 5 (CCL5), which is a potent CC chemokine demonstrated to be crucial in the pathophysiology of RA [12].
Granulocyte-macrophage colony-stimulating factor (GM-CSF) was first described as a growth factor that induced the differentiation and proliferation of myeloid progenitors in the bone marrow. It is interesting to highlight that mice with faulty GM-CSF are unable to develop arthritis in the collagen-induced arthritis (CIA) animal model, and antibodies against GM-CSF stop the disease from progressing and reduce proinflammatory cytokines in the joints [13].
This information sparked our interest in measuring a panel of cytokines in the plasma of RA patients and comparing them to those of healthy controls. Our goal is to help the early diagnosis of RA and identify therapeutic targets that are directly related to the severity of RA.

2. Materials and Methods

2.1. Patients and Controls

This study was carried out at King Khalid University Hospital (KKUH) in Riyadh. Ethical approval was obtained from the Ethical Committee (IRB No. E-14-1194). A convenient sampling criterion was developed to recruit consecutive patients from the Rheumatology unit. Seventy-eight RA patients were recruited (females: 67 (85.9%) and males: 11 (14.1%)). Demographics and disease activity measures using the disease activity score at 28 joints (DAS-28) were recorded. RA patients were grouped into high, moderate, and low based on pain, fatigue, inflammation, and remission groups. DAS-28 ≤ 2.6 corresponds to remission; >2.6 and ≤3.2 corresponds to low disease severity; DAS-28 ≥ 3.2 and ≤5.1 corresponds to moderate disease severity, while DAS ≥ 5.1 represents high disease severity [14]. Table 1 demonstrates the clinical and demographic data of RA patients and controls. Inclusion criteria were age ≥ 18 years and fulfilling the 2010 American College of Rheumatology (ACR) and the European League Against Rheumatism (EULAR) classification criteria [14] with disease duration ≥6 months. Exclusion criteria included systemic lupus erythematosus, systemic sclerosis, inflammatory myopathies, end-stage organ dysfunction, current or present diagnosis of malignancy, and pregnancy. Seventy-eight age- and sex-matched controls from the general population were included in the study. Written informed consent was obtained from each participant following the national protocol. Patients’ demographics, disease duration, seropositivity, and current therapies were extracted from medical records.

2.2. Sample Acquisition and Preparation

Blood was drawn by venipuncture, collected in EDTA tubes, and kept in ice boxes for transportation. The samples were immediately centrifuged, and plasma samples were separated according to instructions given by the Bio-Plex ProTM Human Cytokine Grp 1 Panel 27-Plex kit (BIO-RAD, Hercules, CA, USA). Aliquots of the plasma were stored at −80 °C until required for analysis.
The samples were thawed directly before the Bio-Plex assay using the Bio-Plex TM 200 system (BIO-RAD). Multiplex assays were chosen because they offer several advantages, including the ability to execute several reactions on a single sample and deliver extra information from the sample in a quick and efficient manner.
The Bio-Plex® system is built around three core technologies. The first xMAP technology uses up to 100 unique fluorescently dyed beads, which permit the simultaneous detection of up to 100 different types of molecules in a single well of a 96-well microplate. The second is a flow cytometer with two lasers and associated optics to measure the different molecules bound to the surface of the beads. The third is a high-speed digital signal processor that efficiently manages fluorescent data.
The assay uses magnetic beads with anti-cytokine immunoglobulins to assess concentrations (pg/mL) of many cytokines simultaneously. The principle of the assay is similar to that of the sandwich ELISA, where the capture antibodies, directed to the target biomarker, are covalently coupled to the beads. The coupled beads react with the sample containing the biomarker of interest. After a series of washes to remove the unbound protein, a biotinylated detection antibody is added to create a sandwich complex. The final detection complex is formed with the addition of a streptavidin–phycoerythrin (SA-PE) conjugate. Phycoerythrin serves as a fluorescent indicator or reporter.
The determination of cytokines, chemokines, and growth factors was carried out in the patients and control samples in 4 runs. Each run consisted of a blank (sample diluent only), 8 standards (obtained from Bio-Rad), 20 RA patient samples, and 20 normal subjects (control) samples, all run in duplicates. The samples were processed following the manufacturer’s instructions (Bio-Plex Pro™ Human Cytokine Assays, Bio-Rad Laboratories), and the levels of the studied parameters were read using Bio-Plex Manager 5 Software.

2.3. Statistical Analysis

2.3.1. Hierarchical Clustering

To perform hierarchical clustering, a similarity coefficient and a dendrogram (tree)-building algorithm are needed. The similarity coefficient is used to calculate all possible pairwise similarities in the dataset, while the algorithm constructs a tree with similar subjects clustered in common branches. Similarity in the current study was calculated using Canberra distances [15,16] (Equation (1)), and the Unweighted Pair Group Method with Arithmetic Mean algorithm [17] was used for building the trees. Hierarchical clustering was performed using Bionumerics version 6.6 (Applied Maths, Austin, TX, USA).
D = 1 n i = 1 n X i Y i X i + Y i
where “D” is the Canberra distance metric, “n” is the number of variables, “i” is the ith variable, and “X” and “Y” are two participants.

2.3.2. Binary Logistic Regression

Binary logistic regression (BLR) was used to predict RA among participants. BLR uses a linear combination of multiple independent variables to predict a dichotomous categorical variable [18]. The BLR model is trained using the natural log of odds, and the odds can be calculated from the known classification of participants in the dataset using Equations (2) and (3). The regression process is based on the relationship between the natural log of odds and the predictor variables. The significance of the model is tested using a chi-square test with p-values < 0.05 considered significant. The model is further evaluated using the Hosmer–Lemeshow test and Nagelkerke’s pseudo-R2. The Hosmer–Lemeshow test predicts classification accuracy, which is assumed by retaining the hypothesis of perfect prediction with p-values > 0.05 [19]. Nagelkerke’s pseudo-R2 can take values between 0 and 1 and indicates the proportion of explained variance in the dependent variable [20]. The importance of each independent variable to the model was evaluated in multiple ways. First, the Wald chi-square test was used to test the significance of the variable’s contribution to the model, with p-values < 0.05 indicating significance. Second, the amount of contribution of each variable was evaluated using the variable’s coefficient and odds ratio. The coefficient is the weight assigned to each independent variable in the logistic regression equation, with more important variables having greater coefficients. The odds ratio is the predicted variation in the dependent variable caused by an independent variable’s unit change with all other independent variables remaining unchanged.
Equation (2) is the probability of falling in a target group (Pt) (e.g., rheumatoid arthritis):
P t = Number   of   participants   in   the   target   group Total   number   of   participants
Equation (3) is the relationship between the odds of falling in a target group, the probability of falling in that group (Pt), and the probability of falling in the alternate group (Pa):
Odds = P t P a = Number   of   participants   in   the   target   group Total   number   of   participants Number   of   participants   in   the   alternate   group Total   number   of   participants = Number   of   participants   in   the   target   group Number   of   participants   in   the   alternate   group
Finally, a model’s performance was empirically tested by determining its success rate in correctly classifying participants into either the RA group or the control group. To avoid potentially overinflating the rate of correct classification due to attempting to classify the same subjects used to train the model, the model was recalculated before attempting to classify each of the participants with the participant being classified excluded. BLR was performed using IBM SPSS version 26 (IBM Corporation, Armonk, NY, USA).

2.3.3. Discriminant Analysis

Discriminant analysis (DA) combines correlated variables into new vectors known as canonical discriminant functions. The discriminant function is a selection process that aims to maximize group separation [18]. The discriminant models were evaluated by the Wilks’ λ statistic, which represents the proportion of unexplained variance and ranges from 0 to 1. Therefore, better models have lower Wilks’ λ values [18]. The significance of the model was evaluated with a chi-square test with p-values lower than 0.05 indicating significance [18]. DA results were plotted using GraphPad Prism version 6 for Windows (GraphPad Software, San Diego, CA, USA, www.graphpad.com). Since we had two groups, there was only one discriminant function (number of discriminant functions = number of groups − 1). To plot the data with the sole canonical discriminant function represented on the x axis, we used arbitrary units on the y axis to clearly illustrate group segregation. Discriminant functions were evaluated according to the corresponding eigenvalue, which reflects the amount of variance explained [21], and canonical correlation, which reflects the discriminant function’s correlation with the groups [22]. The significance of independent variables’ contributions was evaluated using one-way ANOVA with a significance threshold of p < 0.05. Individual variables were also evaluated using the Wilks’ λ statistic, where a value of 0 implied that all variance in the independent variable could be explained by differences in group membership and a value of 1 implied that none of the variance could be explained by group membership [18]. The direct contribution of the variable to the discriminant model was reflected in the standardized canonical discriminant function coefficient [23], which was the main criterion in evaluating the importance of individual independent variables to the discriminant models. The rate of correct classification was tested as described for BLR. DA was performed using IBM SPSS version 26 (IBM Corporation, Armonk, NY, USA).

2.3.4. Receiver Operating Characteristic Curve (ROC) and Area under the Curve (AUC)

The ROC curve is generated by plotting the rate of true positives (sensitivity) against the rate of false positives (1 − specificity) for all possible threshold values. For a test with 100% sensitivity and 100% specificity, the AUC is equal to 1, while an absolutely useless test has an AUC of 0.5. The asymptotic significance of the AUC is evaluated by testing the null hypothesis stating that the test has an AUC of 0.5. The null hypothesis is rejected when the adjusted p-value is <0.05. We also evaluated the models and individual biomarkers in terms of sensitivity and specificity at thresholds that correspond to approximately 80–90% sensitivity. ROC/AUC analyses were performed using IBM SPSS version 26 (IBM Corporation, Armonk, NY, USA).

2.3.5. Other Statistical Testing

Correlations between RA probabilities, which were calculated by either BLR or DA, and clinical findings such as ESR, DAS28ESR, and anti-cyclic citrullinated peptide (anti-CCP) were estimated using Spearman correlations. Spearman correlations were performed using GraphPad Prism version 6 for Windows (GraphPad Software, San Diego, CA, USA, www.graphpad.com).

3. Results

Hierarchical clustering was employed to investigate unsupervised group partitioning among participants (Figure 1). A clearly discernable partitioning between RA participants and controls was observed whether the analysis was applied to all participants in one group or separately applied to men and women. We then proceeded to construct models designed to optimize the separation between patients and controls, for which we used BLR and DA.

3.1. Constructing Models to Distinguish RA Patients from Normal Controls Using BLR and DA

BLR and DA models were constructed from the data of all participants, only men, or only women. All six models were statistically significant, as indicated by p-values < 0.05 (Table 2 and Table 3). BLR models explained 53.5%, 63.2%, and 60.1% of data variance (Nagelkerke pseudo-R2) for the all-participants, men, and women models, respectively (Table 2). The Hosmer–Lemeshow p-value was significant (0.019) for the all-participants model, indicating that the model’s fit to the data is suboptimal. Conversely, gender-specific models were both well calibrated, as indicated by Hosmer–Lemeshow p-values > 0.05 (Table 2). As for the discriminant models, the women’s model was superior to the other two in multiple ways. First, the women’s model was the most correlated with group membership, as indicated by its canonical correlation (Table 3). Canonical correlation is the best measure of model fitness given the binary nature of the grouping variable (i.e., RA versus control). In addition, the eigenvalue and proportion of variance that can be explained by group membership (Wilks’ lambda − 1) were both highest in the women’s model (Table 3).
Since we had two groups, the RA and control groups, only one discriminant function was extracted. Plotting participants from both groups on the discriminant function of the all-participants model showed partial separation between groups (Figure 2). Participants were also plotted according to the probability of falling in the RA group calculated from the BLR models, which showed better separation compared to DA, especially among female participants (Figure 2).
The results of both BLR and DA pointed to IL-4 as the most important variable for the all-participants and women’s models. Specifically, in the BLR models of all participants and women, IL-4 had the greatest regression weights (1.387 and 1.548, respectively), and in the discriminant models, it had the greatest standardized canonical discriminant function coefficient (SCDFC) values (1.290 and 1.252) and the highest proportion of variance explained by group membership (i.e., the lowest Wilks’ lambda values: 0.743 and 0.710, respectively). Furthermore, the BLR models for all participants and women indicated that a unit increase in IL-4 results in a 4-fold or more increase in the odds of having RA (odds ratios of 4 and 4.7 for the all-participants and women models, respectively) (Table 4).
It is noteworthy that, regardless of whether BLR or DA was used, the variables in the model and their order of importance were identical in the all-participants models. In addition to IL-4, IL-17, and RANTES appeared to significantly improve the models, which is likely to augment the predictive power of the models. The women’s models also contained more variables in addition to IL-4. The BLR model had IL-17, MIP-1b, and RANTES in descending order of importance, while the discriminant model had IL-1Ra, GM-CSF, IL-17, eotaxin, and RANTES (Table 4). PDGF-BB was the only variable in the discriminant model for men and was the second most important factor in the BLR model after eotaxin (Table 4). It is noteworthy that both of the men’s models (BLR and DA) were weak. For example, the BLR model had variable weights close to 0 and odds ratios approaching 1 (Table 4), which meant that variable contributions were minimal and changes in the predictors did not result in sizable changes in the odds of having rheumatoid arthritis, respectively.

3.2. Testing the Predictive Power of the Models

To directly test the predictive power of each model, we compared the rates of correct classification of the six models. We show that BLR models outperformed their corresponding DA models, slightly for the all-participants and female models and dramatically for the male models (Figure 3).
We then wanted to compare the predictive power of each model to the predictive power of each of its constituent variables, which was accomplished using ROC curves and AUCs. Using the AUCs, we were also able to evaluate the models and compare them to each other using a technique that is computationally independent of BLR and DA. As expected from the quality of the models described above, the all-participants and women models were superior to the men’s models. Although the AUCs obtained by the men’s models, BLR and DA, were comparable to those obtained by the other models, the specificity was generally inferior to other models at thresholds corresponding to 82–91% sensitivity (Figure 4 and Table 5). At 90.9% sensitivity, the specificity of the men’s BLR and DA models were both equal to 45.5%. At 81.8% sensitivity, these two models had 100% and 63.5% specificity, respectively. Despite the perfect specificity obtained using the men’s BLR model at the lower sensitivity of 81.8%, which is the highest among all other models, such results should be interpreted with caution given the model’s quality and very small sample size. That is also in addition to the low specificity obtained with men’s models, including the BLR model at a sensitivity of 90.9%. For example, the BLR models of all participants and women showed specificities of 82.1% (sensitivity 89.6%) and 83.3% (sensitivity 85.9%), respectively (Table 5). A similar trend was also observed for the DA models (Table 5). The obtained ROC-AUCs, specificity, and sensitivity of the measured panel of cytokines are comparable to those recorded in other studies, which confirms the possibility of using our models as predictive biomarkers for the early diagnosis of RA (Table 5).

3.3. Estimated Probability of RA Correlates with Clinical Findings

The probability of RA can be calculated from both BLR and DA models. To test the clinical applicability of our models, we asked whether these probabilities correlated with common clinical findings in RA patients, namely ESR, DAS28ESR, and anti-CCP. Since we had clinical data for 57 RA patients, 55 women and 2 men, we performed the analysis on the women group and the entire group of 57 patients. With only two men, we could not perform a correlation analysis on the men group. We show that the probabilities of RA performed on the same population using BLR or DA were highly correlated (r = 0.987 and 0.824 for all patients and female patients, respectively) (Figure 5 and Figure S1). The RA probabilities among the entire group correlated the most with ESR (r = 0.350 and 0.339 using DA and BLR, respectively), although the correlation was weak. Probabilities calculated using DA also significantly, but weakly, correlated with DAS28ESR (r = 0.262) (Figure 5). The probability of RA among female patients correlated significantly, but also weakly, with anti-CCP (r = 0.355 and 0.328 using DA and BLR, respectively); when calculated using DA, it also correlated with ESR (r = 0.324) (Figure S1).

4. Discussion

This study demonstrated the significance of using a panel of cytokines as a cost-effective multivariate biomarker for the early detection of RA. Early diagnosis and intervention are needed to help avoid the serious complications of RA and improve patients’ quality of life [14]. Although further research is needed to confirm the utility of our methodology for clinical practice, our data suggest that our approach is promising enough to warrant further investigations. We show that the data have enough variance contributing to group separation to noticeably segregate patients from controls using hierarchical clustering—an unsupervised partitioning method—as well as using methods that maximize group separation such as BLR and DA. We have also shown clinically acceptable sensitivity and specificity, as well as rates of correct classification, with the all-participants and women models. We have also shown that the use of multivariate biomarker profiles improves sensitivity and specificity over the use of individual biomarkers; the latter is typically insufficient to predict the prognosis of RA [24].
The role of cytokines, including the ones making up our BLR and DA models, in RA pathogenesis has been well documented in the literature [25]. Cytokines are particularly important for the emergence of both arthritic and systemic RA symptoms, as well as for the conversion of acute inflammation into a chronic process [26,27]. Bone homeostasis is maintained by bone resorption and formation by osteoclasts and osteoblasts, respectively [28]. The osteoclast, which plays a critical role in the initiation of joint and bone erosion in RA, is a known target of numerous cytokines, including ones that are osteoclastogenic (e.g., IL-1 and IL-17) and others that are anti-osteoclastogenic (e.g., IL-4) [28]. Therefore, the use of cytokine biomarker profiles to enable the accurate and early diagnosis of RA is rational.
Our results underscore the importance of a few cytokines in separating RA from healthy controls. IL-4 was the most important among these cytokines, which is consistent with multiple previous reports. RA patients were found to have elevated IL-4 mRNA levels in whole blood and elevated IL-4 protein in mononuclear and whole blood cells [29,30]. RA patients were also shown to have elevated IL-4 levels in synovial fluid and plasma [31,32]. Additionally, Kokkonen et al. [33] demonstrated elevated IL-4 levels in RA patients prior to the onset of the disease. We have also shown that IL-17 was one of the most influential biomarkers in separating RA patients and controls. The role of IL-17 in RA pathogenesis has been supported by multiple previous reports. IL-17 was found in the synovial fluids and membranes of RA patients, implicating Th17 cells in RA pathogenesis [34]. IL-17 is a proinflammatory cytokine that promotes bone resorption, which further implicates it in RA pathogenesis [34]. Since IL-17 promotes inflammation, blood coagulation, and thrombosis in arteries and cardiomyocytes [35], inhibiting IL-17 expression or reducing its interaction with its cognate receptor could become beneficial in treating cardiovascular disease in RA patients [36]. In fact, inhibiting IL-17 reduced inflammation and joint erosion in animal models [37]. Furthermore, Jain et al. [38] and Magda et al. [39] found a substantial association between IL-17 and DAS-28, which is consistent with IL-17 being a valuable biomarker of disease activity in RA. RANTES has also been linked to RA. Yao et al. [40] and Yang et al. [41] found that plasma RANTES levels in RA were more strongly correlated with disease activity than CRP or ESR. They also showed that low RANTES plasma levels were conversely correlated with the duration of clinical remission in RA [40,41], indicating that plasma RANTES can be utilized as a predictor of RA activity.
The involvement of PDGF-BB is supported by numerous studies that showed elevated PDGF signaling, proliferation, and accumulation of fibroblasts that express the PDGF receptor (PDGFR) in the synovial fluid of RA patients. An active form of PDGFR known as phosphorylated PDGFR (pPDGFR) has been found in abnormally high concentrations in the synovium of RA patients [42,43,44,45]. The DA model of males contained PDGF-BB as a sole biomarker, indicating that there was no other biomarker that could have significantly improved the model’s predictive power. Since the DA model contains PDGF-BB as a sole biomarker, the AUC of the model (0.818) was equal to that of PDGF-BB alone. However, the men’s models were weak, as discussed in the results section, a fact that could have resulted from the small sample size of the men’s group. Another possible explanation is the difference in disease severity between men and women. Women with RA typically have a more aggressive disease, higher levels of disease activity, and higher incidences of disability along with less improvement in disability and the highest radiographic progression using the Sharp van der Heijde Score (SHS), primarily because of increased joint space narrowing [46,47]. This suggests a potential benefit in using gender-specific models to improve detection reliability, as recently recommended by Pertsinidou et al. [48] Considering the high AUCs of DA and PDGF-BB, however, it is tempting to speculate that the low qualities of the men’s models were more likely caused by the small sample size than by an intrinsic difficulty in the distinction between male RA patients and controls due the former’s relatively mild symptoms.
To further support our approach, we show correlations between the probabilities of having RA, as calculated by BLR and DA, and some of the most commonly used laboratory markers of RA, namely ESR, DAS28ESR, and anti-CCP. As shown in the results section, these were weak correlations that were inconsistent across models. Two of the most influential cytokines in the best two models we had, the all-participants model and the women’s models, were IL-4 and IL-17. In our study, IL-4 weakly correlated with ESR and anti-CCP, but not DAS28ESR, while IL-17 did not significantly correlate with any of the three laboratory tests. Both agreeing and conflicting results can be found in the literature. For example, one study reported that IL-17, but not IL-4, was moderately correlated with DAS28ESR and ESR [49], while another reported that IL-17 did not correlate with either DAS28 [50] or ESR [50,51]. These discrepancies may have resulted from differences between study populations (e.g., regarding age, disease severity, disease stage, proportion of males and females, genetic background, comorbidities, or medications), technical differences (e.g., regarding medical practices or data collection and analysis), or a combination of both. Furthermore, as we attempt to reconcile existing data and conceptualize future experiments aiming to decisively resolve such discrepancies, it is prudent to comprehend the highly complex nature of the mechanisms that govern such relationships. For instance, ESR increases during the acute phase response chiefly due to enhanced rouleau formation, which is dependent on the concentration of acute phase proteins and immunoglobulins in circulation, as well as the shape and density of erythrocytes in the bloodstream [52]. Fibrinogen is one of the acute phase proteins [52], meaning elevated fibrinogen levels in plasma should enhance rouleau formation and increase ESR. However, fibrinogen and ESR are elevated in RA [53], but in vitro experiments suggest that IL-4, which is also elevated in RA, inhibits the production of fibrinogen [54]. Taken together, although we could not fully explain the inconsistent, weak correlations between the probability of RA prediction based on our algorithms and conventional laboratory markers of RA, such findings do not surprise us given the complexity of the underlying mechanisms. Furthermore, the use of multivariate biomarker profiles may overcome the variability of individual biomarkers.

5. Conclusions

We demonstrated in this study that a cytokine-based, multiparametric biomarker profile successfully distinguished RA patients from healthy controls in a pool of 78 participants. This conclusion shows that the panel could be used as a diagnostic tool. We demonstrated that using BLR led to a higher fidelity of RA patients’ identification than using DA. Using gender-specific models slightly improved detection fidelity, indicating a potential benefit in clinical diagnostics. Further research is needed to show whether these findings will remain applicable using different and larger patient populations.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biom13091305/s1, Figure S1: Estimated probability of RA diagnosis correlates with erythrocyte sedimentation rate (ESR), disease activity score-28 with erythrocyte sedimentation rate (DAS28ESR), and cyclic citrullinated peptide antibody (Anti-CCP). Analysis done on female RA patients with available clinical data (n = 55). DA-Prob: probability estimated using discriminant analysis, BLR-Prob: probability estimated using binary logistic regression.

Author Contributions

Conceptualization, M.A.O. (Mohammed A. Omair), M.D., N.O. and A.W.; methodology, M.A.O. (Mohammed A. Omair), M.D., N.O. and A.W.; software, W.M.H., M.D., N.O., E.A., S.A. and A.W.; validation, M.D., N.O., E.A., S.A. and A.W.; formal analysis, W.M.H., M.A.O. (Maha A. Omair) M.D., N.O. and A.W.; investigation, M.D., N.O. and A.W.; resources, M.D., N.O. and A.W.; data curation, M.D., N.O., A.I. and A.W.; writing—original draft preparation, M.A.O. (Mohammed A. Omair), M.D., N.O. and A.W.; writing—review and editing, M.A.O. (Mohammed A. Omair), M.D., A.E.-A., N.O., R.S.B. and A.W.; visualization, M.A.O. (Maha A. Omair), M.D., N.O. and A.W.; supervision, M.D., A.W. and M.A.O. (Mohammed A. Omair); project administration, M.A.O. (Mohammed A. Omair), M.D., N.O. and A.W.; funding acquisition, M.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Deanship of Scientific Research at King Saud University, Saudi Arabia Research Group No. RG-1440-077.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of King Khalid University Hospital (KKUH (IRB No. E-14-1194).

Informed Consent Statement

Written informed consent was obtained from each patient and control following the national protocol.

Data Availability Statement

Presented in the manuscript.

Acknowledgments

We extend our appreciation and thanks to the Deanship of Scientific Research at King Saud University, Saudi Arabia Research Group No. RG-1440-077, for providing the funding support. We are immensely grateful to the patients and controls for their participation.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Azizi, G.; Jadidi-Niaragh, F.; Mirshafiey, A. Th17 Cells in Immunopathogenesis and treatment of rheumatoid arthritis. Int. J. Rheum. Dis. 2013, 16, 243–253. [Google Scholar] [CrossRef]
  2. Emery, P.; Gabay, C.; Kraan, M.; Gomez-Reino, J. Evidence-based review of biologic markers as indicators of disease progression and remission in rheumatoid arthritis. Rheumatol. Int. 2007, 27, 793–806. [Google Scholar] [CrossRef] [PubMed]
  3. Wells, G.; Becker, J.C.; Teng, J.; Dougados, M.; Schiff, M.; Smolen, J.; Aletaha, D.; van Riel, P.L. Validation of the 28-joint Disease Activity Score (DAS28) and European League Against Rheumatism response criteria based on C-reactive protein against disease progression in patients with rheumatoid arthritis, and comparison with the DAS28 based on erythrocyte sedimentation rate. Ann. Rheum. Dis. 2009, 68, 954–960. [Google Scholar] [CrossRef]
  4. Khan, I.H.; Krishnan, V.V.; Ziman, M.; Janatpour, K.; Wun, T.; Luciw, P.A.; Tuscano, J. A comparison of multiplex suspension array large-panel kits for profiling cytokines and chemokines in rheumatoid arthritis patients. Cytom. B Clin. Cytom. 2009, 76, 159–168. [Google Scholar] [CrossRef]
  5. de Jager, W.; Prakken, B.; Rijkers, G.T. Cytokine multiplex immunoassay: Methodology and (clinical) applications. Methods Mol. Biol. 2009, 514, 119–133. [Google Scholar] [CrossRef] [PubMed]
  6. Dong, C.; Fu, T.; Ji, J.; Li, Z.; Gu, Z. The role of interleukin-4 in rheumatic diseases. Clin. Exp. Pharmacol. Physiol. 2018, 45, 747–754. [Google Scholar] [CrossRef] [PubMed]
  7. Gaffen, S.L. The role of interleukin-17 in the pathogenesis of rheumatoid arthritis. Curr. Rheumatol. Rep. 2009, 11, 365–370. [Google Scholar] [CrossRef]
  8. Qian, Y.; He, Z.; Zhao, S.S.; Liu, B.; Chen, Y.; Sun, X.; Ye, D.; Jiang, X.; Zheng, H.; Wen, C.; et al. Genetically Determined Circulating Levels of Cytokines and the Risk of Rheumatoid Arthritis. Front. Genet. 2022, 13, 802464. [Google Scholar] [CrossRef]
  9. Wakabayashi, K.; Isozaki, T.; Tsubokura, Y.; Fukuse, S.; Kasama, T. Eotaxin-1/CCL11 is involved in cell migration in rheumatoid arthritis. Sci. Rep. 2021, 11, 7937. [Google Scholar] [CrossRef]
  10. Wang, B.H.; Lu, Y.H.; Wu, L.F.; Lu, X.; Guo, W.; Deng, F.Y.; Lei, S.F. Evaluation of plasma cytokine protein array profile: The highlighted PDGF-BB in rheumatoid arthritis. Clin. Rheumatol. 2020, 39, 3323–3330. [Google Scholar] [CrossRef]
  11. Mor, A.; Abramson, S.B.; Pillinger, M.H. The fibroblast-like synovial cell in rheumatoid arthritis: A key player in inflammation and joint destruction. Clin. Immunol. 2005, 115, 118–128. [Google Scholar] [CrossRef] [PubMed]
  12. Patel, D.D.; Zachariah, J.P.; Whichard, L.P. CXCR3 and CCR5 ligands in rheumatoid arthritis synovium. Clin. Immunol. 2001, 98, 39–45. [Google Scholar] [CrossRef] [PubMed]
  13. Cook, A.D.; Braine, E.L.; Campbell, I.K.; Rich, M.J.; Hamilton, J.A. Blockade of collagen-induced arthritis post-onset by antibody to granulocyte-macrophage colony-stimulating factor (GM-CSF): Requirement for GM-CSF in the effector phase of disease. Arthritis Res. 2001, 3, 293–298. [Google Scholar] [CrossRef] [PubMed]
  14. Chu, C.R.; Williams, A.A.; Coyle, C.H.; Bowers, M.E. Early diagnosis to enable early treatment of pre-osteoarthritis. Arthritis Res. Ther. 2012, 14, 212. [Google Scholar] [CrossRef]
  15. Lance, G.N.; Williams, W.T. Computer programs for hierarchical polythetic classification (“similarity analyses”). Comput. J. 1966, 9, 5. [Google Scholar] [CrossRef]
  16. Lance, G.N.; Williams, W.T. Mixed-Data Classificatory Programs I—Agglomerative Systems. Autralian Comput. J. 1967, 1, 6. [Google Scholar]
  17. Michener, S. A statistical method for evaluating systematic relationships. Univ. Kans. Sci. Bull. 1958, 38, 30. [Google Scholar]
  18. Warner, R.M. Applied Statistics: From Bivariate through Multivariate Techniques; Sage Publications, Inc.: Los Angeles, CA, USA; London, UK; New Delhi, India; Singapore, 2008. [Google Scholar]
  19. Hosmer, D.W.; Lemesbow, S. Goodness of fit tests for the multiple logistic regression model. Commun. Stat.-Theory Methods 1980, 9, 1043–1069. [Google Scholar] [CrossRef]
  20. Nagelkerke, N.J.D. A note on the general definition of the coefficient of determination. Biometrika 1991, 78, 691–692. [Google Scholar] [CrossRef]
  21. Büyüköztürk, Ş.; Çokluk-Bökeoğlu, Ö. Discriminant Function Analysis: Concept and Application. Eurasian J. Educ. Res. 2008, 33, 73–92. [Google Scholar]
  22. Lutz, J.G.; Eckert, T.L. The Relationship Between Canonical Correlation Analysis and Multivariate Multiple Regression. Educ. Psychol. Meas. 1994, 54, 666–675. [Google Scholar] [CrossRef]
  23. Nordlund, D.J.; Nagel, R. Standardized Discriminant Coefficients Revisited. J. Educ. Stat. 1991, 16, 8. [Google Scholar] [CrossRef]
  24. Burska, A.; Boissinot, M.; Ponchel, F. Cytokines as biomarkers in rheumatoid arthritis. Mediat. Inflamm. 2014, 2014, 545493. [Google Scholar] [CrossRef]
  25. Jung, S.M.; Kim, K.W.; Yang, C.W.; Park, S.H.; Ju, J.H. Cytokine-mediated bone destruction in rheumatoid arthritis. J. Immunol. Res. 2014, 2014, 263625. [Google Scholar] [CrossRef]
  26. Srirangan, S.; Choy, E.H. The role of interleukin 6 in the pathophysiology of rheumatoid arthritis. Ther. Adv. Musculoskelet. Dis. 2010, 2, 247–256. [Google Scholar] [CrossRef] [PubMed]
  27. Skundric, D.S.; Zhou, W.; Cruikshank, W.W.; Dai, R. Increased levels of bioactive IL-16 correlate with disease activity during relapsing experimental autoimmune encephalomyelitis (EAE). J. Autoimmun. 2005, 25, 206–214. [Google Scholar] [CrossRef] [PubMed]
  28. Amarasekara, D.S.; Yun, H.; Kim, S.; Lee, N.; Kim, H.; Rho, J. Regulation of Osteoclast Differentiation by Cytokine Networks. Immune Netw. 2018, 18, e8. [Google Scholar] [CrossRef] [PubMed]
  29. Haddad, A.; Bienvenu, J.; Miossec, P. Increased production of a Th2 cytokine profile by activated whole blood cells from rheumatoid arthritis patients. J. Clin. Immunol. 1998, 18, 399–403. [Google Scholar] [CrossRef]
  30. van Roon, J.A.; Verhoef, C.M.; van Roy, J.L.; Gmelig-Meyling, F.H.; Huber-Bruning, O.; Lafeber, F.P.; Bijlsma, J.W. Decrease in peripheral type 1 over type 2 T cell cytokine production in patients with rheumatoid arthritis correlates with an increase in severity of disease. Ann. Rheum. Dis. 1997, 56, 656–660. [Google Scholar] [CrossRef]
  31. Iwaszko, M.; Bialy, S.; Bogunia-Kubik, K. Significance of Interleukin (IL)-4 and IL-13 in Inflammatory Arthritis. Cells 2021, 10, 3000. [Google Scholar] [CrossRef]
  32. Schlaak, J.F.; Pfers, I.; Meyer Zum Buschenfelde, K.H.; Marker-Hermann, E. Different cytokine profiles in the synovial fluid of patients with osteoarthritis, rheumatoid arthritis and seronegative spondylarthropathies. Clin. Exp. Rheumatol. 1996, 14, 155–162. [Google Scholar]
  33. Kokkonen, H.; Soderstrom, I.; Rocklov, J.; Hallmans, G.; Lejon, K.; Rantapaa Dahlqvist, S. Up-regulation of cytokines and chemokines predates the onset of rheumatoid arthritis. Arthritis Rheum. 2010, 62, 383–391. [Google Scholar] [CrossRef]
  34. Khan, S.; Greenberg, J.D.; Bhardwaj, N. Dendritic cells as targets for therapy in rheumatoid arthritis. Nat. Rev. Rheumatol. 2009, 5, 566–571. [Google Scholar] [CrossRef]
  35. Robert, M.; Miossec, P. Effects of Interleukin 17 on the cardiovascular system. Autoimmun. Rev. 2017, 16, 984–991. [Google Scholar] [CrossRef]
  36. Wang, J.; He, L.; Li, W.; Lv, S. A Role of IL-17 in Rheumatoid Arthritis Patients Complicated With Atherosclerosis. Front. Pharmacol. 2022, 13, 828933. [Google Scholar] [CrossRef] [PubMed]
  37. van den Berg, W.B.; Miossec, P. IL-17 as a future therapeutic target for rheumatoid arthritis. Nat. Rev. Rheumatol. 2009, 5, 549–553. [Google Scholar] [CrossRef] [PubMed]
  38. Jain, M.; Attur, M.; Furer, V.; Todd, J.; Ramirez, R.; Lock, M.; Lu, Q.A.; Abramson, S.B.; Greenberg, J.D. Increased plasma IL-17F levels in rheumatoid arthritis patients are responsive to methotrexate, anti-TNF, and T cell costimulatory modulation. Inflammation 2015, 38, 180–186. [Google Scholar] [CrossRef] [PubMed]
  39. Atwa, S.E.; Azab, M.M.; Mohamed, M.S. Serum Interleukin-17 Level in Patients with Rheumatoid Arthritis and its Relation to Disease Activity. Zagazig Univ. Med. J. 2019, 26, 87–93. [Google Scholar] [CrossRef]
  40. Yao, T.C.; Kuo, M.L.; See, L.C.; Ou, L.S.; Lee, W.I.; Chan, C.K.; Huang, J.L. RANTES and monocyte chemoattractant protein 1 as sensitive markers of disease activity in patients with juvenile rheumatoid arthritis: A six-year longitudinal study. Arthritis Rheum. 2006, 54, 2585–2593. [Google Scholar] [CrossRef]
  41. Yang, M.H.; Wu, F.X.; Xie, C.M.; Qing, Y.F.; Wang, G.R.; Guo, X.L.; Tang, Z.; Zhou, J.G.; Yuan, G.H. Expression of CC chemokine ligand 5 in patients with rheumatoid arthritis and its correlation with disease activity and medication. Chin. Med. Sci. J. 2009, 24, 50–54. [Google Scholar] [CrossRef]
  42. Charbonneau, M.; Lavoie, R.R.; Lauzier, A.; Harper, K.; McDonald, P.P.; Dubois, C.M. Platelet-Derived Growth Factor Receptor Activation Promotes the Prodestructive Invadosome-Forming Phenotype of Synoviocytes from Patients with Rheumatoid Arthritis. J. Immunol. 2016, 196, 3264–3275. [Google Scholar] [CrossRef]
  43. Saito, Y.; Chikenji, T.; Ozasa, Y.; Fujimiya, M.; Yamashita, T.; Gingery, A.; Iba, K. PDGFR Signaling Mediates Hyperproliferation and Fibrotic Responses of Subsynovial Connective Tissue Cells in Idiopathic Carpal Tunnel Syndrome. Sci. Rep. 2017, 7, 16192. [Google Scholar] [CrossRef]
  44. Ying, H.Z.; Chen, Q.; Zhang, W.Y.; Zhang, H.H.; Ma, Y.; Zhang, S.Z.; Fang, J.; Yu, C.H. PDGF signaling pathway in hepatic fibrosis pathogenesis and therapeutics (Review). Mol. Med. Rep. 2017, 16, 7879–7889. [Google Scholar] [CrossRef]
  45. Uezumi, A.; Fukada, S.; Yamamoto, N.; Ikemoto-Uezumi, M.; Nakatani, M.; Morita, M.; Yamaguchi, A.; Yamada, H.; Nishino, I.; Hamada, Y.; et al. Identification and characterization of PDGFRalpha+ mesenchymal progenitors in human skeletal muscle. Cell Death Dis. 2014, 5, e1186. [Google Scholar] [CrossRef]
  46. Nilsson, J.; Andersson, M.L.E.; Hafstrom, I.; Svensson, B.; Forslind, K.; Ajeganova, S.; Leu Agelii, M.; Gjertsson, I. Influence of Age and Sex on Disease Course and Treatment in Rheumatoid Arthritis. Open Access Rheumatol. 2021, 13, 123–138. [Google Scholar] [CrossRef]
  47. Rydell, E.; Forslind, K.; Nilsson, J.A.; Karlsson, M.; Akesson, K.E.; Jacobsson, L.T.H.; Turesson, C. Predictors of radiographic erosion and joint space narrowing progression in patients with early rheumatoid arthritis: A cohort study. Arthritis Res. Ther. 2021, 23, 27. [Google Scholar] [CrossRef]
  48. Pertsinidou, E.; Manivel, V.A.; Westerlind, H.; Klareskog, L.; Alfredsson, L.; Mathsson-Alm, L.; Hansson, M.; Saevarsdottir, S.; Askling, J.; Ronnelid, J. Rheumatoid arthritis autoantibodies and their association with age and sex. Clin. Exp. Rheumatol. 2021, 39, 879–882. [Google Scholar] [CrossRef]
  49. Pavlovic, V.; Dimic, A.; Milenkovic, S.; Krtinic, D. Serum levels of IL-17, IL-4, and INFgamma in Serbian patients with early rheumatoid arthritis. J. Res. Med. Sci. 2014, 19, 18–22. [Google Scholar]
  50. Sag, S.; Sag, M.S.; Tekeoglu, I.; Kamanli, A.; Nas, K.; Acar, B.A. Relationship of hematologic markers with IL-17 and IL-1 beta in patients with rheumatoid arthritis. J. Back. Musculoskelet. Rehabil. 2018, 31, 703–707. [Google Scholar] [CrossRef]
  51. Liu, D.F.; Yan, J.; Guo, M.Y.; Wang, C.; Hu, Y.H.; Yang, M.; Yun, M.D.; Luo, Y.; Zhang, J.; Li, H. Correlation study between interleukin-17 and ESR and CRP in serum and the synovial fluid of rheumatoid arthritis patients of accumulated dampness-heat obstruction in joints syndrome. Zhongguo Zhong Xi Yi Jie He Za Zhi 2014, 34, 272–275. [Google Scholar]
  52. Tishkowski, K.; Gupta, V. Erythrocyte Sedimentation Rate. In StatPearls; StatPearls Publishing: St. Petersburg, FL, USA, 2023. [Google Scholar]
  53. Babikir, M.B.O. Association of Fibrinogen, Erythrocyte Sedimentation Rate, and C-Reactive Protein Levels with Rheumatoid Arthritis. Open Access Libr. J. 2017, 4, 75369. [Google Scholar] [CrossRef]
  54. Vasse, M.; Paysant, J.; Soria, J.; Collet, J.P.; Vannier, J.P.; Soria, C. Regulation of fibrinogen biosynthesis by cytokines, consequences on the vascular risk. Haemostasis 1996, 26 (Suppl. 4), 331–339. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Unsupervised group partitioning using hierarchical clustering. Tree branches are colored according to participant status with RA in red and controls in green. The heat map shows the relative plasma concentration of analytes with elevated levels in red and low levels in green.
Figure 1. Unsupervised group partitioning using hierarchical clustering. Tree branches are colored according to participant status with RA in red and controls in green. The heat map shows the relative plasma concentration of analytes with elevated levels in red and low levels in green.
Biomolecules 13 01305 g001
Figure 2. Separation of RA and controls. Participants were separated on one axis (the x axis) using either discriminant analysis or binary logistic regression. The variables used in this analysis included age, gender, and 27 plasma cytokines and growth factors. In the discriminant analysis plot (top), the x axis represents the first canonical discriminant function, and participants are graphed using their discriminant scores. In the binary logistic regression plot (bottom), the x axis represents the probability of having rheumatoid arthritis. Sample size: n = 78 each, control and RA; n = 22 (50% RA) males and 134 (50% RA) female participants.
Figure 2. Separation of RA and controls. Participants were separated on one axis (the x axis) using either discriminant analysis or binary logistic regression. The variables used in this analysis included age, gender, and 27 plasma cytokines and growth factors. In the discriminant analysis plot (top), the x axis represents the first canonical discriminant function, and participants are graphed using their discriminant scores. In the binary logistic regression plot (bottom), the x axis represents the probability of having rheumatoid arthritis. Sample size: n = 78 each, control and RA; n = 22 (50% RA) males and 134 (50% RA) female participants.
Biomolecules 13 01305 g002
Figure 3. Rate of correct assignment (RCA). RCAs were determined using algorithms driven from the data collected from all participants (n = 78 each, control and rheumatoid arthritis), male participants (n = 11 each, control and rheumatoid arthritis), or female participants (n = 67 each, control and rheumatoid arthritis). Algorithm derivation was accomplished using either discriminant analysis or binary logistic regression. Algorithms were used to assign participants to either the RA group or control group within the corresponding participant population from which they were driven.
Figure 3. Rate of correct assignment (RCA). RCAs were determined using algorithms driven from the data collected from all participants (n = 78 each, control and rheumatoid arthritis), male participants (n = 11 each, control and rheumatoid arthritis), or female participants (n = 67 each, control and rheumatoid arthritis). Algorithm derivation was accomplished using either discriminant analysis or binary logistic regression. Algorithms were used to assign participants to either the RA group or control group within the corresponding participant population from which they were driven.
Biomolecules 13 01305 g003
Figure 4. Receiver operating characteristic (ROC) curves for all participants, males, and females.
Figure 4. Receiver operating characteristic (ROC) curves for all participants, males, and females.
Biomolecules 13 01305 g004
Figure 5. Estimated probability of RA diagnosis correlates with erythrocyte sedimentation rate (ESR), disease activity score-28 with erythrocyte sedimentation rate (DAS28ESR), and cyclic citrullinated peptide antibody (Anti-CCP). Analysis performed on male and female rheumatoid arthritis patients with available clinical data (n = 57). DA-Prob: probability estimated using discriminant analysis, BLR-Prob: probability estimated using binary logistic regression.
Figure 5. Estimated probability of RA diagnosis correlates with erythrocyte sedimentation rate (ESR), disease activity score-28 with erythrocyte sedimentation rate (DAS28ESR), and cyclic citrullinated peptide antibody (Anti-CCP). Analysis performed on male and female rheumatoid arthritis patients with available clinical data (n = 57). DA-Prob: probability estimated using discriminant analysis, BLR-Prob: probability estimated using binary logistic regression.
Biomolecules 13 01305 g005
Table 1. Clinical and demographic data of patients and controls.
Table 1. Clinical and demographic data of patients and controls.
VariableNumber
Mean Age (years)49.5 + 13.8
Female gender (%)58 (87.8)
Duration of disease
Early (≤2 years)
Established (>2 years)
6
51
Seropositivity (%)
(0 seronegative, 1 positive to RF or anti-CCP, 2 double positive)
For RF and anti-CCP the cutoff for positive was 15 U/mL and 5 IU/mL, respectively.
Rheumatoid factor (positive)36 (63.15)
Anti-cyclic citrullinated peptide49 (85.96)
015 (26.31)
116 (28.07)
235 (61.4)
Medications
Drug-free remission7 (12.28)
MTX monotherapy11 (19.29)
MTX combination base2 (3.5)
Other csDMARDs5 (8.77)
bDMARDs/tsDMARDs41 (71.92)
Controls
Mean age (years)47 + 14.4
Female gender (%)
Female
68 (87.2)
N.B. Clinical data were available for 57 out of 78 RA patients.
Table 2. Fit of the binary regression models.
Table 2. Fit of the binary regression models.
Measure of Model Fit or SignificanceAll ParticipantsMale ParticipantsFemale Participants
Chi-square p-value2.98 × 10−170.000861.55 × 10−16
Nagelkerke pseudo-R20.5350.6320.601
Hosmer–Lemeshow p-value0.0190.6090.258
Table 3. Fit of the discriminant models.
Table 3. Fit of the discriminant models.
Measure of Model Fit or SignificanceAll ParticipantsMale ParticipantsFemale Participants
Canonical correlation0.5640.5240.688
Eigenvalue0.4650.3780.899
Wilks’ lambda0.6820.7260.527
p-value1.37 × 10−120.0129.80 × 10−16
Table 4. Relative importance of individual biomarkers (variables) to each of the multivariate biomarker profiles (models). Individual biomarkers are ranked in descending order of their relative importance. In DA, the standardized canonical discriminant function coefficient (SCDFC) is the best indicator of relative importance, which is the basis of ranking on the left side of this table. Another measure of variable contribution to the discriminant model is Wilks’ lambda (λ), which indicates the amount of variance not explained by group membership. Smaller values of λ indicate better ability to discriminate between groups. In BLR models, the raw score regression coefficients (B) and odds ratios (Exp(B)) are used to rank the relative importance of individual biomarkers in a multivariate model. The p-values are obtained using one-way ANOVA to evaluate the equality of group means with values less than 0.05 considered significant.
Table 4. Relative importance of individual biomarkers (variables) to each of the multivariate biomarker profiles (models). Individual biomarkers are ranked in descending order of their relative importance. In DA, the standardized canonical discriminant function coefficient (SCDFC) is the best indicator of relative importance, which is the basis of ranking on the left side of this table. Another measure of variable contribution to the discriminant model is Wilks’ lambda (λ), which indicates the amount of variance not explained by group membership. Smaller values of λ indicate better ability to discriminate between groups. In BLR models, the raw score regression coefficients (B) and odds ratios (Exp(B)) are used to rank the relative importance of individual biomarkers in a multivariate model. The p-values are obtained using one-way ANOVA to evaluate the equality of group means with values less than 0.05 considered significant.
RankBinary Logistic Regression ModelsDiscriminant Analysis Models
All ParticipantsMaleFemaleAll ParticipantsMaleFemale
1IL-4
B = 1.387
Exp(B) = 4.001
p = 3.97 × 10−8
Eotaxin
B = 0.037
Exp(B) = 1.037
p = 0.288
IL-4
B = 1.548
Exp(B) = 4.703
p = 6.79 × 10−8
IL-4
SCDFC = 1.290
λ = 0.743
p = 1.46 × 10−11
PDGF-BB
SCDFC = 1.000
λ = 0.726
p = 0.012
IL-4
SCDFC = 1.252
λ = 0.710
p = 1.87 × 10−11
2IL-17
B = −0.027
Exp(B) = 0.974
p = 8.09 × 10−4
PDGF-BB
B = 0.007
Exp(B) = 1.007
p = 0.070
IL-17
B = −0.038
Exp(B) = 0.963
p = 1.23 × 10−4
IL-17
SCDFC = −0.479
λ = 0.902
p = 7.09 × 10−5
n/aIL-1Ra
SCDFC = 0.783
λ = 0.900
p = 1.91 × 10−4
3RANTES
B = 0.000
Exp(B) = 1.000
p = 5.85 × 10−4
n/aMIP-1b
B = 0.054
Exp(B) = 1.055
p = 0.017
RANTES
SCDFC = −0.439
λ = 0.967
p = 0.024
n/aGM-CSF
SCDFC = −0.598
λ = 0.943
p = 0.005
4n/an/aRANTES
B = 0.000
Exp(B) = 1.000
p = 1.25 × 10−4
n/an/aIL-17
SCDFC = −0.535
λ = 0.909
p = 3.85 × 10−4
5n/an/an/an/an/aEotaxin
SCDFC = −0.533
λ = 0.915
p = 6.20 × 10−4
6n/an/an/an/an/aRANTES
SCDFC = −0.334
λ = 0.958
p = 0.017
Table 5. Predictive power evaluation using the area under a receiver operating characteristic curve (AUC) method.
Table 5. Predictive power evaluation using the area under a receiver operating characteristic curve (AUC) method.
Patient PopulationModel/VariableAUCp-ValueSensitivitySpecificity
AllBLR0.8821.76 × 10−1685.9%83.3%
DA0.8774.14 × 10−1685.9%75.6%
IL-40.8252.37 × 10−1285.9%73.1%
IL-170.7421.85 × 10−785.9%56.4%
RANTES0.4630.42185.9%32.1%
MalesBLR0.9010.00190.9%
81.1%
45.5%
100%
DA0.8180.01190.9%
81.1%
45.5%
63.5%
PDGF-BB0.8180.09190.9%
81.1%
45.5%
63.5%
Eotaxin0.7520.04590.9%
81.1%
36.4%
54.5%
FemalesBLR0.9037.85 × 10−1689.6%82.1%
DA0.9131.61 × 10−1689.6%80.6%
IL-40.8258.83 × 10−1188.1%73.1%
IL-170.7431.00 × 10−689.6%52.2%
MIP-1b0.7093.10 × 10−589.6%50.7%
IL-1Ra0.8606.33 × 10−1389.6%64.2%
Eotaxin0.7477.66 × 10−789.6%50.7%
RANTES0.5620.21989.6%35.8%
GM-CSF0.5970.05289.6%41.8%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hassan, W.M.; Othman, N.; Daghestani, M.; Warsy, A.; Omair, M.A.; Alqurtas, E.; Amin, S.; Ismail, A.; El-Ansary, A.; Bhat, R.S.; et al. The Fidelity of Rheumatoid Arthritis Multivariate Diagnostic Biomarkers Using Discriminant Analysis and Binary Logistic Regression. Biomolecules 2023, 13, 1305. https://doi.org/10.3390/biom13091305

AMA Style

Hassan WM, Othman N, Daghestani M, Warsy A, Omair MA, Alqurtas E, Amin S, Ismail A, El-Ansary A, Bhat RS, et al. The Fidelity of Rheumatoid Arthritis Multivariate Diagnostic Biomarkers Using Discriminant Analysis and Binary Logistic Regression. Biomolecules. 2023; 13(9):1305. https://doi.org/10.3390/biom13091305

Chicago/Turabian Style

Hassan, Wail M., Nashwa Othman, Maha Daghestani, Arjumand Warsy, Maha A. Omair, Eman Alqurtas, Shireen Amin, Abdulaziz Ismail, Afaf El-Ansary, Ramesa Shafi Bhat, and et al. 2023. "The Fidelity of Rheumatoid Arthritis Multivariate Diagnostic Biomarkers Using Discriminant Analysis and Binary Logistic Regression" Biomolecules 13, no. 9: 1305. https://doi.org/10.3390/biom13091305

APA Style

Hassan, W. M., Othman, N., Daghestani, M., Warsy, A., Omair, M. A., Alqurtas, E., Amin, S., Ismail, A., El-Ansary, A., Bhat, R. S., & Omair, M. A. (2023). The Fidelity of Rheumatoid Arthritis Multivariate Diagnostic Biomarkers Using Discriminant Analysis and Binary Logistic Regression. Biomolecules, 13(9), 1305. https://doi.org/10.3390/biom13091305

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop