1. Introduction
Rheumatoid arthritis (RA) represents a persistent autoimmune disease characterized by synovial joint inflammation, resulting in manifestations such as pain, swelling, stiffness, and the prospect of consequential joint deformities. Unlike osteoarthritis, which results from the wear and tear of joints, RA involves the immune system mistakenly attacking healthy joint tissues. RA is a multifactorial autoimmune disease influenced by a combination of genetic, environmental, and lifestyle factors [
1]. Understanding these risk factors is crucial for identifying individuals with high susceptibility and enhancing preventive measures. Many studies have investigated the relationship between RA and cancer [
2,
3,
4,
5,
6,
7,
8]. A possible link between cancer and RA was initially documented in 1978 in a study that revealed an elevated incidence of lymphoma in individuals diagnosed with RA [
9]. Recently, a large observational study demonstrated that patients with RA have a heightened risk of developing all types of cancers compared to the general population, indicating a relative excess risk of 20% [
6]. Additionally, this study revealed that certain cancers were more prevalent among individuals with RA, including lung, bladder, kidney, upper urinary tract, melanoma, ovarian, cervical, and prostate cancers. There was also an over-representation of certain lymphomas, including diffuse large B-cell lymphoma, follicular lymphoma, Hodgkin’s lymphoma, marginal zone lymphoma, and multiple myeloma [
6].
Head and neck cancer (HNC) is a malignant tumor of the head and neck region. These tumors typically originate in areas such as the oral cavity, throat, larynx, pharynx, nasal cavity, paranasal sinuses, and nearby lymph nodes. The common subtypes include oral, pharyngeal, and laryngeal cancers. The advancement of diagnostic technologies in cancer and the refinements in pivotal therapeutic modalities for individuals afflicted with HNC, encompassing surgical interventions, radiation therapy, and chemotherapy, have not only prolonged the overall survival rates of cancer patients but have also concomitantly escalated the frequency of delayed complications. Thus, HNC has exhibited associations with various autoimmune rheumatic diseases [
10,
11,
12]. Moreover, one recent study showed that individuals with autoimmune rheumatic diseases, inclusive of conditions affecting the thyroid, oral cavity, and nasopharynx, exhibited a conspicuously heightened risk of developing HNC in comparison to the general populace [
13]. One investigation conducted within the European population revealed an augmented susceptibility to HNC among individuals diagnosed with RA [
14]. Nevertheless, to date, there exists a dearth of research scrutinizing the occurrence and risk of RA as a complication in cancer patients, particularly among those afflicted with HNC.
Examining the reciprocal connection between HNC and an increased risk of RA poses challenges owing to the diversity of rheumatic clinical manifestations, various cancer sites susceptible to risk, and exposures related to treatments in both oncology and rheumatology. Hence, within the purview of this investigation, we systematically scrutinized the prevalence and prospective risk of RA among individuals diagnosed with HNC by leveraging a comprehensive dataset representative of the entire national populace. This dataset provides a broad spectrum of diseases, allowing for a comprehensive exploration of the potential interrelations between these specific health conditions. Rigorous control measures were implemented to account for potential confounding variables such as clinical status and demographic factors.
2. Materials and Methods
2.1. Cohort Dataset and Ethical Considerations
Researchers have constructed a nationwide population-based sample cohort dataset using health insurance claims data collected from the National Health Insurance Service, provided through a de-identification process, and conducted a longitudinal study. Therefore, this study was based on a cohort dataset with a relatively large sample size. As Koreans are compulsorily assigned a unique identification number at birth, medical claims data registered in the cohort dataset cannot be duplicated or missing.
The healthcare provider database also contains information on the type, staffing, and equipment of healthcare facilities. The national database contains information on various medical uses, including the date and time of death, visits to hospitals and outpatient medical facilities, and medication history. The cohort dataset used in this study consisted of a representative sample of 1,025,340 adults, which is approximately 2.2% of the population of the Republic of Korea. Its excellent reliability has already been proven through a reliability verification study and several previous publications [
15,
16,
17,
18].
However, the disease diagnosis status in the claims data may not accurately reflect a patient’s health status, depending on the situation. For example, errors may occur when determining whether drug prescriptions are covered by insurance or due to errors or omissions when entering the diagnosis code. Therefore, to improve diagnostic accuracy, careful judgment regarding the use of diagnostic codes is needed among the researcher group.
This investigation received approval from the Institutional Review Board (IRB) at Hallym Medical University, specifically the Chuncheon Sacred Hospital, with the designated protocol code 2021-08-006. Notably, the IRB review process resulted in a waiver of the requirement for written informed consent. This decision was predicated on the absence of identifiable clues or explicit personal information within the cohort database, thereby precluding the need for individual consent. To uphold privacy standards following South Korean regulations, the cohort data were presented to the principal investigator in a de-identified secondary data format.
Due to the stringent policies of the Korean National Health Insurance Service, the datasets produced and scrutinized in this study are not accessible to the public. Nevertheless, the authors affirm that comprehensive data supporting the conclusions drawn in this study are encapsulated within this article. It is imperative to note that this research adhered to the ethical guidelines outlined in the Declaration of Helsinki throughout its execution.
2.2. Longitudinal Study Using Retrospective Cohort Design
To investigate the risk of subsequent RA development in patients with HNC, we selected the matched-target cohort (HNC) and comparison cohort (non-HNC) groups. We compared the incidence and risk rates between the two cohort groups using a retrospective cohort study design. Hence, given the retrospective cohort design employed in this study, all data were sourced from archival records. Observations were initiated from a distinct historical juncture and systematically scrutinized to identify outcomes spanning from that temporal point to the present. Despite the retrospective nature of the study, the analytical framework allowed for the comparison of groups manifesting divergent characteristics concerning specific outcomes that transpired in the past.
The research design was as follows: First, it largely comprised a wash-out period, an index period, and an observation period. For the wash-out period, we selected the first year from the cohort dataset (January–December 2002). This allowed us to eliminate the possibility that RA was diagnosed before HNC. Additionally, we defined the HNC cohort by identifying patients diagnosed with HNC during a specific index period (2003–2005). Participants were selected based on the presence or absence of a diagnostic code indicating HNC. The sub-diagnostic codes encompassing the HNC cohort comprised oral cavity malignancies (C00–C06), salivary gland neoplasms (C07–C08), oropharyngeal carcinomas (C09–C10), nasopharyngeal malignancies (C11–C12), hypopharyngeal tumors (C13–C14), paranasal sinus carcinomas (C30–C31), and laryngeal malignancies (C32).
Concurrently, in the process of assembling a comparison cohort, individuals without HNC were meticulously chosen from the residual cohorts documented in the database. Subsequently, these non-HNC individuals were randomly paired with HNC patients through the application of the propensity score methodology, entailing a fourfold ratio of cancer-free participants for each HNC-afflicted patient. During this matching procedure, variables such as gender, age, residence, income level, and comorbidities were identified as independent variables for achieving equilibrium between the two cohort groups. The quantification of comorbidities was executed through the utilization of the Charlson Comorbidity Index (CCI), a widely employed metric in claims-based research. The CCI is commonly used in research studies and clinical settings to stratify patients based on the comorbidity burden, aiding in predicting outcomes, resource allocation, and treatment decision-making [
19,
20]. This index is used to predict the ten-year mortality for a patient who may have a range of comorbid conditions. Each condition is assigned a score based on the risk of dying associated with that condition. The scores are then summed to provide a total score that predicts mortality for a patient. The CCI includes a wide array of conditions, including heart disease, AIDS, lung disease, and others. The higher the score is, the more likely the patient’s chances of mortality due to these comorbid conditions. The index has been validated in numerous settings and is widely used in clinical research to adjust for the confounding effects of comorbidities on patient outcomes. It has been adapted for use with administrative health data and has undergone several revisions to update its conditions and scoring system to reflect changes in the understanding and treatment of diseases. The CCI is a valuable tool for researchers and healthcare providers in assessing the burden of comorbidities on patients and predicting their prognoses. The follow-up was terminated if the primary outcome RA; M05, M06 was detected or if the participant died during the observation period. If no specific event occurred in the patient until the final follow-up period in the cohort database, the observation was terminated. The participant selection process is depicted in
Figure 1, and the retrospective cohort design is summarized in
Figure 2.
2.3. Statistical Analysis of Primary Outcomes
The primary objectives of this study encompassed the determination of the incidence and risk ratio of RA in patients afflicted with HNC as compared to those without HNC. The overall RA incidence was computed by dividing the count of RA-diagnosed patients by 1000 person-years, the time frame extending from the patient’s enrollment date to the conclusion of the observational period. To ascertain whether HNC correlated with an elevated hazard ratio for RA development, the Cox proportional hazards regression model was employed. Hazard ratios, denoted as HRs, along with their associated 95% confidence intervals (CIs), were reported both in unweighted and weighted forms. Notably, the Kaplan–Meier method was applied to delineate and juxtapose the cumulative probability of RA onset throughout the entire follow-up duration in the two cohort groups. All statistical analyses were conducted utilizing R software (version 4.0.0), and a p-value threshold of <0.05 was established to denote statistical significance.
4. Discussion
The association between HNC and RA can be influenced by several factors, including common immunological factors, inflammation, genetic factors, immunomodulatory medications, and behaviorals and environmental factors. Thus, we postulated that the association between cancer and autoimmunity might be bidirectional. To our knowledge, this study constitutes a pioneering endeavor to systematically investigate the risk landscape associated with RA incidence among individuals diagnosed with HNC. By leveraging a robust nationwide population-based cohort dataset, our investigation revealed a discernible proclivity for elevated RA occurrence within the HNC patient cohort compared to their counterparts without HNC. Rigorous adjustment for covariates consistently demonstrated a statistically significant increase in the HR for RA within the HNC population. Temporal scrutiny of the RA risk trajectory revealed a noteworthy surge within the first year after HNC diagnosis, exhibiting sustained elevation and intensification throughout the protracted follow-up period. Intriguingly, the amplified risk of RA manifestation demonstrated statistical significance, specifically among male and middle-aged patients with HNC, in contrast to the absence of significant differences in females or other age strata. A meticulous subgroup analysis predicting distinct cancer subtypes revealed a substantive escalation solely in the risk of RA development among survivors of oral cancer. This refined granularity in our findings highlights the importance of nuanced consideration of cancer subtypes, enriching our understanding of the intricate interdependencies within this specific patient cohort.
The association between cancer and RA remains ambiguous and inadequately elucidated. As our understanding of the underlying mechanisms, long-term consequences, and emergence of new treatments for both cancer and RA continues to progress, the interplay between these conditions becomes more apparent and simultaneously more complex. To date, numerous studies have undertaken investigations into the plausible associations between autoimmune diseases and HNC [
2,
3,
4,
5,
6,
7,
8]. These studies reported that various immune and inflammatory pathways could trigger tumorigenesis or that the heightened inflammatory state associated with rheumatic diseases may play a role in initiating and promoting cancer. Conversely, immune responses designed to combat tumors and restrict their growth may inadvertently target self-tissue, leading to the development of autoimmune conditions. Therefore, tumors can function as antigen reservoirs and trigger the initiation of immune responses. Simultaneously, the regenerating cells within specific tissues may act as sources of antigens, thereby contributing to the propagation and maintenance of autoimmunity. Additionally, HNC and RA share a significant risk factor, namely smoking. Previous studies have demonstrated that smoking stands out as the predominant risk factor for the onset of HNC, doubling the likelihood of developing head and neck squamous-cell carcinoma [
21,
22]. The risk escalates further with an increased intensity of smoking, with a heavier smoking history being correlating with a heightened susceptibility to HNC. Other studies have also reported that smoking could not only elevate the risk of developing RA, particularly among individuals with a genetic predisposition, but could also complicate the treatment of RA if the individual is both a smoker and has the condition [
23,
24,
25]. Thus, the association between smoking and these diseases can be understood through several mechanisms. The shared risk factor of smoking for both HNC and RA highlights the complex interplay between environmental exposures, genetic predisposition, and immune system dysfunction. While the direct mechanisms may differ—carcinogenesis in the case of HNC and autoimmunity in RA—the systemic effects of smoking on inflammation, immune modulation, and cellular damage contribute to the increased risk of both conditions [
26,
27,
28]. Understanding these mechanisms underscores the importance of smoking cessation and prevention strategies in reducing the incidence and severity of both head and neck cancer and rheumatoid arthritis, alongside other smoking-related diseases.
As it is challenging to accurately assess infrequent events such as cancer within the context of rheumatic diseases, extensive longitudinal epidemiological studies offer a valuable opportunity to explore the connections between malignancy and autoimmunity. Additionally, the intricate relationship of cancer with specific autoimmune conditions is subject to a myriad of complex and diverse influences. Thus, large-scale studies could provide an excellent opportunity to gain insight into the intricate relationship between autoimmune conditions and cancer development. Notably, a recent systematic review and meta-analysis revealed a 2.35-fold increase in the overall incidence of HNC among individuals with autoimmune rheumatic diseases, surpassing the rates observed in the general population [
13]. Moreover, the present study identified a significant 1.49-fold increase in the overall incidence of RA events among patients with HNC compared to controls. Collectively, these findings provide robust epidemiological evidence supporting the hypothesis of a bidirectional relationship between HNC and RA.
Notably, previous longitudinal studies have shown different risks based on sex. One retrospective cohort study revealed that men exhibited significantly elevated risks of lung, liver, and esophageal cancers while experiencing a reduced risk of prostate cancer, whereas women demonstrated a notable decrease in the risk of several cancers, including breast, ovarian, uterine, cervical, and melanoma, with a risk reduction compared with the general population [
29]. Another extensive observational study demonstrated that an elevated risk was predominantly observed among males, manifesting as a 34% relative increase in the overall cancer risk, whereas females exhibited a comparatively modest increase in relative risk, totaling 8% when compared to the risk observed in the general population [
6]. Similarly, our investigation revealed a higher incidence and risk of RA development among male patients with HNC. Additionally, we found the highest incidence and risk of RA among patients aged 45–64 years.
Our study has several notable strengths. First, this was an inaugural cohort analysis utilizing nationwide population-based data to scrutinize the subsequent emergence of RA in individuals diagnosed with HNC. Employing the propensity score matching method, we effectively controlled for crucial confounding variables, facilitating a rigorous comparison of RA incidence between the HNC and matched control groups. While refraining from making definitive causal inferences, our findings offer valuable insights and implications for clinical practice, prompting the consideration of potential associations between HNC and RA. Second, our study had a robust design, encompassing a large patient cohort and an extensive 11-year observation period. This prolonged timeframe allowed a comprehensive examination of the temporal relationship between HNC and RA development. Third, endeavors were undertaken to attenuate surveillance bias in the evaluation of RA risk among patients diagnosed with HNC. This was achieved by meticulously selecting sociodemographically matched controls from a cohort database. By ensuring that the control group shared similar sociodemographic characteristics with the patients with HNC, this study aimed to reduce potential biases related to surveillance practices. This approach enhanced the reliability of the findings and provided a more accurate evaluation of the association between HNC and the risk of developing RA. Fourth, RA is a disease that requires time for patients to visit the hospital and receive an accurate diagnosis, even when symptoms appear. As well as the possibility of underdiagnosed RA, there is always a length-of-time bias in which RA may have existed before patients with cancer were diagnosed. Thus, to remove this, we set a wash-out period of 1 year before the index period. This means that our research design minimized length-of-time bias. Finally, we systematically assessed variations in the risk dynamics of RA development with increasing observation periods. This study aimed to determine whether the observed risk of RA in individuals with HNC was a coincidental occurrence confined to specific temporal intervals or manifested as a sustained phenomenon following a discernible pattern. Our findings revealed a sustained increase in the long-term risk of developing RA after HNC diagnosis. Similar to previous reports [
30,
31,
32], our findings suggest a substantive association between HNC and RA occurrence, indicating a non-random correlation rather than a chance event.
However, it is crucial to delineate the limitations of this study and approach its findings judiciously. Firstly, the identification of diseases relied upon diagnostic codes from the ICD-10-CM system, lacking the granularity provided by detailed medical records. Additionally, our investigation has omitted essential information such as comprehensive medical histories and pathological reports. Thus, our cohort database included not only squamous-cell carcinoma but also non-squamous-cell carcinoma. Due to these limitations, we have a plan for future prospective clinical studies, incorporating a more comprehensive array of variables, to be conducted to delineate the complex pathophysiological mechanisms underlying the correlation between these conditions. Second, we used a cohort sample database with a restricted set of identifiable variables. This limitation prevented the correction for some risk factors that could potentially influence the occurrence of RA, including family history and smoking. Third, due to the unavailability of information on treatment methods for HNC, this study could not assess the impact of cancer treatments, such as chemotherapy and radiotherapy, on the risk of developing RA. Comprehending the outcomes of these therapeutic interventions is pivotal for elucidating the intricate interplay between oncological disorders and autoimmune diseases. Such an understanding could illuminate potential shared pathogenic pathways, reveal implications for prognosis, and inform the development of targeted treatment strategies that address both the primary condition and its immunological consequences. Regrettably, the database did not encompass information on treatment modalities. We acknowledge this as a significant limitation and will endeavor to address it in subsequent research endeavors. Fourth, the age data in the database were presented in groups owing to de-identification concerns. Consequently, the matching of the two groups was based on categorized age data rather than on the actual age distribution, potentially introducing residual bias into the study analysis. Additionally, our cohort group may have a potential discrepancy between the time of diagnosis and the actual onset of the disease across patients. To overcome this issue, we have made concerted efforts to align variables such as access to healthcare facilities and sociodemographic characteristics (residence, income, age, gender, etc.) between the groups, thereby striving to minimize any biases and ensure a more balanced analysis. Finally, the retrospective cohort design of the study limited direct investigation and analysis of the pathological mechanisms underlying HNC and RA. Overall, while this study contributes valuable insights, the acknowledged limitations underscore the exigency for subsequent research endeavors to furnish a more nuanced comprehension of the interrelationship between HNC and RA.