1. Introduction
Deep endometriosis (DE) is characterized by the presence of endometriosis lesions that infiltrate the peritoneum by >5 mm [
1]. It occurs in approximately 1% of women of reproductive age and it affects between 4% and 37% of women with pelvic endometriosis [
2]. The DE lesions typically affect the Douglas pouch, the uterosacral ligaments, the posterior vaginal wall, the anterior rectal wall, the sigmoid colon, the rectum and the urinary tract [
3]. Between 90 and 95% of women diagnosed with DE experience severe pain, most commonly in the form of dysmenorrhea, deep dyspareunia and intermenstrual pelvic pain [
4]. Additional gastrointestinal symptoms depend on the affected area and raise the clinical suspicion for infiltrative endometriosis lesions of the bowel. When the rectum and the sigmoid are affected, symptoms include deep dyspareunia, dyschezia, rectorrhagia, catamenial diarrhea and narrowed stools. Surgical management of colorectal DE is required when the lesions become symptomatic and impair bowel, reproductive and sexual functions [
5].
Endometriosis affects the quality of life (QoL) of young, active women by disrupting their ability to work, their sexual functioning and their social relationships [
6]. The HRQoL (Health-Related Quality of Life) is a multidimensional concept and reflects how a patient perceives their own physical and mental state. This perception is shaped not only by the disease itself but also by the impact of the disease on the social and family life of the patient. As a result, morbidity and mortality alone are not comprehensive measures of evaluating the benefits of surgical interventions in endometriosis patients and subjective, patient-reported instruments are required. The 36-Item Short Form Survey (SF-36) is a well-established HRQoL instrument that hasn’t been validated yet for women with endometriosis. It includes eight distinct health status domains, with a high level of comprehensiveness. Higher scores indicate a better health status [
7].
The aim of this study is to evaluate the validity and reliability of SF-36 in patients with endometriosis before and after surgical treatment. A second objective is to compare the HRQoL pre and postoperatively, using different instruments apart from SF-36, such as the Gastrointestinal Quality of Life Index (GIQLI) and Knowles–Eccersley–Scott Symptom (KESS) questionnaire, which are validated and suitable for endometriosis-related gastrointestinal symptoms and the Visual Analogue Scale (VAS), as shown by Bourdel N. et al. in an exhaustive systematic review, which is considered the “gold-standard” for pain-measurement [
8].
2. Materials and Methods
The present study is a retrospective analysis of prospectively recorded data from June 2009 to November 2016 at Rouen University Hospital, Rouen, France and includes patients with surgical procedures for DE nodules of the rectum or sigmoid colon. The procedures, both laparoscopies and laparotomies, were performed by our team, which has significant experience in the surgical management of colorectal endometriosis. The women were enrolled in the CIRENDO database (the North-West Inter-Regional Female Cohort for Patients with Endometriosis), a prospective cohort with financial support by the G4 Group (The University Hospitals of Rouen, Lille, Amiens, and Caen). Prospective data recording and analysis were approved by the French authorities CNIL (Commission Nationale de l’Informatique et des Libertés: the French data protection commission) and CCTIRS (Comité Consultatif pour le Traitement de l’Information en matière de Recherche dans le domaine de la Santé: the advisory committee on information technology in healthcare research, NCT02294825).
Of the patients enrolled in CIRENDO, we selected for analysis those with histologically confirmed endometriosis and one-year follow-up visit. The surgical and histological records, as well as the self-assessment questionnaires completed before and after the surgery, provide information for the study. Data were recorded by a clinical researcher in charge of evaluating patients’ follow-up.
All patients filled in baseline standardized questionnaires, i.e., SF-36, GIQLI, KESS, before surgery and one year after, as implemented by the CIRENDO protocol. The Visual Analog Scale (VAS) was also used for the evaluation of dysmenorrhea and dyspareunia, with and without pain relief medication. The French-language versions of the SF-36, GIQLI and KESS questionnaires were validated long ago on large cohorts [
8].
The SF-36 self-administered questionnaire incorporates 36 standardized items evaluating the generic health status of a patient [
7]. This survey evaluates 8 health domains: physical functioning (PF); role limitations due to physical problems (RP); bodily pain (BP); general health perception (GH); vitality (VT); social functioning (SF); role limitation due to emotional problems (RE); mental health (MH). The Physical and Mental Component Summary scores (PCS and MCS) represent the composite groups based on the typical factor structures that are hypothesized in SF-36, as follows: PF, RP, BP and GH as being domains of the PCS and RE, VT, MH and SF as being domains of the MCS. Each of the 8 domains has a score that ranges from 0 to 100. Higher scores indicate a better health status [
9].
The GIQLI and KESS questionnaires are used when describing, comparing and differentiating the outcomes of surgical treatment in patients with gastrointestinal diseases. The GIQLI survey consists of 36 questions targeting intestinal dysfunctions, with scores ranging from 0 (worst) to 144 (best) [
10]. The KESS survey is an 11-item tool that measures various degrees of constipation. A KESS score can range from 0 (no symptoms) to 39 (high symptom severity), with a cut-off score ≥ 11 that indicates the presence of constipation [
11]. The VAS ranges from 0 to 10, with its extremes marked as “no pain” and “worst imaginable pain” [
8].
The baseline and management characteristics and questionnaire results before and after surgery are presented as absolute and relative frequencies for qualitative variables and as mean and standard deviations for quantitative variables.
Two techniques were used for the evaluation of the internal consistency of the SF-36 questionnaire: item-scale correlation and Cronbach’s α. The item-scale correlation was assessed by using item-internal consistency and item-discriminant validity. Item-internal consistency estimates the correlation between the implied item and the sum of the other items on the same domain; its value must be greater than 0.4 [
12]. Item-discriminant validity was evaluated using the scaling success rate (%) for each scale. A scaling success rate was calculated for each question of SF-36, as the percentage of item-own scale correlations that are significantly larger than the correlations between the item and the other concurrent scales [
13]. Cronbach’s α globally measures the correlations between the items of a scale. The reliability of a questionnaire is considered acceptable when Cronbach’s α value is greater than 0.7 [
14]. Both item-scale correlation and Cronbach’s α were analyzed pre and postoperatively.
Construct validity was evaluated by comparing the SF-36 scores and pain (dysmenorrhea and dyspareunia, with and without treatment) intensity levels, both pre and postoperatively. The Pearson correlation coefficient was used for evaluating the relationship between HRQoLs as measured by the SF-36 and pain intensity as measured by the VAS.
A paired T-test of differences before and one year after surgery was used to determine changes in the SF-36 (evaluating HRQoL), the KESS and GIQLI scores and in the pain intensity VAS score. Observed differences between pre-op and one-year post-op values were described using 95% confidence intervals (95% CI). For the patients whose KESS and GIQLI scores showed a worsening of digestive symptoms, the evaluation of the HRQoL was compared separately using a T-test for means comparison.
Statistical analysis was performed using the software IBM SPSS Statistics for Windows, Version 23.0. Armonk, NY, USA: IBM Corp.
3. Results
We enrolled 488 patients from the CIRENDO database. Patients’ characteristics at baseline are presented below in
Table 1. Of them, 56.6% had a history of gynecologic surgery. The mean revised American Fertility Society classification (AFS-r) score was 69.6 ± 41.4. Among the patients included in the study, 29.1% reported a history of mental health disorders (depression, anxiety, sleep disorders, panic attacks), 97.7% had dysmenorrhea and 76.2% had dyspareunia.
The most important intraoperative findings and surgical procedures are represented below. Postoperative complications were reported in the order of their appearance, as follows: ureteral fistula 7%, digestive tract fistula 2.2%, pelvic abscess (surgical management) 0.6%, bladder atony 0.6%, pelvic abscess (conservative management) 0.4% and others 1.8% (
Table 2).
Dysmenorrhea intensity, as measured by the VAS, both with and without pain relief therapy (PRT), which primarily consisted of nonsteroidal anti-inflammatory drugs (NSAIDs), decreased significantly (
p < 0.001) at one year post-surgery compared to preoperative evaluations. This intensity decreased from 8.4 ± 1.5 to 2.1 ± 3.2 points without PRT and from 4.7 ± 2.3 to 2.9 ± 2.1 with PRT. A similar evolution was noticed in dyspareunia intensity without PRT. We observed a significant decrease in VAS score (
p < 0.001) from 5.6 ± 2.4 preoperative to 1.9 ± 2.8 post-op at one year. Dyspareunia with PRT only slightly decreased after surgery, from 4.0 ± 2.3 to 3.8 ± 2.3, without achieving statistical significance (
p = 0.657) (
Table 3).
Item-internal consistency exhibited preoperative values of 0.472 (General Health domain), 0.760 (Bodily Pain dimension) and signaled good internal consistency. Item-internal consistency had better postoperative values for all of the eight domains of the SF-36.
Scaling success rates were used to evaluate item-discriminant validity. The preoperative scaling success rate had values greater than 98.2%. The postoperative scaling success rate showed values greater than 99% for 7 of the 8 domains of SF-36. The values of this reflected higher correlations with their hypothesis scales than with other scales, both pre and postoperatively, except the Vitality domain, which slightly decreased in postoperative, from 98.2% to 97.7%, due to high correlation between the vitality item (VT) and distinct scales.
Internal consistency measured with Cronbach’s α was greater than 0.7 for all of the eight domains of the SF-36, both pre- and post-op. Cronbach’s α for pre and postoperative PF and for postoperative BP exceeded the 0.9 threshold, meeting the recommended standard criteria for individual comparisons (
Table 4).
All of the SF-36 domains and the summary scales (physical and mental component scores) revealed significantly higher mean values postoperatively (p < 0.001), than the preoperative mean scores. The most remarkable improvements were observed in the domains of Role (limitation) physical (26.9), Bodily pain (25.4) and Role (limitation) emotional (20.8).
The Vitality, Mental Health and General Health domains showed a weaker improvement of HRQoL (10.2, 12.0 and 12.3, respectively). All of the SF-36 domains scored inferior values for the HRQoL compared to a nationally representative sample of healthy French women aged 25–34, both preoperatively and postoperatively (
Table 5).
All SF-36 domains correlated negatively with dysmenorrhea (with and without treatment) and dyspareunia (without treatment), as measured by the VAS, both preoperatively and postoperatively. The scores for dyspareunia with treatment were significantly correlated with the preoperatively Bodily Pain scores and with the postoperatively General Health and Social Functioning scores (
Table 6).
Both KESS and GIQLI scores significantly improved one year after surgery (
p < 0.001). The KESS score dropped from 10.6 ± 7.6 to 9.1 ± 7.1 and the GIQLI score increased from 76.6 ± 35.9 to 90.5 ± 38.5 (
Table 3). The KESS score improved in 50.8% of the patients, worsened in 30.5% and remained stationary for the rest. In patients whose KESS score worsened, the scores for the eight domains of the HRQoL significantly improved; however, they did so to a lesser extent compared to the whole cohort of the study. The smallest increases were recorded for the Vitality (5.2), General Health (6.4) and Mental Health (8.5) domains (
Table 7).
The GIQLI score improved in 66.8%, worsened in 21.3% and remained stationary in 11.9% of the patients. The 104 patients whose GIQLI score decreased also experienced a decrease in the scores for most HRQoL domains but without statistical significance. Bodily Pain is the only SF-36 domain that recorded a statistically significant increase (p = 0.006), in addition to the Physical component score (p = 0.047). Patients with worsened GIQLI scores also experienced higher scores for the Physical Functioning and Role (limitation) physical domains (p = 0.562 and p = 0.104, respectively), without statistical significance.
4. Discussion
Our study reports data from multiple questionnaires in order to assess the HRQoL in women with rectal and sigmoid DE, before and one year after surgical excision. All questionnaires that were administered showed mostly significantly higher HRQoL scores at one-year follow-up compared with the pre-op scores. Though some patients did not show improvement on several items on the questionnaires, the overall HRQoL appears to have increased. As in other studies, we concluded that surgery for DE involving the colon and the rectum generally improves a patients’ HRQoL. Our study demonstrates that score variations are concordant. These variations also seem suitable for assessing different dimensions of HRQoL: pelvic comfort, emotions, physical well-being, gastrointestinal function, etc.
Our study has several strengths. It is based on prospective data collected from a large cohort of 488 patients whose follow-up was monitored by a clinical researcher. Multiple questionnaires were used before and one year after surgery in order to assess the HRQoL and digestive function of the patients and to provide an accurate and comprehensive evaluation of the effect of surgical excision for colorectal endometriosis. The SF-36 questionnaire also allowed for a comparison with French normative data. This comparison underlined a higher HRQoL in treated patients, in contrast to patients who still suffer from colorectal DE.
One weakness of our study is that we do not provide an interpretation of the variation in the scores depending on postoperative complications, recurrences or sequelae. However, as mentioned above, the main objective of our study was to evaluate the validity and reliability of the SF-36 questionnaire in patients with colorectal DE. Moreover, an additional objective was using various tools in order to compare pre and postoperative HRQoL.
The SF-36 questionnaire has been validated in many studies that have established its importance in the setting of various pathologies [
16,
17]. As mentioned above, internal consistency measured with Cronbach’s α is greater than 0.7 for all eight domains, both pre and postoperatively. This translates into a good degree of validity and reliability for the SF-36 questionnaire. The cardinal advantages are represented by its self-administered form, the large addressability and high level of comprehensiveness. This generic tool is suitable for evaluating the HRQoL of patients in different stages of endometriosis. Moreover, SF-36 has the ability to point out different characteristics in a certain population (for example, the pain intensity that affects mental health) and it allows for comparisons with the general population [
17].
Redwine and Wright looked at the differences in the postoperative SF-36 scores across patients with both major and slight improvements in their HRQoL. They predicted a good response to surgery when spontaneous pain during physical examination correlated with Douglas pouch obliteration through a palpable nodule [
18]. Dubernard et al. further developed the idea of prediction for surgical responses and elaborated a mathematical model for women with symptomatic endometriosis that predicts improvement in their SF-36 scores. The probability of HRQoL improvement after surgery was 80.7% when preoperative Physical Component Summary (PCS) scores were below 37.5. The probability of HRQoL improvement after surgery was 84.2% when preoperative Mental Component Summary (MCS) scores were below 44.5 [
19]. Similarly, mean values of PCS and MCS increased after surgery from 50.9 to 70.4 and from 45.3 to 60.6, respectively. This suggests, in turn, a significant improvement in HRQoL.
The domains that exhibited the largest improvement 12 months after surgery were Role Limitation (RP) (physical), Role Limitation (RE) (emotional), and Bodily Pain (BP). These domains exhibited some of the lowest preoperative scores (all below 50). Our results are in line with most of the other studies [
20,
21,
22]. Ribeiro et al. reported increased mean values for the BP domain from 25.83 to 79.56 [
20]. A recent randomized study conducted by our team, which compared radical and conservative rectal surgery, used SF-36 to demonstrate the improvement in HRQoL after both conservative surgery and segmental resection. The results appear to be similar across these surgical techniques and they consist of positive outcomes at the clinical assessment 24 months after the surgery. The RP domain showed excellent values of 100, both in the conservative and the radical groups, while the BP domain revealed slightly better results in the conservative group (84, compared to 74 in the radical group). The RE domain revealed mean values of 100 as well [
23]. In the current study, the mean values for the RP and the RE domains also increased. Garavaglia et al. showed an overall improvement in all of the SF-36 domains for operated patients for colorectal endometriosis, with consistent increases for the physical role, emotional role and pain subscales (
p = 0.0001;
p = 0.001;
p = 0.003, respectively) [
24]. Another recent study conducted by Marlon de Freitas Fonseca et al. showed that all of the patients with dysmenorrhea and chronic pelvic pain experienced an improvement in all of the SF-36 subscales after laparoscopic surgery for endometriosis and concluded that “the more intense the pain, the worse the quality of life” [
25]. Meuleman et al. repeatedly stated in extensive reviews that, in general, most studies showed significant improvement in HRQoL after surgery [
26,
27].
It can be challenging for patients to deal with gastrointestinal symptoms due to colorectal endometriosis or surgical treatment procedures [
28]. In our study, both the KESS and the GIQLI scores showed improvement. Our previous studies have reported similar findings [
29,
30]. One of these pointed out that the worst KESS scores in untreated women were registered in DE of the rectum (with a value of 13.1) compared to cases of DE without rectum involvement (with a value of 10.4) [
31]. Previous KESS and GIQLI scores suggest that better digestive function results are achieved when using conservative techniques such as shaving or disc excision because of the ability to avoid low anterior resection syndrome [
5,
32]. Some of the patients did not show any improvement in the digestive symptoms and expressed worsening of the KESS and GIQLI scores. However, the analysis of the SF-36 results in this paper suggests that HRQoL was better after surgery. Most of our studies have reported similar findings, with a focus on the constipation subscale from the GIQLI score, which did not suffer any change [
29,
30,
31,
33].
Previous studies on the topic mainly emphasize the improvement in pain after laparoscopic surgery for colorectal DE [
34,
35]. In this regard, our findings showed significant improvement in dysmenorrhea one year after surgery. This was quantitatively illustrated with the results from the VAS and regardless of the association with pain relief therapy (PRT) (
p < 0.001). Therefore, pain intensity in the form of either dysmenorrhea or dyspareunia, with or without treatment, represents an important component of diminished self-perceived HRQoL. In a prospective study on patients with colorectal excision for endometriosis, Bailly et al. showed significant improvement in postoperative VAS scores for dysmenorrhea (
p < 0.0039), dyspareunia (
p < 0.0084), chronic pelvic pain (
p < 0.0002) and dyschezia (
p < 0.0312) [
35]. Our results also show improvement in dyspareunia symptoms, but only in the group without additional PRT. In a prospective cohort study on HRQoL after colorectal resection for endometriosis, Abbot et al. reported similar results to ours. Dyspareunia and dyschezia VAS median scores showed improvement at one-year follow-up, compared to the baseline (66 to 5 and 48 to 20, respectively) but did not reach statistical significance (
p < 0.080 and
p < 0.173, respectively). Hence, the same study proved that dysmenorrhea and chronic pelvic pain VAS median scores improved one year after surgery (71 to 5 and 74 to 11, respectively) and reached statistical significance (
p < 0.028 and
p < 0.046, respectively) [
36].