1. Introduction
Antimicrobial resistance (AMR) is an ever-growing concern in medicine and public health globally. Patients infected by AMR bacteria require extended hospital stays and costly and multiple treatments that result in an economic impact on both the patients and the healthcare system [
1].
Several pathogens have started to develop AMR, particularly that to first-line, inexpensive, broad-spectrum antibiotics, while the introduction of new drugs (e.g., fluoroquinolones) has been followed by the emergence and dissemination of resistant strains [
2,
3].
Although single resistance is an important public health problem, MDR is a more critical and growing problem in the world. MDR tuberculosis caused 1.5 million deaths in 2018 (251,000 with HVI) [
4]. In the United States (US), two of three deaths related to antibiotic-resistant pathogens are caused by MDR organisms commonly associated with healthcare [
5].
P. aeruginosa [
6,
7] is a Gram-negative bacterium widely recognized as a microorganism related to HCAI. It is a ubiquitous environmental bacterium that causes opportunistic human infections, such as urinary tract infections, respiratory system infections, dermatitis, soft tissue infections, bacteremia, bone and joint infections, gastrointestinal infections, and a variety of systemic infections, particularly in patients with severe burns, and in cancer and AIDS patients who are immunosuppressed [
6].
The eradication of
P. aeruginosa has become increasingly difficult due to its remarkable capacity to resist antibiotics, which includes biofilm-mediated resistance and the formation of multidrug-tolerant persisted cells [
8].
The European Antimicrobial Resistance Surveillance Network (EARS-Net) claims that
P. aeruginos a remains one of the major causes of healthcare-associated infection in Europe because of its ubiquitous nature and potential virulence [
9]. In 2019, EARS-Net reported that 30.8% of the
P. aeruginosa isolates were resistant to at least 1 of the 5 antimicrobial groups under regular surveillance (piperacillin-tazobactam, ceftazidime, fluoroquinolones, aminoglycosides, and carbapenems) across the European Union (EU) and European Economic Area (EEA) countries and reported a 12.1% population-weighted mean resistance in 2015–2019 to at least 3 antimicrobial groups [
9] or MDR
P. aeruginosa [
10].
MDR
P. aeruginosa (or MDR-Pa) is an important determinant of a higher rate of intensive care unit (ICU) admission and hospital mortality [
11] and an increase in patient morbidity due to a higher incidence of surgery and longer duration of hospital stays [
10]. Despite the enormous impact of MDR-Pa, the resistant mechanism is a complex and not fully understood phenomenon; several studies have described some of these mechanisms, which include well-described molecular phenomena, such as antibiotic-mediated selection, horizontal gene transfer, and others [
12], even though there are very few models that study the factors favoring MDR over simple AMR, in general [
13].
Although MDR and AMR phenomena occur in the molecular scenario, those mechanisms may be fostered and pushed by different social and behavioral determinants. Several studies have found that the general and imprudent consumption of antimicrobials, antimicrobial misuse, and overuse in hospitals—when inappropriate initial antibiotic therapy is prescribed—are the main causes for resistance [
12,
13,
14]. However, some factors have been suggested as conditional factors to those well known in clinical settings, such as socioeconomic and political factors across countries, mainly those related to governance and regulation, which are usually poorly understood and minimally taken into account as leading factors in MDR and AMR phenomena [
15].
Multiple studies have observed a pattern of AMR distribution across European countries, showing greater resistance in southern and eastern countries and lower resistance in northern and western countries. Researchers have suggested that this phenomenon is related to in-hospital and out-of-hospital imprudent antibiotic use practices [
16] and underdeveloped action plans against AMR in southeastern European countries [
17].
Additionally, there is evidence of higher AMR rates in low-income and middle-income countries, in comparison with high-income countries [
18,
19,
20]. Although this effect has been related to much lower community consumption of antibiotics in high-income countries than middle–low income countries [
19], this also could be related to differences in national health policies, since the control of AMR is generally centralized in policies, with national initiatives and commitments [
21].
The quality of government and socioeconomic development are key elements of people’s well-being and health. Thus, these indicators have been a priority among EU policy makers for decades. However, there are remarkable differences in socioeconomic and governance performances across political communities over time in EU/EAA [
22]: those differences are consistent with the AMR patterns in Europe. Stronger governance performance has been seen in northwestern over southeastern regions. The same differential pattern has been in socioeconomic performance. Additionally, there are important income differences between southeastern (lower income) and northwestern (higher income) EU/EAA countries [
23].
Following these particular patterns across EU/EAA countries, we suggest that there are other non-consumption-related variables, such as the quality of governance, poverty, education, and community infrastructure, which affect health outcomes as structural determinants [
15,
20], also influencing MDR. In this context, we explore the MDR in
P. aeruginosa, given that its resistance is less related to out-of-hospital care and antibiotic consumption per person, and more related to variables that could affect the institutional dynamics and the health system, in terms of resource distribution and national health policies.
For this purpose, firstly we performed a clustering analysis to determine the countries across Europe that contribute the most to the MDR-Pa. We identified three clusters, and found that the two that contribute the most to the MDR-Pa are shaped mainly by countries from the southeastern region.
Then, assuming the differences between these countries are mainly dependent on governance and socioeconomic factors, we ran a multidimensional data panel regression analysis for 30 EU/EAA countries from 2005 to 2018, including worldwide socioeconomic and governance indicators. Our results show that governance indicators are the variables that better fit our model in explaining the MDR-Pa variance across countries and over time. These results were validated by two machine learning methods (XGBoost and random forest).
Through our models, we show the complexity of the antibiotic resistance problem. Geographical and temporal differences of MDR-Pa across EU/EEA countries could be explained by governance factors, such as corruption control, the rule of law, and other economic factors. Our findings provide another layer (macro-level) of factors for understanding MDR-Pa, including governance and socioeconomic factors, which are related to MDR-Pa. Thus, interventions focused on controlling MDR-Pa should be country-specific interventions; moreover, in addition to individual guidelines, such as promoting the appropriate use of antibiotics, these interventions should be focused on solving national governance problems that impact resource distribution and create health inequalities across countries, especially in southeastern countries.
2. Materials and Methods
2.1. Study Area
Europe contains around 50 countries, 27 of which are part of the European Union (EU), and some of the others are members of the European Economic Area (EEA). The EU/EEA is an economic and political union of 30 countries. It operates an internal (or single) market, which allows for the free movement of goods, capital, services, and people between member states [
24]. A common geographical distribution of the EU/EEA countries into four regions: Northern Europe, Southern Europe, Eastern Europe, and Western Europe [
25].
Table A1 (see
Appendix A) shows the geographical distribution of the countries of the EU/EEA regions that were used in this study.
2.2. Data Collection
The data were collected from the European Antimicrobial Resistance Surveillance System (EARSS), the World Health Organization (WHO), and the World Bank. Their datasets are available through the European Centre for Disease Prevention and Control (ECDC) (
https://www.ecdc.europa.eu/en, accessed on 5 July 2021), the World Health Organization (WHO) (
https://www.who.int/health-topics/, accessed on 5 July 2021), and the Worldwide Governance Indicators (WGI) project (
https://info.worldbank.org/governance/wgi/, accessed on 5 July 2021).
We used the EARSS dataset, corresponding to the resistance percentages and MDR percentages to P. aeruginosa. The original dataset contained information collected from 2000 to 2018, of 8 bacteria, namely: Acinetobacter spp.; Enterococcus faecalis; Enterococcus faecium; Escherichia coli.; Klebsiella pneumoniae; Pseudomonas aeruginosa; Staphylococcus aureus; Streptococcus pneumoniae. This dataset corresponded to the 30 EU/EEA countries.
For this study, MDR-Pa was defined as 1 isolate resistant to at least 3 antibiotic classes [
10,
26]; therefore, the data for
P. aeruginosa reported by EARS was restricted to the percentage of resistance of combined resistance to at least 3 antibiotic groups, reported out of piperacillin/tazobactam, carbapenems, fluoroquinolones, ceftazidime, and aminoglycosides, per country, from 2005 to 2018.
Then, non-informative data was removed and the socioeconomic variables were incorporated; this was to classify the obtained regions of the European continent and match the data by country. After an exhaustive data curation and reorganization, 401 observations (for 30 geographical units: countries (
i1–30) and 14 periods of time (
t2005–2018)), 12 variables, 8 independent variables (GDP_total, GOV_effect, GDP_health, CTRL_corrup, Rule_law, Per_cap_US, Out_pocket_exp, HDI), 1 dependent variable (R_multi or MDR), a temporal variable (Year), and a cross-section variable (Country) were obtained. The region variable was included to classify the most affected countries by MDR-Pa. The final variables considered in this study are specified in
Table 1.
2.3. k-Means Clustering
Through a
-means clustering, we analyzed the correlation between the EU/EEA countries with respect to the resistance percentage to
P. aeruginosa. This technique was applied to group the percentage of the resistance data, to identify how the countries are grouped in relation to the resistance percentages. The Mojena criterion [
27] was used to determine the optimal number of clusters. Once the best number of clusters was determined, the
-means technique was applied to group the data.
2.4. Panel Data Analysis
The multidimensional data panel regression analysis was run in the software EViews v12. This approach provided us with observations on cross-section units (in this case: geographical unites = countries), , over repeated time periods, (in this case: years). Meanwhile, we found difficulty in obtaining information over time for the individual countries. Due to the amount of missing data, and in order to maintain the homogeneity of the data, a polynomial interpolation of the missing data was performed.
However, it was not possible to make up for the lack of data from the EARSS dataset (only with the resistance percentage information), which forced the construction of unbalanced panel data (i.e., when there are missing elements that result in an incomplete data series for an individual, or individuals are absent in some years for a given variable), with
and
, Slovakia being the country with the fewest periods (
), followed by Belgium (
), and with most of them (20) having periods of 14. According to Hsiao [
28], although many statistical proposals are built from the consideration of balanced panels, most of the empirical studies and the data that can be used only enable unbalanced panels, such as the one presented in this study.
We used the two-way fixed effects method (TWFE), fixing cross-section units and periods. Thus, it was necessary to apply a transformation of the data to eliminate the unobserved heterogeneity, which allowed the fixed effects estimator to take the form of an ordinary least squares (OLS) estimator. Finally, we proposed the TWFE model for MDR percentage across EU countries included in our dataset.
2.5. Pooling Panel Data Analysis Using Machine Learning
For the machine learning (ML) validation, we used models such as those used in the references [
29,
30,
31]. We used the
Scikit-learn package in
Python 3.1.2 (Netherlands). Firstly, we ran a Hausman test and an
-test to choose our model (pooled OLS or fixed effects). We modeled the relationship between the input (independent) variables,
or features, and the output (dependent) variable,
, or target variable, as a non-linear relationship with the polynomial of degree,
, called the complexity on the ML framework, on the variables
, as follows:
Then, we used a polynomial of degree of the Polynomial (1) and built the covariance matrix of these polynomial variables, such that , with respect the target variable, , and have selected the polynomial variables, , the covariance value of which concerns the target variable, , which has a greater value than a threshold value, .
Next, we split our data, as follows: 80% was used as training data and 20% was used as testing data. We retrieved the results of the and on the test data for polynomial degrees and the threshold values were selected.
We also used different types of variable filters. We first used the low variance filter, which consists of eliminating the variables whose variance is less than a threshold value, . Apart from the linear regression (LR) algorithm, we have used the k-nearest neighbors (kNN) and the decision tree (DT) algorithms. The hyperparameters for the KNN are k = 5 neighbors, and n = 5 for the decision trees.
Furthermore, we applied the best variable selection (kBVS) and the recursive feature elimination (RFE) methods as other filter techniques of the polynomial features. The best variable selection method selected the best variables, based on the Fisher test.
Finally, we used the XGBoost and the random forest algorithms as two more options for the pooling method, with the combination of the Shapley additive explanations (
SHAP) package [
32]. The latter package was used to formulate some plots which could better explain the models. We used the following hyperparameters for the XGBoost:
.
4. Discussion
P. aeruginosa is a well-known microorganism related to HCAI. In this context, control strategies based on the reasonable and adequate use of intra-hospital antibiotics, especially on patients in critical care, have been proposed. This problem has also led to extreme measures, such as the creation of new types of therapies or new antibiotics [
33]. Here, we provided insights into the macro-level factors, across EU/EEA countries, related to MDR-Pa. To date, studies have been exclusively performed to identify the in-hospital factors [
11,
14] and the molecular mechanisms [
28,
34] related to MDR-Pa as strategies to understand this phenomenon. Our findings provide another layer of understanding both the transnational and temporal variances of MDR-Pa, defined by governance and socioeconomic variables, which work as possible conditional factors for appropriate institutional dynamics and adequate distribution of resources within the health systems across EU/EEA countries.
Other authors [
18,
19,
20,
21,
22,
23] have seen the southeastern (higher)–northwestern (lower) pattern, especially as it is associated with AMR. Few studies have shown this effect in MDR. Gunther et al. [
35] indicates that Eastern Europe shows a high incidence of multidrug-resistant Mycobacterium tuberculosis, with a low incidence in Western Europe. To the best of the authors’ knowledge, this is the first study applying clustering methods to reveal differences in MDR-Pa between southeastern and northwestern EU/EA countries. These results increase the evidence about how national differences in a community as large as the EU could be related to public health problems, such as MDR. Although these differences could be related to antibiotic usage between countries, as private practices carried out by providers and consumers [
36,
37], here, we moved forward, proposing that these dynamics are conditioned across countries and over time by macro-level factors, such as the government quality and the socioeconomic characteristics of the countries.
In 2015, Collignon et al. [
15] showed that factors other than antibiotic usage, such as the quality of governance and private health expenditures, are potentially very important in explaining the different levels of AMR seen in different EU countries. The authors include antibiotic usage as community consumption; however,
P. aeruginosa is an HCAI-related microorganism—antibiotic resistance is poorly impacted by this factor. Here, we propose a clean model, validated by ML algorithms, with similar results for MDR-Pa, which avoid the community consumption effect.
Control corruption was the most important variable explaining the MDR-Pa variance across countries and over time in our data panel model. Our model shows that, the lower the control corruption is in a country, the higher the MDR-Pa. ML models show that the control corruption explains most of the variation of MDR-Pa, with this indicator becoming the most important in the ML validation. Data panel and ML models exclude total GDP as an important variable explaining MDR-Pa; these results question the presumed relation between poverty and higher AMR [
19].
On the other hand, our results show that gross domestic product for health (GDP_health) is an important factor. GDP for health has been shown to have a positive correlation with health outcomes [
38]. In this context, the higher GDP for health in a country, the lesser the MDR-Pa. The corruption in a nation impacts the resource distribution for dealing with important health public issues, such as MDR-Pa. Factor and Kang [
39], observing 133 countries, concluded that corruption is associated with lower levels of health expenditure. Thus, following our findings, the control of corruption is essential to guarantee the adequate distribution of the resources intended to cover the basic health needs of the population, so that, in a country where the control of corruption is lower, there is less probability of adequate spending on health to respond to problems, such as antimicrobial resistance, and there is also less confidence on the part of the population in their institutions.
Private health expenditure, such as current health expenditure per capita in the U.S. and out-of-pocket expenses, were included as part of the final data model as positively related factors. However, the XGBoost and random forest models show these variables are the less important features. High private health expenditure in a country suggests that healthcare is being delivered predominantly in the private sector, this means fewer controls and supervision. This impacts the health sector, resulting in fewer controls on antimicrobial distribution, time of drug therapy, and the volumes used [
15]. This is true for community-level antimicrobial usage. Nevertheless, in the case of MDR-Pa, the ML algorithms results are expected, because MDR-Pa is mainly related to hospital-regulated factors.
Interestingly, the rule of law (Rule_law) governance indicator, which evaluates the confidence in, and abides by, the rules of society, is negatively associated with MDR-Pa. The rule of law could be seen as a social determinant of health [
40], in several ways, in a communitarian institution such as the EU/EAA. Implementing transnational health policies, improving skills of government health policymakers and providers, and the dissemination of information and experience across countries requires trust, not only in national institutions, but also in transnationals, such as the EU [
41]. Here, we propose that countries with higher confidence in transnational institutions are more likely to follow the action plans against MDR, promoted by the EU [
17,
41]. Although there is a generally favorable view of the EU across EU countries [
41], Greece, Czech Republic, Italy (southeastern countries), and France (western country), which are the countries that contribute the most in our MDR-Pa data panel model, have the largest proportion of unfavorable views of the EU. Even Italy, France, and Slovakia have deteriorated over time in their opinions of the EU. On the other hand, most of the northwestern countries (except France) had maintained or increased their favorability on the EU [
42].
To our knowledge, this is the first study to include the human development index (HDI) as an integrated socioeconomic determinant to explain MDR. Strikingly, our data panel model excludes this variable, because we accept a
p-value < 0.05 (it was significant for a
p-value < 0.1 (0.0702)); however, the XGBoost model shows HDI as an important feature and inversely related MDR-Pa—that is, the higher human development in a country, the lower the MDR-Pa. This indicator includes three variables that must be taken into account: life expectancy at birth, schooling (expected years and mean years schooling), and GNI (gross national income) per capita. Since life expectancy at birth is an indirect health status, and quality of health is expected, in this context, MDR-Pa could be associated with lower quality health care systems, an association which has increased MDR threat to life expectancy, reducing neonatal survival in critical care settings [
43,
44]. On the other hand, increased use of emergency and hospitalization services due to lower health literacy [
45] can increase the risk of infection for an MDR-Pa. Recently, Zhen et al. [
46] found that GDP per capita is positively spatiotemporally related with antibiotic resistance in China, which, according to the authors, may be related to higher consumption of antibiotics. GNI per capita could be interpreted in the same way, but more studies are required to understand the importance of this variable.
As far as we know, this is the first study that compares different multivariate methods (data panel and ML algorithms) to evaluate the impact of socioeconomic and governance indicators as macro-level factors explaining the MDR-Pa. Our findings increase the knowledge surrounding MDR, especially concerning such a rarely studied microorganism as P. aeruginosa. Additionally, this study provides another layer (macro-level) of factors in understanding MDR-Pa, including governance and socioeconomic factors explaining the variance of MDR-Pa across EU/EAA countries over time.
Thus, interventions focused on controlling MDR-Pa should be country-specific; in addition to individual guidelines, such as those promoting the appropriate use of antibiotics, these interventions should be focused on solving national governance problems that impact resource distribution, and which also create health inequalities across countries, especially in southeastern countries.
A limitation of this study was the impossibility to include the in-hospital antibiotic consumption variable. European Surveillance of Antimicrobial Consumption Network (ESAC-Net) does not have complete data on the period and countries here studied. This hindered us from concluding that the governance and socioeconomic variables have a greater impact on the MDR than the antibiotic usage variable. It also reveals the need for better MDR surveillance systems in clinical settings to test this hypothesis.
We explored these patterns in exclusively EU/EAA countries because of the robust ESAC-Net surveillance system, and because it is a continent where good data are available from multiple countries. Since MDR is growing, it is necessary to be able to access better and better data to determine the factors related to MDR and to improve the antibiotic resistance surveillances systems, promoting action plans against this serious health public issue. Likewise, ML methods are becoming recursive, robust, and important technics to deal with the complexity of health problems. Here, we used two algorithms that revealed governance indicators as the most important determinants of MDR-Pa. However, the amount of data used (402 observations) could have restricted the power of the ML-based approximations; however, these results are supported by our data panel. This shows the strength of mixed methods in understanding and dealing with complex problems, such as MDR.