1. Introduction
Health care organizations are one of the largest consumers of energy resources. This is due to:
- (a)
The large number of services offered.
- (b)
Care services are provided continuously throughout the day and the year.
- (c)
They have strict requirements to guarantee optimum conditions of temperature and humidity for patients, and to provide a certain degree of comfort for visitors.
The distribution of energy consumption into thermal and electrical energy depends on many factors such as the location of the hospital, its size, the characteristics of its apparatus and facilities, the services offered, the number of services provided, etc. It is a very complex matter to calculate a standard distribution for energy consumption in the health sector. This is because it depends on the number of care services provided. Most energy consumption, however, is linked to the working of systems for climate control and lighting. Thus, 45% of consumption may be considered to be due to air conditioning, 35% to lighting and 20% to production of sanitary hot water [
1].
Water for human consumption is called domestic or clean water. The production facility consists of a cold-water and a hot-water circuit. Drinking water heated to 60 °C is then called Hot Domestic Water (HDW) and reaches 70 °C, in order to pasteurize it, in accordance with current Spanish regulations for the treatment of legionella. The system most commonly used in HDW facilities in the hospital environment is centralization, which uses boilers operating with solid, liquid or gas fuel. HDW facilities that have an accumulator tank, as hospitals do, allow for the availability of a large volume of water, but accumulating in this way creates the conditions for the development of legionella, if the temperature in the tanks is not guaranteed to stay above 60°.
Therefore, the production of HDW and its treatment to prevent legionella requires a large amount of energy due to the necessary high temperatures. There is also a need to ensure a minimum of preventive maintenance, as covered by the regulations for thermal facilities in buildings [
2].
Up to now health care organizations have considered the need to save energy by focusing on investment in more efficient technologies, reducing demand, optimizing contracts or optimizing facilities. Up to now, however, the literature has not addressed the question on how efficient decision making in maintenance can affect the consumption of energy and resources such as water, as well as the quality of care in the hospital.
For example, avoiding faults, leaks or breakdowns may lead to a significant decrease in water consumption. This also leads to significant savings in energy because of the reduced fuel consumption involved in heating the water, a reduction in treatment products and in working hours of the pumping equipment. The decrease in pump working time leads to an extension of the life cycle of the equipment, and a more efficient use of resources.
If appropriate preventive maintenance is carried out, it will reduce the need for corrective maintenance, and there will be an improvement in performance of the facilities, a reduction in costs and better service, to the benefit of patients. A preventive programme in this type of facility usually consists of the actions shown in
Table 1.
The choice of the best combination of maintenance policies is a complex decision [
4], as it involves multiple factors: technical, environmental, economic, safety, and relating to business strategy. Multi-Criteria Decision Making (MCDM) techniques are considered ideal when multiple criteria are involved [
5]. These criteria are generally in conflict, and may also involve multiple stakeholders, decision groups or scenarios. MCDM techniques assist in justifying decisions [
6,
7] to the community or the management of the company, as they guarantee a participatory decision model that is objective and easy to understand [
8].
However, there are no precedents in the literature applying MCDM techniques to thermal energy production systems in health care organizations. That is, there is no analysis of systems intended to guarantee the correct operation of care services such as operating theatres, casualty areas, ICU’s, neonatology, etc., and elimination of legionella, production of primary air, etc. as in the case analysed here.
The literature that uses MCDM techniques for selection of maintenance policies mostly considers alternative maintenance policies in isolation. They consider only the use of, for example, preventive maintenance. However, organizations apply a combination of maintenance policies, since, for example, corrective maintenance should always exist.
There are many combinations of maintenance policies, and the choice of the most appropriate should be made through an objective decision-making process, based on mathematical techniques guaranteeing, in the case of health care organizations, the provision of care services to the required standard. It should be remembered that the quality of healthcare is a major concern for regional, national and international authorities [
9]. This combination of maintenance policies was taken into consideration in this study when identifying the alternatives.
Total consumption in the Spanish hospital sector is 6x10
6 watt/hours, which represents 2% of total national consumption [
10]. The large number of state-run hospitals in Spain, together with the current economic recession, suggests the need to improve energy efficiency. This is defined as applying appropriate management of available resources to obtain the greatest benefit, avoiding any unnecessary cost [
11].
The main aim of this study is to optimize maintenance in the thermal energy production systems in hospitals. A multicriteria model will be described, providing the optimal combination of maintenance policies to be applied to these systems. The idea is to obtain greater availability of the systems and thus, greater functionality of the care services that depend on them, leading to improvements in quality of care. This study was carried out in the University General Hospital of Ciudad Real (UGHCR), in Spain. This is a public hospital that opened in 2005.
The novel contributions of this research are:
Development of a multicriteria model using the Measuring Attractiveness by a Categorical Based Evaluation Technique (MACBETH) approach for maintenance optimization.
Applying a multicriteria model to thermal energy production systems in Health Care Organizations. They have critical characteristics as compared with other buildings and organizations, because of the quantity of water used and the number of services and people that depend on this provision. Some of these people will be patients with immune deficits, or illnesses that may be exacerbated by deficient provision of the systems under analysis.
Modelling the thermal energy production systems in a hospital via continuous time Markov chains, for different combinations of maintenance policies.
Analysing the solutions obtained by the multicriteria model and their implications for quality of care.
This study, then, gives an easily-applied methodology for any health care organization to choose the best combination of maintenance policies. The lack of precedents in the literature on this kind of decision making means that the criteria, descriptors, weightings, value functions, etc., given here may serve as standards for other health care organizations. The way the criterion ‘Impact on health care organization activity’ is designed is especially noteworthy. This criterion takes into account the availability of the system analysed, and the number of critical, important and normal systems that depend on it.
However, the most important contribution of this study is that it aims to improve quality of care. Particularly, the role played by systems or facilities not directly involved in patient care.
The integration of Markov chains and the MACBETH approach allowed the alternatives of the multicriteria model to be defined more accurately, the systems analysed to be better understood, and it provided up-to-date availability information for UGHCR. Availability is an important parameter in guaranteeing proper care services. The choice of the MACBETH approach made it possible to work around the hesitation or uncertainty inherent in group decision making. Furthermore, the alternatives of the problem are evaluated using only qualitative judgements about the difference of attractiveness between two elements at a time. This avoids the drawbacks of ordinal aggregation methods. The MACBETH approach has clearly defined procedures for carrying out each step of the decision process, which include the need to validate the results obtained by the group at each stage. This guarantees that the results of the research will be reliable.
The entire study was carried out on thermal energy production systems currently operational in a public hospital in Spain, thus guaranteeing the viability of the methodology used, and that good results, specific to each health care organization, can be obtained with relative ease.
This article is structured as follows: First comes a literature review on MCDM applied to energy and the selection of maintenance policies.
Section 3 sets out a description of the mathematical grounds for the application of continuous time Markov chains. Next, Markov chains are applied to different thermal energy production systems in a hospital. The multicriteria model following the MACBETH approach is then explained; this is done by describing the criteria, descriptors, value functions and criteria weightings, following the methodology in logical order of application. Then, the results and the real implications their application would have on quality of care at the Hospital are shown. Finally, the conclusions, acknowledgements and references are presented.
2. Literature Review
MCDM methods have been applied with success to different research areas and problems (see for example [
12,
13] for an extensive literature review on application of MCDM methods). MCDM methods are held to be most suitable for solving problems related to energy [
14]. MCDM methods can be classified into continuous and discrete methods depending on the characteristics of the alternatives evaluated [
15]. Continuous MCDM methods, or Multi-Objective Decision Making (MODM) methods, are applied to problems where the decision space is continuous and the alternatives are not explicitly known. The solution is obtained by solving the mathematical model [
16]. Discrete MCDM methods or Multi-Attribute Decision Making (MADM) methods have a finite number of alternatives, explicitly defined, a set of goals and criteria, and a method for classifying the alternatives based on how they satisfy the goals and the criteria [
17].
Among different applications of MODM methods in the energy field, the following could be recalled for their importance: In [
18], Zheng et al. (2015) develop a multi-objective group search optimizer with adaptive covariance and Lévy flights to optimize the power dispatch of a large-scale integrated energy system. The goals of economy and reliability of the power grid and distributed district heating and cooling units are taken into consideration. In [
19], Yu et al. (2017) describe a multi-objective mixed integer non-linear programming model for investment decision making in a coal mine in China. The model described maximizes company profits and minimizes energy consumption and pollutant levels. In the field of renewable energy, Kantas et al. (2015) [
20] analyse the economic and environmental viability of ethanol production from multiple biomass sources. They consider factors such as amount of biofuel inventory held per unit time, amount of biomass purchased and energy produced from each biomass type per unit time, CO
2 emissions released and total water used during biofuel production per unit time, penalties and incentives for CO
2 emissions and penalties for excessive water usage during biofuel production. In [
21], Cobuloglu et al. (2015) set out a multi-objective mixed-integer optimization model to find optimal solutions on land allocations for food and energy crops, harvesting time and quantity, seeding time, and budget allocations to farm operation, while maximizing the economic and environmental benefits. A full and up-to-date review of MCDM methods applied to renewable energies can be found in Kumar et al. (2017) [
14].
Additionally, MADM techniques are considered highly suited to problems of reliability, maintainability, availability and safety [
22]. They have, as such, been successfully applied to the choice of maintenance policies in different types of company. Among the literature analysing choice of maintenance policy the following contributions are particularly noteworthy. In [
23], Bevilacqua and Braglia (2000) apply the Failure Mode Effect and Criticality Analysis Technique (FMECA) to obtain a criticality index and to categorize the machinery by risk. Then, they apply the Analytic Hierarchy Process (AHP) to choose the optimal maintenance policy for each of the groups. In [
24], Al-Najjar and Alsyouf (2003) classify maintenance policies by their ability to provide information about the changes in the behaviour of failure causes. They do this with a fuzzy multiple criteria decision-making methodology, where the failure modes are the criteria. Emblemsvåg and Tonning (2003) [
25] apply AHP to identify, from among four options, the best maintenance policy for weapons system in the Norwegian Army. Monte Carlo simulations are used to evaluate the robustness of the decision. In [
26], Bertolini and Bevilacqua (2006) build a model using AHP to obtain the priority levels for each maintenance policy, corrective, preventive and predictive. Next, a goal-programming model was applied to identify the best set of maintenance policies for the failure modes in a set of the 10 most critical centrifugal pumps operating in an Italian oil refinery. In [
27], Gómez de León and Ruiz (2006) propose calculating an equipment criticality index via a normalized Simple Additive Weighting (SAW) method. The various maintenance policies are assigned according to the value obtained from the criticality test. In [
28], Wang et al. (2007) construct a model that uses a modified fuzzy AHP, deriving crisp priorities from a consistent or inconsistent fuzzy judgement matrix and solving an optimization problem with non-linear constraints. In [
29], Shyjith et al. (2008) combine AHP and Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) to choose a maintenance policy in a textile industry. In a similar industry, Ilangkumaran and Kumanan (2009) [
30] apply fuzzy AHP and TOPSIS to obtain a classification of maintenance policies. In [
31], Mousavi et al. (2009) combine Factor Analysis and fuzzy TOPSIS in the choice of maintenance policies. In [
32], Ahmadi et al. (2010) use AHP to calculate the effectiveness of appraisal criteria for a maintenance policy. The different maintenance policies in an aircraft system are ranked using the benefit-cost ratio, TOPSIS and VIseKriterijumska Optimizacija I Kompromisno Resenje (VIKOR). In [
33], Arunraj and Maiti (2010) apply AHP to obtain a priority vector for the maintenance policies analysed. The priority vector is a goal-programming model with three goals: global scores of maintenance policies, local scores of maintenance policies based on risk contribution, and local scores of maintenance policies based on cost. In [
34], Ghosh and Roy (2010) use the worst-case failure mode to calculate criticality of equipment, and with this information they prioritize maintenance activity. They then use fuzzy AHP and a goal-programming model to optimize the goals of risk reduction and cost minimization. In [
35], Chen and Chen (2010) apply AHP, TOPSIS and grey relational analysis to choose the optimal maintenance policies and their performance in a semiconductor company, while Siew-Hong and Shahrul (2012) [
4] use fuzzy TOPSIS to assess and select the optimal maintenance policy. In [
36], Zhaoyang et al. (2011) combine a risk-based inspection method and AHP to choose the optimal maintenance policy in oil and gas industries. Risk is defined from the product of the probability of failure and the consequences of that failure, and the resulting combinations are presented in a risk matrix. The equipment is assigned a given maintenance policy depending on the level of risk. In [
37], Zaim et al. (2012) apply AHP and the Analytic Network Process (ANP) to choose the most appropriate maintenance policy in a local newspaper printing facility in Turkey, while Shahin et al. (2012) [
38] apply ANP to the machinery in a mining company. In [
39], Cavalcante and Lopes (2015) use a multi-attribute value function to choose opportunistic maintenance, preventive maintenance or actions of minimal repairs in a cogeneration system using sugarcane bagasse at a power plant in Brazil. In [
40], Kirubakaran and Ilangkumaran (2016) use fuzzy AHP, grey relational analysis and the TOPSIS technique to decide, from corrective maintenance, predictive maintenance, time-based preventive maintenance, and condition-based maintenance, which is best for the pumps of a paper manufacturing plant.
Following [
41], the top of
Figure 1 shows the criteria used in the MADM literature, ordered by frequency, for the choice of most suitable maintenance policy. Criteria of costs and savings are clearly most significant, together with other criteria such as competitiveness or quality, which are very important to industrial companies. On the other hand, the bottom part of
Figure 1 gives the subcriteria for the most commonly used criteria: costs, savings, safety and availability.
The only precedent for analysing maintenance in health care organizations is in [
42] where Taghipour et al. perform a statistical analysis of the maintenance data of complex medical equipment with censoring and missing information for the development of a maintenance/inspection optimization model. In [
43], Sezdi (2013) studies the performance of medical devices to establish programmes of preventive maintenance. Specifically, the use of MADM techniques in the literature is limited to Taghipour et al. (2011) [
44], which prioritizes medical devices according to their criticality through the use of AHP. The criticality values obtained are used to establish guidelines for selecting appropriate maintenance strategies for different classes of device. In [
45], Carnero and Gómez (2016) describe a multicriteria methodology for the choice of maintenance policy in dialysis systems for patients infected with hepatitis C, and with hepatitis B, acute and chronic. The result is that corrective and preventive maintenance plus two reserve machines is the best choice.
3. Markov Chains
Markov processes are widely used in the literature as they allow systems to be modelled, and their reliability, maintainability, availability and safety parameters to be estimated [
46].
Continuous-time Markov chains consider a discrete set of exhaustive and mutually exclusive states where the time at which there is a change from one state to another is random.
Mathematical modelling of a system by Markov chains is described in [
47,
48,
49,
50]. It consists of obtaining a graph defining the states of the system, and transition between states occurs through failure or repair. Failure may cause stoppage of the system directly (catastrophic failure) or through a process of wear. In the latter case the states of wear and decline are considered to be non-catastrophic failures.
The conditional probability of a transition in homogeneous continuous-time Markov chains is defined in Equation (1).
and the probability matrix for transition
between states is shown in Equation (2).
where
.
Furthermore, the transition probabilities should be continuous at t = 0.
The stationary state is obtained by letting
and if the chain is regular the resulting matrix of transition probabilities is shown in Equation (3).
Substituting in the first equation of state in the form of a matrix:
gives Equation (5):
And since
this gives:
Equations (3) and (6) state that in a regular Markov chain, considering time t sufficiently large, the transition and state probabilities stabilize to values independent of the initial state and of time t. This is called a stationary state.
While taking the derivative the second equality in Equation (4) for
gives Equation (7).
and since
Writing out Equation (8) gives,
and deriving with respect to
in
,
where, from Equation (9)
Furthermore, since
, Equations (11) and (12) may be combined into a single equation:
and in matrix form,
Thus, the vector
of probabilities in the stationary state may be expressed through Equation (15).
Given the specific structure of , the vector comprises the last line of matrix . This system of equations may be interpreted as a system of probabilistic fluid balances for the continuous chain.
4. Markov Chains for Thermal Energy Production Systems
The thermal energy production systems analysed in this study are in the UGHCR in Spain. This is a state-run hospital opened in 2005. It is one of the largest hospitals in Spain. Its direct catchment area includes 66 towns and villages, meaning it offers health care to some 174,550 people directly, and potentially to 370,000 inhabitants from other hospitals. It is a regional reference in the specialities of nuclear medicine, eating disorders, and for its blood bank.
The thermal energy production systems are responsible for producing hot and cold water centrally to meet the needs of UGHCR. These systems are codified in the UGHCR as Pracal, Prafri, Prafcd, Pacbom and Pafbom.
4.1. The Pracal System
The Pracal system (
Figure 2) produces the hot water needed for the climate conditioning systems and also produces hot domestic water.
The system has four mixed boilers with a modulating burner, three of them are identical at 3000 kW each, while the other is of 900 kW, and is used to produce hot domestic water in summer. Three boilers working at full power can provide the minimum needs for the UGHCR. At times of low demand, such as in summer, or in the case of a breakdown in the smaller, 900 kW boiler, any of the three boilers is able to provide the required energy. Failure of the system is when two boilers break down.
Starting the boilers is an automatic process, carried out by the control system, depending on the load required. All incidents are registered in the control centre. A system failure is a breakdown in the burner, or breakage or blockage of the tube bundle of the boiler. If there is a failure in the control system, the boiler can be started manually.
λ
11 and μ
11 are defined as the rates of failure and repair due to breakdowns in the burner. λ
12 and μ
12 are the rates of failure and repair associated with breakage or blockage of the tube bundle of the boiler. The equivalent repair rate is calculated from Equation (16). Using the data for the specific system gives Equation (17).
The equivalent failure rate is
.
Figure 3 shows the Markov graph for the Pracal system. In the graph, the nodes represent states. A link from state 0 to state 1 describes the transition probability from 0 to 1, in this case 4
. The link from state 1 to state 0 represents the transition probability between these states, in this case the repair rate
. A description of the five feasible states of the Pracal system is shown in
Figure 2.
The transition matrix corresponding to the Markov graph is shown in Equation (18).
Solving the system of equations in Equation (15) gives the values for the probability vector in the stationary state. The solutions of these equations in all the systems analysed in this study were obtained using a recursive approach developed in MATLAB®.
Given that the system is operational in states 0, 1 and 2, the mean availability of the system is .
4.2. The Prafri System
The so-called Prafri system at UGHCR produces all the cold water required by the hospital for climate control purposes.
The system has three identical rotary coolers of 2700 kW each and a screw cooler of 900 kW, used to produce cold water at times of reduced demand. Two coolers working at full power are sufficient to provide the minimum needs for UGHCR. At times of low power, such as in winter or if there is a breakdown in the 900 kW cooler, any of the other three coolers is able to provide enough power to meet the demand.
System failure is when there is a breakdown in the second cooler. Once a fault is detected in any of the coolers, it will be repaired immediately. The coolers are started automatically according to the demand. All incidents are recorded at the control centre and tele-maintenance is provided in real time by the manufacturer’s support service. If there is a fault in the control system, the coolers can be started manually as required.
The failure and repair rates due to breakdown of the rotary cooler are defined as
and
.
and
are the failure and repair rates due to breakdown of the electric motor, and
and
are the failure and repair rates due to the starting-stopping mechanism. The equivalent repair rate is calculated from Equation (16), giving, for this system, Equation (19).
The equivalent failure rate is .
The graph and the transition matrix for this system are identical to those produced for the Pracal system, as the sequence of states is the same. The resulting transition matrix is, therefore, that shown in Equation (20).
The mean availability of the system is thus .
4.3. The Prafcd System
The so-called Prafcd system at UGHCR cools water from the condensers of the cooling units of the Prafri system.
The system comprises four cooling towers, which work in active redundancy, fed by a group of four pumps that also work in active redundancy. These pumps have similar characteristics (variable speed) and automatically alternate according to the number of working hours, and so, when calculating availability, they are considered as two groups operating in series: one comprising a set of cooling towers and the other a set of pumps. Each cooling tower contains an inbuilt automatic biocide dosage system to treat legionella.
The system is considered to have failed when two towers cease to function, including the anti-legionella system, or two pumps fail. The control centre indicates the existence of a fault in the towers but not in the anti-legionella system.
is defined as the equivalent failure rate for each tower calculated from Equation (21), where
and
are the failure rates of the anti-legionella system, the mechanical and filtering system and the ventilation and mechanical transmission system, respectively.
is defined as the equivalent repair rate of each tower calculated from Equation (22), where
and
are the repair rates of the anti-legionella system, the mechanical and filtering system and the ventilation and mechnical transmission system, respectively.
The failure and repair rates of each pump are calculated in the same way as for the towers, where: and , are the failure and repair rates of the axle-bearing group, the mechanical closure system and the motor, respectively, for each pump. The equivalent failure and repair rates for each pump are defined as and .
The Markov graph and the availability matrix for each block are similar to those obtained for the Pracal system (
Figure 3), but the failure and repair rates assigned in each case are specific to this system.
The mean availability of the system
is the product of the mean availability obtained for each block, as shown in Equation (23).
where
is the mean availability of the block comprising the cooling towers, and
is the mean availability associated with the group of pumps.
are the coefficients found by solving Equation (15).
4.4. The Pacbom and Pafbom Systems
The Pacbom and Pafbom systems feed hot and cold water produced by the Pracal and Prafri systems to other systems in the hospital.
The pumping system of the primary hot and cold water circuit comprises four identical pumps operating in parallel. These pumps have variable speed and alternate automatically according to the number of working hours. At full load three pumps operate simultaneously, so there is at least one reserve pump connected automatically if a breakdown in one of the operational would happen. We have assumed that the failure rate of the automatic control system is negligible. The system is considered to have failed when two pumps break down.
As the pumps in this system are similar to those in the Prafcd system, the same failure and repair rates will be used. Thus,
. and
are the failure and repair rates of each pump. The Markov graph for these systems is similar to that shown in
Figure 4.
The transition matrix for this system is shown in Equation (24).
The system is operational in states 0 and 1, therefore, the mean availability of the system is .
5. A Multicriteria Model for Optimizing Maintenance Policies
The MACBETH approach was created by Bana e Costa and Vansnick [
51]. It is an interactive approach for cardinal measurement that has been used on many real problems [
52,
53,
54,
55,
56,
57,
58,
59], among others on administration or public evaluation. MACBETH has the advantages of other MCDM methods, such as including a value tree resolving a complex problem into elements that are easier to analyse, structured in the form of a tree. This gives an organized visual overview of the concerns to be analysed. A large number of quantitative and/or qualitative decision criteria can also be included, as well as a tool to check the consistency of judgements. Additionally, the use of pairwise comparisons allow more accurate information to be obtained about the preferences of the decision makers [
60]. MACBETH is easily understood by the decision group. Nevertheless, the final decision to choose the MACBETH approach was taken because:
The alternatives of a problem can be evaluated using only qualitative judgments about the difference of attractiveness between two elements at a time, in order to generate numerical scores for the options in each criterion and to weight the criteria. It also avoids the drawbacks of ordinal aggregation methods, such as the Condorcet’s paradox or Arrow’s theorem [
61].
It comes with a user-friendly software called M-MACBETH [
62]. M-MACBEH simplifies the application of the entire multicriteria evaluation process. Among the important tools are the construction of value functions by linear programming, the ability to show variation intervals for the valuations associated with the performance levels so as to retain compatibility with the remaining judgements of the judgement matrix. It also checks the consistency each time a judgement is given, and provides suggestions for solving any inconsistencies found, and it facilitates the performance of a broad sensitivity and robustness analysis.
It offers the possibility of including different levels of precision in the judgements given. For example, if the decision group knows that one alternative is better than another but cannot specify how much better it is, the judgement “positive” may be chosen. On the other hand, if the group knows accurately how much better one alternative is than another, they can choose a value from among the various semantic categories of difference in attractiveness: no difference, very weak, weak, moderate, strong, very strong and extreme. It also allows hesitation or disagreement between the members of the group, allowing the choice of a number of consecutive categories. This permits uncertainty to be included in the decision process without having to use a fuzzy multicriteria model, with a far higher level of computational complexity.
MACBETH, unlike other MCDM techniques, requires validation of the results obtained from the judgements given by the decision group. It requires, for example, the validation of the weightings of the criteria, and acceptance or adjustment of the judgement. This is to match them with the result expected by the group. The extra effort to analyse the intermediate results of the model leads to better decisions.
MACBETH constructs the value functions, the weighing process, etc., using two reference levels (good and neutral) that must be defined for each evaluation criterion. Inconsistencies will be due to the unstructured way that pairwise comparisons are made [
63]. MACBETH, like other MCDM models such as, for example, the Best-Worst Method [
63], uses two references in the decision process, which leads to more consistent comparisons, and so to more objective and reliable results.
-
The MACBETH approach consists of a complete methodology that facilitates objective decision making, requiring a series of steps that are not defined or not specified with other techniques. MACBETH has clearly defined procedures to guarantee that each step of the decision process: structuring, weighting and ranking (analysis of results and sensitivity analysis) is properly carried out.
For these reasons, the MCDM considered optimal for this study was the MACBETH approach.
We now describe the multicriteria model for optimizing maintenance in thermal energy production systems of UGHCR, using the MACBETH approach. The stages of the MACBETH methodology are first described, and then applied to the analysed systems.
5.1. Structuring
For the building of a multicriteria model, a decision group was set up, made up of those in charge of different areas of the UGHCR: maintenance of facilities, electromedicine, programming and admissions, environment, care staff, and health and safety. The decision group was coordinated by the head of technical services at UGHCR. This decision group established decision criteria, their descriptors and associated performance levels.
The decision group analysed the criteria applied in the previous literature. As this was, however, applied mostly to manufacturing, transport and energy organizations with different objectives, requirements and characteristics than a health care organization, they decided to propose their own criteria. Availability is, nonetheless, a criterion found in a large number of studies (
Figure 1). The availability has been considered in this model, through the criterion Impact on health care organization activity, in an innovative manner. The literature also considers the criterion Investment cost together with Maintenance cost. However, the maintenance policies in this study do not require investment cost, as periodic and continuous predictive maintenance, proposed here as alternatives, would be outsourced by UGHCR. This assumes that there will be no need for equipment to gather or handle data, or fault diagnosis software. Therefore, the costs of outsourcing this policy are included in the maintenance costs. Environmental damage and zero pollution are the criteria used in the literature in the few existing studies that include environmental questions in the choice of maintenance policy. A descriptor was associated with each criterion or subcriterion to make an operational description. A descriptor is an ordered set of plausible performance levels in a criterion to describe objectively the impacts of alternatives with respect to that criterion [
53]. Scale levels were defined for each descriptor. This allows unambiguous subsequent interpretation of the descriptors. Each scale has the reference levels Neutral (N), for a level considered by the group to be neither satisfactory nor unsatisfactory, and Good (G) considered by the group to be fully satisfactory [
52]. These reference levels make the comparisons between criteria easier.
The decision group established the following decision criteria:
Maintenance costs (MC). These are the annual maintenance costs for labour and materials associated with a given alternative for each system analysed. The scale levels of the descriptor Annual maintenance costs, ordered in decreasing order of relative attractiveness are:
- -
L11 (highest level of performance). €1000 (G)
- -
L12. €2000
- -
L13. €3000 (N)
- -
L14. €4000
- -
L15 (lowest level of performance). €5000
Operativity. This criterion is related with the working of the system from the perspective of health care professionals or maintenance of the medical equipment. This criterion includes the subcriteria Safety of maintenance workers and Suitability of maintenance actions. The scale levels of the descriptors associated with these subcriteria are common to all the systems analysed. Now, the subcriteria and the scale levels of the descriptors used are defined:
- ▪
Safety of maintenance workers (SW). This assesses the risk that maintenance operatives and, in general, those who work directly with the systems, are exposed to. To assess the impact of this subcriterion on the alternatives, the descriptor Need to take safety precautions due to the existence of risk was defined by the decision group. The scale levels of this descriptor are as follows:
- -
L21 (highest level of performance). There is no potential risk to the worker at any time (G).
- -
L22 It is necessary to take certain basic precautions when undertaking the activity, due to the existence of some potential risk.
- -
L23 It is necessary to take basic precautions when undertaking the activity, due to the existence of a definite risk (N).
- -
L24 It is necessary to take basic precautions plus some specific precautions when undertaking the activity, due to the existence of a definite risk.
- -
L25 (lowest level of performance). It is necessary to take basic, specific and emergency precautions when undertaking the activity, due to the existence of definite risk.
- ▪
Suitability of maintenance actions (MA). This measures the level of suitability in the technical working of thermal energy production systems. That is, it considers safety when diagnosing faults, and the level of programming of the corrective tasks. The scale levels of the descriptor Capacity for diagnosing faults and programming corrective actions are:
- -
L31 (highest level of performance). The maintenance worker gives reliable diagnoses of the breakdowns and faults analysed and with sufficient notice of the failure, so that corrective action can be programmed jointly with other systems involved (G).
- -
L32 The maintenance worker gives reliable diagnoses of the breakdowns and faults analysed and with sufficient notice that corrective action needs to be programmed in the system.
- -
L33 The maintenance worker gives reliable diagnoses of the breakdowns and faults analysed and with a period of notice such that corrective action must be carried out immediately in the system (N).
- -
L34 The maintenance worker cannot give reliable diagnoses of all breakdowns and faults analysed and requires those in charge to take a decision. The period of advance warning in detecting the breakdown or fault is limited, and so immediate intervention is required, to carry out corrective action in the system.
- -
L35 (lowest level of performance). The maintenance worker cannot give reliable diagnoses of the breakdowns and faults analysed and the period of advance warning in detecting the breakdown or fault means that immediate intervention is required, to carry out corrective action in the system.
Impact on the environment (IE). This measures the level of negative consequences for the environment, both inside and in the areas surrounding UGHCR. The descriptor used is severity of environmental damage, and the scale levels are as follows:
- -
L41 (highest level of performance). Does not cause any type of environmental damage (G).
- -
L42. May cause environmental damage whose effects do not break regulations on the environment.
- -
L43. May cause environmental damage requiring corrective measures to restrict such damage (N).
- -
L44 (lowest level of performance). May cause irreversible environmental damage outside the limits of the hospital, which needs to be communicated to the authorities.
Impact on health care organization activity (IA). This measures the impact that a failure in the thermal energy production systems would have on other hospital systems. To evaluate this, the descriptor shown in Equation (25) was defined.
is the number of systems (critical, important and normal) that depend on the system analysed.
is the weighting index, which may take the values one, two or four, accordingly, depending on whether the system analysed has been classified as normal, important or critical, respectively.
is the mean availability of the system.
Therefore, this descriptor takes into account the mean availability of the system evaluated and the influence that that system has on other systems that depend on it directly or indirectly, weighted according to criticality for the health care organization.
The scale levels of this descriptor lie between the most favourable and most unfavourable possible, obtained in all the systems analysed in UGHCR. The most favourable value corresponds to the minimum possible value in the number of dependent systems and the minimum weighting and maximum mean availability possible, where the resulting value is . The most unfavourable scale level corresponds to . The scale levels, defined for the descriptor take into account the most and least favourable previous levels, are:
- -
L51 (highest level of performance) 0 (G).
- -
L52 (0, 50] (N).
- -
L53 (50, 100]
- -
L54 (100, 200]
- -
L55 (lowest level of performance) (200, 400]
The hierarchical structure or value tree of the thermal energy production systems is shown in
Figure 5.
A value function is required to assign value scores to the performance levels of a descriptor relative to the fixed scores of 0 and 100 assigned to the good and neutral reference levels. To convert the various scale levels of each descriptor into a value scale, the decision group gives qualitative judgements of differences in attractiveness between pairs of performance levels. The decision group was asked to choose one of the MACBETH semantic categories of difference in attractiveness: no, very weak, weak, moderate, strong, very strong or extreme. If necessary, judgements can be included that cover two or more consecutive MACBETH semantic categories. Each time the decision group provides a qualitative judgement, the consistency of all the judgements previously made by the group is checked automatically by the M-MACBETH software. The value function is generated by M-MACBETH through linear programming [
64].
The obtained value functions were checked by the decision group to ensure that they adequately represent the relative magnitude of the decision makers’ judgements [
65].
Figure 6 shows the MACBETH judgement matrices for the criteria/subcriteria: Maintenance costs, Safety of maintenance workers, Suitability of maintenance actions, Impact on the environment and Impact on health care organization activity.
All the judgement matrices given by the decision group are consistent. The value functions obtained by linear programming in M-MACBETH for the different criteria and subcriteria are shown in
Figure 7. MACBETH identifies when a criterion uses qualitative performance levels (qualitative levels of scale of the descriptor) or quantitative performance levels. The subcriteria Safety of maintenance workers and Suitability of maintenance actions and for the criterion Impact on the environment, use qualitative levels of scale of the descriptor. In this case, the value function consists of a numerical scale that assigns the neutral reference level of the descriptor the value 0, and the good reference level the value 100. For example,
Figure 7a shows that the value function of the criterion Safety of maintenance workers, the value 100 is assigned to the performance level L21, which considers that there is no potential risk and, therefore, has the good reference level. The scale level L23, assigned to the neutral reference level, is assigned the value 0. This means that the numerical scales are linear and continuous.
The criterion Maintenance cost uses quantitative performance levels.
Figure 7c shows two graphs; on the left is a vertical axis with a numerical scale that assigns the neutral reference level of the descriptor Maintenance costs, in this case €3000, to the value 0, and the good reference level (€1000) to the value 100. On the right is a piecewise-linear value function graph. The horizontal axis shows the scale levels of the descriptor, and the vertical axis the scores. The value function of the criterion Maintenance costs gave a numerical scale in two linear pieces with different slopes, which implies a discontinuity in the scale levels: 1000 to 3000, and 3000 and 5000. These linear segments facilitate the calculation of the score of any alternative whose performance with respect to the Maintenance cost criterion is between two consecutive scale levels of its descriptor.
If, given the value function, the decision group wishes to change the score, the M-MACBETH software provides a graph interval, and within this interval the score of the scale level can be adjusted, keeping the scores of the other performance levels the same, and preserving the consistency of the judgement matrix. For example, if the decision group considers that the difference in attractiveness between 1000 and 2000 should be slightly greater than the difference between 2000 and 3000, M-MACBETH produces the graph in
Figure 8. This shows that 2000 could be given a value in the interval 0.01 to 74.99 and the consistency of the judgement matrix would be maintained. This was not, however, the case in this study, and the decision group validated the value function originally obtained.
5.2. Weighting
To obtain weightings for the criteria, an additional alternative was defined at the neutral level for all criteria and subcriteria. The decision group was asked to qualitatively judge, using the MACBETH semantic categories from “very weak” to “extreme”, the increase in overall attractiveness provided by a swing from the neutral level to the most attractive impact level, in each of the criteria/subcriteria considered. For example, how much would a swing from neutral to good in Impact on the environment increase its overall attractiveness? The decision group answered “strong”. Next, how much would a swing from neutral to good in Suitability of maintenance actions increase its overall attractiveness? The answer of the decision group was “very weak”. A similar question was subsequently asked for the criteria Impact on health care organization activity, Maintenance costs and Safety of maintenance workers. In the case of the criterion Safety of maintenance workers, the group hesitated between weak and moderate qualitative judgements. Finally, they decided that to include both semantic categories in the answer was most suitable. These judgements make up the last column of the matrix of
Figure 9. M-MACBETH software derives an incomplete ranking of the swings, which the decision group was asked to validate and complete, changing the order of the criteria/subcriteria in the matrix from the most attractive swing to the least attractive (see in
Figure 10 the new ordering of the criteria/subcriteria by rows and columns).
The group then compared how much more preferable the change from the neutral to the good level is in the criterion Impact on hospital activity as opposed to the same change in the criterion Impact on the environment. The qualitative judgement given by the decision group was “weak”. Next, the group was asked How much more attractive is a swing from neutral to good in Impact on hospital activity than in Maintenance costs? The group was doubtful between weak and moderate. Therefore, the judgement matrix of
Figure 10 includes both consecutive categories “weak-moderate”. This comparison process is repeated between the criterion Impact on health care organization activity and the criteria Safety of maintenance workers and Suitability of maintenance actions respectively. The first row of the weighting matrix is completed in this way.
This process was repeated, now first asking the group How much more attractive is a swing from neutral to good in Impact on the environment than in Maintenance costs? Of course, when a criterion is compared with itself, the absence of difference in attractiveness is shown with a “no” in the diagonal of the judgement matrix. The process is then repeated row-by-row (from left to right), comparing the swings pairwise, also using the MACBETH semantic categories, until the weighting of the judgement matrix was completed (see
Figure 10) [
65]. This procedure for producing the weightings of the criteria using references in the performance scales avoids the most common critical error found in decision making, according to Keeney [
66].
The consistency of each judgement given by the group was tested by M-MACBETH. The weights
obtained are:
,
,
,
and
. The group was asked for their degree of agreement with the weights obtained and if they consider it necessary to adjust the values obtained interactively to obtain more appropriate values. Nevertheless, the decision group expressed agreement with the weightings obtained, and no adjustment was necessary. The weightings were therefore validated by the decision group. The validated weightings are shown in the scale column in
Figure 10.
5.3. Alternatives
The alternatives considered in this study are a combination of various maintenance policies.
- -
Corrective maintenance (ALT1). Corrective maintenance is carried out only when an unexpected failure occurs, or as a consequence of performing preventive or predictive activity if a need for corrective action is discovered. Once a fault has been detected, corrective maintenance has an almost immediate response. However, the resolution time depends on the nature and extent of the fault.
- -
Corrective maintenance and preventive maintenance (ALT2). Preventive maintenance is carried out with differing frequencies, depending on the system: fortnightly for Pracal and Prafcd, and monthly for Prafri, Pacbom and Pafbom. Corrective maintenance is carried out in a complementary fashion, as stated above. In the Prafcd, the maintenance policies used for the cooling towers are corrective and preventive maintenance.
- -
Corrective maintenance, preventive maintenance and periodic predictive maintenance (ALT3). Predictive maintenance is carried out quarterly in the Prafcd, Pacbom and Pafbom systems. This is done with portable devices that analyse the vibrations of the pumps and ventilators, to be later analysed for diagnosis. In the Pracal system, predictive maintenance is carried out quarterly by analysing the burning of the fuel. Corrective and preventive maintenance are performed as indicated for the previous alternative.
- -
Corrective maintenance, preventive maintenance and continuous predictive maintenance (ALT4). In the Prafri system predictive maintenance is performed continuously by means of vibration analysis. The vibration data are transmitted to the control centre of the official support service. Corrective and preventive maintenance are performed as previously indicated.
6. Results
To obtain the Impact on health care organization activity for each system it is necessary to calculate in advance the mean availability for each alternative.
Table 2,
Table 3,
Table 4 and
Table 5 show the failure and repair rates as a function of the alternative under consideration, for the burner-boiler unit, the cooler units, the cooling towers and the pumps, respectively.
Table 6 shows the results of the mean availability for each alternative and analysed system.
Equation (25) serves to calculate the impact of each system on hospital activity.
Table 7 shows the number of critical, important and normal systems that depend on each system to be evaluated, and the resulting value of
.
The results of the assessment of the criteria and subcriteria for each system and alternative are shown in
Table 8.
The evaluation of each alternative in each thermal energy production system is carried out by simple additive aggregation from bottom to top in the value tree. When considering
decision criteria, the performance
of an alternative
is calculated using Equation (26) [
67].
with
is the weight of each criterion and
is the value score of
in criterion
.
Figure 11 shows the classification of alternatives obtained for the different thermal energy production systems at UGHCR by using Equation (26). The model gives as the most suitable alternative the use of corrective maintenance together with preventive maintenance (ALT2) in the Pracal, Pacbom and Pafbom systems. On the other hand, it recommends using corrective, preventive and periodic predictive maintenance (ALT3) in the Prafcd system, and corrective, preventive and monitored predictive maintenance (ALT4) in the Prafri system. UGHCR is currently applying, in all the analysed systems, corrective maintenance and preventive maintenance (ALT2). This assumes that the Pracal system has the highest possible availability (1), while the Prafcd and Prafri systems have an availability of 0.9990. In the case of Pacbom and Pafbom the availability is 0.9999 and, therefore, very close to the maximum possible.
In three of the analysed systems the model suggests continuing with the combination of maintenance policies that was already being used at UGHCR, that is, corrective and preventive maintenance. The model in these cases allows maintenance decisions to be justified based on objective models where the decisions have been based on the consensus of a multidisciplinary decision group.
In the case of the Prafcd system, the model suggests applying corrective, preventive and predictive maintenance. This assumes an improvement in availability of 3.20% and would require a cost increase of 41.33%.
The model suggests, together with the use of corrective and preventive maintenance, using continuous predictive maintenance in the Prafri system. This would mean the introduction of an additional maintenance policy with respect to those that were being used until then in the system at UGHCR. In this way, the system would go from an availability of 0.9990 to 0.9999. Maintenance costs would increase by 80.65%. Such a significant increase in cost is due to the high technological level of continuous predictive maintenance. That is, the use of continuous predictive maintenance requires purchasing and maintaining advanced systems for acquisition and handling of predictive data (vibration, oil used parameters, thermography images, etc.). Furthermore, high-level training of staff in order to use and develop the policy is required. Nonetheless, this cost increase is compensated by the fact that the use only of corrective and preventive maintenance can have serious consequences for the normal working of UGHCR, albeit for a short time. The change in the Prafcd and Prafri systems to the alternatives ALT3 and ALT4 respectively, would eliminate these serious consequences, which stem from the large number of systems that depend directly on the correct working of the Prafcd and Prafri systems, such as: 14 systems that produce conditioned air for operating theatres, three systems producing primary air in Casualty, two systems dedicated to producing primary air for ICUs, production of primary air for neonatology and paediatric services or haemodialysis. Thus, the best choice could eliminate small stoppages in the service of the system that might otherwise seriously affect a variety of hospital services that are considered critical or important. Furthermore, the application of ALT3 and ALT4 would improve maintenance programming, and safety of care and maintenance workers, increasing the availability of maintenance workers to perform maintenance activities of greater added value to the organization.
Table 9 shows the values obtained by the model for the alternatives currently applied at UGHCR and those proposed by the model in the Prafcd and Prafri. The availability and consequences of applying each alternative are also included. In the remaining systems, no change of alternative is suggested and therefore they are not included in
Table 9.
The sensitivity analysis is performed by modifying a specific criterion weight slightly, to values considered feasible by the decision group, and subsequently adjusting the difference equally over the remaining criteria in such a way that
. The classification of alternatives does not change. The results of the sensitivity analysis for the Pracal system when the criterion Maintenance cost is modified is shown in
Figure 12a. The current weight of Maintenance cost in percentage is shown in red at the top of the figure (18.18%) and is accompanied by a continuous vertical red line. ALT2 is the alternative chosen for weighting in the criterion Maintenance cost, above 11.3%, given that it is the one with the largest overall variation. The intersection point between a pair of alternatives can be seen as a pink line. The weighting of the criterion that reaches this intersection point is shown on the x-axis (11.3%) and the overall score of the alternatives that cross (ALT2 and ALT4), is shown on the y-axis (66.5). For weightings in the criterion Maintenance cost lower than 11.3%, the alternative chosen would be ALT4. It would be necessary to assign to Maintenance costs a weighting 37.84% lower than the current one in order for the alternatives ALT2 and ALT4 to change places. This decrease is very high and is not considered feasible by the decision-making group.
Figure 12b shows that the current weight of the criterion Safety of maintenance workers is 13.64% and the alternative chosen for this value is ALT2. The intersection point between ALT2 and ALT4 is reached when the weighting of this criterion is 26.0%. For weightings higher than 26.0%, the alternative chosen would be ALT4. This supposes, an increase of 90.62% with respect to the current value, to produce this reversal of places. This increase required to change the classification of alternatives is so great that the decision group does not consider it feasible.
Figure 12c shows that the current weight of the criterion Suitability of maintenance actions is 4.55%. The intersection point between ALT2 and ALT4 is reached at the weighting 11.9%. Therefore, below this weighting the chosen alternative would be ALT2. An increase in the weighting of 161.54% in this criterion with respect to the current weighting would be required for the alternative chosen to switch to ALT4.
Figure 12d shows that the current weighting of the criterion Impact on the environment is 27.27% and in no case would there be a switch in the classification of alternatives if the weighting were modified. No intersection point can be perceived.
Figure 12e shows that the current weighting of the criterion Impact on health care organization activity is 36.36%. There is no intersection point between the alternatives in first and second place. The chosen alternative is ALT2 regardless of the weighting assigned to this criterion.
Thus, in all cases the decreases and increases required to change the classification from ALT2 to ALT4 are significantly large that the decision group does not consider them feasible. As a result, the model described is considered robust. Similar analyses were performed on the other systems, reaching the same conclusion in each case, as can be seen in
Appendix A.
UGHCR developed a plan for introducing the most suitable alternatives suggested by the model. Thus, the introduction of ALT4 in the Prafri system is considered to take 15 days, specifying the need for specific additional training for maintenance staff.
The theoretical contribution of this research is a methodology that can be easily applied in other health care organizations, for the selection of the best maintenance strategy. This strategy combines various maintenance policies, following the usual practice of maintenance departments, unlike much of the current literature. Also, as there are no precedents in the literature, this model describes a number of decision criteria and descriptors that could be applied to other health care organizations. These criteria could also be adapted to the specific circumstances of each hospital.
7. Conclusions
This research analyses, for the first time, the optimal combination of maintenance policies to be applied to thermal energy production systems in hospitals. Although these systems are important in most organizations, in the case of health care organizations their criticality is considerably greater, given the implications for patient health. This is because these systems control the exact temperature and humidity as required by patients, in critical care areas such as theatres, casualty and neonatology, and they also provide hot domestic water ensure the absence of legionella.
To obtain an objective decision on the best combination of maintenance policies, an innovative multicriteria model was designed using the MACBETH approach. This guarantees a full, objective classification of the possible alternatives, which is also produced from judgements given by those in charge of difference care and non-care services in the hospital. To build the model, it was first necessary to model the systems to be analysed by continuous time Markov chains.
The result was that in Pracal, Pacbom and Pafbom the optimal combination of maintenance policies was already in place, and no later action was necessary. In Prafcd and Prafri systems, however, a need to change the maintenance alternative was found, in order to guarantee the best quality of care to patients. To this end, the Health Care Organization created a plan of action to achieve the policy changes recommended.
The most important contribution of this research is, however, the fact that it addresses improvements in quality of care in systems and facilities not directly involved in care, such as thermal energy production systems. The results of the study show in a practical way how a change in maintenance strategy to a combination of corrective, preventive and predictive maintenance (thermal analysis), alternating the preventive with the predictive policy in programmed quarterly checks, together with redundancy of transformers, would prevent stoppages in radiology, and therefore guarantee the proper provision of this care service. It would also improve safety of staff and patients, assist in compliance with current maintenance review standards, improve maintenance programming, etc. This research was carried out wholly in real thermal energy production systems in a state hospital in Spain, thus guaranteeing the viability of the methodology, and the ease that the best results for each health care organization can be obtained with.
Future development will consider the need to review the models periodically to see if it is possible to include new alternatives, in the form of new combinations of maintenance policies, for example, include the application of Total Productive Maintenance (TPM) or Reliability Centered Maintenance (RCM). It is also important to review the judgements given by the decision group, since changing economic or environmental circumstances, or the needs of care quality, may lead to different weightings for the criteria, or the need to include additional criteria in the model.
Although the MACBETH approach allows uncertainty, doubt or ambiguity to be included in certain areas of the decision process, the idea is to update the model in the future using a fuzzy MACBETH methodology and comparing the results with those from the model in this study.