1. Introduction
Data envelopment analysis (DEA) [
1] is a non-parametric method for measuring the efficiencies of decision-making units (DMUs) by finding the most favorable inputs and outputs for them. Such a self-evaluation can classify all the DMUs into efficient ones and inefficient ones, but efficient DMUs are not further discriminated [
2,
3]. To enhance the discrimination power of the original DEA, the cross-efficiency evaluation was then developed to calculate the cross-efficiency scores of DMUs linked to all DMUs [
4]. However, the cross-efficiency evaluations obtained from the original DEA are generally not unique due to the optimal solution to the DEA linear program not being unique. Sexton et al. [
4] and Doyle and Green [
5] proposed the secondary goals (the aggressive and benevolent formulations) to deal with this issue. Several authors [
6,
7] extended Doyle and Green’s model by introducing a number of different secondary objective functions.
The above DEA models are focused on the calculation of the self-efficiency scores or cross-efficiency scores of DMUs, which are based on some specific weight information. However, the weight information space or the evaluation information space that makes a DMU efficient is not distributed symmetrically. For two efficient DMUs, the corresponding weight information space or evaluation information space is different, and the efficient DMU with a bigger information space should be better than the one with a smaller information space [
8,
9,
10]. Therefore, the corresponding information space distributed symmetrically can be used to discriminate efficient DMUs, which is not considered in the classical DEA models. Stochastic multicriteria acceptability analysis (SMAA) [
9] is used to find the information space that supports each alternative for the best ranking. Lahdelma and Salminen [
8] introduced the SMAA-2 method, which extends SMAA by considering all the rankings in the analysis.
However, these methods are not suitable to deal with DEA problems. Lahdelma and Salminen [
10] presented the SMAA-DEA method, which is a combination of DEA and SMAA-2 and is intended for evaluating the efficiencies of DMUs by exploring the corresponding weight spaces, according to which, clearly efficient and barely efficient DMUs can be identified. Yang et al. [
11] considered all possible weights in the weight space when computing the cross-efficiency, and each DMU was given an interval cross-efficiency. By using the SMAA-2 method, all DMUs in the interval cross-efficiency matrix could be fully ranked according to acceptability indices. However, in Lahdelma and Salminen’s method [
10] and Yang et al.’s method [
11], the inputs and outputs of DMUs were assumed to be independent of each other. By considering the interactions between criteria, Angilella et al. [
12,
13] applied the Choquet integral [
14] to the SMAA-2 method.
In most of the existing DEA methods, the inputs and outputs are assumed independent. Actually, there exist interactions between inputs or outputs in many practical problems [
15,
16,
17]. Recently, Ji et al. [
18] and Xia and Chen [
19] gave the efficiency evaluation model with interactive inputs and outputs, in which the Choquet integral was used to aggregate the multiple inputs and outputs into a single efficiency index. Pereira et al. [
20] used a Choquet-integral-based approach for incorporating decision-maker’s preference judgments in DEA. Ji et al.’s [
18] method and Xia and Chen’s [
19] method were based on the CCR model [
1], while Pereira et al.’s [
20] method was based on the value-based ADD model [
21]. However, their method also has multiple solutions, and the calculated self-efficiency and cross-efficiency scores of DMUs were also based on some specific input and output weight vectors. Their method can identify the efficient and inefficient DMUs, but cannot further identify which is better in the efficient DMUs. In addition, their method can only deal with the DMUs with determined input and output evaluations, which is not common in practical problems, because there always exists uncertainty, fuzziness, or randomness in the process of estimating input and output evaluations due to its inherent stochastic nature or specification errors [
22,
23].
Based on the above analysis, we can find that the DEA models taking into account the interactive variables do not consider the information space when calculating the efficiency scores of DMUs, while the DEA models taking into account the information space do not consider the interactions between inputs and outputs. This paper fills this gap by using the SMAA-2 method to deal with DEA models with interactive variables to explore the information space that is favorable for a DMU at any ranking. In the process, the Choquet integral is employed to reflect the interactions between inputs or outputs. The contributions of this paper are given as follows:
- (1)
The proposed method gives a combined method, which not only considers the interactions between inputs and outputs, but also can discriminate the efficient DMUs by exploring the corresponding supported information space;
- (2)
Compared to the DEA models with interactive variables, the proposed method can explore the information space that supports each DMU, which not only can discriminate the efficient DMUs, but also can give a ranking of efficient DMUs according to the supported information space;
- (3)
Compared to the DEA models with interactive variables, the proposed method not only can deal with the DEA problem with determined input and output evaluations, but also can deal with the DEA problem, in which the weight vector and evaluations of the input and output are stochastic;
- (4)
Compared to the DEA models with dependent input and output evaluations, the proposed method can deal with the DEA problems in which the inputs and outputs are interactive.
The remainder of this paper is constructed as follows:
Section 2 introduces the Choquet-based DEA models;
Section 3 gives the CH-SMAA-DEA method to measure DMUs in terms of rank acceptability indices, central weights, and confidence factors. Examples are given in
Section 4 to compare the proposed method with the existing ones.
Section 5 gives the conclusions.
2. DEA Models Based on the Choquet Integral
Suppose there are
m decision-making units (DMUs) with
h inputs and
s outputs. Let
and
be the input and output values of DMU
, respectively. Let
and
be the input and output weights, respectively. Then, the efficiency
of DMU
can be calculated by the ratio of its weighted score for the output criteria to its weighted score for the input criteria:
For convenience, let be the set of input and output weight information and be the set of all input and output evaluations with being the evaluation vector corresponding to DMU.
The DEA model aims to maximize each DMU’s self-efficiency score by finding the most favorable weights, then the maximum efficiency score of DMU
can be calculated by the following CCR model [
1]:
Based on Equation (
1), (MOD 1) can be written as the following linear programming model:
Suppose the optimal value of (MOD 2) is denoted by
. If
, then DMU
is efficient in the CCR model [
1]. (MOD 2) is a self-efficiency model, which can identify the efficient DMUs, but cannot further discriminate between efficient DMUs. The cross-efficiency is defined by considering all the DMUs. For DMU
, a group of optimal weights
and
can be obtained by solving (MOD 1), and its cross-efficiency of DMU
to DMU
, namely
, can be calculated by using the weights of DMU
.
Then, the average of all
can be calculated as:
which is called the cross-efficiency score of DMU
.
It is noticed that (MOD 1) may have multiple optimal solutions, and the cross-efficiency calculated by Equation (
3) is referred to as an arbitrary strategy. To resolve this problem, one remedy suggested by Sexton et al. [
4] is to introduce a secondary goal to choose the one from multiple optimal solutions while keeping the self-efficiency obtained by (MOD 2) unchanged. Many other strategies [
5,
6,
24,
25] have also been developed about the secondary goal. However, when different strategies are used, different results may be obtained.
It is noted that the classical DEA models assume that the inputs and outputs are independent. However, many authors [
13,
18] showed that there are interactions between inputs or outputs. The Choquet integral [
14] is the generally used technique to reflect the interactions between criteria. Before introducing the concept of the Choquet integral, several definitions are given first:
Definition 1 ([
26])
. A fuzzy measure μ on is a function μ: , satisfying the axioms: (i) ; (ii) implies . Definition 2 ([
14])
. Let f be a positive real-valued function on and μ be a fuzzy measure on Y. The discrete Choquet integral of a function with respect to μ is defined by:where indicates that the indices have been permuted so that , and is the set of , and let here. As the Choquet integral takes into account the interactions between criteria, Ji et al. [
18] utilized it to aggregate the input and output evaluations of DMU
:
where
indicates that the indices have been permuted so that
,
is the set of
and
and
indicates that the indices have been permuted so that
and
is the set of
and
.
For DMU 1 and DMU 20 in Example 2 (see
Table A5), we have:
and:
However, it is noted that the input and output evaluations of DMUs should be ordered before being aggregated. Therefore, it is not convenient when there are many input and output evaluations to be ordered. Especially when the input and output evaluations are not expressed exactly, it is hard to give an exact order. In addition, if there are two DMUs and the orders of their input and output evaluations are not the same, then the corresponding weight vectors associated with these two DMUs are not the same. This means that we have to determine the corresponding weight vector for each DMU, which makes it difficult in modeling and calculation, especially when the fuzzy measures are unknown and should be determined from the known information.
To deal with such issues, another form of the Choquet integral can be defined in the following.
Definition 3 ([
27,
28,
29])
. The Möbius transform of μ is a function on defined as , ; is the set of elements in A excluding the elements in B; is the cardinality of . In terms of Möbius representation, (i) and (ii) can be represented by [
27,
28]: (iii)
; (iv)
and
,
, where
S is the subset of
Y excluding
i and
T is the subset of
S.
The Choquet integral can be redefined in terms of the Möbius representation, without reordering the aggregated values [
27,
28,
30]:
Definition 4 ([
27,
28,
30])
. With respect to the Möbius representation, the Choquet integral defined in Definition 2 can be rewritten as:where , , , , , and . Different from Definition 2, Definition 4 does not have to reorder the aggregated arguments, and the weight vector is associated with the aggregated arguments, but not the position, and therefore would be easy to use. In Definition 4, for any two alternatives, the corresponding weight vectors are the same, which provides much convenience in deriving the unknown weight vector.
Let
,
,
,
and
be the set of input and output weight information with interactions, then:
where
includes the input weights and output weights with interactions between each other, respectively.
By considering the interactions between inputs or between outputs, the efficiency of DMU
can be written as [
19]
In Example 2, take DMU 1 and DMU 20 as an example (see
Table A5); we have:
It is noted that, when calculating the efficiency scores of DMUs, we do not have to reorder the aggregated input and output evaluations, and the corresponding input and output weight vectors will not change as the DMU changes, which provides much convenience in modeling and calculation.
The difference between Equation (
6) and Equation (
1) is that the former considers the interactions between inputs or outputs, but the latter does not. Especially, if the interactions between inputs or outputs are not considered, that is
,
, where
t is the cardinality of the coalition
T and
, then Equation (
6) reduces to Equation (
1). Comparing Equation (
6) with Equation (
4) given by [
18], both of them are based on the Choquet integral and can be converted between each other; this is because Definition 2 and Definition 4 can be converted between each other. However, they are based on different forms of the Choquet integral: Ji et al.’s model [
18] is based on Definition 2, while Equation (
6) is based on Definition 4. The most important is that Equation (
6) does not have to reorder the input or output evaluations and the corresponding input and output weight vectors will not change as the DMUs change, which provides much convenience in the process of deriving the efficiency scores of DMUs.
Based on Equation (
3) and (MOD 2), the DEA model considering interactions between inputs and outputs can be established as [
19]:
where
is the Choquet integral input and output weights associated with DMU
. Since
x is determined, (MOD 3) is linear. Similar to (MOD 2), the solution of (MOD 3) may not be unique. By solving (MOD 3) using Lingo 14, we obtain a set of optimal input and output interactive weights
for each DMU
. In (MOD 3), each DMU is self-evaluated and termed efficient if and only if the optimal objective function is equal to 1. The cross-efficiency of DMU
using the weights of DMU
, namely
, can be calculated as:
For DMU
, the average of all
, namely
, can be considered as the cross-efficiency score of DMU
:
Suppose the optimal value of (MOD 3) is denoted by
. By comparing (MOD 2) and (MOD 3), it is noted that all the feasible solutions of (MOD 2) are also those of (MOD 3), which indicates that the feasible region of (MOD 3) is not smaller than that of (MOD 2). Therefore, the optimal solution of (MOD 3),
, is not smaller than that of (MOD 2),
, that is
. That is because (MOD 3) takes into account the interactions between inputs and outputs, while (MOD 2) does not. Especially, if
,
, then (MOD 3) reduces to (MOD 2), and Equation (
8) reduces to Equation (
3).
It has been proven that Definitions 2 and 4 can be transformed between each other; therefore, Xia and Chen’s model [
19] based on Definition 4 is equivalent to the one given by Ji et al. [
18], which is based on Definition 2. The only difference is that Xia and Chen’s model [
19] does not have to reorder the input and output evaluations when aggregating them, and the corresponding input and output weight vectors will remain unchanged with different DMUs. We denote the Choquet-integral-based DEA model as CH-DEA hereafter.
However, the disadvantage of (MOD 2) and (MOD 3) is that they can only discriminate efficient DMUs from inefficient ones, but cannot further identify between efficient ones. Although the cross-efficiency scores calculated by Equations (3) and (8) can give a ranking of DMUs, different results may be obtained when the input and output weights are calculated by using different strategies. Actually, self-efficiency scores and cross-efficiency scores are all based on some specific weight vectors and do not consider the whole set of the information space. In addition, neither of them can provide the acceptability of an efficient DMU. For two efficient DMUs, one may correspond to a large space of weight information, and the other may correspond to a smaller one, which indicates the former should be better than the latter. Therefore, these two efficient DMUs should be discriminated, which will be discussed in the following section.
3. SMAA-DEA Based on the Choquet Integral
By exploring the corresponding information spaces of DMUs, Lahdelma and Salminen [
10] presented the SMAA-DEA method, in which the DMUs are evaluated by using several indices including the acceptability index, the central weight vector, the confidence factor, the maximum efficiency, the central efficiency, and the average efficiency. However, their method does not consider the interactions between inputs or outputs. In this section, the Choquet-integral-based SMAA-DEA method (CH-SMAA-DEA) is proposed to explore the information space of each DMU by taking into account the interactions between inputs or outputs.
In DEA models, suppose the input and output evaluations of DMU
are represented by the stochastic variables
with the probability distribution
over the space
. Similarly, the decision-makers’ unknown or partially known preference about the input and output weights is represented by a stochastic weight vector
w with joint density function
in the feasible input and output weight space
W.
can be given by the decision-makers; we assumed that it is an independent uniform distribution in this paper. In the SMAA-DEA method, the input and output weights are normalized to give a finite information space for simple computation. Similarly, the Choquet-based input and output weights in space
W are also normalized, respectively, as:
For each
in
X and each
w in
W, the efficient score
of DMU
can be denoted by the following formula:
One difference between Equations (6) and (9) is that the input and output evaluations are determined in Equation (
6), while they are stochastic in Equation (
9). When the input and output evaluations are determined in Equation (
9), the difference between Equations (6) and (9) is that the input and output weights are respectively normalized in Equation (
9), but are not in Equation (
6). For a DMU, the biggest efficiency score derived from Equation (
6) is 1, but may not be 1 in Equation (
9). To deal with such a situation, we first normalize the values of
into the interval
, that is:
For each
in
X and each
w in
W,
provides a complete ranking of alternatives, then the rank of DMU
is denoted by:
where
and
.
For each
, suppose DMU
ranks
rth; we can compute the set of the possible input and output weight space based on SMAA-2 [
8]:
which is called the favorable weights of DMU
ranking
rth.
contains all the input and output weights that make DMU
rank
rth. It is noted that
is distributed asymmetrically. If
, then it is possible that DMU
ranks
rth, and the bigger the space
, the bigger the likelihood that DMU
ranks
rth for
. Considering all the
in
X, an index can be given to measure the acceptability of DMU
ranking
rth.
On the basis of the favorable weight information space
and all input and output evaluations
, the Choquet-integral-based acceptability index for DMU
ranking
rth is given as:
which is described by the shared information space that supports DMU
ranking
rth over all the information space; in particular,
measures the shared information space making DMU
the most preferred one. If
, then DMU
is efficient according to the CCR model; otherwise, DMU
is not. The bigger the
, the more efficient DMU
is. Therefore,
can not only discriminate the efficient DMUs, but also can measure the acceptability of efficiency.
For efficient DMUs, to describe which weight vector supports DMU
ranking first, the Choquet-integral-based central weight vector can be defined as:
The Choquet-integral-based central weight vector describes the preference of a typical weight vector that makes DMU
the most preferred one, which can help decision-makers understand which weights support which alternative. For inefficient DMUs, we have
; suppose their Choquet-integral-based central weight vector is that which makes them attain their maximum efficiency or attain their best rank
, that is:
Based on the Choquet-integral-based central weight vector, the Choquet-integral-based confidence factor is defined as:
which measures the likelihood of DMU
becoming the best one when the Choquet-integral-based central weight vector is used.
It is noted that the above measures are all based on the calculation of the information space for supporting DMUs. Except for the above indices, the following stochastic efficiency measures can be defined to reflect the efficiencies of DMUs from different views.
The Choquet-integral-based maximum efficiency
is the best efficiency score for DMU
and can be calculated by maximizing the efficiency score over all the stochastic evaluation values and weight values.
The process to calculate the Choquet-integral-based maximum efficiency is to find the most favorable evaluations and weights for a DMU.
The Choquet-integral-based central efficiency
is the expected efficiency score of DMU
when the Choquet-integral-based central weight vector is used:
which can estimate the average performance of a DMU when the most favorable weight vector is used.
The Choquet-integral-based average efficiency
is the Choquet-integral-based expected efficiency score of DMU
over all the stochastic evaluation values and weights:
According to
, the average performance of DMU
can be estimated when all possible evaluations and weights are considered. The flowchart of the proposed method is illustrated in
Figure A1 in
Appendix A.
Ji et al. [
18] calculated the self-efficiencies and cross-efficiencies of DMUs considering the interactions between inputs and outputs. However, their method has multiple solutions, and the calculated self-efficiency and cross-efficiency scores of DMUs are based on some specific input and output weight vectors. Their method can identify the efficient and inefficient DMUs, but cannot provide the acceptability of efficient DMUs. Although the cross-efficiencies of DMUs can distinguish efficient DMUs, different optimal input and output weights will produce different cross-efficiencies of DMUs, which will produce different results in distinguishing efficient DMUs. In addition, their method can only deal with the DMUs with determined input and output evaluations and will be invalid when the input and output evaluations are uncertain.
Lahdelma and Salminen [
10] developed the SMAA-DEA method to derive the acceptability indices of DMUs by exploring the information space that supports the ranking of DMUs, but they did not consider the interactions between inputs or outputs. Especially, if the interactions between inputs or outputs are not considered, that is
,
, then the proposed method reduces to Lahdelma and Salminen’s method [
10] and the proposed indices reduce to the ones defined by Lahdelma and Salminen [
10].
4. Examples
In this section, two examples are given to compare the proposed methods with the ones given by Lahdelma and Salminen [
10] and Ji et al. [
18].
Example 1 ([
10])
. Consider eight DMUs A, B, C, D, E, F, G, and H with one input and two outputs as listed in Table A1. Lahdelma and Salminen [
10] assumed that there was no interaction between inputs and outputs. As discussed in the Introduction, it is reasonable to assume that the there exist interactions between inputs and outputs, which is also the assumption in this paper.
First, we treat the problem as deterministic. SMAA-DEA [
10] and CH-SMAA-DEA were implemented by Monte Carlo simulation in the MATLAB environment. The results obtained by the classical DEA model [
1], the SMAA-DEA [
10], the CH-DEA [
18], and the CH-SMAA-DEA are given in
Table A2. Based on the optimal weights (
and
w) and the efficiency scores (
and
) obtained by the DEA and CH-DEA methods, we can find that A, B, C, and D are all efficient DMUs, but the DEA and CH-DEA method cannot further describe which is better. Based on the acceptability indices (
) and the confidence factors (
) obtained by the SMAA-DEA and CH-SMAA-DEA methods, it is shown that SMAA-DEA identified A, B, and C as efficient DMUs with 100% confidence and acceptability indices 33%, 38%, and 29%, correspondingly, while CH-SMAA-DEA identified A, B, and C as efficient DMUs with 100% confidence and acceptability indices 13%, 77%, and 10%, correspondingly. It is obvious that the deviation of the acceptability indices between the efficient DMUs (A, B, and C) obtained by CH-SMAA-DEA was bigger than that obtained by SMAA-DEA, which indicates that the CH-SMAA-DEA method has better discriminability than the SMAA-DEA method. It was found that B has the highest acceptability both in the SMAA-DEA and the CH-SMAA-DEA method. Further, the values of the maximum, central, and average efficiencies (
,
, and
) obtained by CH-SMAA-DEA were not smaller than those obtained by SMAA-DEA, that is because CH-SMAA-DEA considers the interactions between inputs or outputs and can provide better results.
Next, we introduce uncertainty to the problem in Example 1. The inputs are accurate, but the outputs follow an independent uniform distribution
in the range
with
. Ji et al.’s method [
18] will be invalid in this situation, because their method is only used for the DEA problems with determined input and output values. The rank acceptability indices obtained by SMAA-DEA and CH-SMAA-DEA are illustrated in
Table A3; here,
. We can find that A, B, C, D, and E are classified as efficient DMUs by both SMAA-DEA and CH-SMAA-DEA, and B has the highest acceptability, while F has the lowest one. However, the ranking acceptability indices are different by using SMAA-DEA and CH-SMAA-DEA; for example, the acceptability of B is 36% by SMAA-DEA, which is slightly bigger that of A with 33%, while the acceptability of B is 74% by CH-SMAA-DEA, which is much bigger than other ones. The acceptability of F is 0.05% by SMAA-DEA and 0.00015% by CH-SMAA-DEA, which shows that CH-SMAA-DEA can discriminate the efficient DMUs better than SMAA-DEA.
The central weight vector (
), confidence factors (
), and maximum, central, and average efficiencies (
,
,
) obtained by SMAA-DEA and CH-SMAA-DEA are listed in
Table A4, from which it was found that most of the values of the maximum, central, and average efficiencies obtained by CH-SMAA-DEA are not smaller than those obtained by SMAA-DEA. This is because CH-SMAA-DEA considers the interactions between inputs and outputs, which can enlarge the information space.
Example 2 ([
18])
. The Community Health Center (DMU) of Hebei Province in China was evaluated. The evaluated input indices were the public expenditure (CNY 10,000 Yuan), the number of medical staff, and the fixed assets (CNY 10,000); the output indices were the number of medical services (thousands) (including inpatient service and childhood immunization) and the number of managed of chronic diseases (thousands). The data are shown in Table A5. Ji et al. [
18] assumed that there exist low the interactions (correlations) between the input (output) variables, but they did not give the evidence to show that the interactions between input (output) variables are low. It is usually known that there exist interactions between inputs and outputs, but it is not easy to give exactly the interactions between them. Therefore, we assumed that there exist interactions between inputs and outputs, but we do not know whether the interactions are low or high.
The results obtained by the CCR, CH-CCR, SMAA-DEA, and CH-SMAA-DEA methods are listed in
Table A6, from which we can find that DMUs 2, 3, 6, 9, 10, 12, 16, 18, 19, and 20 were classified as efficient DMUs by the CCR, CH-CCR, SMAA-DEA, and CH-SMAA-DEA methods. DMU 18 was efficient by CCR and CH-CCR methods and was almost efficient by the SMAA-DEA and CH-SMAA-DEA methods with maximum efficiency scores of 0.9997 and 0.9963, respectively. This may be because CCR and CH-CCR calculate the efficiency scores of DMUs based on the optimization programming with the whole feasible region, while SMAA-DEA and CH-SMAA-DEA derive the maximum efficiency scores of DMUs based on Monte Carlo simulation, which is a sampling analysis. SMAA-DEA and CH-SMAA-DEA can measure DMUs from different views, such as confidence factors, maximum efficiencies, confidence efficiencies, and average efficiencies, while CCR and CH-CCR measure DMUs based on self-efficiency scores and cross-efficiency scores. Most of the results obtained by CH-CCR were not smaller than those obtained by CCR, and most of the values of
,
,
, and
obtained by CH-SMAA-DEA were not smaller than those obtained by SMAA-DEA.
It was found that the measures in
Table A6, i.e.,
,
, and
, can classify DMUs into efficient and inefficient ones, but cannot further discriminate between efficient ones, and the measures
and
can give a ranking of DMUs, but cannot identify which DMU is efficient. All of the measures in
Table A6 cannot give the acceptability of an efficient DMU. Then, the rank acceptability indices of DMUs were calculated by CH-SMAA-DEA and SMAA-DEA, and are listed in
Table A7 and
Table A8 and
Figure A2 and
Figure A3, and the central weight vectors are listed in
Table A9. From the data, we can find that DMUs 2, 3, 6, 9, 10, 12, 16, 18, 19, and 20 are efficient DMUs with different rank acceptability indices; DMU 10 had the biggest acceptability 62% by CH-SMAA-DEA and 48% by SMAA-DEA; DMU 3 had the smallest acceptability 0.001% by CH-SMAA-DEA; DMU 9 had the smallest acceptability 0.003% by SMAA-DEA. The results showed that CH-SMAA-DEA can discriminate the efficient DMUs better than SMAA-DEA. In other rankings, the acceptability indices of DMUs obtained by CH-SMAA-DEA and SMAA-DEA were different. For example, DMU 14 had the highest acceptability with 18% for ranking sixth by CH-SMAA-DEA, but DMU 1 had the highest acceptability with 35% for ranking sixth by SMAA-DEA.