1. Introduction
The main motivation for this study comes from the need for standards for the assignment of importance to criteria and for the modeling of the interactions between them in a multiple criteria decision analysis (MCDA) dealing with a large variety of criteria subject to random disturbances and interacting with each other. The abundance of distinct methods in the field [
1,
2] makes the search for such standards more relevant. Standards streamline decisions, for instance, in situations such as making an emergency health decision [
3], facing a sudden enemy military attack [
4], offering advice regarding stock trading under volatile market conditions [
5], or choosing a supplier for a failing critical component [
6].
Another aspect of situations that makes the establishment of standards important is the complexity and subjectivity of the application of the related criteria [
7,
8]. As a consequence, it is important not only to simplify the measurements but also to make them comparable and to ensure that the relations between the measurements according to different criteria, the forms of criteria importance assessment, and the criteria combination rules can be easily explained. In these situations, combining a large variety of criteria, avoiding arbitrarily assigning unequal importance to them, and accounting for the complexity of their relations necessitates a set of simple and well-defined standards governing the key analysis procedures.
One situation involving complex and subjective decisions is that of the prediction of electoral preferences. Electoral preference prediction models can be classified into two categories: fundamental models and polling models. Fundamental models attempt to derive votes from economic and social explanatory variables. Polling models address only the final voting preferences. They pose the single question: “if the election were today, who would you vote for?”
Due to a low predictive power observed in the models of the first category, the currently prevailing models tend to combine variables from poll models with other variables, which mostly serve to evaluate the personal qualities of the candidates or the approval of parties’ incumbents [
9,
10,
11]. Formulating the choice in terms of individual preferences regarding individual qualities strengthens individualistic motives.
Behind such an assessment model for individuals is the rational choice assumption that the members of the sample weigh their reasons mentally. In contrast, by making explicit the multiple objectives that interact in the formation of personal preference, complex MCDA models emphasize these objectives themselves. In an assessment of the preferences of the population, this has the effect of strengthening the pursuit of the common good. To project a long-term goal, a nation must have it as a collective target.
Collective objectives that lose importance in the modeling of preferences also lose importance in the culture in which those preferences are formed. In contrast, the modeling of preferences based on matters of collective interest will favor the prevalence of a culture of collaboration around collective goals. If elicited in preference assessments, the positioning with respect to civil rights, the size of the government, environmental protection, development goals, and other objectives of high importance will, in turn, be more present in the political scenery.
In recent years, modern economic growth theory has shifted from the classical theory emphasizing capital and labor productivity to a theory emphasizing the roots of productivity in the form of ideas and institutions [
12]. Countries that are open to ideas invest in education and create efficient institutions. The predictions based on economic and social objectives supply the motivation for these efforts.
From a parsimony perspective, a small number of variables with high predictive power for certain aspects of the decision can make a prediction model more efficient. To become competitive, models encompassing a large set of interacting variables must be supported by efficient fitting strategies.
The standard procedures in MCDA provide for the safe consideration of a large number of objectives that drive the choice of a model. These standards should be simple but comprehensive to allow for models that capture the complexity of the reality to be addressed.
The development of such standards is, then, an important research issue, and the lack of a reliable set of standards constitutes a serious gap in the MCDA research area. The present study fills this gap.
Here, standards for combining preferences established by multiple criteria whose importance is derived from their ability to identify the most preferred alternatives are discussed. Included are standards for the comparison of alternatives according to each criterion [
13,
14], standards for the combination of evaluations based on separate criteria into a global score [
15,
16], and standards for the measurement of the importance of interacting criteria [
17,
18,
19]. The development of an analysis applying these standards will always serve as a basis for the comparison of the results of the application, in every case, of any MCDA method recommended by the peculiarities of the case.
The article is structured as follows. After this introduction,
Section 2 sets the foundations for establishing standards for data collection, criteria comparison, and evaluations combination. Desirable properties of the decision rules are then established.
Section 3 deals with the application of the principles and procedures advanced in the preceding section.
Section 4 discusses the results obtained in
Section 3 based on the analysis of practical situations. A set of conclusions on the viability and the usefulness of the standards established is presented in
Section 5.
2. Materials and Methods
To be able to account for a wide variety of criteria, the assessment of preferences must be simple. By reducing the opportunities for the occurrence of errors in the measurement, simplicity engenders reliability.
Another important quality is comparability. To combine preferences among the alternatives being evaluated according to diverse criteria to obtain global scores, it is necessary to determine measurement standards that make the preferences assessed comparable in accordance with the different criteria.
An even more fundamental feature is representativeness. Alternatives must be evaluated based on their degree of commitment to an objective that is considered relevant to sustaining a criterion. How can such degrees of commitment be measured in a simple and comparable form?
The sampling of pairwise trichotomic comparisons exhibits all the desired properties. This assessment standard consists of assessing a sample of randomly selected representatives, either of the population or of experts, to determine, for every pair of alternatives, whether each of them is more capable than the other of reaching the desired objective. In the case of a tie, the preference measure is equal for both.
The small number of possible values, namely three, enhances robustness [
20,
21]. For instance, if, in a more detailed preliminary assessment, a distorted “very high” preference classification is given to one alternative, this alternative shares, in the comparison with an alternative with a “small” initial evaluation, the same preference assessment with any other alternative with a “high” classification.
If different scales of measurement of a single attribute have different meanings for different objectives, this same attribute may be used in different criteria with different roughness [
22,
23]. For instance, a complementary criterion can be built by imposing a transitivity requirement on experts’ ratings to tie more pairs of alternatives.
In the same sense, if considering more precise distances between alternatives is found to be useful, additional criteria can be created to represent narrower levels of variance. For instance, in a first criterion, two alternatives may be considered equivalent if the difference between their values in a certain attribute is smaller than 10, whereas, in a second criterion, they may be considered equivalent only if the difference is smaller than 1.
Counting the number of pairs exhibiting a preference for each alternative serves as the basis for calculating preference probabilities. Shifting from attribute measures to probabilities of preference has the benefit of unifying all evaluations to the same scale of measurement. This property of the commensurability is an essential condition to enable the combination of the criteria by means of the Choquet Integral [
24].
From assessments of preference based on diverse criteria, we need to move to a measure of preference that considers all the criteria collectively. Counting favorable cases in pairwise trichotomic comparisons may still be useful to measure joint preference. However, this count does not consider the interactions between criteria. For such a calculation, the Choquet Integral of the preference probabilities according to each criterion is the simplest form. It will be calculated with respect to a capacity designed to adequately measure the relevance of each criterion considering their interactions.
The principle of concentration of preferences [
19] makes it possible to derive such capacity. This principle, consistent with aversion to uncertainty in the decision-making process, leads a decision-maker to seek the maximization of the ability to discriminate the most preferred alternative. It leads, in the case of determining capacities of sets of criteria, to assign greater importance to those sets that exhibit a higher ability to point to an alternative as the most preferred. The importance of criteria is related to the personal aim of the decision-maker to choose the best among the available alternatives, and, barring possible gross mistakes, high measurements provide more reliable information about differences in preference than low measurements do.
Assuming an absence of interaction, the principle of preference concentration allows for the use of weighted averages with weights indirectly extracted from the results of the application of the criteria to the alternatives. In this case, the weights will be proportional to the maximal probabilities. However, by allowing for interaction, the combination via the Choquet Integral provides a more general standard. In the opposite direction, assumptions of generalized additive independence and other forms of modeling relations among criteria [
25] are even less restrictive, but, compared to such approaches, the Choquet Integral has the advantage of simplicity.
When interaction is assumed to be absent and weighting average of the preference probabilities for isolated criteria is performed, the global preferences still add to 1, while, when the preferences are combined via the Choquet Integral with respect to a nonadditive capacity, this sum can vary. Nevertheless, the possibility of interaction being present must always be considered.
When pairs of criteria are evaluated, instead of isolated criteria, if two criteria agree in giving high preference to an alternative, even if it is not the most preferred by either of them, their high joint preference for this alternative results in a high capacity for the union of the two corresponding unitary sets. The capacities of sets of more than two criteria are determined similarly.
Why does this construction suitably consider interactions? Interaction is present in the selection process if a choice of an alternative according to a given criterion either increases or decreases the effect of the preference according to other criteria. A preference criterion interacts with others in the selection of the best alternative if its inclusion in addition to the others in the evaluation process increases or reduces the probability that the best alternative will be identified.
The same applies to groups of criteria regardless of the reason for the influence. Such reasons may include exchangeability, complementarity, substitutability, or preferential dependence [
26], but only the effect, not the origin, of the interaction needs to be established.
This application of the principle of preference concentration is similar to maximum likelihood estimation. It attributes to a criterion a capacity proportional to the preference value that this criterion assigns to the alternative to which it assigns the highest preference.
Finally, and more importantly in the search for standards, extracting information on preferences for criteria from available information about preferences for alternatives according to those criteria is simpler and more reliable than asking directly for information about the relative importance of the criteria in abstract comparisons.
Figure 1 highlights the benefits of the introduction in MCDA of the approaches advocated here.
3. Results
This section presents a set of procedures that constitute standards for (i) calculating the preference probabilities according to each criterion, (ii) deriving the capacities of the sets of criteria, and (iii) ranking the alternatives by the Choquet Integral.
3.1. Standards for the Individual Assessments
The probability of preference for each alternative according to each criterion is obtained from evaluations by a sample of the population whose preference is to be measured. Preference probabilities are derived from the values of the trichotomic pairwise comparisons that they produce.
The preference counts are transformed into probabilities by dividing the counts by the total number of evaluations. Let us denote by the count of preferences for the i-th alternative according to the j-th criterion and by the result of the trichotomic comparison between alternatives i and u according to the j-th criterion by the k-th evaluator. This result has three possible values: 1 if the k-th evaluator declares i preferable to u, 0 if the evaluator declares u preferable to i, and ½ if the evaluator declares indifference between i and u.
The preference count for
i according to the
j-th criterion,
, is the sum
for
k varying across all the raters evaluating according to criterion
j and
u ranging over all the alternatives that
i is compared with.
This count is, therefore, the sum of the number of pairwise comparisons where
i is preferred with half the number of comparisons where
i is considered equivalent to another alternative. The estimate of the probability of preference for
i is the quotient
of the count
by the number of comparisons
for
, denoting the number of evaluators by the
j-th criterion, and
N, the total number of alternatives.
The sum of these probabilities is exactly 1.
It is interesting to note that
granting antisymmetry.
However, the trichotomic pairwise comparison approach avoids the requirement for transitivity [
27]. Transitivity need not hold, generally, in the evaluation of preferences. Consider, for instance, the case of ranking a group of tennis players when a criterion may be based on the observation of matches in some number of tournaments. In pairwise comparisons, the evaluator declares a player better than another by looking at the result of a match. It is possible to observe player
beating player
, who beats player
, while player
beats player
.
3.2. Standards for the Initial Joint Assessments
Once the preference probabilities according to each criterion have been obtained, the criteria capacities that will be used to combine these preference probabilities according to the individual criteria into a global score can be calculated.
Leaving aside the weighting of the criteria and the interactions between them, global scores can be obtained by using the preference counts to directly estimate the preference probabilities for each alternative. Extending the case of only one criterion, in the case of a set
J of two or more criteria, a preference score
PiJ for alternative
i according to
J will then be obtained by adding the
given by Equation (1) along the criteria
j in
J. To avoid implicitly overvaluing those criteria
j with a large
, it will instead be employed thusly:
This initial assessment assumes additivity. Other forms of additive composition are described in [
28]. They include probabilistic rules to obtain the preference according to at least one of the criteria in J. Assuming, respectively, the maximum dependence and independence between the indicators involved,
will be given by
max j∈J or by 1 − Π
j∈J(1 −
). Nevertheless, counting is the easier form of evaluating the effect of joining the criteria.
These counts give all the criteria and all the experts equal importance, although weights can be included if there is sufficient information to consider the ratings differently by distinct criteria or by distinct experts. For instance, to control the effect of any factor unduly raising the preference according to some criteria, when translating the counting into the probabilities of preference , the importance of any criterion j may be corrected by applying a proportional reduction to the vector of probabilities of preference according to j (for instance, dividing all by j).
3.3. Standards for Considering Interactions
A capacity on S is a nondecreasing function μ defined on the power set of S with values of 0 at ø and 1 at S. Capacities are not-necessarily additive measures that express, for each subset of S, the subjective importance associated with that subset.
Estimates of the capacities of the criteria can be obtained directly from the experts’ evaluations. However, it is simpler to extract them from the preference counts.
Starting with the preferences according to each subset of criteria above denoted , the principle of concentration of preferences leads to the measurement of the capacities as proportional to the vector of maxima along these preferences. The exact capacity values will be obtained by scaling such that a capacity of 1 is assigned to the set of all the criteria.
Formally, the capacity assignment algorithm for the subset
J will have the central step of computing, along all the alternatives, the maximum of the joint preference probabilities
The final value of the capacity is achieved with the final standardization that consists of dividing by the largest value. Thus, the capacity of
J is
3.3.1. Simplified Capacities
In the case of a large number of criteria, the above derivation of capacities may be limited to sets with a small number
L of criteria, while capacity 1 is assigned to all the sets of a larger size. That, is, while the sets of more than
L criteria receive capacity 1, the sets
J of 1, 2, …
L criteria have the capacity given by
for
H varying along the sets of criteria of cardinality
L.
This simplification may reduce the importance of the final score of criteria with strong interactions with large sets of criteria. To prevent distortions, computation for a few other small values of L is advisable if the scores for L = 2 do not present a clear preference for a best alternative.
3.3.2. Combination via the Choquet Integral
To consider in the global scores, in addition to the preferences between the criteria, the interactions between them, the preference probabilities according to the isolated criteria are combined via the Choquet Integral.
The Choquet Integral is a form of aggregation used in place of the weighted average when it is possible that interactions between criteria may invalidate the use of compensatory addition.
For any function
x = (
x1, ...,
xt), of domain S = {1, ... ,
t} and values in R
+, the Choquet Integral of
x with respect to the capacity μ on S associates with x the non-negative real number
for τ denoting a permutation of S such that
The Choquet Integral is equivalently given replacing (9) by
for
for all
j from 1 to
t, and
When combining preferences via the Choquet Integral, one is also following the principle of concentration of preferences. In fact, the Choquet Integral assigns greater value to the highest preferences if they are obtained by applying a criterion with greater positive interactions with other criteria and lesser value otherwise.
To see how this happens, let us consider the case of only two criteria. If the interaction between the two is positive, the capacity (equal to 1) of the set of both criteria is greater than the sum of the capacities of each in isolation. Thus, the integral value is closer to the highest value than the arithmetic mean is. In fact, the integral is the sum of the smallest value and the product of the multiplication of the difference between the two values by the complement of that smallest capacity, and this complement is higher than the capacity of the second criterion if there is positive interaction and lower if there is negative interaction. In the weighted average, this complement would be replaced by the capacity of the second criterion.
5. Conclusions
This article brings a new approach to MCDA based on the application of standard rules. A set of standards is proposed. Its validity is checked and its usefulness for practical situations of large numbers of alternatives and of criteria is demonstrated.
The combination of preferences based on multiple criteria can be achieved via the simple procedures developed here, which serve as standards for the analysis of complex decisions with a high degree of subjectivity. The simplest standards comprise the combination via the Choquet Integral, the consideration of the interactions between the criteria by assigning importance to the sets of criteria proportional to the highest preference that they assign, and the employment of trichotomic pairwise comparisons in the data collection.
The strategy for criteria evaluation proposed is designed to concentrate preferences. It is based on the information on preferences for the alternatives according to the criteria instead of on the direct comparison of the criteria.
The results of the application of these standards may serve as a basis for comparison with the application of any MCDA method recommended by the peculiarities of each case. Variations in the capacities, which may amplify the basis of comparison, were studied in practical applications, presenting consistent results.