1. Introduction
The concept of concentration is used in industrial economics. This variable measures the level of market concentration as a function of firms’ size distribution, that is, it measures the degree of control that larger firms have over the market. If all the firms are of the same size, this variable reaches its minimum value, whereas if only a few companies make up a major part of the market, it reaches its highest value. A monopoly would represent the extreme case.
While it is true that concentration has been comprehensively developed in economics, with its principal application being market structure in the field of industrial economics, it can be conceptually applied or lend itself to other areas of analysis. In fact, one of the pioneering authors in this sense, Hirschman [
1], proposed an index for studying foreign trade that bears his name. Health economics [
2,
3], political science [
4,
5], sports economics [
6,
7], space economics [
8], linguistics [
9,
10], and finance [
11,
12] are also noteworthy. These fields have highlighted the relationship between concentration and other variables, such as inequality, polarization, diversity, dispersion or fractionalization [
13].
However, there are certain areas in which the representative configuration of the maximum concentration is not that of a monopoly. In effect, in the case of the markets, the vector of the size of the firms is the result of a competitive process in which each and every firm interacts with its competitors in multiple ways. However, there are competitive processes in which this is not true: sports competitions are competitions of a bilateral nature, there are no multiple encounters, and no one can accumulate the total result [
14].
The most commonly used concept to analyze this phenomenon is competitive balance. Competitive balance is a basic and current concept in sports economics [
15]. As in industrial economics, this concept indicates the degree of control exercised by the major teams, measured in terms of points obtained at the end of the championship or other metrics, such as wins. Therefore, there is an inverse relationship between competitive balance and concentration. From an ex-post point of view, this is related to the number of points (or wins) reached by the teams and, therefore, the competitiveness demonstrated over the season. This extent of competitive balance is a central feature in the design of professional team sports leagues [
16].
The extent to which playing strengths vary between teams has important implications for the uncertainty surrounding the outcomes of individual matches and overall competitions. According to the uncertainty of outcome hypothesis [
17], the less predictable the outcome of a sports match, the greater the consumer interest, which is reflected in higher match attendance and higher television audiences. Therefore, if the uncertainty contributes to consumer demand for sports, we can expect that greater competitive balance should increase team revenue. Nevertheless, it may be that there is no reason for fans to even care about outcome uncertainty, so competitive balance should have little impact on fan demand [
18]. Moreover, it is possible for fan interest to increase, even as win uncertainty decreases [
19].
Consequently, the competitive balance and attractiveness of leagues is a key consideration for league organizers and a focus for the analysis of sport competitions in several disciplines, such as economics, operations research and statistics [
20,
21,
22,
23,
24,
25,
26,
27]. Measurement is therefore important, either to track movements in the within-season competitive equilibrium over successive seasons and assess the effects of regulatory and institutional changes, or to examine the effects of changes in the competitive equilibrium on consumer demand for sports competitions [
28].
How do we measure competitive balance? There are approaches based on probabilistic measures [
29], although these measures do not contemplate the possibility of ties, and there are measures of uncertainty of outcome that are also used [
30,
31,
32] to measure competitive balance, using money lines. However, to measure the competitive balance of a tournament, it is common to use indices.
These indices summarize the distribution of scoring at the end of the tournament [
6,
33,
34,
35] and they are linked to inequality and concentration indices: Gini [
36,
37,
38,
39], concentration ratio [
40], entropy index [
41,
42,
43] or Herfindahl–Hirschman index [
7,
44,
45].
It is possible to find many examples of real situations in which, given the characteristics of a competition, a monopoly does not represent the configuration that generates the minimum competitive balance as measured by concentration indices, such as the Herfindahl–Hirschman index applied to the point shares achieved by the teams that contested the championship. This is the case in bilateral sports competitions, such as the leagues of the most popular sports (football, rugby, American football, baseball, basketball, hockey, and handball, to name just a few) in which it is not possible for a single team to accumulate the total number of points, as it does not participate in all the encounters. Kharrat [
46] pointed out the need to take into account differences in the competitiveness of leagues to evaluate the performance of football players. This assessment is of interest to team managers, coaches and fans.
In the case of industrial concentration measurements, various indices have been proposed and traditionally used [
47,
48,
49]. The Herfindahl–Hirschman index (HHI), defined as the sum of the squares of the individual market share of firms, has reached the greatest acceptance and dominance in the literature. Different versions of the HHI have been used in studies on a wide variety of sports. A non-exhaustive list includes American football [
33,
50,
51], association football [
44,
45,
52,
53,
54,
55], Australian rules football [
56,
57], baseball [
7,
58], basketball [
59,
60,
61], ice hockey [
62], rugby league [
57,
63], and table tennis [
64].
Most of these indices incorporate specific characteristics of sport competitions. In a bilateral sports competition, the range of variation in competitive balance measurements is subject to specific constraints. Indeed, there are aspects that influence competitive balance measurements and restrict the theoretical range of traditional concentration indices [
14]. Among these sport-specific issues, there are three relevant ones that influence the index range: (i) the impossibility of a team reaching a monopoly position, (ii) the scoring system adopted, and (iii) the number of teams in the tournament. When the value of competitive balance reaches its maximum, that is, when all teams have the same score, the number of teams competing becomes relevant to define the theoretical range of the index to compare different leagues, which can have different numbers of teams [
7,
33,
62,
64,
65]. Similarly, as the monopoly position cannot be reached due to the bilateralism of the encounters (only two teams can meet in any one competitive encounter, as mentioned at the beginning of the Introduction), the results configuration that generates the minimum competitive balance must be considered, and this is determined by the scoring system used.
For this reason, various proposed indices take these issues into account to a greater or lesser degree, and most of the measures currently used are relativized with respect to the maximum or minimum values that they can reach. Since these indices are normalized, they have a theoretical unit interval. More specifically, we can find two competitive balance indices that have index concentration standardization included in their definition:
Standardized Herfindahl–Hirschman index [
58]:
Distance to competitive balance [
66]:
It is shown that
, where
σ2 represents the variance of the shares of teams [
39]. On the other hand, DCB index enables the result to be interpreted as a mathematical distance that fulfils the cardinality property and, therefore, maintains its ratios and permits the differences between index values to be significant.
Both indices require knowing the maximum and minimum values that can be reached in a sports championship, the sum of the square of the score share of all participating teams.
Given the number of teams in a championship, the maximum competitive equilibrium is reached when there is an even distribution of results (that is, when none of them stands out from the rest). This result is provided by the
perfectly balanced distribution [
43]. In this distribution, the share of points is equal for all of them, so the teams get the same number of points. In these cases, the value of the HHI is equal to the inverse of the number of teams [
7]. Therefore, the maximum balance depends on the number of teams in the competition.
The perfectly balanced distribution of scores indicates that all teams have an equal likelihood of winning a match and, subsequently, the championship. This result can be obtained when either all matches end in a tie, or, if there is no possibility of a tie and with a double round-robin system, each team wins and loses matches against the same opponent. In the field of sports economics, these issues are well known as bilateral confrontations.
Our interest is focused on the calculation of the perfectly unbalanced distribution of the scores, that is, the distribution that generates the HHImax. Its configuration allows any competitive balance index that requires this value, such as HHIN or DCB index, to be calculated.
What is the minimum value, and what distribution of results generates it? The distribution generally used so far is not the perfectly unbalanced distribution if the scoring system does not fulfil a stability condition (the total number of points of the championship is constant). The current scoring system in European football does not fulfil this condition because pw = 3 points are awarded for a win, pt = 1 point is awarded for a tie, and pl = 0 points are awarded for a loss. Therefore, the standardized indices being used are not correct.
Until now, the distribution of results that generates the maximum level of concentration has been identified in the literature as
Complete cascade distribution. However, if the scoring system used does not fulfil the
stability condition, which implies that the total number of points of a championship is constant, it can be demonstrated that
Complete cascade distribution does not generate the maximum level of concentration [
14,
67]. This is the case, for example, with major European football leagues. Therefore, if the total number of points at the end of the season varies, the
perfectly unbalanced distribution so far used in the literature is not correct. Note that in a championship, several matches are played. At the end of each match, each team is remunerated according to the result achieved. This award is a certain number of points for winning, losing and, if necessary, tying. At the end of the championship, after all teams have played against each other, each team has a certain number of points. The amount of points will be the sum of the points obtained after each match. Therefore, if the remuneration system distributes a fixed number of points (e.g., two for the winner, zero for the loser, and one for each team in the case of a tie), the total number of points of the championship is constant (
stability condition). However, a system that allocates three points to the winner, zero to the loser and one point to each team in the case of a tie implies that the total number of points at the end of the championship is not known from the beginning, that is, it is not constant and it does not fulfil the
stability condition.
In this article, we calculate the minimum value of competitive balance, obtained from the
perfectly unbalanced distribution. This value can be calculated prior to the competition so as to avoid recalculating the results and, as proposed, when the scoring system does not fulfil the stability condition [
39,
45]. We introduce a methodology for obtaining the
perfectly unbalanced distribution, which allows to solve the knowledge gap in these situations. We demonstrate that the concentration index has a unique maximum for each number of teams. We also obtain the value of the index for a generic scoring system and show that there is a critical value of
q with which we obtain the
perfectly unbalanced distribution of points. The top
q teams win their matches in cascade, while for the teams in positions higher than
q, all teams are tied. This critical value of
q is characterized by the solution of a fifth degree polynomial inequation. This value defines the
Truncated cascade distribution of points that reaches the maximum index value of concentration, that is, the
perfectly unbalanced distribution. This distribution generates the maximum concentration level and, if we do not use it, there is an overestimation of the concentration measured with normalized indices. Then, the calculated competitive balance will be wrong, that is, underestimated.
In particular, we obtain the
perfectly unbalanced distribution for current European soccer with a scoring system of {3, 1, 0}, and characterize this distribution. In the
Supplementary Materials, we propose a tool to obtain the most imbalanced distribution for any other scoring system of the same type {
pw,
pt, 0}.
In the Materials and Methods section, we define the formal frame of reference for bilateral sports competitions, analyzing their characteristics. In the Results section, we characterize the
Truncated cascade distribution and show that this distribution reaches the maximum concentration level of a championship. The fourth section presents the Discussion and Conclusions in which we highlight the contribution of knowing the distribution of the maximum concentration in advance, for example, for normalization and, if necessary, for comparison. Three appendices provide proof of the results formulated in the Results section. Moreover, in the
Supplementary Materials, we provide a Microsoft Excel spreadsheet that presents the
Truncated cascade distribution for different scoring systems.
2. Materials and Methods
Perfectly unbalanced distribution is identified in the literature as the result of a competition, where, in the final ranking, the first team has beaten the rest; the second, all others except the first; and so on, until the last, which has lost all its games [
68]. Due to its graphic representation, we called this the
Complete cascade distribution. It is true to say that if the total number of points of a championship is established from the beginning, this distribution will generate the HHI
max. However, this stability condition is not always fulfilled; in the scoring system currently used by the Union of European Football Associations (UEFA), it is not met, and therefore, the
Complete cascade is not the distribution that generates the HHI
max. In the following paragraphs, we present the basic model that we follow in our analysis and characterize the
Complete cascade distribution as the
perfectly unbalanced distribution when this stability condition is fulfilled.
2.1. The Model and the Perfectly Unbalanced Distribution of Scores
We consider a league with teams. The tournament can be played in a single or double round-robin system. The points assigned to each team in a match correspond to a pattern of the type ( points are awarded for a win, points are awarded for a tie, and points are awarded for a loss), where , and is assumed. Let be the number of points obtained at the end of the championship by the i team; the subscript i indicates the position of each team in the final ranking. The vector represents the final score of the championship. A new vector of shares is defined, where . We define a competitive balance index as a function defined on the space of vectors of shares, which assigns to each vector of shares a value of the unit interval.
The number of matches played in a championship with N teams is N(N − 1)/2, while each team plays (N − 1) matches in a single round-robin system. In the case of a k-round system, the total number of matches played in a season is kN(N − 1)/2, while each team plays k(N − 1) matches. This proportionality undermines the importance of the number of rounds in the aggregate analytical terms of a season.
The so-called
perfectly unbalanced distribution was characterized by Fort and Quirk [
69], Gayant and Le Pape [
39,
70], Horowitz [
41], Owen et al. [
58], and Utt and Fort [
38], among others. In this distribution, “each team has defeated all those below them and have lost to all the teams above them. Once the tournament is over and the positions have been ordered, the results of distribution can be visualised as being ‘in cascade’ in which each team has fewer points than the team in the preceding position” [
38].
Figure 1 shows this for
N = 20 with a {2, 1, 0} pattern in a single round-robin system.
The
Complete cascade distribution, identified in the literature as the most concentrated for a {2, 1.0} point pattern, visually corresponds to an inverted staircase (
Figure 1). This most unbalanced distribution is also achieved with other patterns as is the case when no ties are allowed (as in baseball or basketball). The reason is that in the case of the pattern {2, 1, 0} as in the case {1, 0, 0} or {k, 0, 0}, in each match the same number of points is distributed as opposed to a pattern of the type {3, 1, 0} in which the number of points distributed depends on the outcome. The European soccer leagues use a point pattern {3, 1, 0}. The points distribution becomes unstable depending on the number of ties: with a point pattern {2, 1, 0}, at the beginning of each game, there are two points to be distributed among the teams and those two points are those that are assigned (both to the winner, or one to each team if they tie). However, with a system {3, 1, 0}, in the event of a tie, two points are awarded; if it is a win, three points are awarded to the winner. With a {2, 1, 0} system, both teams are always awarded two points together at the end of the match; with a {3, 1, 0} system, they can be awarded 3 if someone wins, or 2 if they tie.
The
Complete cascade distribution is rarely formalized [
39]. We will do so below.
First, in a single-round competition, the points obtained by the team in position
i have won
N −
i matches:
In the case of competitions with k rounds, the value of pi shall be multiplied by k, that is, 2 in the case of round-robin competitions.
Second, the higher concentration, lower competitive balance is obtained in such a distribution where the team in position i wins all its matches against teams in position j > i and loses its matches with teams in position 1 < m < i. The scoring system {2, 1, 0} at the end of the season distributes a fixed number of points, that is, it does not depend on the results of each match.
If
is constant, the marginal value of HHI is as follows:
Thus, as is constant, the marginal value increases with the value of points obtained by the team. The maximum number of points that any team can score is limited by the maximum number of matches that it can win. The first team wins its N − 1 matches, but from the second onwards, team i can only defeat teams j > i since it has lost to the first i − 1 teams. This feature could be called the principle of saturation.
Ultimately, the contribution to the value of the concentration index for each point is increased with the points won by a team but it is limited (saturated) by the number of matches that it can win.
This result applies to scoring patterns of the type .
Thirdly, the distribution of the results are
p . Given the total points
, the vector of shares are as follows:
The share of the first team in this distribution is that of the N-teams in the
perfectly balanced distribution [
43].
2.2. Complete Cascade Distribution and Stability Condition
Let us imagine a league with four teams, each playing double round-robin with a scoring system {3, 1, 0}. This is the case for each group in the final classifying rounds of the UEFA Champions League (UCL). The Complete cascade distribution implies the vector of results and generates . Would this be the maximum value that the concentration could reach? If so, then y .
However, in this competition, there are, at least, three possible results, which contradict the above: (a) two teams win all their matches, except the one played against each other, which is a tie, and the remaining two teams tie with each other; (b) one team wins all its matches, as does the next team, except those that played against the first, and the other two teams tie with each other (but lose against the first two); and (c) one team wins all its matches, and the other three tie with each other.
The distribution of results is the following , , and . The corresponding values of the sum of the square of the score share is the following: HHI(a) = 0.391, HHI(b) = 0.412 y HHI(c) = 0.413. All these are greater than . Therefore, if we assume that represents the distribution that generates the maximum concentration, the values that we calculate for and the DCB index are greater than the whole, which is impossible, given the fact they are standardized.
This example clearly shows that the Complete cascade does not necessarily have to be the distribution that generates the maximum value of as it does not represent the perfectly unbalanced distribution in all cases. What is the reason for this?
The scoring system establishes the rewards or incentives for winning. It is part of the mechanism that generates results in a competition. The possibility of ties occurring with a system of the type {3, 1, 0}, as mentioned above, demonstrates that the number of total points at the end of the championship can vary depending on the results of each match, and the distribution of points will be unstable.
Below is a condition that guarantees the stability of the number of points allocated in a tournament.
Stability Condition.If , the number of points obtained in a tournament, , with N teams does not depend on the results obtained by each match [39,43]. This stability condition can be generalized with Lemma 1, including the points obtained by the losing teams, .
Lemma 1. If or , then the stability condition is fulfilled.
Proof. Let
be the number of wins and
the number of ties in a tournament. Then
. Let
be the proportion of matches with a winner,
be the proportion of matches without a winner, that is, tied matches. The total points at the end of a championship is as follows:
Then, if or , the total championship points only depends on N and scoring pattern. □
When the
stability condition is not fulfilled, instability is generated, which is identified as one of the main problems in measuring competitive balance [
39].
In this case, how can we calculate a perfectly unbalanced distribution? Additionally, how can we apply the principle of saturation? Given that the marginal contribution to the value of the HHI concentration index increases with the number of points obtained by the team, the winning matches should be accumulated among the top teams in the ranking, while the rest should be tied. Thus, the highest HHI value is obtained when the teams at the top of the ranking win all their matches with the teams below them in the final ranking.
Therefore, the real issue is that the Complete cascade distribution is not perfectly unbalanced in all cases. The Complete cascade distribution is the least balanced distribution when the stability condition is satisfied. However, with another pattern, the perfectly unbalanced distribution is different from the Complete cascade distribution.
3. Results
In this section, we demonstrate that the HHI has a unique maximum for each N, called HHImax. We also obtain the value of HHI for a generic scoring system and show that there exists a critical value of q with which we obtain the most concentrated distribution of points. The top q teams win their games in cascade, while for teams in positions higher than q, all teams are tied. This critical value of q is characterized by the solution of a fifth-degree polynomial equation.
3.1. Distribution of Scores
We define a type of asymmetric distribution of points, characterized by a q value in which the top q teams win their matches in cascade, while for teams in positions higher than q, all teams are tied. Therefore, in this type of point distribution, two groups of teams emerge: those that have won at least one match and those that have not won any match, tying with each other.
Remark 1. Note that, for each q, the number of points is constant. Therefore, the saturation principle can be applied to each value of q.
For every , a generalized distribution is obtained, which we call the Truncated cascade in () distribution. For the value , the teams tie all their matches, and we obtain a perfectly balanced distribution. For the value , the teams win their matches in a cascade, that is, their point distribution coincides with the classical Complete cascade distribution.
Let us now consider a pattern
; in this case, the points obtained by the teams are as follows:
3.2. The Corresponding Herfindahl–Hirschman Index
In a competition with N teams, Proposition 1 allows us to obtain the expression of HHI as a function of the q-value, for a Truncated cascade distribution in (q).
Proposition 1. For every , and {3, 1, 0} scoring system, the value of HHI is the following: From there, for a competition with N teams with an established scoring pattern, we want to calculate the critical value (q*) that forces the HHI to reach its maximum value.
Theorem 1. Let , with , then HHI(q) has a unique maximum and minimum in [1/N,1].
3.3. The Cut-Off Value of q for HHImax
In this section, we obtain, with Theorem 2, the cut-off value of q that causes to reach its maximum value. For this purpose, we build a recurrent formulation of the transition of HHI between two consecutive values of q, with 0 ≤ q < N.
Theorem 2. Let N be the number of teams in a tournament with a scoring pattern {3, 1, 0}; then, the maximum value of HHI is obtained by accumulating wins in the first q* teams, where q* satisfies the fifth-degree inequality: While with Theorem 1 we defined the existence of a single maximum value of HHI, with Theorem 2, we constructively generated a polynomial inequation of the fifth degree from which we can obtain the cut-off value of q to define the Truncated cascade distribution of points that reaches the maximum value of HHI, that is, the perfectly unbalanced distribution, in a competition with N teams and a scoring pattern {3, 1, 0}.
3.4. Calculus and Results
To perform the calculations that allow us to obtain the cut-off value of Theorem 2, we developed a Microsoft Excel spreadsheet, a tool that allows us to confirm the theoretical results already formulated and to obtain the cut-off value of
q for any
N. This tool is provided in the
Supplementary Materials.
Table 1 shows, for a score pattern {3, 1, 0} and values of
N between 2 and 25, the maximum value reached by HHI. As shown, the values of HHI
max decreased with
N, while the values of
q increased. However, the critical point of the cut-off value of
q, where the HHI
max is reached, can be observed in relation to higher and lower values. In the
Supplementary Materials, we observed that the value of HHI, its numerator and denominator evolve. The values in bold in
Table 1 show a
perfectly unbalanced distribution.
Figure 2 shows graphically, for comparison with
Figure 1, how the cascade of teams that won some games was interrupted at the 7th rank with a scoring pattern {3, 1, 0}.
Table 2 compares the values of
q and HHI
max for two scoring patterns: one that fulfils the
stability condition, {2, 1, 0}, and one that does not, {3, 1, 0}. We observed that with the pattern of scores {2, 1, 0}, the cut-off value of
q was
N − 1, whereas with the pattern of scores {3, 1, 0}, the cut-off was truncated at a value less than
N − 1, which increased with
N.
With the known maximum and minimum values of the HHI for a competition with N teams and scoring pattern {3, 1, 0}, we can standardize and, therefore, compare, using the DCB. This result provides an answer to the problem [
39] of not being able to standardize the HHI for scoring patterns that do not fulfil the
stability condition.
In the major European leagues and in the UEFA Champions League (UCL), the current scoring pattern is {3, 1, 0}. Specifically, in the qualifying round of UCL, there are groups of four teams. In this case,
Table 2 shows that the cut-off value of
q was 1, so the
perfectly balanced distribution was associated with one team winning all its matches, and the rest of the teams tying with each other and losing to the first ranked team. So HHI
max = 0.413333, and if the
Complete cascade distribution is incorrectly used, HHI
max = 0.388888. Similarly, in a league with 20 teams and a scoring system {3, 1, 0}, the cut-off value was 7, and the HHI
max was 0.075402 with the
Truncated cascade distribution, while the HHI
max value for the
Complete cascade distribution was 0.068421.
For each N, the maximum of HHI with {3, 1, 0} pattern was greater than the maximum of HHI with the {2, 1, 0} pattern. Therefore, if we do not use the Truncated cascade distribution, there is an overestimation of the concentration measured with HHIN. Then, the calculated competitive balance will be wrong, that is, underestimated.
In the literature, there are measurements of competitive balance, based on points scored at the end of the championship, which do not use standardized indices. We believe that such estimates should be reviewed. The use of standardized indices allows comparability between championships with different numbers of contenders.
If the championship scoring pattern fulfils the stability condition, the minimum competitive balance is well constructed from the Complete cascade distribution. However, if the scoring pattern does not fulfil the stability condition and normalized indices are used, the measurements should be redone using the Truncated cascade distribution. In the next section, we elaborate on this issue.
4. Discussion and Conclusions
So far, to compare the competitive balance between seasons, it was proposed to reconstruct the scores obtained with a {3, 1, 0} scoring pattern to a {2, 1, 0} pattern, based on the actual match results [
39,
45] and using the
Complete cascade distribution as a
perfectly unbalanced distribution.
This recalculation generates cardinal and ordinal negative effects, and alters the values of the competitive balance index [
55]. Therefore, for any model using competitive balance as a variable, the use of the reconstructed values for another scoring pattern may generate both quantitative and ranking errors. For example, if elasticity is estimated in a model, the results obtained using the reconstructed competitive balance are not correct. Note that calculating the competitive balance usually entails analyzing its evolution, together with its relationship with other characteristic variables of championships of sports: baseball [
7], American football [
33], football [
71,
72] are all good examples.
These problems arise from the inappropriate use of the Complete cascade distribution as a perfectly unbalanced distribution for scoring patterns that do not fulfil the stability condition.
In the case of the {3, 1, 0} scoring pattern, the most common in soccer, it does not fulfil the
stability condition and, in addition to the problem mentioned above, a new problem emerges, which Gayant and Le Pape [
39] identified as one of the main problems in sport economics: the impossibility of standardization due to the lack of knowledge, at that time, of the least balanced, that is, most concentrated, distribution for that scoring pattern.
In this paper, we identified and characterized the least balanced distribution for that scoring pattern, and therefore, we solved the problem generated by the fact of not knowing it a priori.
This identified generic distribution, which we call the Truncated cascade distribution, includes as a particular case the Complete cascade distribution, widely used in the literature for the {2, 1, 0} scoring pattern.
Failure to use the most appropriate unbalanced distribution linked to the corresponding scoring pattern leads to an overestimation of the concentration and, as a consequence, to an underestimation of the competitive balance, generating both cardinal and ordinal problems. The reconstruction of the results is not neutral. Cardinal effects affect possible modeling in comparative studies, as the values of the indices change. The ordinal effects are even more severe, as they can affect the league’s ranking for a season. One example is as follows. In the 2016/17 season, among the five major European football leagues, the Spanish Primera División was the most concentrated league (DCB = 0.5367 versus the Italian Serie A, with a DCB = 0.5366); if the data were reconstructed under a {2, 1, 0} system, the league with the most concentrated results are the Italian Serie A (DCB = 0.5749 versus the Spanish one, with a DCB = 0.5697). This same error can be checked for eleven of the twenty-one seasons between 1997/98 and 2017/18.
Another aspect to highlight is that to be able to compare competitions when some of their parameters change (number of teams or scoring system), it is necessary to normalize and, for this, it is necessary to know the maximum and minimum values of the index used, in this case, the HHI. This type of index also requires normalization that takes into account the specific characteristics of bilateral sports competitions.
Once the perfectly unbalanced distribution linked to the scoring pattern is known, we can construct a comparison with the necessary rigor and avoid problems derived from an unnecessary recalculation of the teams´ scores in the respective seasons.
Another relevant issue is that the formulation developed in this article is generic as can be seen in
Appendix A,
Appendix B and
Appendix C, so its particularization to other similar scoring systems is easily accessible. The spreadsheet contained in the
Supplementary Materials allows for this without any difficulty.
An extension of these findings to other types of scoring patterns (inclusion of bonuses, double alternative scoring of ties, etc.) in bilateral sports competitions could be the focus of a future study.