1. Introduction
Football is one of the most popular sports in the world, and the impact of “the beautiful game” on the international economy is growing fast (
Ashtonet et al. 2011). As a consequence, it has become a major subject of interest for researchers (
Salgado-Barandela et al. 2017). Nowadays, football means big money, and the economic consequences of failure can be huge for professional football teams playing in the national first divisions. Deloitte Business Group estimates that relegation for any English Premier League team costs approximately £50 million. Not surprisingly, a lacklustre performance is a cause for concern for owners of professional football teams and attracts a lot of media interest (
van Ours and van Tuijl 2016). One of the most frequent decisions made by football team owners in such a situation is to dismiss their head coach. Based on 22 years of data,
Frick et al. (
2010) state that on average 36% of head coaches are dismissed during a season in the German Bundesliga. As regards the French Ligue 1 and the English Premier League during the 2015–2016 and 2016–2017 seasons, 31 head coaches were dismissed or left their clubs before the end of the season. The prevalence of dismissal cases is quite high everywhere, even though it may differ from one league to another. For example,
Flores et al. (
2012) report that the head coaches in Argentina are dismissed more often than in Europe.
Head coaches are key actors in professional sports because of the strong impact they are expected to have on the performance of their team (
Detotto et al. 2018). It is true that, quite often, their role is not just coaching. Most of them also have the power to recruit the football players of their choice and to pick the team for games from the squad. This is the case in the French and UK professional football leagues, where the role and power of head coaches are very similar. Even if the quality of the individual players under contract seems to be the main determinant of the final league position of a team,
Frick et al. (
2010) suggest that the influence of the head coach on team performance is far from negligible.
Most papers examine two questions connected with head coach replacement. First, why and when is the head coach dismissed? Second, what is the effect of dismissal on team performance? These questions have been raised for the main American and European professional team sports: football, American football, baseball, basketball, and ice hockey (
Koning 2003;
Holmes 2011;
Hill 2009;
White et al. 2007).
The literature provides various reasons that head coaches are dismissed (
Allen and Chadwick 2012). One obvious reason is poor team performance attributed to bad coaching. If results do not match the club’s expectations, owners start to make changes, including coach dismissal, in order to achieve the expected results. The correlation between coach replacement and poor game results has been well documented in recent years, (for example, in
Barros et al. 2009;
Hughes et al. 2010;
Frick et al. 2010;
van Ours and van Tuijl 2016). Being in the relegation zone may trigger a coach dismissal (
de Dios Tena and Forrest 2007). Some authors have compared expectations (expressed as bookmaker odds) with actual results and suggest that a string of outcomes below expectations may lead to coach dismissal (
van Ours and van Tuijl 2016;
Pieper et al. 2014). Even if poor past performance seems to be the main determinant of the decision to dismiss, some other variables may also play a role in that decision, including the race, sex, ethnicity, and salary of the head coach (
Mixon and Trevino 2004;
Kahn 2006).
There is no clear-cut answer regarding the effects of coach turnover on team performance. A recent comprehensive review of the literature is found in
Scelles and Llorca (
2020). They identify 32 studies addressing that issue regarding 12 different countries. Among these studies, 11 find no significant improvement in performance after a coach replacement, 11 a negative impact, and 10 studies find a positive effect. The empirical results may depend on the period of time under scrutiny, that is, short-term versus long-term periods (
Lago-Peñas 2011;
Hughes et al. 2010;
Audas et al. 2006), and also on the methods used.
The key point of those studies is how to measure team performance. Most of the time, performance is measured on short-term results, for example over the 3, 4, or 5 games prior to dismissal, and it is expressed as the number of points won, the number of goals scored and allowed, the change in the league position, or the presence in the relegation zone (
Salomo and Teichmann 2000;
Frick et al. 2010;
d’Addona and Kind 2014;
Pieper et al. 2014). In this paper, we propose an innovative evaluation method of professional sports team performance based on Monte Carlo simulations. This technique makes it possible to estimate the probability distribution of the final team ranking and thus to calculate the expected final league position of teams. The computation of those probability distributions is based on team payroll. That expected final ranking is revised after each game day. Expected performance (over- or under-performance) at game day
t is measured by the difference between the expected final ranking calculated at the beginning of the season and the expected final ranking knowing the results of the first
t game days
1 .
We use that method to assess the performance of the English Premier League and the French Ligue 1 professional football teams over the 2015–2016 and 2016–2017 seasons. We show that the failure to perform according to expectations over the season is the main reason for the decision to dismiss head coaches: 93% of teams with an average expected final ranking two positions lower than what was expected at the beginning of the season dismissed their coach. That percentage falls to 83% for teams whose average expected final ranking is lower than the expected final ranking before the season started by one position. We also find that coach replacement improves performance only when the teams under-performed before the dismissal. If the teams performed well before the coach was dismissed, the dismissal hinders performance.
The rest of the paper is as follows.
Section 2 gives the team probabilities of winning, drawing, or losing a game computed from the team payroll.
Section 3 presents the performance evaluation method based on a Monte Carlo simulation. In
Section 4, we use that method to establish a link between performance and coach dismissal. In
Section 5, we investigate the relation between dismissal and performance. Finally, we present our conclusions in
Section 6 and comment on our results.
2. The Probability of Winning, Drawing, or Losing a Game
Calculating the probability of winning, drawing, or losing a game is the first step in building our stochastic model. We define those probabilities as contest success functions depending on the relative total payroll of the teams. We assume that the probability of team
i winning against team
j is not the same whether team
i plays at home or away. The probability that team
i wins, loses, or draws at home against team
j on game day
t is given as:
where
is the total payroll of team
k (
). This variable is supposed to be a proxy for team
k’s stock of talent (
Breunig et al. 2014) . From the previous set of probabilities, we can deduce the probabilities that team
j wins, loses, or draws in an away game against team
i. We have:
Parameter
has two functions. First, it allows us to compute the probability of team
i losing or drawing at home. If
, the home team never loses (wins or draws) and if
, the home team never draws (wins or loses). Second, it may also be considered as a measure of the Home/Away effect. The difference between the probability of team
i winning against team
j at home and in an away game is given by:
The lower
, the higher the Home effect. As we may see, for
,
,
, while the sign of
is ambiguous. Thus, a team has a higher chance of winning and a lower chance of losing when it plays a home game. In the same way, we can compute
,
and
and deduce
,
and
.
Table 1 displays those probabilities.
We consider a league championship to be a competition among professional sport teams organized by a league, each team playing against each other twice a season (at home and away). For football, that competition is known as the English Premier League in England and the Ligue 1 in France. From this set of probabilities, we can compute the expected total number of points each team may win when the league championship is composed of
n teams. The expected number of points for team
i against team
j at home,
, is written as:
where
and
are the points assigned for a win and a draw respectively, no point is assigned for a loss. Similarly, the expected number of points for team
i against team
j in an away game,
, is as follows:
It yields the expected number of points won by team
i against team
j:
and team
i’s expected total number of points:
Finally, after a few computation steps, we end up with the expression for the league championship expected total number of points as:
where
n is the number of teams playing in the league championship (20 teams in the Ligue 1 and English Premier League). To run our simulations (shown in the next section), we need to estimate the value of
. From Equation (
10), that value may be expressed as:
To estimate
, we replace
with the total number of points won over the season. For the English Premier League, that number is equal to 1033 for the 2015–2016 season and 1056 for the 2016–2017 season, and to 1032 and 1046 for the Ligue 1. Replacing
with those numbers in Equation (
11) and knowing that
,
and
gives us for
: 0.43 and 0.55 for the English Premier League and 0.42 and 0.5 for the Ligue 1. In the simulations, we take the average value of
, that is,
.
3. The Expected Performance of the English Premier League and Ligue 1 Football Teams from Monte Carlo Simulation
We take the English Premier League and the Ligue 1 fixtures for the 2015–2016 and 2016–2017 seasons, and we use a Monte Carlo simulation method to produce probability distributions for the final league positions of the teams. Using the total wage costs as a measure for the team payrolls (see
Table 2), we run a simulation model to generate 10,000 point paths for each team. A path displays at each game day the number of points a team has accumulated over the previous game days. The point paths are the results of a stochastic process in that the number of points the teams win at each game day is random and depends on the probability of winning, drawing, or losing the game. To compute that number, we use the RandomChoice command of Mathematica, which gives us the pseudorandom choice of getting 3, 1, or 0 points weighted by the probability of winning, drawing, or losing the game, as specified in
Section 22. More precisely, the number of points home team
i wins against away team
j at game day
t is given by the following Mathematica command: RandomChoice[{
}->{
}]. By running the simulations, we act as if the 2015–2016 and 2016–2017 seasons were played 10,000 times. Thus, we get 10,000 rankings of teams and we can compute the number of times a team was ranked 1st, 2nd, …, 20th, which gives us the probability distribution of final position for each team and allows us to compute the expected final ranking of teams. Being based on exogenous factors, the teams’ wage costs, those rank expectations cannot be manipulated by any stakeholder (owners or shareholders, sponsors, head-coaches, players, media, bookmakers, …).
The
Figure 1,
Figure 2 and
Figure 3 depict the 10,000 point paths (on the left-hand side of the figures) and the probability distribution of final positions (on the right-hand side of the figures) for a few Ligue 1 and English Premier League teams over the 2015–2016 season, namely Toulouse Football Club, Leicester City, and Manchester City. Two examples are depicted in the figures: the point paths computed at the beginning of the season (first line of the figures) and the point paths computed at mid-season, that is, at game day 19, after knowing the results of the first 19 games (second line of the figures). On the point path figures, the dashed curve is the expected point path. It displays the average accumulated number of points resulting from the simulations, and the solid curve gives the real accumulated number of points obtained by the team over the season. If the solid curve is below the dashed curve at date
t, the team has under-performed over the first
t game days. For example, Toulouse has consistently performed below what was expected at the beginning of the season and the solid curve is below the dashed curve over the 38 game days; it is the opposite for Leicester City. The performance of Manchester City is very close to what was expected when the season started.
On the figures showing the probability distribution of ranks (right-hand side of the figures), the bars give the team probability of ending 1st, 2nd, …, 20th. The expected final rank is the product of the probability and the rank. For example, the expected final rank computed at the beginning of the season is 11.375 for Toulouse (see
Figure 1). On the same figures, the dot shows the observed final rank. Toulouse ended 17th. The expected final rank of a team may also be revised at any game day
t by taking into account the observed total number of points won over the first
t game days. For example, the expected final rank of Manchester City computed at game day 19 is equal to 2.63 (see
Figure 3). This is the result of a non-random component, the total number of points Manchester City won over the first 19 game days, and from a random component, namely, the number of points the team could win after game day 19.
We measure the under- or over-performance of team
i at game day
t as the difference between the expected final rank computed at that date (knowing the results of the
t first game days) and the expected final rank computed at the beginning of the season (without any information regarding the results of the team over the first
t game days). We call that difference the expected margin,
:
where
and
denote the expected final rank of team
i at dates 0 (before the season starts) and
t respectively. A positive expected margin at game day
t means that the team performance expectation at that date is worse than what was expected at the beginning of the season. Conversely, when the expected margin is negative, the performance of the team is better in expectation than what its relative payroll would have suggested at the beginning of the season.
Figure 4 and
Figure 5 display the evolution of the expected margin for the 20 teams of the French Ligue 1 and the English Premier League respectively, for the 2015–2016 season. The same is provided in
Figure 6 and
Figure 7 for the 2016–2017 season. As an example, let us consider the evolution of the expected rank for Marseille over the 2015–2016 season (second figure of
Figure 4). At the beginning of the season (Game day 0), the expected final rank for Marseille was 3.28. Until game day 26, Marseille performed slightly less well than what was expected at game day 0. Its expected rank amounted to 5 on average. After game day 26, there was a dramatic increase in the expected final rank of the team. At game day 34, Marseille coach, Míchel, was dismissed. The expected margin at the date of dismissal was around 8.5. Marseille ended 13th.
For each game day, the sum of the teams’ expected margins is equal to zero, because when one team performs better than expected, at least one other team under-performs. Formally, we have:
That result is easy to prove. For a league championship composed of 20 teams, we know that the expected final rank of team i at game day t, , where is the probability computed at date t that team i will finish at rank r. Summing the expected ranks gives: . Since , we get: . Finally, .
According to this result, the sum of the expected margins of the under-performing teams at game day
t is equal to the opposite of that of the over-performing teams:
where
denotes the set of teams that under-performed (
) and
denotes the set of teams that over-performed (
).
The average expected margin of team
i over the entire season, denoted
, is computed as:
where 38 is the number of game days. Of course, the previous results also hold for the average expected margin, namely:
and
Note that may be seen as a “Surprise Index” of the team rankings over the season. The higher , the less expected the final rankings of the teams on average over the season. The maximal value of is 100. This corresponds to a situation in which the team that was expected to finish 1st ended 20th (), and the one that was expected to finish 2nd ended 19th (), and so forth. In that case, On the other hand, the minimal value of is zero when the rankings on average are exactly those expected. We may express this “Surprise Index” in terms of the percentage of surprise: . The higher the , the more surprising or unexpected the season. If , there was no surprise at all regarding the season; if , the season was totally unexpected.
Table 3 reports the computation results of the average expected margins for the Ligue 1 and English Premier League teams over the 2015–2016 and 2016–2017 seasons. The teams are ranked with respect to their margin (in decreasing order). The upper part of the table shows the results for the under-performing teams (
); the results of the over-performing teams are reported in the lower part of the table (
). According to our “Surprise Index”, the English Premier League 2015–2016 season was the most unexpected season (
); the English Premier League 2016–2017 season was the least surprising one (
). That index reaches 19.21% for the Ligue 1 2015–2016 season and 18.55% for the Ligue 1 2016–2017 season.
4. The Dismissal of Head Coaches
Over seasons 2015–2016 and 2016–2017, 31 coaches were dismissed or left their club before the end of the season, 18 in the French Ligue 1 and 13 in the English Premier League (see
Table 43). For the clubs which sacked their head coach, the game day just before dismissal is depicted by a vertical dashed line in the figures of
Figure 4,
Figure 5,
Figure 6 and
Figure 7. As regards the Ligue 1, the average expected margin computed at the game day just before dismissal is equal to 3.32 and 3.50 for seasons 2015–2016 and 2016–2017 respectively and to 3.19 and 2.11 for the English Premier League. Regarding the two leagues and for the two seasons together, 28 clubs fired their coach and 52 did not.
We assume that the dismissals of head coaches are linked to the teams lack of performance. We hypothesis that dismissal is actually the consequence of under-performance
4. Head coach dismissals thus should happen for teams displaying a positive average expected margin (
). It is what our simulation results show on average: The mean average expected margin of teams which dismissed their coach,
, is equal to
, whereas that of teams which did not,
is equal to
(see bottom of
Table 3). Here also, there is a relationship between those two mean average expected margins. We know that:
where
D denotes the set of teams that dismissed their coach and
denotes the set of teams that did not dismiss their coach. We can write:
where
stands for the number of teams that dismissed their coach and
stands for the number of teams that did not. This expression can be written as:
Consequently, for the two leagues and the two seasons, .
Whatever the league and the season, the mean of teams that dismissed their coach is positive. Symmetrically, the mean of the teams that did not dismiss their coach is negative. As for Ligue 1, for the 2015–2016 season and for the 2016–2017 season. As for the English Premier League, is equal to for the 2015–2016 season and for the 2016–2017 season and is equal to for the 2015–2016 season and for the 2016–2017 season.
In addition, it is clear from
Table 3 that most of the teams that dismissed their coach (in bold) underperformed—61% of the teams that exhibit a positive average expected margin (22 teams out of 36) dismissed their coach. Only 13% of the teams displaying a negative average expected margin dismissed their coach (6 teams out of 44).
Figure 8 displays the distribution of teams according to their average expected margin (grey bars). The black bars show the distribution of teams that dismissed their head coach. For example, the number of teams having an
between 0 and −1 is equal to 18. Among those teams, 4 teams dismissed their coach. Among the 15 teams that exhibited an
greater than 2, 14 teams dismissed their head coach (93%). As for the teams displaying an
greater than 1 (23 teams), 83% of them dismissed their coach. The link between bad performance and the dismissal of coaches is clearly established here.
To show graphically the relation between performance and dismissal (
Figure 9), we plot the expected final rank computed at the beginning of the season,
, against the average expected final rank for teams that dismissed their coach (circles) and for those that did not (black dots). The figure displays 80 points (two leagues of 20 clubs for two seasons). The average expected margin,
is measured by the distance of the points (circles or black dots) to the first bisector. The further above (below) the bisecting line the points are, the lower (higher) the performance of the teams. It is clear from the figure that most of the points above the first bisector are circles showing that most of the under-performing clubs dismissed their coach, whereas clubs that performed quite well on average did not dismiss their coach (the black dots are mostly below the first bisector).
We see that a few well performing teams dismissed their head coach (a few circles are below the bisecting line). The reason might be because the national league championships are not the only competitions in which the teams play. There are also other competitions, such as the
Coupe de France and the
FA Cup in England, that may have an influence on the perceived performance of teams. Dismissals may occur because of poor performances in those competitions. For example, this was the case for the French club Stade Rennais (Rennes) whose coach, Philippe Montanier, was dismissed in 2016 even though the team performance in Ligue 1 was better than expected (
). Coach Montanier was dismissed under pressure from fans who blamed him for the elimination of the team from the
Coupe de France against Bourg-en-Bresse, a poorly ranked team of the French Ligue 2.
5The interpretation of our results is straightforward: Almost all the clubs whose average expected performance was below pre-season expectations dismissed their coach. Another question is whether the club decision to dismiss is different according to the seasons and the leagues. To investigate this question, we estimate the following equation on the sample of teams that dismissed their head coach:
where
and
The results are reported in
Table 5. We have 1036 observations (28 dismissed coaches times 37 game days) Parameter
is the mean average expected margin of English Premier League teams that dismissed their head coach over the 2015–2016 season (
). The mean average expected margin is significantly lower for the 2016–2017 season (
). This is mostly because Sunderland’s head coach was not dismissed despite the poor performance of his team (
). For the Ligue 1, the difference in the mean average expected margin between the 2015–2016 and 2016–2017 seasons is positive and significant (
).
The difference in the mean average expected margin between the Ligue 1 and the English Premier League is negative and significant for the 2015–2016 season ( and positive and significant for the 2016–2017 season (β0 + β1 + β2 + β3 − β0 − β1 = β2 + β3 = 1.5). Even if the two leagues and the two seasons differ in terms of dismissal decisions, the common feature is that dismissal and poor performance are strongly related in both leagues for both seasons.
5. The Effect of Dismissal on Team Performance
In this section, we assess the impact of dismissal on team performance. Because of the perfect symmetry between the average expected margins of teams that dismissed their coach and the average expected margins of those that did not, we focus on the sub-sample of teams that dismissed their coach. To evaluate whether dismissal improves performance, we estimate the following equation:
where
The expected margin before dismissal is usually computed as:
. The expected margin after dismissal takes the value of
. It is as if a new season started for teams that dismissed their coach. The performance of the new coaches is assessed from date
d onward. For those coaches, the reference expected final rank is
; it is not
anymore (see
Figure 4,
Figure 5,
Figure 6 and
Figure 7 for clubs that dismissed their coach). We also consider the same period of time in charge for coaches who were dismissed and those who took over. For example, if a team dismissed its coach at game day 10, we only consider the next 10 game days of coaching for the new coach. If a coach is dismissed at game day 25, we only take into account the last 13 game days of the former coach (the reference expected final rank is then computed at game day 12) and the 13 remaining game days of the season for the new coach (38−25). In that sample, the average coaching time period under consideration is about 12 game days. We thus focus on what we may consider as the short-term effects of coach dismissal.
In this equation,
is a dummy variable that takes a value of 1 for the game days after dismissal and a value of 0 before. Parameter
thus measures the mean average expected margin of teams before dismissal. If
is negative and significant, it means that the mean average expected margin is significantly lower after the dismissal than before and consequently that the teams performed better after the dismissal than before. The results are given in
Table 6 (1st column). Parameter
is significant and equal to −0.455, suggesting that the
after the dismissal is lower (by about 0.455) than the
before dismissal. The
is equal to 0.740 before dismissal and 0.285 after (0.740−0.455). According to that result, performance improved after dismissal, even if, on average, the teams of the new coaches still under-performed (
is still positive after dismissal). We confirm here the results obtained by
Scelles and Llorca (
2020) that coach change improved the post-dismissal performance of the French Ligue 1 teams over the 2000–2016 period. Symmetrically, the performance of teams that did not dismiss their coaches decreased on average.
6Figure 10 illustrates this result. The horizontal axis gives the expected margin of teams before dismissal (
), while the expected margin after dismissal is shown on the vertical axis (
). Each point in the figure corresponds to the same day of coaching for the former coach and for the new one. For example, if a coach is dismissed just before game day 8, there will be 8 points in the figure regarding that team. The coordinates of the 1st point correspond to the expected margin of the former coach at game day 1 and to that of the new coach at game day 9, which is the first day that the new coach is in charge. The 8th point displays the expected margin of the former coach on her last game day in charge and that of the new coach on her 8th game day in place (game day 16). The dashed line represents the first bisector. The figure shows that a higher number of points are below the first bisector, illustrating the fact that the mean average expected margin is lower after than before dismissal. We also see that this may not be true for teams that over-performed before dismissal (
. For those teams (left-hand side of the figure), it seems that the number of points above the bisector is higher. It is as if teams that performed well before dismissal displayed a lower performance after dismissing their coach.
To investigate this issue econometrically, we estimate the following equation:
where
and
The dummy variable
Perf is equal to 0 for teams that under-performed before dismissal
and 1 for those that over-performed before dismissal
. The results are reported in
Table 6 (2nd column). All the estimated coefficients are significantly different from zero. The mean average expected margin of under-performing teams is equal to 1.23 before dismissal (
) and 0.10 after (
). On the other hand, the mean average expected margin of over-performing teams is equal to −0.36 before dismissal (
) and 0.68 after (
β0 +
β1 +
β2 +
β3). The two points (1.23, 0.10) and (−0.36, 0.68) are displayed in
Figure 10 (black dots). Regarding the teams that over-performed before they dismissed their coach, that point is above the first bisector; for those teams that did not perform well before dismissal, the point is below the bisecting line. This result suggests that for teams performing quite well before dismissing their coach, dismissal hinders performance. Conversely, for teams performing poorly before dismissal, dismissing their coach enhances performance.
Poulsen (
2000) finds similar results regarding the top two English divisions for the 1993–1994 to 1997–1998 seasons. According to him, “if teams are doing well, or just “not to bad” then there appears to be no gain from sacking the manager”. We may use a medical allegory to illustrate our results: Dismissing a coach is like taking medication, it may help if you are sick but if you take medication for an illness you do not have, you will only experience the negative side effects.
We may wonder whether this result is not due to a regression to the mean phenomenon. It might be the case that poor team performance is due to bad luck. But bad luck is unlikely to last for long. After a while, performance will improve whether the coach is changed or not. The improvement in performance would thus be a pure statistical phenomenon. The same statistical effect may be observed for teams performing well before dismissal and whose performance is hindered after dismissing their coach. However, luck is probably not the right explanation here because the period of time we consider is quite long: 12 game days on average. A team may be unlucky for a few game days, but if it is unlucky for 12 game days in a row, the cause may not be only a matter of bad luck.
6. Conclusions
In this paper, we propose an innovative method for assessing the performance of professional sport teams using Monte Carlo simulation. To apply the method, we need only information regarding the teams’ stock of talent. We use the team wage costs as a proxy for that stock. The key principle of our method is to compare the average expected performance, expressed as the average expected final position over the season, with the position expected before the season starts (depending only on the teams’ relative payroll). We call that difference an average expected margin.
We show that in-season head coach dismissals are the result of poor performance over the season in comparison to pre-season expectations. Among the teams displaying an average expected margin greater than two, that is, an average expected final ranking two positions lower than what was expected at the beginning of the season, 93% of them dismissed their coach. For the Ligue 1, the average expected margin computed at the game day just before dismissal is equal to 3.32 for the 2015–2016 season and 3.50 for the 2016–2017 season. It means that on average the expected final ranking at date of dismissal is between 3 and 4 positions lower than the pre-season expectations. For the English Premier League, it is equal to 3.19 and 2.11 for the 2015–2016 and 2016–2017 season, respectively. We also find that dismissing a coach may enhance performance only if the team under-performed before the dismissal. According to our findings, even if a team improves its results after coach dismissal, it may still perform below expectations. Conversely, if the teams performed well before the coach was dismissed, the dismissal hinders performance.
There is no obstacle to using the same method for professional teams in other sports. The method is easily reproducible and does not require much information in order to be applied.