1. Introduction
Modern portfolio theory originated with the introduction of mean-variance optimization (MVO) by
Harry Markowitz (
1952). By formulating the question of optimal portfolio selection as a quadratic optimization exercise, Markowitz reduced a complex problem to the estimation of expected returns and covariances for investable assets. While his solution was conceptually and mathematically elegant, practical problems in its implementation became clear over time.
Michaud (
1989) and others noted that MVO often results in portfolios with dramatically unequal position sizes, with many assets reduced to zero weight, while a few assets take on very large weights.
Green and Hollifield (
1992) analyzed the sensitivity of MVO solutions to even modest changes in expected returns or the covariance matrix. This lack of robustness is especially troubling given the known difficulty of estimating forward-looking measures, particularly forward-looking returns (
Merton 1980). Finally, research has established that MVO portfolios often perform very poorly in practice, even underperforming naive portfolio construction methods like equal weighting (
Kan and Zhou 2007). Spurred by these realizations, many institutional investors have sought alternatives to portfolio optimization (
Hurst et al. 2010) or have attempted to correct it in various ways.
Early proposals for MVO improvement, such as those of
Frost and Savarino (
1988), added constraints to portfolio weights beyond the standard requirement that all weights sum to 100% and the common avoidance of negative weights (short-selling). While added constraints can improve on the weaknesses of MVO, asset-level or asset class-level constraints are inherently arbitrary and portfolio-specific, making them difficult to generalize.
Black and Litterman (
1992) applied a Bayesian approach to the problem, seeking to merge an investor’s expectations with market-implied figures. However, this requires the existence of a market portfolio, and especially the ability to identify such a portfolio.
Jorion (
1986) showed the value of “shrinking” estimates of the expected return vector to increase stability; other shrinkage methods have focused on the covariance matrix, imposing structure beyond the sample covariances taken from historical returns (
Ledoit and Wolf 2003). The main complication for any shrinkage technique is the need to specify a target and a shrinkage intensity a priori.
Michaud (
1998) suggested the use of resampled portfolios, where the final weights would be an average of numerous portfolio optimizations, each drawn from a distribution defined by expected returns and covariances. While this approach has merit, it produces a single output portfolio without the ability to adjust the balance between optimality and robustness.
The field of robust optimization has also taken an interest in improving MVO by employing uncertainty sets (
Tutuncu and Koenig 2004). In this approach, the analyst solves for the weights that produce the best outcomes under the worst realizations of returns and covariances. More recently, statistical and machine learning methods have introduced the idea of regularization to portfolio optimization (
Brodie et al. 2009;
Carrasco and Noumon 2010). Simply put, regularization imposes an explicit or implicit penalty on large target weights, often driving optimization toward more general solutions that should perform better across a range of outcomes (for an overview, see
Tian and Zhang 2022). In the utility maximization context, this can mean adding an extra term to the objective function to discourage large weights in any single asset, or it can involve more complex reformulations of the optimization problem (
Ban et al. 2018). While all of the aforementioned MVO improvements have their merits, few have achieved the combination of simplicity and results necessary to become popular among practicing portfolio managers.
We propose two intuitive methods of regularized portfolio optimization in the context of multi-asset portfolios. In both, the key idea is to approximate mean-variance optimality while reducing the concentration of the portfolio—simply defined as the sum of squared weights. The first method starts by running standard mean-variance optimization to obtain the maximum possible portfolio utility. It then performs another optimization, minimizing concentration while keeping utility at or above a preset percentage of the maximum possible utility. The second method resamples the assets across iterations, analogous to a scenario where only a certain number of assets are available for investment in a given iteration. Each method allows for adjustable levels of regularization, and each noticeably reduces portfolio concentration. Finally, we test the performance of both methods using a novel approach. We generate thousands of hypothetical return series for all available assets using a multivariate Gaussian distribution, then set expected returns and covariances as a weighted average of past values and “future” values. By toggling the weight assigned to past versus future values, we simulate different forecasting skill levels, allowing us to demonstrate the potential benefits of the two methods. We show that for all but the most implausibly skilled investors, some degree of regularization in portfolio optimization will improve risk-adjusted returns. We also show that both methods reduce the volatility in portfolio weights as optimization inputs change.
2. Background
The canonical MVO problem is given in matrix form by
where
is the column vector of expected returns,
x is the column vector of weights,
is the risk aversion parameter, and
V is the covariance matrix (
Cornuejols et al. 2018). The first constraint ensures that the portfolio weights sum to one. For most investors, the second constraint applies and mandates that there are no negative weights (i.e., no short selling). MVO requires estimating expected returns and covariances among all assets, as well as knowing the investor’s level of risk aversion. Following
Paravisini et al. (
2010), we will assume an average level of investor risk aversion throughout this paper, with
set to 3.
To illustrate the workings of MVO and our proposed regularization methods, we will utilize a set of self-created capital markets assumptions (CMAs) detailed in
Appendix A. Using standard MVO with no short-selling, we obtain the weights shown in
Table 1.
One feature becomes apparent immediately: the portfolio is highly concentrated in only three assets, with the other seven assets receiving no allocations at all. This is despite relatively modest differences in expected returns and covariances among the ten total assets. Less obvious is how sensitive these estimates are to small changes.
Table 2 demonstrates that adjusting the expected return of Non-US Developed Equities up or down by only 50 bps can lead to swings of over 15% in total portfolio weights. Many asset owners rightly question the wisdom of a process that produces such extreme weights and is so reactive to minor shifts in assumptions.
Besides the inherent difficulty of predicting the future, there is an additional reason the weights are so unstable: very different weights can often produce similar utility. In
Figure 1 below, we have constructed a simple two-asset portfolio. Asset A has an expected return of 8% with an expected standard deviation of 16%, while Asset B has an expected return of 4% with an expected standard deviation of 8%. The expected correlation between the assets is 0.2, and we assume an investor risk aversion of 3. While the maximum utility of 4.81% is achieved when Asset A has a weight of 62% (and Asset B’s weight is thus 38%), the portfolio can achieve at least 98% of the maximum utility with Asset A weights of between 47% and 77%. Though the exact results depend on the expectations, most realistic capital markets assumptions will induce similar outcomes. This insight leads directly to the first method we propose.
4. Results
Arguably the best test of any portfolio construction method is how its created portfolios perform in terms of returns and volatility in an out-of-sample time period. To that end, we offer a way of testing the portfolio construction methods we have described, as well as almost any other approach to portfolio construction. The challenge of back-testing portfolio construction methodologies is that many require some ex ante assumptions, which can be difficult to formulate after the fact. In the case of MVO and the two methods we have proposed, tests require expected returns, expected volatilities and expected correlations.
We begin by simulating daily returns for all ten available assets using the annualized capital markets assumptions found in
Appendix A. We draw the returns from a multivariate Gaussian distribution, though more complex and realistic methods could be used instead. We simulate 10,000 runs, each lasting 504 trading days (2 years) to obtain a large sample size of potential market activity. In each simulation run, we assume the position of an investor standing at the end of day 252, denoted as time
t, that is the midpoint of the time series. While the investor knows the realized returns and covariances for days 1 through 252 (in-sample period), they are investing for days 253 through 504 (out-of-sample period), and, thus, must forecast those unseen values. For each of the 10,000 runs, the investor’s return expectations at time
t are a weighted average of the previous 252 days (R
P) and the forward 252 days (R
F), as seen in (3) below. The same relation holds for expected volatilities and expected correlations.
The parameter that controls the relative weights is η, which can be thought of as a measure of investor forecasting skill. In the case of an investor with no forecasting skill, η equals zero, and expectations are naively set to the realized values of the past 252 days; any minimal correspondence between past and future returns will be due to autocorrelation. If an investor is perfectly clairvoyant, η equals one, and expectations are the same as future 252-day realized values. The η value is thus conceptually similar to the information coefficient (IC) developed by
Grinold and Kahn (
2000), as each measures the correlation between predictions and outcomes.
For a given degree of regularization and level of forecasting skill, we create 10,000 portfolios (one for each simulation run) and store the weights. We also compute the realized return and volatility for each portfolio over the forward 252 days, as well as the Sharpe ratio. Finally, we record the out-of-sample median Sharpe ratios for all combinations of regularization and skill, which are shown in
Table 5 and
Table 6 for Method One and Method Two, respectively. Each row of each table corresponds to a skill level, while the columns are degrees of regularization; note that the first column of each table is equivalent to MVO because it assumes no regularization. The median Sharpe ratios in each row are colored in green if they exceed the median MVO Sharpe for their row, while they are colored in red if they fall below.
Two results stand out in this analysis. First, regularization does appear to improve ex-post portfolio Sharpe ratios, especially at lower levels of forecasting skill. Only once forecasting skill (η) rises to or above 0.4 does MVO routinely outperform the regularized methods. Considering that an IC of even 0.2 corresponds to an impressive hit rate of 60%, it seems that the methods described could have wide applicability. Second, the two methods exhibit somewhat different patterns of Sharpe ratio change. Method One achieves improvements more gradually at lower skill levels than does Method 2, while Method 2 appears to be somewhat more robust at higher levels of skill. It is worth remembering that Method One explicitly solves for minimum concentration for a given level of utility, while Method Two is more of a heuristic that achieves broadly similar outcomes. Method One can also accommodate any constraints, while Method Two cannot guarantee the same due to its final step of averaging weights across iterations.
The weights calculated by each method also contain valuable information, showing the reduction in weight volatility as the degree of regularization increases.
Figure 3a,b illustrate this for a subset of assets, while
Table 7 and
Table 8 contain the results for the full asset lineup. All else equal, less volatility in weights leads to less portfolio turnover and thus reduced transaction costs. Less movement in portfolio weights over time should also mitigate harm from estimation error in forecasting asset returns and covariances.
5. Discussion and Concluding Remarks
There are obvious practical implications for the ideas we have presented. While the results of a study can never guarantee future outcomes, our work suggests that professional investors managing institutional multi-asset class portfolios can make use of either regularization method to build portfolios that are more diversified, more robust to changing conditions, and possibly more efficient than those produced by classic MVO. The choice of which method to employ will likely depend on how many constraints are needed for the portfolio construction process, with Method One generally being more accommodating of constraints beyond those in Equation (1). Both methods allow portfolio managers to tailor the degree of regularization to their assumed skill level, as humbling as such an exercise might be. Just as importantly, portfolio managers can explain the logic of these methods to their clients without resorting to complex mathematics. Asset owners may also be drawn to the aesthetic aspects of regularized optimization, as portfolios will contain fewer extreme weights than in MVO. But practitioners should not be the only beneficiaries of this work. Finance is both an applied and an academic discipline, and the latter community can continue to develop these ideas. As always, more research is needed and welcomed.
One direction is to adapt the regularized optimization methods to single-asset class portfolios, possibly in combination with benchmark-relative performance metrics. This paper has argued for the benefits of regularized optimization in a common multi-asset setting, but we cannot guarantee that our methods produce comparable results in all investment universes (e.g., within emerging markets or among individual small-cap equities). A second option is to extend the two methods presented to the case of multi-period portfolio optimization. A more computationally intensive study could measure the effects of different volatility and correlation regimes on the results we have obtained. This might be combined with more realistic simulated returns by drawing from non-normal distributions and conditional correlation assumptions. One could also separately model skill in forecasting future returns from skill in forecasting future covariances, as the former task has generally proven much more difficult than the latter. Yet another extension may involve applying regularization to non-MVO methods of portfolio construction such as risk budgeting. Whatever the course of future research, we hope this paper has contributed both in terms of promising improvements to MVO and a method of testing various portfolio construction approaches.