2. Description of the Approach
Consider a managed portfolio p comprised of N assets, consisting of M asset classes with assets in class , such that . Let b denote a benchmark portfolio composed of Q assets comprising the same M asset classes, with assets in class , such that . Let the index pair, , identify portfolio asset j in class i, with the analogous identification for benchmark assets. Denote the daily closing price of an asset as and its corresponding log-return as . For brevity, we will suppress the time variable for most of the discussion in this section. Let denote the weight of asset in portfolio p, and denote asset weight in the benchmark. We assume all weights are non-negative; that is, all portfolios considered take long-only positions. Let and represent the total weights of the assets in class i in the portfolio and benchmark respectively. Note that for any portfolio fully invested in its component assets (which we assume is the case in this study), .
The quantities AA and SE for asset class
i are defined as follows (
Biglova and Rachev 2007):
1
where
and
denotes expected value. In (
3), the ratio
represents the fractional weight held by asset
j in class
i in portfolio
p. (That is
.) Thus
(similarly
) represents an expected log-return for asset class
i considered as a fully-invested portfolio by itself. In contrast,
represents the usual expected log-return for the entire benchmark portfolio.
2 From (
3) we have
. Similarly we have the usual expected log-return for portfolio
p,
The excess return,
, can be viewed as the value added by portfolio management. From (
1) through (
4),
where
is an “interaction” term. AA, SE and I are, respectively, the total asset allocation, total selection effect, and total interaction terms for portfolio
p. The contribution to the total value added to the excess return,
S, from asset class
i is
, while
represents the contribution to
S determined by the choice of assets within class
i. To understand these interpretations, consider first the sign of the value of
in (
1).
If , the expected return from asset class i in the benchmark is outperforming the total expected return for the benchmark. Therefore if , the weight of asset class i in portfolio p is larger than in the benchmark, capitalizing further on the better return from class i. Otherwise, if , the class i weighting in portfolio p is hurting the potential performance of that class (as determined by the benchmark).
If , the expected return from asset class i in the benchmark is under-performing the total expected return for the benchmark. Therefore if , the weight of asset class i in portfolio p is smaller than in the benchmark, further suppressing the poorer return from that class. Otherwise, if , the class i weighting in portfolio p is overweighting the poor performance of that class.
Thus, a positive sign for the value of indicates a “correct” decision in the management of portfolio p relative to the benchmark while a negative sign indicates a “poor” decision. The magnitude of quantifies how good or poor the decision is.
Similarly, as we assume
3 , a positive sign for the value of
in (2) indicates that the expected return from the choice of assets in class
i in portfolio
p is outperforming that class in the benchmark, while a negative sign indicates that the expected return from the choice of assets in class
i in portfolio
p is under-performing.
The interaction term,
, captures the part of the excess return unexplained by asset allocation and selection effect. Written as
it can be viewed as the product of the asset allocation and selection effect contributions of class
i to portfolio
p compared to the weighted excess return of class
i in the benchmark
b. Alternatively, written as
it can be interpreted as the product of the asset selection effect and the over- or under-weighted part of asset class
i. The relationship (
7) between
and
reveals a simple form for the sum of the selection effect and interaction terms,
Equation (
8) provides a way to incorporate a constraint on the sum of the selection and interaction effects for class
i; however, we will not consider such a combined constraint in this study.
Portfolio optimizations that maximize return while minimizing risk (subject to additional constraints) require specification of a proxy measure for risk. Common examples of risk proxy include: the variance of the portfolio (
Markowitz 1952); value-at-risk (VaR) (
JP Morgan 1996); expected tail loss (ETL); conditional value-at-risk (CVaR) (
Rockafellar and Uryasev 2000);
4 and mean absolute-deviation (
Konno and Yamazaki 1991). Measures, such as VaR and CVaR, that focus on tail-risk became very popular as the result of the need to understand exposure to loss under ‘extreme’ market events. (See
Gava et al. (
2021) for a recent study demonstrating that consideration of tail risk can successfully reduce sharp losses in multi-asset portfolios). However VaR has undesirable mathematical characteristics; except when the underlying random process is Gaussian, VaR is not a coherent risk measure as it lacks the properties of subadditivity and convexity (
Artzner et al. 1999). As a risk measure, CVaR is coherent (
Pflug 2000); its use as a standard has grown to the point that the Basel III regulatory framework for banks requires it. We therefore use CVaR as the risk measure for our portfolio optimizations.
The random return
Y of a portfolio is expressed as realizations,
y, of a profit(+) - loss(−) function
of the (column vectors of) asset weights
and returns
. Let
denote the probability density function determining the daily asset returns
. For any fixed value of
, the cumulative distribution of the daily portfolio return is given by
The value-at-risk is
5
where
is a prescribed tail risk probability; equivalently
is a prescribed quantile level, typically having value of
or
. Assuming
is continuous, conditional value-at-risk can be expressed as
6Rockafellar and Uryasev (
2000) show that the function
7
where
, has the following properties: (1) for fixed
,
minimizes
; (2)
; and (3)
is convex in
(and convex with respect to
if
is convex in
).
When evaluated for a portfolio consisting of a finite sample of asset returns,
…,
T with
, the discrete form of (
12) results in the following minimization problem,
The approach by Rockafellar and Uryasev proceeds by converting (
13) to a linear objective function by introducing the variable
. This conversion is particularly appropriate if all constraints are also linear, in which case the constrained minimization problem can be solved by linear programming. As we will be dealing with nonlinear constraints, we leave the objective function in the form (
13) and solve using nonlinear optimization.
We describe our approach for solving (
13) with a general constraint here, and discuss the specific constraints below. Consider optimization of (
13) under the constraint
. If, for any day
t, the feasible set to the constrained optimization is null, the constraint is removed and replaced for that day by a quadratic penalty term in (
13),
The coefficient
can be set by the user. If
k constraints need to be removed, they are replaced in (
14) by the sum
.
We consider four portfolio optimization problems, P
, based upon constrained minimization of (
13) or, in case of a null feasible set, (
14):
- P:
(a) ; and (b)
- P:
(a) ; (b) ; and (c) .
- P:
(a) ; (b) ; and (d) .
- P:
(a) ; (b) ; (c) ; and (d) .
Here
is a turnover constraint,
used as a proxy to control transaction costs. The ‘base case’ portfolio P
considers no performance attribute constraints and is therefore independent of the benchmark portfolio. Optimization problems P
through P
, successively add further performance attribute constraints to the long-only, fully invested, CVaR
-minimized base portfolio.
The constants
can be user-specified to meet particular goals. For example, the constraint
requires that, on average, the asset classes in the optimized portfolio
p equal-or-outperform those in the benchmark. A constraint
requires that the weights of the portfolio assets in class
i be adjusted to perform as well as, or better than, class
i in the benchmark. Since individual asset weights can be zero, this is equivalent to choice of assets in the class. The constraint
requires that this be true averaged over classes. As
involves the ratio
, constraints involving
terms are nonlinear. In contrast, constraints involving terms
and
are linear.
8 Our implementation is done in MatLab using the constrained, nonlinear multivariate function fmincon() and the solver sqp().
Performance of these four optimized portfolios, relative to each other, will be judged based upon cumulative price and four common risk measures. Let
,
,
,
denote daily weights obtained from one of these optimizations.
9 Recalling that
is the log-return based upon the closing price of asset
on day
t, the portfolio log-return
10 and cumulative price are
and
. The four measures used are:
maximum drawdown (MDD),
which characterizes the maximum loss incurred from peak to trough during the time period
;
Sharpe ratio (
Sharpe 1994),
where
is a risk-free rate, and
and
are the expected mean and standard deviation of the portfolio’s excess return,
;
Rachev ratio (
Rachev et al. 2008),
which represents the reward potential for positive returns compared to the risk potential for negative returns at quantile levels defined by the user. In our analysis, we set
.
The choice of these reward-to-risk ratios was influenced by the work of
Chertido and Kromer (
2013) who classified reward-to-risk measures and considered their properties relative to monotonicity, quasi-concavity, scale invariance and whether distribution-based. Their stance was that every performance measure should be at least monotonic (a measure of “more” is better than a measure of “less”) and quasi-concave (the measure prefers averages to extremes and encourages diversification of risk rather than concentration). The (most commonly used) Sharpe ratio does not guarantee monotonicity; perhaps the most critical property a risk-measure should have. The Rachev ratio, used by hedge funds which seek excessive returns and insure against big losses, does not guarantee quasi-concavity. The Sortino–Satchell ratio guarantees both.
3. Application to a Test Portfolio
To illustrate portfolio optimization under performance attribution constraints, we consider a specific portfolio comprised of stocks from the Dow Jones Industrial Average (DJIA). As a limited-information index of the performance of the U.S. stock market, the DJIA consists of the weighted stock price of 30 large, publicly-traded companies. The stock composition of the DJIA and their weights in the index, as of 1 February 2021, are presented in
Table A1 of
Appendix A. To preserve a sufficiently long trading history, our test portfolio comprises 29 of the 30 stocks from the DJIA.
11 Daily closing price data for all 29 stocks were available
12 covering the period 19 March 2008 through 1 February 2021. We grouped the stocks in our test portfolio into six classes based upon their weighted value in the DJIA. Class composition and their total weight in the DJIA are presented in
Table 1. As is apparent from Equations (
1)–(
7), results from an attribution analysis depend on the choice of benchmark. We separately consider two benchmarks. For ease of assignment to asset classes, both benchmarks comprise the same assets as the test portfolios
13 but one benchmark (EQW) is equi-weighted while the other (PW) is price-weighted. The 10-year U.S. Treasury yield curve rate was used as the risk-free rate.
Daily return data for the stocks covered 3240 trading days. Using a standard rolling-window strategy for optimization with a window size of 1008 days (four years), optimized portfolio weights were computed for an in-sample period of days. The historic return sample distribution in each window was used for the computation of CVaR. We optimized at two separate quantile levels, . Daily turnover constraints were set to one of three values: (no turnover constraint), 4% and 0.4%. For the attribution constraints in optimizations P, P, and P, we set the lower bounds and set no upper bounds (). Thus, for example, optimization P minimizes CVaR for the long-only portfolio while requiring that, on average, its asset classes outperform the benchmark.
If the constrained optimization problem resulted in a null feasible set for day t, constraints were replaced by penalty terms in the following order.
- P:
The turnover constraint was replaced by a penalty term.
- P:
The turnover constraint was replaced by a penalty term. If the feasible set was still null, the asset allocation constraint was then additionally replaced by a penalty term.
- P:
The turnover constraint was replaced by a penalty term; if necessary, the selection effect constraint was also replaced.
- P:
The order of additional conversion to penalty terms was turnover constraint, selection effect, and finally asset allocation.
If the feasible set was still null after all indicated hard constraints were converted to penalty terms for day t, the optimized weights obtained for day were used for day t.
For the optimization of CVaR
when the PW benchmark was used to determine values for AA and SE,
Table 2 summarizes the frequency of conversion of a ‘hard’ constraint to a penalty term. For example, for optimization of P
under the turnover constraint
: 45.54% of the timesteps resulted in a feasible solution to the fully constrained problem; 19.03% of the timesteps required converting the turnover constraint to a penalty term; 34.39% of the timesteps required converting both the turnover and SE constraints to penalty terms; 1.03% of the timesteps required conversion of turnover, SE and AA constraints; and there were no timesteps for which a null feasible set was never obtained under this rubric. The results when the EQW benchmark was used differ from those in
Table 2 by only a few percent.
The results in
Table 2 provide support for the TO, SE, AA order of conversion of hard constraints to penalty terms. Note that, when the turnover constraint is not imposed or is ‘relatively mild’ (i.e.,
%), a large percentage of the time the SE constraint had to be converted to a penalty term in optimizations P
and P
. However, when the turnover constraint is 0.4%, the TO constraint needed to be converted essentially every 9 of 10 days in order to produce a feasible solution, and conversion rates for the SE constraint dropped to a few percent. Note, in all optimizations, frequency of conversion of the AA constraint never exceeds 2%.
Figure 1 displays the box-whisker plot summaries of the distribution of TO, AA, and SE values observed in the resulting optimized portfolios when the EQW benchmark was used to determine values for AA and SE. For the base portfolio P
(with similar results for P
), 58% of the daily AA values were negative, while 93% of the daily SE values were negative. Thus imposition of the constraints AA
and SE
are ‘strong’ requirements. As we saw from
Table 2, the AA constraint was achieved ‘easily’; this is confirmed in
Figure 1 (plots labeled P
and P
), which show 100% of the daily AA values to be non-negative. However achieving the SE constraint requires softening of either the daily turnover constraint or the SE constraint itself. For the value
, as indicated in
Table 214 and summarized in
Figure 1, 88% of the daily TO values for P
required softening. As a result, less than
of the SE values remained negative in P
and P
. For P
, less than
of the AA values were negative, while fewer than
of the SE values were negative.
Figure 2 shows the price performance of the base portfolios, P
and P
, under change of turnover constraint. We note the relative insensitivity of price performance of P
to the changing TO constraint. The price performance is more sensitive at the
quantile level.
Figure 3 shows the cumulative price performance of each of the performance attribute constrained, CVaR
minimized portfolios under changing TO constraint level computed using both the EQW (left plots) and PW (right plots) benchmarks to determine AA and SE values. As with the base portfolio, the price performance of the AA-constrained portfolio P
is relatively insensitive to changing the TO constraint level. Much greater sensitivity is seen in the price performance of the SE-constrained portfolio P
for the EQW benchmark. In contrast, the price performance is relatively insensitive to
for P
computed with the PW benchmark. Since the SE constraint affects the sensitivity of the total excess return to the weighting of assets within each individual asset class we ascribe this sensitivity difference as due to the difference in asset weighting between the EQW and a PW benchmark portfolios. The sensitivity of P
to changing the TO constraint level reflects a “compromise” between the sensitivity of the AA-constrained P
and the SE constrained P
. For both
and
, the result is that the price performance of the doubly constrained portfolio improves under tighter daily TO constraint.
Figure 4 summarizes the cumulative price performance of the CVaR
minimized portfolios under the TO
constraint level for both the EQW and PW benchmarks. For both benchmark valuations, we see that the AA-constrained portfolio, P
, produces only slight price performance improvement compared to the base portfolio. For both benchmarks, the AA and SE constrained portfolio, P
, produces strong improved price performance compared to the base portfolio. For the reason attributed above, the price performance improvement for the SE-constrained portfolio, P
, is different between the EQW and PW portfolios.
Figure 5 summarizes total time period risk measures for the
portfolios constrained by a daily turnover of 0.4%. The reported maximum drawdowns all reflect behavior related to the onset of the Covid-19 pandemic. Our optimized portfolios all outperform the benchmarks in MDD, but only the P
portfolio using PW benchmark valuations for SE outperforms the base portfolio P
. Surprisingly, the benchmarks have better Sharpe ratios than any of the optimized portfolios. However, all of the performance attribute constrained portfolios outperform the base portfolio. For the EQW benchmark, all optimized portfolios have better Sortino–Satchell ratios than the benchmarks. For both EQW and PW, the SE-constrained portfolios also outperform those of the base portfolio. All optimized portfolios equal or are better than the Rachev ratios of the benchmarks. The SE-constrained portfolio using PW benchmark valuation also outperforms that of the base portfolio.