1. Introduction
Interest rates are fundamental to the economy. Understanding their dynamics is essential in situations where either discounting of cash flows is performed or the interest rate influences the variables of interest (investing in bonds, interest rate derivatives, etc.). Therefore, their mathematical modeling is necessary as a part of the modeling of the original problems. Also, since interest rates are an important indicator of the state of the economy, information about their behavior and tendencies is of interest by itself. We refer the reader to books [
1,
2] for a complex coverage of interest rate modeling.
Short rate models of interest rates are a class of models in which the instantaneous interest rate (called short rate) is modeled by a stochastic differential equation. A concept of shadow rate dates back to Black [
3], who argued that the nominal interest rate cannot become negative, but its zero values can be observed for a certain time interval. This can be explained by the existence of a so-called shadow rate. In the case it is positive, the short rate coincides with it. However, if the shadow rate becomes negative, the short rate will be equal to zero.
At the time when negative interest rates were considered to be unrealistic, it was this zero lower bound that characterized shadow rate models. This property can be achieved in different ways. Usually, a particular choice of the process in Black’s framework is given. Often, it is an affine combination of Gaussian factors; see [
4,
5,
6,
7]. There are also other possible approaches: the sum of squares of the factors and the sum of exponentials were considered in [
8]. Paper [
9] studies a shadow rate version of the model from [
10], which is based on the Nelson–Siegel model [
11]. A nonzero lower bound can be found in some of the papers, either as a note about a possible generalization or as a particular question of interest; see [
8,
9]. When interest rates in some countries fell to negative values, a modification of Black’s model became necessary. This motivated different ways of incorporating this feature into modeling. Dividing the time period into subperiods with different lower bounds was considered in [
12], and several ways of estimating the lower bound from the data were implemented in [
13]. The stochastic character of the shadow rate was proposed in [
14,
15], while Ref. [
15] also assumed dependence of the shadow rate on an exogenous variable.
Since the factors are not observable, a typical approach to statistical analysis of interest rate time series is the Kalman filter and the extended Kalman filter. Pricing bonds, which are used to evaluate interest rates with different maturities, are complicated, and several different methods can be found in the literature. They include a numerical solution of the partial differential equation in [
8], an approximation based on options in [
6,
9], the computation of forward rates in [
15], and eigenfunction expansion in [
16].
Because of the complicated calculations in shadow rate models, the simple Vasicek model [
17] with a Gaussian short-term rate and an explicit formula for the bond prices is often employed to allow negative interest rates when the interest rate model is not the main question but one of the building blocks of an analysis. Such complex problems include a pension planning analysis [
18,
19], a portfolio optimization [
20], and a stochastic optimization arising from a reinsurance–investment problem [
21]. Therefore, there is undoubtedly a demand for uncomplicated interest rate models that still capture important features of the observed interest rates.
Ueno in [
14] considered a rather complicated model and subsequently its simplified versions. One of the models for the short rate
and the one that we study in this paper was given by the Vasicek-type (cf. [
17] for the original short rate model) shadow rate
and the transformation rule
for
. We show that this particular model has the potential to serve as a computationally tractable model that, at the same time, allows different dynamics for positive and negative interest rates.
The paper is organized as follows. In
Section 2, we describe the model under consideration. Subsequently, in
Section 3, we derive the probability density of the short rate in this model, and in
Section 4, we derive the conditional expected value of the short rate and note some of its properties. Before the calibration of the model, we introduce the data in
Section 5. Firstly, we use the density to construct the likelihood function in
Section 6, and we obtain the maximum likelihood estimates for the market data of short-term interest rates. Afterward, in
Section 7, we use the method of lines to numerically compute the solution of the partial differential equation for the bond prices. Using this computation, we estimate the market price of risk of the model by fitting the long-term interest rates. A discussion of the results and further considerations are given in
Section 8.
7. Pricing Bonds and Estimating the Market Price of Risk
A discount bond is a security that pays its holder a unit amount of money at a specified time
T, called the maturity of the bond. If the short rate
is defined via a stochastic process
x governed by a stochastic differential equation
then, after the specification of the so-called market price of risk
, the price of a discount bond
is a solution of a parabolic partial differential equation
where
and the range of
x is given by the range of values attainable by the stochastic process (
7). The terminal condition is given by
for all
x. Let us note that this approach to pricing bonds (see [
25], [Chapter 7.2.1]) is based on the construction of a portfolio of bonds, for which its stochastic part vanishes. As a consequence, the partial differential Equation (7) emerges as a condition satisfied by the bond price. Another approach leading to Equation (7) is based on the so-called risk-neutral probability measure
, which is used to express the bond price as the expected value
. The Feynman–Kac formula then enables us to obtain the partial differential equation satisfied by the expected value and, therefore, the bond price. The change in the measure and the specification of market prices of risk are closely related; we refer the reader to [
25] for more details regarding pricing bonds in interest rate models.
Let T be a fixed maturity. The interest rate at time when the stochastic factor equals x is connected with the price of a discount bond through the relation . This allows us to compute the bond prices by solving the partial differential equation, transform them into interest rates, and compare them with market data.
We consider Ueno’s shadow rate model for two choices of the market price of risk:
Constant market price of risk ;
Continuous piecewise linear market price of risk, taking for a negative shadow rate and for a shadow rate greater than 0.01 (i.e., one percent).
We note that these boundary values, when the market price of risk changes from its constant value, correspond to the same values of the observed short rate since they are both nonnegative. However, the values of zero and one percent were chosen only for illustration purposes and can also be taken as parameters that are being estimated.
We use the methodology from [
8]: we consider the boundary conditions given by vanishing second derivative with respect to
x at
and the method of lines for solving the partial differential equation for bond prices. In order to make the paper self-contained, we review the method of lines according to [
8] while performing necessary modifications in order to apply it to our model. The discretization is achieved in space variable
x, and the solution is computed on grid points
, uniformly distributed over the interval
with uniform step
. The time
t is transformed into the time remaining to maturity
, where
T is the maturity of the bond. Then, after denoting the vector of values at the grid points by
, the discretization that we use takes the form of the system of the ordinary differential equations
for the interior points. Taking the boundary conditions into account, the system can be written in the matrix form as
where
is a
J-dimensional vector with all elements equal to 1,
is computed by differentiating the components of the vector
u, and
with
The solution can be written in terms of a matrix exponential as
To evaluate the matrix exponential, we use Padé’s computation, as implemented in the expm package [
26] for R. We use the same number of discretization points (i.e., 200) as in [
8], but we use a smaller interval for
x, namely,
instead of
. We test the accuracy on the Vasicek model, which is a special case for
and which has an explicit solution for the bond prices; see [
17]. We consider the parameter values estimated for the Vasicek model by the maximum likelihood method for EA data, which are equal to
,
, and
. The market price of risk is taken to be zero. Since we are going to fit the 10-year interest rates, we consider the absolute error of the 10-year interest rates when using the numerical method for bond prices, as described above, and computing the corresponding interest rates.
Table 2 provides the statistics about the error in the following cases:
Test 1. Interest rates computed for all the discretization points.
Test 2. Interest rates computed for the discretization points withing the range of EA short rate.
Test 3. Using linear interpolation of the computed interest rates for the vector of x values with the step 10−6 (the precision used to quote the EA data), with minimum and maximum values equal to those of the EA short rate.
As we can see in
Table 2, the numerical results are already more precise than the precision of the data. Furthermore, in the paper [
8], the author refers to a good precision of this method when applied to Black’s model with the short rate given by
, where
follows a process of the Vasicek type.
We compare the calibration results for the proposed shadow rate model and the classical Vasicek model (i.e., using to obtain maximum likelihood estimates of , , and ) with its usual choice of a constant market price of risk.
For every dataset, we define the objective function equal to the sum of squares of differences between observed long rates and the long rates implied by the model for a particular choice of market price of risk. The latter are computed by linear interpolation of the interest rates obtained by the numerical method described above, while the remaining parameters of the model (i.e., , , , and k) are taken from the maximum likelihood estimation from the previous section. In the case of a constant market price of risk, the resulting optimization problem is one-dimensional. If the market price of risk is switching its value in the way specified above, the optimization is two-dimensional, and as an initial approximation, we use the point corresponding to the optimal constant market price of risk. We use the L-BFGS-B method to solve the resulting optimization problems. For comparison, we compute the optimal constant market price of risk in the Vasicek model, in which case the objective function is quadratic.
When interpreting the quality of fit, we need to keep in mind the difference between moving from the Vasicek model to a shadow rate model with a constant market price of risk and moving from a constant market price of risk to switching in the shadow rate model. While the shadow rate model is a generalization of the Vasicek model when both have constant market prices of risk (the Vasicek model corresponds to ), it is no longer the case at the moment of fitting the long-term rates in our approach. At this moment, the parameters from the maximum likelihood step are fixed, and therefore, it is not necessarily true that the shadow rate model cannot perform worse than the Vasicek model. On the other hand, when specifying the market price of risk in the shadow rate model, the fixed parameters are the same. Therefore, since considering the same market prices of risk in the switching case reduces the model to a constant market price of risk, the switching model cannot lead to a worse fit.
The optimal values of the objective function are presented in
Table 3 and
Figure 7; the estimated market values of risk are given in
Table 4. The quality of fit in each case is shown in
Figure 8, where the observed data of the long rates are compared with the fitted values. We can observe different behaviors: In the case of Denmark and Sweden, the Vasicek model and the shadow rate model with a constant market price of risk provide a very similar fit. However, allowing the market price of risk to switch its value lowers the objective function (and, therefore, makes the fit better) more pronouncedly in the case of Denmark. In the case of the Euro Area, the improvement is nontrivial both when changing from the Vasicek model to the shadow rate model with a constant market price of risk and when moving from a constant to a switching market price of risk in the shadow rate model.
8. Conclusions
In this paper, we studied one of the shadow rate models considered in the literature and provided a methodology for estimating its parameters. Firstly, the parameters from the time evolution of the short rate were estimated by means of the maximum likelihood method. The form of the model made it possible to reduce the maximization of the likelihood function to a one-dimensional optimization problem. In the second step, the market price of risk was estimated. We considered a constant market price or risk and a special form of switching market price of risk; for comparison, we also assumed the classical Vasicek model.
The first major implication of our study is a clear rejection of the Vasicek model in favor of the proposed shadow rate model based on the short rate evolution in all of the datasets. This approach uses only one new parameter, and the shadow rate can be reconstructed after its specification, which simplifies the use of the model. Therefore, it can be seen as a suitable alternative when modeling interest rates is only a part of a more complicated setting. In such a case, the interest rate model has to be kept sufficiently simple so that the whole analysis is trackable.
The second important result is the importance of a suitable specification of the market price of risk. In two out of three cases, the shadow rate model with a constant market price of risk was able to produce only practically the same fit of the long-term rates as the Vasicek model. Also, in two out of three datasets, a nonconstant market price of risk in the shadow rate model decreased the objective function by factors of 2.9 and 3.3, respectively, although the assumed form of the market price of risk was rather restricted by fixing some of its parameters and estimating only two constants. We consider this to be a very promising result, and we see a potential to improve the fit of the model by a proper choice of the market price of risk. This can be accomplished by introducing new parameters (in a generalization of our example, we may also be estimating the values for which the market price of risk switches its values) or by considering different classes of functions for its specification. Depending on the number of parameters and the behavior of the objective function, more consideration may be necessary regarding the choice of the algorithm used for its optimization. Furthermore, the optimization of the likelihood of the short rates and fitting the long rates do not need to be performed sequentially. While it simplifies the optimization problems, an equally interesting path of possible future research lies in a bicriteria optimization, i.e., looking for Pareto optimal parameters with respect to these two criteria. Furthermore, it is possible to look for a suitable market price of risk using the techniques for solving inverse problems instead of using a parametric form for its specification.
We estimated the model using data from 2011 to 2020, a period marked by negative short-term rates in its later years. Our results, therefore, offer insights into how interest rate behavior changes in such an environment. It can be, for example, noticed that the estimated equilibrium level of the shadow rate is negative across all cases. Obviously, this prevents the model from predicting subsequent growth of the interest rates. However, with sufficient amounts of “post-negative period” data, the model can be reestimated. In principle, a model with a positive equilibrium short rate can accommodate temporary negative rates before their reversion to the equilibrium and return to the positive region. During this period, the model would constrain the extent of negative rates. A practical evaluation of this concept can provide valuable new insights.
Finally, we note that the idea of a modified behavior when the short-term rate is negative can also be applied to any other model with negative values, which would take the role of the Vasicek process. It may be suitably modified also for the models with exogenous variables and model their different effects at the times with positive and negative short-term rates.