Next Article in Journal
Optimal Propagating Fronts Using Hamilton-Jacobi Equations
Next Article in Special Issue
Preference Based Portfolio for Private Investors: Discrete Choice Analysis Approach
Previous Article in Journal
Sine-Cosine Algorithm to Enhance Simulated Annealing for Unrelated Parallel Machine Scheduling with Setup Times
Previous Article in Special Issue
Predicting Contagion from the US Financial Crisis to International Stock Markets Using Dynamic Copula with Google Trends

Article

# A Nonparametric Approach to Bond Portfolio Immunization

National Research University Higher School of Economics, 101000 Moscow, Russia
Mathematics 2019, 7(11), 1121; https://doi.org/10.3390/math7111121
Received: 16 October 2019 / Revised: 8 November 2019 / Accepted: 14 November 2019 / Published: 16 November 2019

## Abstract

We consider the problem of short term immunization of a bond-like obligation with respect to changes in interest rates using a portfolio of bonds. In the case that the zero-coupon yield curve belongs to a fixed low-dimensional manifold, the problem is widely known as parametric immunization. Parametric immunization seeks to make the sensitivities of the hedged portfolio price with respect to all model parameters equal to zero. However, within a popular approach of nonparametric (smoothing spline) term structure estimation, parametric hedging is not applicable right away. We present a nonparametric approach to hedging a bond-like obligation allowing for a general form of the term structure estimator with possible smoothing. We show that our approach yields the standard duration based immunization in the limit when the amount of smoothing goes to infinity. We also recover the industry best practice approach of hedging based on key rate durations as another particular case. The hedging portfolio is straightforward to calculate using only basic linear algebra operations.

## 1. Introduction

Hedging a bond-like obligation with a bond portfolio is a classical topic in finance, first presented in [1]. We only give several references here to sketch the research context. Fisher and Weil [2] introduced the duration based immunization against parallel shifts in the term structure of interest rates. Cooper [3] proposed to use a parametric function to estimate the term structure and considered partial derivatives with respect to function parameters; however, his choice of parametrization was not particularly helpful. Other authors [4,5,6] used the Nelson–Siegel parametric model [7] to estimate the term structure and considered partial derivatives with respect to Nelson–Siegel parameters. Almeida and Lund [8] considered a richer parametric model for the term structure. Barber and Copper [9] estimated the dominant term structure change patterns via principal component analysis for the purpose of bond portfolio immunization.
Another branch of research, which is non-parametric by nature, started with Fong and Vasicek [10] and was continued by Nawalkha et al. [11,12]. They considered a Taylor approximation to the change in the portfolio price in terms of the interest rate term structure changes.
Research in this area has been concentrated on testing and comparing various methods in various settings since then.
In contrast to developed markets where it is customary to estimate interest rate term structure from derivative instruments such as interest rate swaps (IRS) or overnight indexed swaps (OIS), in many developing markets, these instruments are unavailable or relatively illiquid. This leaves government bonds as the only viable source of interest rate term structure information. Furthermore, even in developed markets, bonds can have much longer maturities than swaps. Finally, when the obligation is itself bond-like, it makes sense to price it using the bond-implied term structure to avoid the necessity to estimate the basis between swap and bond zero-coupon rates.
This makes the term structure endogenous and the problem of hedging an obligation with bonds different from what was studied in the literature.
To overcome this problem, we adopt a different approach. We assume that the interest rate term structure for pricing the portfolio is estimated from prices of a set of benchmark bonds via a nonparametric term structure estimation method with smoothing, i.e., by solving some variational problem. We derive the necessary hedging equations from this variational problem formulation.
To gain some intuition about the result, we also investigate some particular cases, in which we recover existing hedging approaches like the conventional duration based and convexity based hedging and the key rate duration approach introduced by Ho [13].
The remainder of the paper is structured as follows. We present the problem and its mathematical formulation in Section 2. Then, we provide the solution in Section 3. Section 4 shows that several conventional immunization methods (duration based, convexity based, and key rate durations) can be considered special cases of our general approach. Section 5 concludes.

## 2. The Hedging Problem

We consider the following hedging problem. An obligation to be hedged is described by its cash flow amounts row vector $F 0 ∈ R 1 × N$ corresponding to the cash flow moments vector $t ∈ R N$, i.e., the cash flow at time $t i$ is $[ F 0 ] i$. The obligation is supposed to be regularly repriced by marking to model, by using a pricing operator $P V ( F 0 ; r )$, where $r ∈ R N$ is the estimated market term structure of interest rates to be defined later, i.e., a vector of market zero-coupon rates for the corresponding terms to maturity t. We consider a relatively short term hedge, until the next repricing. For simplicity, ignore the time related features of investing in a bond portfolio such as changes in durations, receiving cash coupon payments and bond redemptions, etc. These only matter for a long term hedge.
The pricing operator is assumed to be linear in $F 0$ to allow hedging. This assumption is shown to be not restrictive even for risky coupon bearing bonds [14]. Thus, the pricing operator can be represented as:
$P V ( F 0 ; r ) = F 0 · d ( r ) ,$
where $d ( r ) ∈ R N$ is the vector of discount factors for the corresponding terms to maturity t. We assume that $[ d ( r ) ] i$ is a function of $r i$ and $t i$ only; however, the exact formula depends on the interest rate compounding convention chosen. In a more general framework, the discount factor could also be a function of credit quality, liquidity, eligibility for repo operations with the centralized counterparty, etc. However, all these parameters can be assumed constant in the short run and thus not sources of additional risks. For a long term hedge, they could be important.
Example 1.
Assuming continuously compounded interest rates, we get $[ d ( r ) ] i = e − r i t i .$
As for the term structure r, we assume that it is estimated from a set of K benchmark risk-free (e.g., government) bonds with the cash flow matrix $F ∈ R K × N$, where $F k , i$ is the cash flow for bond k at term $t i$. Note that the vector of payment terms t can be considered common for the obligation and all the benchmark bonds. We can always introduce zero cash flows where necessary without any loss of generality.
The estimation procedure is assumed to be nonparametric, fitting the benchmark bond model prices $P V ( F ; r ) ∈ R K$ to their observed market prices $P ∈ R K$ while requiring some sort of smoothness as follows:
$P V ( F ; r ( · ) ) − P 2 + α · ∥ E [ r ] ∥ H ′ 2 → min r ( · ) ∈ H ,$
where $r ( · )$ is a function to be estimated belonging to some functional space H, e.g., Sobolev–Hilbert space $H 1 [ 0 , T ]$, E is a linear functional from H to some $H ′$, and $α$ is the smoothing (regularization) parameter. The most popular nonparametric methods of estimating the term structure of interest rates assume problems of this form [15,16,17,18].
Example 2.
Two common choices for E are the differentiation operator D (with H being $H 1 [ 0 , T ]$) and $D 2$ (with $H = H 2 [ 0 , T ]$); however, more elaborate choices might be appropriate in developing markets, as discussed in [19]. It is well known that D yields a piecewise linear term structure, while $D 2$ yields a smoothing cubic spline as solutions of (2).
A hedging portfolio is chosen from among the same set of benchmark bonds; let $w ∈ R K$ be the corresponding investment amounts. To simplify the exposition, we assume no trading restrictions.
The hedging problem can be formulated as follows. Given $F 0 , P , F$, find w so that the sensitivities of the hedged obligation price to price changes should be zero, which corresponds to minimizing the risk associated with the zero-coupon yield curve changes:
$∂ P V ( − F 0 + w T F ; r ) ∂ P = 0 .$
Here and in what follows, we use the notation $A T$ for the transpose of A, $A +$ for the Moore–Penrose inverse of A, $ker A$ for the null space of A, $im A$ for the range of A, $V ⊥$ for the orthogonal complement of the subspace V, and $P V A$ for the orthogonal projection of A onto the subspace V.

## 3. Results

We now present the main result regarding the solution of the hedging problem (3). It gives the explicit expression for the hedging coefficients w.
Theorem 1.
The hedging coefficients w can be found via weighted least squares regression by regressing the vector of the obligation point sensitivities $B 0 = ∂ P V ( F 0 ; r ) ∂ r T$ onto the corresponding point sensitivities of the benchmark bonds $B = ∂ P V ( F ; r ) ∂ r T$:
$B 0 = B w + ε$
with the error covariance matrix of ε equal to $Ω = B B T + A + α J T J$, where:
$A = ∑ i = 1 K P V ( F i ; r ) − P i ∂ 2 P V ( F i ; r ) ∂ r 2 , J T J = 1 2 ∂ 2 G ( r ) ∂ r 2 , G ( r ) = min f ∈ H | f ( t i ) = r i ∥ E [ f ] ∥ H ′ 2 .$
Alternatively, the hedging coefficients w solve $M T B w = M T B 0 ,$ where columns of the matrix $M = Ω − 1 B$ are the elementary term structure perturbations against which we hedge.
Note that the covariance matrix $Ω$ is not singular if the solution of (2) is unique; this will be proven later. We cannot guarantee that this is indeed the case as it depends on the choice of the regularization functional $E [ f ]$; but since the overall idea of regularization includes guaranteeing the uniqueness of the solution, this can be safely assumed. We also assume that F has full rank, which means that benchmark bond payment schedules are linearly independent.
Example 3.
For $E = D$, it is always not singular because $ker D$ only consists of constant functions $r ( t ) = r$, and $∥ B T r ∥ 2 = ∥ F d ′ ( r ) r ∥ 2 ≠ 0$ for constant $r > 0$, where $d ′ ( r ) = ∂ d ( r ) ∂ r$ is a diagonal matrix. For the continuously compounded interest rates, $d ′ ( r )$ is given by $[ d ′ ( r ) ] i , i = − t i e − r i t i$.
Proof of Theorem 1.
Note that even though the problem (2) is infinite-dimensional, the first term only depends on the finite-dimensional projection of $r ( · )$, namely on the values $r ( t i )$. Therefore, we can consider the following hierarchical optimization problem:
$min r ∈ R N min f ( · ) ∈ H | f ( t i ) = r i P V ( F ; r ) − P 2 + α · E [ f ] ,$
which can be written in a more compact way as:
$P V ( F ; r ) − P 2 + α · G ( r ) → min r ∈ R N ,$
where r is a vector of zero-coupon rates at corresponding terms t, and $G ( r ) = min f ∈ H | f ( t i ) = r i ∥ E [ f ] ∥ H ′ 2 .$ □
Example 4.
When $E [ f ] = D f = f ′$, the optimal f is actually a piecewise linear function connecting the known values $r i$ at points $t i$; therefore, $G ( r ) = ∑ i = 1 N ( r i − r i − 1 ) 2 t i − t i − 1$. Similar reasoning applies for higher derivatives with $G ( r )$ also being a quadratic form in r.
A first order condition for an extremum in (7) is:
$2 ( P V ( F ; r ) − P ) T ∂ P V ( F ; r ) ∂ r + α ∂ G ( r ) ∂ r = 0 .$
Now, differentiate this with respect to P:
$2 ∂ P V ( F ; r ) ∂ r T ∂ P V ( F ; r ) ∂ r ∂ r ∂ P − I + 2 ∑ i = 1 K P V ( F i ; r ) − P i ∂ 2 P V ( F i ; r ) ∂ r 2 ∂ r ∂ P + α ∂ 2 G ( r ) ∂ r 2 ∂ r ∂ P = 0 ,$
$2 B B T ∂ r ∂ P − I + 2 A + 2 α J T J ∂ r ∂ P = 0 ,$
where:
$B T = ∂ P V ( F ; r ) ∂ r = F · d ′ ( r ) , A = ∑ i = 1 K P V ( F i ; r ) − P i ∂ 2 P V ( F i ; r ) ∂ r 2 , J T J = 1 2 ∂ 2 G ( r ) ∂ r 2 .$
Therefore,
$∂ r ∂ P = B B T + A + α J T J − 1 B = Ω − 1 B$
for $Ω = B B T + A + α J T J$.
Let us prove that $Ω$ is not singular, i.e., $ker J ∩ ker ( B B T + A ) = { 0 }$. Assume that this is not the case and there is a nonzero $f ∈ ker J ∩ ker ( B B T + A )$. Note that $f ∈ ker J$ would mean that the regularization term $G ( r )$ is invariant to the perturbation by f: $G ( r + f ) = G ( r )$ for any r. Furthermore, $f ∈ ker ( B B T + A )$ would mean that $f i = 0$ for every i for which there is nonzero cash flow for at least one benchmark bond (this easily follows from the structure of A), which in turn implies that $P V ( F ; r + f ) = P V ( F ; r )$. Thus, adding f to the optimal solution $r *$ of (7) would not change the functional as $P V ( F ; r * + f ) = P V ( F ; r * )$ and $G ( r * + f ) = G ( r * )$. Therefore, the existence of such f immediately implies the non-uniqueness of the solution of (7) or equivalently of the solution of (2), which we assumed to be unique.
In what follows, we denote the matrix $Ω − 1 B$ as M or $M α$ if the dependence on $α$ is important. Using the linearity of the pricing operator, we can rewrite (3) as:
$∂ P V ( F 0 ; r ) ∂ P = w T ∂ P V ( F ; r ) ∂ P = w T B T ∂ r ∂ P ;$
or in terms of M,
$M T B 0 = M T B w$
In what follows, we assume that F has full row rank, i.e., there are no bonds with linearly dependent payoff schedules. This is always the case in practice. Then, $∂ P V ( F ; r ) ∂ P$ is invertible because $Ω$ has full rank, $B = d ′ ( r ) F T$, where $d ′ ( r )$ is diagonal and not singular regardless of the chosen discounting convention.
Therefore, we can solve for w:
$w = ∂ P V ( F ; r ) ∂ P − T ∂ P V ( F 0 ; r ) ∂ P T = B T Ω − 1 B − 1 B T Ω − 1 B 0 ,$
where $B 0 = ∂ P V ( F 0 ; r ) ∂ r T$.
The expression (15) is exactly the weighted least squares estimator for the linear regression:
$B 0 = B w + ε$
with the error covariance matrix of $ε$ given by $Ω$ [20].
Note that the elements of $B 0$ and B can be thought of as point sensitivities of the corresponding bond prices with respect to the spot rate $r i$ for the respective term to maturity $t i$:
$[ B 0 ] i = ∂ P V ( F 0 ; r ) ∂ r i .$
Thus, the hedging coefficients can be found by regressing the vector of the obligation point sensitivities $B 0$ onto the corresponding point sensitivities of the benchmark bonds B. The error covariance matrix depends on both the set of benchmark bonds and the smoothing functional.
As we show in Section 4, this result can be seen as a generalization of duration based, convexity based, and key rate duration hedging.

## 4. Special Cases

To develop some intuition about the solution, we now consider two special cases, as the amount of smoothing $α$ goes to zero and to infinity. We show that in these special cases, we can recover the conventional duration based and convexity based immunization approaches, as well as key rate duration hedging.

#### 4.1. Infinite Smoothing: $α → ∞$

First, let us show that with a suitable choice of regularization functional $E [ f ]$, we can recover conventional duration based and convexity based approaches.
Theorem 2.
Let $w α$ be the vector of hedging coefficients for the smoothing parameter α. Then:
$lim α → ∞ w α = w ∞ ,$
where $w ∞$ satisfies:
$ε ∞ = B 0 − B w ∞ ∈ ker J .$
Example 5.
If $E [ f ] = D f$ is the first derivative operator, then the hedging coefficients $w ∞$ provide a duration based hedge assuming continuous compounding, i.e.,
$D u r 0 = D u r · w ∞ ,$
where:
$D u r 0 = ∑ i = 1 N ∂ P V ( F 0 ; r ) ∂ r i r = r ∞ = P V ( F 0 ; r ∞ ) ∑ i = 1 N t i · [ F 0 ] i e − t i r ∞ P V ( F 0 ; r ∞ )$
is the dollar Fisher–Weil duration of the obligation and $D u r$ is the row vector of the corresponding benchmark bond dollar Fisher–Weil durations.
Example 6.
If $E [ f ] = D 2 f$ is the second derivative operator, then the hedging coefficients $w ∞$ provide a hedge based on duration and convexity assuming continuous compounding, i.e.,
$D u r 0 = D u r · w ∞ , C 0 = C · w ∞ ,$
$C 0 = ∑ i = 1 N ∂ 2 P V ( F 0 ; r ) ∂ r i 2 r = r ∞ = P V ( F 0 ; r ∞ ) ∑ i = 1 N t i 2 · [ F 0 ] i e − t i r ∞ P V ( F 0 ; r ∞ )$
is the dollar convexity of the obligation and C is the row vector of the corresponding bond dollar convexities.
Proof.
It is known [21] that for any matrices M and L of suitable dimensions,
$lim α → ∞ M T M + α L T L + M = [ M P ker L ] + ,$
where $P ker L = I − L + L$ is the orthogonal projector onto $ker L$. However, in the expression for $Ω$ (12), we have an additional convexity term A inside the brackets, which is symmetric, but not nonnegative definite; thus, a separate proof is in order.
Factor $B B T + A$ onto $ker J ⊗ im J T$:
$B B T + A = P ker J ( B B T + A ) + P im J T ( B B T + A ) = P ker J ( B B T + A ) + J T X .$
Now, let $Ω = V T + U T$, where $V = ( B B T + A ) P ker J$, $U = ( α J T + X T ) J$ and $U V T = 0$. Therefore, by Theorem 2 of [22],
$Ω − 1 = ( U + V ) − T = U + + ( I − U + V ) [ C + + ( I − C + C ) K V T U + T U + ( I − V C + ) ] T ,$
where $C = ( I − U U + ) V$ and $K = [ I + ( I − C + C ) V T U + T U + ( I − C + C ) ] − 1$. Note that $lim α → ∞ U + = 0$ and $lim α → ∞ U + V = 0$, both because J has full column rank and thus $ker U = ker J$. Therefore, $lim α → ∞ K = I$ and:
$M = lim α → ∞ Ω − 1 B = C + T B = [ P ker U V ] + T = [ P ker J ( B B T + A ) P ker J ] + B .$
Since $ker J ∪ ker ( B B T + A ) = { 0 }$, $C + T ker J$ has full rank as a mapping $ker J → ker J$ and is zero otherwise. Therefore, $ker M T = ker C + T = ( ker J ) ⊥$.
Finally, (15) can be written as $M α T ( B w α − B 0 ) = 0$, where $M α = ( B B T + A + α J T J ) − 1 B$. For $α → ∞$, we have:
$M T ( B w − B 0 ) = M T ε = 0 .$
To observe the particular cases, note that for $E [ f ]$, $ker J = { x | x = k · 1 }$, where 1 is a vector of ones and $k ∈ R$. Therefore, the condition $ε ∈ ( ker J ) ⊥$ reduces to $1 T ε = 0$.
In terms of our regression, this means that:
$1 T B 0 = 1 T B w .$
Now,
$1 T B 0 = ∑ i = 1 N ∂ P V ( F 0 ; r ) ∂ r i = ∂ P V ( F 0 ; r + ξ · 1 ) ∂ ξ = D 0 ,$
exactly equal to the dollar Fisher–Weil duration of the obligation, while by the same logic, $1 T B$ is the row vector of dollar Fisher–Weil durations of the benchmark bonds. The condition (29) thus requires us to equate the dollar Fisher–Weil durations of the hedge and the obligation.
For $E [ f ] = D 2$, we similarly have $ker J = { x | x = a · t + b · 1 }$. Thus, in addition to (30), we have:
$t T B 0 = ∑ i = 1 N t i ∂ P V ( F 0 ; r ) ∂ r i = − ∑ i = 1 N t i 2 e − r · t i [ F 0 ] i = − 1 2 ∑ i = 1 N ∂ 2 P V ( F 0 ; r + ξ · 1 ) ∂ ξ 2 = − 1 2 C 0 ,$
where $C 0$ is the dollar Fisher–Weil convexity of the obligation, while by the same logic, $t T B$ is the row vector of dollar Fisher–Weil convexities of the benchmark bonds times $− 1 2$. The condition (29) thus requires us to equate both dollar Fisher–Weil durations and convexities of the hedge and the obligation. □

#### 4.2. Virtually No Smoothing: $α → 0$

Now, we establish the limit of the hedging portfolio composition as $α → 0$, which corresponds to estimating the term structure with non-smoothing splines.
Proposition 1.
As $α → 0$, $α − 1 A → A ′$, where $A ′$ is a constant diagonal matrix.
Proof.
From (8) and from the fact that B has full column rank, we have $P V ( F ; r ) − P = − α 2 B + ∂ G ( r ) ∂ r T$. First note that it follows from (2) that $r → r 0$ as $α → 0$, where:
$r 0 = arg min r : P V ( F ; r ) = P 0 G ( r ) .$
Then:
$α − 1 A = ∑ i = 1 K α − 1 P V ( F i ; r ) − P i ∂ 2 P V ( F i ; r ) ∂ r 2 → A 0 = − ∑ i = 1 K B + J T J r 0 i ∂ 2 P V ( F i ; r ) ∂ r 2 r = r 0 ,$
because the convergence can easily be shown to be uniform with respect to the vector r.  □
Thus, $A ∼ α$ as $α → 0$.
Theorem 3.
As $α → 0$, the optimal hedging coefficients $w α → w 0$, where:
$w 0 = ( M 0 B ) − 1 M 0 B 0 , M 0 = I − [ P ( im B ) T A 0 P ( im B ) ⊥ ] + P ( im B ) ⊥ A 0 B + T .$
Proof.
The following could be perceived as overly complicated. If we could assert that $α − 1 A + J T J$ is not only symmetric, but also non-negative definite (and therefore could be represented as $L T L$), a much simpler proof would follow right away along the lines of Section 4.3.
To deal with the general case, we apply the same matrix inversion approach to $Ω − 1$, but with:
$U = B B T + α ( α − 1 A + J T J ) P im B , V = α ( α − 1 A + J T J ) P ( im B ) ⊥ .$
Note that as $α → 0$, $V → 0$, $U → B B T$, and $U + → B +$ since U does not change its rank in the limit.
$Ω − 1 = ( U + V ) − 1 = U + + ( I − U + V ) [ C + + ( I − C + C ) K V T U + T U + ( I − V C + ) ] ,$
where $C = ( I − U U + ) V$ and $K = [ I + ( I − C + C ) V T U + T U + ( I − C + C ) ] − 1$. One can easily see that $K → I$, and thus, $( I − C + C ) K V T U + T U + ( I − V C + ) → 0$. We now demonstrate that the limit of $C + B$ exists.
$C + B = α − 1 [ P ( im U ) ⊥ ( α − 1 A + J T J ) P ( im B ) ⊥ ] + B = α − 1 [ P ( im U ) ⊥ ( α − 1 A + J T J ) P ( im B ) ⊥ ] + P ( im U ) ⊥ B .$
Let us calculate $P ( im U ) ⊥ B$ as $α → 0$. First of all, note that since B has full rank,
$im U = im ( B B T + α ( α − 1 A + J T J ) P im B ) = = im ( ( I + α A ˜ ( B B T ) ) + ( B B T ) ) = im ( ( I + α A ˜ ( B B T ) + ) B ) = im U ˜ ,$
where $A ˜ = ( α − 1 A + J T J )$.
$P ( im U ) ⊥ = P ( im U ˜ ) ⊥ = I − U ˜ ( U ˜ T U ˜ ) − 1 U ˜ T = I − ( I + α A ˜ ( B B T ) + ) B ( U ˜ T U ˜ ) − 1 B T ( I + α ( B B T ) + A ˜ ) ,$
where:
$( U ˜ T U ˜ ) − 1 = B T ( I + α ( B B T ) + A ˜ ) ( I + α A ˜ ( B B T ) + ) B − 1 = B T B + α [ B T ( B B T ) + A ˜ B + B T A ˜ ( B B T ) + B ] + o ( α ) − 1 = I − α ( B T B ) − 1 [ B T ( B B T ) + A ˜ B + B T A ˜ ( B B T ) + B ] ( B T B ) − 1 + o ( α ) = ( B T B ) − 1 − α ( B T B ) − 1 [ B T ( B B T ) + A ˜ B + B T A ˜ ( B B T ) + B ] ( B T B ) − 1 + o ( α ) .$
Now, plug this into $P ( im U ) ⊥$, and using $B ( B T B ) − 1 B T = P im B$:
$P ( im U ) ⊥ = I − P im B + α A ˜ ( B B T ) + P im B + P im B ( B B T ) + A ˜ − P im B A ˜ ( B B T ) + P im B − P im B ( B B T ) + A ˜ P im B + o ( α ) = P ( im B ) ⊥ − α P ( im B ) ⊥ A ˜ ( B B T ) + P im B + P im B ( B B T ) + A ˜ P ( im B ) ⊥ + o ( α ) .$
Thus,
$P ( im U ) ⊥ B = − α P ( im B ) ⊥ A ˜ ( B B T ) + B + o ( α ) = − α P ( im B ) ⊥ A ˜ B + T + o ( α ) .$
Now,
$C + B = − [ P ( im U ) ⊥ A ˜ P ( im B ) ⊥ ] + P ( im B ) ⊥ A ˜ B + T + o ( 1 ) → − [ P ( im B ) ⊥ A 0 P ( im B ) ⊥ ] + P ( im B ) ⊥ A 0 B + T .$
Finally,
$M α = Ω − 1 B → U + B + ( I − U + V ) C + B → ( U + + C + ) B → M 0 = I − [ P ( im B ) T A 0 P ( im B ) ⊥ ] + P ( im B ) ⊥ A 0 B + T$
and:
$w α = ( M α B ) − 1 M α B 0 → w 0 = ( M 0 B ) − 1 M 0 B 0 .$
□

#### 4.3. Key Rate Durations

This result requires more simplifying assumptions and is itself rather simple. However, it is illustrative to see where the industry best practice approach of matching key rate durations is placed within our framework, to understand where it is not.
Proposition 2.
Let the model term structure be estimated from known key rates $r *$ at predefined key terms to maturity $t *$ via linear interpolation and constant extrapolation. Then, the hedging weights $w *$ satisfy:
$M T B w * = M T B 0 ,$
where the columns $m k$ of the matrix M correspond to the key rate durations loadings, i.e., they are:
• equal to one for one key term to maturity $t k *$;
• equal to zero for all other key terms to maturity;
• are linearly interpolated for maturities between key terms;
• are constantly extrapolated outside key terms to maturity.
Proof.
Note that $r = M r *$ where M is the linear interpolation and constant extrapolation operator defined above. The hedging equation becomes:
$∂ P V ( F 0 ; r ) ∂ r * = w T ∂ P V ( F ; r ) ∂ r * ,$
which can in turn be simplified using $∂ r ∂ r * = M$ to:
$M T B w = M T B 0 .$
□
We now present two lemmas characterizing the limit behavior of the optimal solution when partially relaxing the strict assumptions of Proposition 2.
Lemma 1.
In the setup of Proposition 2, let the model term structure be determined via regularized curve fitting as follows:
$P 1 ′ r − r * 2 + α · ∥ D r ∥ H ′ 2 → min r ( · ) ∈ H ,$
where D is the differentiation operator, $P 1 ′$ is the projection operator:
$P 1 ′ r = [ r ( t 1 * ) , r ( t 2 * ) . . r ( t K * ) ] T$
and $r *$ is the vector of observed market yields at key terms to maturity $t *$.
Then, as $α → 0$, the hedging portfolio composition approaches the portfolio chosen via the key rate duration approach.
Proof.
It is easy to see that Theorem 1 still holds with $B = ∂ P 1 ′ r ∂ r T$ consisting only of ones and zeroes and $A = 0$. In our setting, (15) can be written as:
$w α = ( M α T B ) − 1 M α T B 0 ,$
where:
$M α = ( B B T + α J T J ) − 1 B .$
It is known that:
$lim α → 0 ( ( M K ) T ( M K ) + α L T L ) − 1 ( M K ) T M = K M , L + ,$
where $K M , L + = ( I − ( L P ) + L ) ( M K ) + M$ is the $M , L$-weighted generalized pseudoinverse of K and $P = I − ( M K ) + ( M K )$. Thus,
$lim α → 0 M α = M 0 = ( I − ( J P ) + J ) B + T = ( I − ( J P ) + J ) B , where P = I − B B T ,$
because $B + = B T$.
Let $r α$ be the optimal solution to (49). Then, $∂ r α ∂ r * = M α$. $M α → M 0$ and $w α → w 0 = ( M 0 T B ) − 1 M 0 T B 0$ uniformly over $r *$ as $M α$ does not depend on $r *$ at all. Obviously, $r 0 = lim α → ∞ r α = M r *$; the linearly interpolated key yields, so $∂ r 0 ∂ r * = M$. Therefore, $M 0 = M$, and $w 0$ coincides with the optimal hedging weight w from Proposition 2. □
Lemma 2.
Let the term structure be estimated from:
$∥ Y ( r ) − r * ∥ 2 + α · ∥ D r ∥ H ′ 2 → min r ( · ) ∈ H ,$
where $Y ( r )$ is the vector of benchmark bond yields to maturity. Furthermore, let the benchmark bonds be coupon-bearing with coupon rates uniformly decreasing towards zero and terms to maturity $t *$. If in addition, $α → 0$, then the hedging portfolio composition approaches the portfolio chosen via the key rate duration approach.
Proof.
Theorem 1 holds for this case with $B = ∂ Y ( r ) ∂ r T$ and:
$A = ∑ i = 1 K Y i ( r ) − r i * ∂ 2 Y ( r ) ∂ r 2 .$
First, let us calculate B.
$∂ Y i ( r ) ∂ r = ∂ P V ( F i ; y · 1 ) ∂ y − 1 ∂ P V ( F i ; r ) ∂ r = 1 δ i F i d ′ ( r ) ,$
where:
$δ i = ∂ P V ( F i ; y · 1 ) ∂ y = ∑ j = 1 N F i , j d ′ ( y · 1 ) j , j .$
Note that as the coupon rate of the benchmark bonds approaches zero, $F i , t i * → 1$ and all other $F i , j → 0$, where $i *$ is the index of the ith bond term to maturity $t i * = t i *$.
Since at the same time $Y i ( r ) → r t i *$, as the coupon rate decreases to zero, $∂ Y i ( r ) ∂ r → e i *$, where $e i *$ is the corresponding unit vector with one at the coordinate $i *$ and zero everywhere else.
Now, let us calculate A.
$∂ 2 Y i ( r ) ∂ r 2 = − 1 δ i 2 ∂ δ i ∂ y ( F i d ′ ( r ) ) T ∂ y ∂ r + 1 δ i F i d ″ ( r ) = 1 δ i F i d ″ ( r ) − 1 δ i 2 ∂ δ i ∂ y ( F i d ′ ( r ) ) T ( F i d ′ ( r ) ) = 1 δ i F i d ″ ( r ) − 1 δ i 2 ∑ j = 1 N F i , j d ″ ( y · 1 ) j , j d ′ ( r ) F i T F i d ′ ( r ) .$
Using the same logic, we get that as the coupon rate of the benchmark bonds decreases to zero, $∂ 2 Y i ( r ) ∂ r 2 → 0$, which means $A → 0$.
Finally, to get the joint convergence, observe that the convergences above is uniform in $α$ since it only depends on r and $r → r 0$ as $α → 0$. □

## 5. Conclusions

The de facto industry standard of immunizing a bond-like obligation with a portfolio of bonds is based on key rate durations. It is nonparametric in its nature, but relies on having an exogenous reference term structure of interest rates determined via piecewise linear interpolation from a set of observed benchmark zero-coupon rates (e.g., interest rate swap rates).
We presented a general treatment of the immunization problem in a usual setting for developing markets, when the reference term structure of interest rates is endogenous, i.e., estimated via a smoothing spline approach from the same set of bonds (usually government bonds), which is being used for hedging.
We showed that this framework was general enough to contain both the standard duration based (Example 5) and convexity based (Example 6) hedges, as well as the key rate duration approach (Proposition 2 with the subsequent lemmas) as special cases.
This could give rise to more general practical implementations of bond portfolio immunization in developing markets. Our model was limited to short term time frames (no time effects and no changes in bond payment schedule), risk-free bonds, and perfect frictionless markets, so future research is desired to introduce the effects of time, heterogeneous credit quality, liquidity, and market frictions.
The practical implications of this work for developing markets are as follows. In the relatively widely spread case of estimating the term structure via linear interpolation of benchmark bonds’ yields to maturity, the key rate duration approach could be more or less justified if the coupon rates of the benchmark bonds are small (with the key rates being equal to the benchmark bonds’ terms to maturity). However, for coupon-bearing bonds or for higher orders of smoothing, this is generally not the case, and a specially designed immunization approach as described in this paper could be more desirable.

## Funding

This research received no external funding.

## Conflicts of Interest

The author declares no conflict of interest.

## References

1. Redington, F.M. Review of the Principles of Life-office Valuations. J. Inst. Actuar. 1952, 78, 286–340. [Google Scholar] [CrossRef]
2. Fisher, L.; Weil, R.L. Coping with the risk of interest-rate fluctuations: returns to bondholders from naive and optimal strategies. J. Bus. 1971, 44, 408–431. [Google Scholar] [CrossRef]
3. Cooper, I. Asset values, interest-rate changes, and duration. J. Financ. Quant. Anal. 1977, 12, 701–723. [Google Scholar] [CrossRef]
4. Willner, R. A new tool for portfolio managers: Level, slope, and curvature durations. J. Fixed Income 1996, 6, 48–59. [Google Scholar] [CrossRef]
5. Bravo, J. Parametric interest rate risk immunization. In New Developments in Banking and Finance; Nova Science Publishers, Inc.: New York, NY, USA, 2007; pp. 35–64. [Google Scholar]
6. Bravo, J.; Fonseca, J. Parametric Immunization in Bond Portfolio Management. In Proceedings of the 9th AFE International Conference on Applied Financial Economics, Samos Island, Greece, 28–30 June 2012. [Google Scholar]
7. Nelson, C.R.; Siegel, A.F. Parsimonious Modeling of Yield Curves. J. Bus. 1987, 60, 473–489. [Google Scholar] [CrossRef]
8. Almeida, C.; Lund, B. Immunization of fixed-income portfolios using an exponential parametric model. Braz. Rev. Econom. 2014, 34, 155–201. [Google Scholar]
9. Barber, J.R.; Copper, M.L. Immunization using principal component analysis. J. Portf. Manag. 1996, 23, 99–105. [Google Scholar] [CrossRef]
10. Fong, H.G.; Vasicek, O.A. A risk minimizing strategy for portfolio immunization. J. Financ. 1984, 39, 1541–1546. [Google Scholar] [CrossRef]
11. Nawalkha, S.K.; Chambers, D.R. The M-Vector Model: Derivation and Testing of Extensions to M-Square. Available online: https://ssrn.com/abstract=979600 (accessed on 14 November 2019).
12. Nawalkha, S.K.; Soto, G.M.; Zhang, J. Generalized M-vector models for hedging interest rate risk. J. Bank. Financ. 2003, 27, 1581–1604. [Google Scholar] [CrossRef]
13. Ho, T.S. Key rate durations: Measures of interest rate risks. J. Fixed Income 1992, 2, 29–44. [Google Scholar] [CrossRef]
14. Jarrow, R.A. Risky coupon bonds as a portfolio of zero-coupon bonds. Financ. Res. Lett. 2004, 1, 100–105. [Google Scholar] [CrossRef]
15. Waggoner, D.F. Spline Methods for Extracting Interest Rate Curves from Coupon Bond Prices; Federal Reserve Bank of Atlanta Working Paper 97-10; SSRN: Rochester, NY, USA, 1997. [Google Scholar]
16. Fisher, M.; Nychka, D.; Zervos, D. Fitting the Term Structure of Interest Rates with Smoothing Splines; Federal Reserve System Working Paper No. 95-1; SSRN: Rochester, NY, USA, 1997. [Google Scholar]
17. Laurini, M.P.; Moura, M. Constrained smoothing B-splines for the term structure of interest rates. Insur. Math. Econ. 2010, 46, 339–350. [Google Scholar] [CrossRef]
18. Tanggaard, C. Nonparametric Smoothing of Yield Curves. Rev. Quant. Financ. Account. 1997, 9, 251–267. [Google Scholar] [CrossRef]
19. Kaushanskiy, V.; Lapshin, V. A nonparametric method for term structure fitting with automatic smoothing. Appl. Econ. 2016, 48, 5654–5666. [Google Scholar] [CrossRef]
20. Strutz, T. Data Fitting and Uncertainty: A Practical Introduction to Weighted Least Squares and Beyond; Springer: Berlin/Heidelberg, Germany, 2011; p. 244. [Google Scholar] [CrossRef]
21. Eldén, L. A weighted pseudoinverse, generalized singular values, and constrained least squares problems. BIT 1982, 22, 487–502. [Google Scholar] [CrossRef]
22. Cline, R.E. Representations for the generalized inverse of sums of matrices. J. Soc. Ind. Appl. Math. Ser. Numer. Anal. 1965, 2, 99–114. [Google Scholar] [CrossRef]