1. Introduction
The modelling of the trend component of economic time series has a long history (Mills, 2003 [
1]). The most primitive and popular model displays the trend as a linear function of time. However, the residuals of such a linear trend frequently exhibit long-period fluctuations (see, for example, King and Rebelo (1993, Figure 1) [
2] and Yamada (2018, Figure 2) [
3]). The Hodrick–Prescott (HP) filter (Hodrick and Prescott, 1997 [
4]) is able to capture such long-period fluctuations well, resulting in a very realistic trend-cycle decomposition. This is probably why the HP trend is often used instead of the linear trend. It removes a smooth trend as one would draw it with a free hand (Pedersen, 2001 [
5]). For a historical review of the HP filter, see Weinert (2007) [
6] and Phillips and Jin (2021) [
7].
It may be queried whether the HP trend residuals no longer contain useful long-period fluctuations. If such long-period fluctuations are present, then taking them into consideration could improve the HP trend. In a recent article, Phillips and Shi (2021) [
8] proposed a new approach to address this issue. They applied the
-boosting (Bühlmann, 2006 [
9]; Tutz and Binder, 2007 [
10]), a popular technique in machine learning. They called it the boosted HP (bHP) filter. For
unit root nonstationary time series, the HP filter cannot properly estimate the trend (Phillips and Jin, 2021, Theorem 3 [
7]). However, the bHP filter is capable of doing so (Phillips and Shi, 2021, Theorem 1 [
8]), making it a highly promising alternative.
Several studies concerning the bHP filter have since emerged: (i) Knight (2021) [
11] established the properties of the bHP filter by deriving the penalized least-squares problems corresponding to the bHP filter; (ii) Tomal (2022) [
12], Trojanek et al. (2023) [
13], Xu (2023) [
14], and Andrián et al. (2024) [
15] conducted empirical studies using the bHP filter; (iii) Hall and Thomson (2024) [
16] provided a way to use the bHP filter as a frequency-selective filter; (iv) Mei et al. (2024) [
17] and Biswas et al. (2024) [
18] provided further results that demonstrate the usefulness of the bHP filter; (v) Yamada (2024) [
19] established the properties of the bHP filter following Knight (2021) [
11]; and (vi) Jin and Yamada (2024) [
20] and Bao and Yamada (2025) [
21] studied the boosted version of the Whittaker–Henderson graduation (Weinert, 2007 [
6]).
The three trends mentioned above, (i.e., the linear trend, the HP trend, and the bHP trend) can be treated in a unified manner. In this paper, we demonstrate the relationship in detail. We show how the bHP trend is constructed from the linear/HP trend and long-period fluctuations remained in their trend residuals.
The organization of the paper is as follows: In
Section 2, we describe the three trends. In
Section 3, we present a unified perspective of them. In
Section 4, we add some more details to the perspective. In
Section 5, we provide an empirical illustration of the obtained results.
Section 6 concludes the paper. In Appendices
Appendix A and
Appendix B, we provide additional results and several proofs.
2. Preliminaries
In this section, we review the three trends mentioned in
Section 1.
Let
denote a macroeconomic variable
y at time
t for
and
. Let
be the
matrix whose
t-th row is
for
. We assume that
does not belong to the column space of
. That is,
cannot be expressed as
, where
is a two-dimensional column vector. The case where
belongs to the column space of
is discussed in
Appendix A.
The linear trend estimated by ordinary least squares, denoted by
, is
where
where
. Here, for a column vector
,
denotes the squared
-norm of
, and it thus identical to
.
The HP trend, denoted by
, is
where
is a smoothing parameter,
,
is the
matrix such that
, and
Here,
is the
identity matrix. Since
in (
4) is a positive definite matrix (Danthine and Girardin, 1989 [
22]), it is nonsingular. Given that
is a symmetric matrix.
As is widely acknowledged, there is a relationship between
and
, as follows:
Here, (
5) follows by applying the Sherman–Morrison–Woodbury formula to
. More specifically, it follows from
where
. Here,
is an
positive definite matrix. Note that the last equality in (
6) holds because
is a nonsingular matrix such that
where
denotes the
matrix of zeros.
The bHP trend, denoted by
, can be considered a generalization of the HP filter. It is defined using the smoother matrix of the HP filter,
in (
4), as follows:
where
Given that
, it follows that
Therefore, as stated, the bHP trend can be considered a generalization of the HP trend.
Given that
is a symmetric matrix, it follows that
Hence,
is also a symmetric matrix.
3. A Unified Perspective of the Three Trends
The three trends defined in the previous section, , , and , can be treated in a unified way. In this section, we show this.
This perspective can be regarded as an extension of the result presented in Kim et al. (2009) [
23] and Yamada (2018) [
3]. Thus, here, we review the result. Using (
7), it follows that
, which leads to
Therefore, it follows that
from which we obtain
Given that
is a low-pass filter (Yamada, 2018 [
3], Figure 1),
in (
14) is a low-frequency component of the linear trend residuals,
. Therefore,
is the sum of the linear trend and a low-frequency component of the linear trend residuals.
Let
. Then, since
, (
14) can be represented as
Next, consider
. Given that
it follows that
Then, substituting (
15) into (
17) yields
The next proposition generalizes (
15) and (
18).
Proposition 1. in (8) can be decomposed asfor , where in (1). Proof. Let
. Given that
the bHP trend,
, can be represented as
Here, since
, it follows that
, which yields
for
. In addition,
. Accordingly, (
21) can be rewritten using
as
Finally, by substituting (
15) into (
22), we obtain (
19). □
Remark 1. (a) Again, given that is a low-pass filter, for in (19) are trend components retained in the trend residuals, . In this regard, see also Proposition 3 in the next section. (b) The next (23)–(26) illustrate how the linear trend, the HP trend, and the bHP trend can be treated in a unified way.Here, the term ‘twicing’ is used in Hall and Thompson (2024) [16]. Interestingly, not only and but also can be considered as the gains from boosting. (c) From Proposition 1, we immediately obtain the following recursive formula:for . Let
. Then, it follows that
. In addition, given that
, (
12) can be represented as
. Moreover, for
, it follows that
Combining these results, we obtain
This is a generalization of (
12).
Given that
is symmetric, from (
29), it follows that
Thereby, we obtain
From (
30) and (
31), we have the following results.
Proposition 2. for in (19) belong to the orthogonal complement of the column space of Π. Proof. From (
30) and (
31), it follows that
Therefore,
for
belong to the orthogonal complement of the column space of
. □
Remark 2. (a) Proposition 2 implies that the total gains of boosting, , in (19) is orthogonal to . In other words, the right-hand side of (19) shows an orthogonal decomposition of . (b) Denote the t-th entry of by for . Proposition 2 implies that 4. Some More Details to the Perspective
In this section, we add some more results concerning
for
in (
19).
Denote a spectral decomposition of an
real symmetric matrix
by
, where
and
. Here, the eigenvalues,
, are in ascending order. Given that
is a positive semi-definite matrix whose rank is
, the eigenvalues satisfy the inequalities given by
Note that
follows from the Gershgorin circle theorem (Estrada and Knight, 2015 [
24]).
Figure 1 plots the eigenvalues
for
.
is an orthogonal matrix. Given that
, it follows that
Accordingly, given that
is of full column rank, we can let
where
,
, and
. Note that, since
, both
and
belong to the column space of
. Denote the
i-th entry of
by
for
, i.e.,
. Accordingly,
, where
.
The next proposition shows how the residuals, , are smoothed by the smoother matrix .
Proposition 3. can be represented bywhere denotes the i-th entry of . Moreover, in (36), the eigenvectors, , satisfy the inequalities given byand the coefficients on the eigenvectors satisfy the inequalities given by Remark 3. (a) The inequalities in (37) show that the degree of smoothness of is higher than that of for . Figure 2 illustrates the latter results by plotting , , , and for . (b) The inequalities in (38) are the core of smoothing. This is because only relatively short-period fluctuations are compressed. For example, is close to zero, whereas is close to unity. We note that are the eigenvalues of in descending order. See Figure 3, which plots for and . The next proposition shows the results concerning the squared -norm of for .
Proposition 4. If does not belong to the column space of Π, then it follows that Remark 4. From Proposition 4, it follows thatHere, we show that this result is reasonable. Given that is an orthonormal basis of the column space of Π, it follows that , where , from which we haveIn addition, given that and , we havewhich is consistent with (40). The next proposition documents the results for the arithmetic mean and the variance of for .
Proposition 5. (i) The sum of the entries of is equal to zero for . (ii) If does not belong to the column space of Π, then it follows that Remark 5. (a) (i) shows that the arithmetic mean of equals zero for . (b) (i) and (ii) imply that the variance of strictly decreases as k increases from zero. Moreover, in addition to (43), given that for , from Proposition 4, it follows that 5. An Empirical Illustration
The dashed line in the top panel of
Figure 4 depicts the log of US real gross domestic product (GDP) from 1947:1 to 1998:2,
. This is the same data used by Morley et al. (2003) [
25] and Perron and Wada (2009) [
26]. The solid line in the panel is the linear trend,
. The dashed line in the middle panel plots the linearly detrended series,
. It clearly shows that the linear trend residuals contain long-period fluctuations. The solid line in the panel plots
is estimated with
. Recall that
The two panels presented here are identical to those shown in Yamada (2018, Figure 2) [
3].
The dashed line in the bottom panel plots the HP detrended series,
. The solid line in the panel plots its low-frequency component,
, which is again estimated with
. From the bottom panel, we observe that the HP detrended series,
still contain long-period fluctuations and they are extracted by
. Recall that
which is twicing. Finally, we report that the following inequalities hold:
which illustrate Propostions 4 and 5.
As a supplementary examination, the same analysis was conducted with
= 800,000. Note that both 1600 and 800,000 are the values used in Perron and Wada (2009, Figure 5) [
26]. The results are illustrated in
Figure 5. Even in this case, the following inequalities hold:
Regarding (
47) and (
48), we make a remark. If
does not belong to the column space of
, then it follows that
A proof of (
49) is provided in
Appendix B.
6. Concluding Remarks
Yamada (2018) [
3] clarified the relationship between the linear trend in (
1) and the HP trend in (
3) and explained why the HP trend seems to be more plausible than the linear trend. This paper is an extension of Yamada (2018) [
3]. In this paper, we treated the three trends, the linear trend, the HP trend, and the recently proposed bHP trend in (
8), in a unified manner and clarified their relationship in detail. We showed how the bHP trend is constructed from the linear/HP trend, and long-period fluctuations remained in their trend residuals. The results obtained are summarized in Propositions 1–5 and illustrated in
Figure 4 and
Figure 5.
Finally, we make one remark. It concerns the value of
m. As stated by Phillips and Shi (2021) [
8], when
is fixed, the effective degrees of freedom of the bHP filter, defined as
), is an increasing function of
m. Thus, increasing
m reduces the sum of squares of the bHP trend residuals. (As shown in Proposition 1 of Yamada (2024) [
19], it tends to 0 as
m goes to
∞.) This relationship is similar to that between the number of explanatory variables and the sum of squares of residuals in a linear regression model. For selecting the value of
m, Phillips and Shi (2021) [
8] proposed an information criterion.