1. Introduction
With the integration of the global economy, international trade has developed rapidly and has become an indispensable source of economic growth. Fluctuations in total trade volume affect not only a country’s economic growth but also its international trade situation. An increasing number of countries earn large revenues from international trade. Total trade volume significantly influences countries’ economies and is an important indicator of a country’s international trade situation. A trade volume forecast helps enterprises to schedule their production efficiently and promote sustainable international trade. For these reasons, it is important for company managers to forecast total trade amounts for relevant government departments and business investors. A trade volume forecast is subject to complicated factors, and because of its complex and irregular patterns, it still has not been explained fully in the academic literature. Consequently, the demand for the accurate modeling and forecasting of trade volume has increased. At present, methods of forecasting total import and export trade are divided into three categories: the linear prediction method, the nonlinear prediction method, and a combination forecasting method. While linear prediction is simple and intuitive, and its parameters have strong economic significance, it has difficulty in fitting complex data [
1,
2]. The advantage of nonlinear prediction is that it can fit a complicated nonlinear system, so nonlinear predictions have better fitting precision than linear predictions for nonlinear import and export trade volumes [
3,
4,
5]. The combination forecasting method addresses these shortcomings by integrating multiple single-prediction models, including linear and nonlinear prediction models, using advantages of both models to obtain better predictions results [
6,
7,
8]. In this paper, the Grey wave prediction method with an HP filter combines linear and nonlinear predictions, thereby improving prediction accuracy.
Grey system theory [
9,
10], proposed by Professor Deng, is based on various forms of uncertainty information theory and methods, characterized by small data amounts that can disregard data distribution. The theory is applicable in many fields [
11,
12,
13].
The Grey prediction model is one of the most active and widely used Grey system models. Some data processing programs can scientifically and quantitatively predict the future of a system. Grey prediction models have numerous applications. For instance, Chirwa and Mao, using U.S. and UK data, applied a GM (1,1) model to estimate the risk of car accidents [
14]; Cempel used a Grey prediction model to monitor the state of mechanical vibration [
15]; Grey system models were also used to analyze and predict hydraulic processes at the Karst watershed [
16,
17]; Wang adopted a Grey neural network method for the Chinese civil aviation operation’s nonlinear, on-line prediction model [
18]; and Li et al. used a Grey system model to predict spacecraft failure [
19].
Nearly all previous studies used the GM (1,1) or GM (1,N) model to derive predictions [
20,
21,
22] for monotonic time series. While these studies showed very effective results with high prediction accuracy [
23], they are not suitable for cyclical fluctuations and a wide range of data series. Many researchers propose that GM (1,N) models be modified on the basis of the characteristics of data series. For a series of fluctuating data, Feng proposed the GM (1,1, sinω) model [
24], and Qian and Dang proposed an oscillation series in the GM (1,1) model. However, the application of both models is very limited [
25]. Only Guo et al. used GM (1,1, sinω) to predict epidemiological trends of hemorrhagic fever and renal syndrome in Shenyang [
26]. The GM (1,1) model, which is based on an oscillation series, is only proven by a numerical example. Both models of mathematical theory are very complex, and few practitioners can master them. In addition, these two models only apply to stable fluctuations in data series; in other words, they are not suitable for time series with growth trends.
Grey wave predictions based on data series predictions are one of the most important research topics in Grey system prediction research. Grey wave prediction models not only have the advantage of a Grey system prediction model but also that of forecasting short-term fluctuations of time series based on limited data [
27,
28]. These models do not consider the distribution and stability of a time series. In fact, the Grey wave prediction model accurately predicts data series with large fluctuations, such as that of Chen et al., who analyzed an economic cycle using a Grey wave forecasting model and proposed Grey wave prediction in the economic cycle [
29]. However, there is little research regarding Grey wave prediction models for time series forecasting, let alone an improved model for Grey wave forecasting, when compared with the explosive growth of studies on GM (1,1) or GM (1,N) [
22,
30,
31,
32,
33]. A pioneer in Grey wave prediction is by Wan et al., who advanced a basic Grey wave prediction model beyond data series with a regular fluctuation range based on a data series and trough selection contours [
34]. However, this method is inefficient because all crests and valleys are found to carry heavy workloads. Chen proposes a non-equidistant Grey wave prediction model that begins with selecting qualified contour lines and using a GM (1,1) model. This method, however, is only applicable to time series that do not have significant trends [
35]. A few international studies exist regarding oscillation series with obvious trend characteristics. Chen proposed a Grey wave prediction method based on general contours. This method uses the least squares method to select contours, which can make up for shortcomings in a Grey wave prediction model by applying ascending and descending trend data [
36]. Based on the above research, this paper proposes a Grey wave prediction model with an HP filter that aims to reduce the interference of long-term trend items, using wave prediction by trend decomposition. After applying an improved Grey wave prediction model to China’s total trade volume forecast, this paper concludes that a Grey wave prediction with an HP filter can improve the accuracy of non-equidistant Grey wave predictions.
  2. A Grey Wave Prediction Model with an HP Filter
Grey wave prediction is a graphical prediction method that uses information from a time series data graph to predict future series. It focuses on irregular, developing trends of data series, and it is applicable to data series with frequent and irregular fluctuations. Contours are selected and identified to pick up graphical information. The principle is to obtain coordinate information regarding the intersection of the contour lines and the data graph.
Finally, GM (1,1) is used to forecast and model intersections. To improve the adaptability of a Grey wave forecasting model in a fluctuating data series with increasing or decreasing trends, a Grey wave forecasting model with an HP filter is proposed in this research.
  2.1. The HP Filter
Proposed by Hodrick and Prescott (1980), an HP filter [
37,
38,
39] method is adopted to eliminate the long-term tendency of a time series in economics. This method is widely used in the field of macroeconomics. An HP-filtering method is a time series decomposition method in state space. The HP-filtering method assumes that macroeconomic data is composed of a long-term tendency and a short-term circulation. Trend decomposition is the first step of a Grey wave prediction model with an HP filter. A traditional Grey wave prediction model is based on original data; hence, subject to trend term interference, it is easy to create a prediction effect of the model.
Given 
 observations on a variable 
, 
 is the trend component as a smooth series that does not differ too much from the observed 
. The tendency of total trade volume can be decomposed through optimization:
		
As we mentioned before, for minimization, we use  to adjust any changes in the long-term trend. When , which means that the trends satisfy the minimization problem (in this connection, with increasing ), we estimate a smoother tendency, moving toward infinity, with a quasi-linear, long-term trend.
  2.2. Predictions of Tendency
The main task of this step is to find a series of trends after HP filtering. We apply a GM (1,1) model to fit and predict the trend series.
Definition: Assume that 
 is an original series 
. Let 
 be the raw data series, generating the first-order accumulated generating operation (AGO) series 
, where:
		
The first-order differential equation of the GM (1,1) model is obtained as follows:
		
By using the least squares method, parameters 
 and 
 can be obtained as follows:
 is the sequence mean generated based on consecutive neighbors of 
.
        
Furthermore, an accumulated matrix 
B and vector 
 are given by the following:
The discrete form of the solution for the GM (1,1) model is given below.
        
The recovered data 
 can be retrieved from the following equation:
		
  2.3. Prediction of Circulation
The purpose of this step is to obtain a series of circulation after HP filtering. We use a Grey wave prediction model to fit and predict the cycle series.
  2.3.1. Choosing Contour Lines
Original Grey wave prediction models utilize horizontal contour lines, which only achieve preferable prediction results when using data series with steady, periodic fluctuations. However, because economic data fluctuates with upward and downward tendencies, an original Grey wave forecasting model is no longer suitable for this type of data series. Therefore, we choose to use the quantile of time series as a contour line to seize graphical information correctly proposed by Chen (2016) [
36] (shown in 
Figure 1).
Definition 1. (unequal-interval contour lines):
Let the original series be , and sort the series in ascending order to get .
Let , , and let  be the -quantile of the data , where the i ths-quantile is:where , the  horizontal lines decided by  are unequal-interval contour lines.
    2.3.2. Identifying Contour Time Series
The contour time series should be determined according to the quantile of short-term cycle time series. Each element of a contour time series is the abscissa of the intersection point between contour lines and a short-term cyclic time series graph.
Definition 2. (contour time series):
Let the cyclic time series be , with contour line  intersecting with the original time series, and let  be the set of intersection points.  is located on the th broken line, and the coordinates of  are as follows:and let:where by  is the contour time series of contour line .
    2.3.3. Filtrating Contour Time Series
The basic Grey wave prediction method is based on a contour time series of four or more values. However, when the time series fluctuates violently with a long-term tendency component (as shown in 
Figure 1), the different number of intersection points for different contour lines is obvious. Therefore, the traditional method, based on four or more values, is not appropriate, and the contour time series needs to be filtered to perform the GM (1,1) prediction process.
This paper uses autocorrelation of time series to select the contour lines, which means that future series observations will be affected by current or late observations. Therefore, when the last value of the contour time series is close to the first forecasted observation, we choose it as the qualified contour time series. The rest are defined as unqualified contour time series.
Definition 3. (qualified and unqualified contour time series):
Let  be the first predicted observation and . If  is less than or equal to the threshold ,  is the qualified contour time series. If  is larger than the threshold ,  is the unqualified contour time series.
   2.3.4. Establishing GM (1,1) Models
For the final step, we use GM (1,1) models based on qualified contour time series for both in-sample fitting and out-of-sample forecasting, and we use GM (1,1) models based on unqualified contour time series only for in-sample fitting.
Proposition 1 establishes GM (1,1) based on  to obtain forecasting values .
Sort all the values in contour time series 
 in ascending order, and delete invalid elements. The forecasting series is as follows:
		  
In which 
. If 
		  is on contour line 
, the generated wave through in-sample modeling and out-of sample prediction is:
		
The novel Grey wave predicting procedure of total Chinese trade volume mentioned above is shown in 
Figure 2.
  3. Empirical Studies
  3.1. Data
In this study, we use monthly trade data of the total Chinese import and export trade volume, a valid estimation of international trade. The data are prepared and obtained from the National Bureau of Statistics of the People’s Republic of China. Datasets span from November 2013 to December 2016. 
Figure 3 indicates the frequent and irregular fluctuations in total trade volume.
  3.2. The Forecasting Process
This study divides the datasets into two parts: an in-sample data form November 2013 to August 2016 (34 observations) and an out-of-sample data from September 2016 to December 2016 (four observations). In-sample data, which are applied to select contour lines, determine contour time series. Out-of-sample data appraise predictive performance. We expect a four-step prediction process.
Table 1 lists the autocorrelation coefficients from 1 to 10 lags. It indicates that the total trade volume incorporates autocorrelation; hence, we can use Definition 3 to classify contour time series.
 This paper uses a Hodrick–Prescott filter to further decompose long-term trend items and short-term cycle items. According to an ordinary decomposition of the monthly time series, 
λ was set to 14,400. The result of Hodrick–Prescott filter is shown in 
Figure 4.
We divide the prediction process into two parts. The first part establishes the GM (1,1) model to predict long-term tendencies, and the second part predicts short-term cyclic time series.
  3.2.1. Tendency-Component Prediction
The first part of the Grey forecasting process established the GM (1,1) model to forecast long-term tendency series.
By using the least squares method, matrix 
 and vector 
 are as follows:
		  
According to Equation (3), the first-order differential equation of the GM (1,1) model is as follows:
		  
The solution of the GM (1,1) model is as given below.
          
  3.2.2. Cyclical Component Predictions
The second part of the forecasting process involves predicting the short-term cyclic time series. Fourteen contour lines are shown, as follows:
		  
Since the serial number of the first forecasting observation is 38, and the data show autocorrelation according to 
Table 1, this paper chooses 
, 
, 
, 
, and 
 as qualified contour time series. Other counter series are just for in-sample fitting purposes.
The final step involves GM (1,1) to predict the contour time series. In this section, we only give the time-modeling series of a qualified contour time series 
, 
, 
, 
, and 
. The modeling process of the remaining contour time series is the same.
          
Let 
; thus, the forecasting value of the contour time series is as follows:
		  
  3.3. Forecasting Results
To better reflect the superiority of a novel Grey predicting method, this paper chooses the improved basic method (using unequal-interval contour lines) and the most commonly used model in time series analysis ARMA (1,3) to forecast China’s total monthly trade volume.
We use the Mean Absolute Percentage Error (MAPE) to evaluate the prediction accuracy of the model. 
Table 2 suggests that the predictive effect of an improved Grey wave prediction method is superior to the basic method, in both modeling and predicting. Although the performance of ARMA (1,3) is better than those of the basic methods for out-of-sample forecasting, the novel Grey wave prediction method is significantly superior to the above mentioned two models, whether using in-sample data-fitting or out-of-sample predictions. 
Figure 5 compares the forecasting results. In a four-step prediction, ARMA (1,3) tracks the movement of the existing trade volume closely, which affects the accuracy of prediction, whereas the improved method considers possible fluctuations.
  3.4. An Asymptotic Test
The asymptotic test is a statistical method used to test the equality of the forecast accuracy [
40]. It uses the null hypothesis of equal forecast accuracy for two forecast methods to test the accuracy of the prediction models. The asymptotic test does not assume the distribution of the loss, and there is no limitation to the loss function. Therefore, the asymptotic test is more practical to compare the predictive accuracy of the models.
We assess two forecasts, the basic grey wave forecast and the forecast implicit in the grey wave prediction model with HP filter (the difference between the basic model and the novel model). As ARMA forecast have been widely used in time series forecast, we chose ARMA model as the “no change” forecast.
The actual and predicted changes are shown in 
Figure 6. We shall assess the forecasts’ accuracy under absolute error loss. The loss differential series is shown in 
Figure 7, in which no obvious non-stationarities are visually apparently. Approximate stationarity is also supported by the sample autocorrelation function of the loss differential, shown in 
Figure 8, which decays quickly.
Because the forecasts are four-step-ahead, according to the condition of the asymptotic test, the forecast allows least three-dependent forecast error. This intuition is confirmed by the sample autocorrelation function of the loss differential, in which sizable and significant sample autocorrelation appear at lags 1, 2 and 3 and nowhere else.
We then proceed to test the null of equal expected loss. F, MGN, and MR are inapplicable because one or more of their maintained assumptions are explicitly violated. The statistic for testing the null hypotheses of equal forecast accuracy is
        
        where 
 is a consistent estimate of 
.
By taking a weight sum of the available sample autocovariances,
        
        where:
 is the truncation lag
 is the lag window, defined as follows:
		
We therefore focus on test statistic  setting the truncation lag at 3 in light of the preceding discussion. We obtain  of the novel model is negative 1.2391, implying a p value of 0.2178. Thus, for the sample at hand, we do not reject at conventional levels the hypothesis of equal expected absolute error. Moreover, we can also obtain 
        of the basic model is negative 1.7925 with a p value of 0.1936, the result is the same as the novel model, which means that both the basic Grey wave forecast and the novel Grey wave forecast are not a statistically significantly worse predictor of the future total trade volume than is the ARMA forecast.
  4. Discussion
This paper divides the data into in-sample data and out-of-sample data. In-sample data-fitting results obtained are based on 34 observations. The 35th to 38th observations are used for a four-step forward prediction. To better reflect the superiority of the improved Grey wave prediction method, this paper chooses the unequal-interval prediction method (shown in 
Figure 5b) and the most commonly used ARMA (1,3) model in a time series analysis (shown in 
Figure 5c) to forecast total monthly Chinese import and export trade volume. From 
Figure 5, we conclude that the unequal-interval Grey wave prediction method fluctuates more intensely, mainly because in an unequal-interval Grey wave model, it is easy to locate data in the extraction, ignoring the impact of the tendency. Therefore, the amplitude of the restored data series is greater. It can be seen from 
Figure 5c that a significant hysteresis effect exists for the in-sample fitting value of the ARMA (1,3) model, which is mainly due to the joint action between the AR term and the MA term. Because of this effect, the ARMA (1,3) model shows a slow downward trend in the total import and export volume forecast in the four-step prediction, which is inconsistent with the fact that the actual volume of trade rose suddenly. The novel method obviously has been improved, for both in-sample data-fitting and out-of-sample forecasting. The novel method has the same fluctuation as the in-sample fitting, with little hysteresis effect. In addition, the improved method shows a precipitous rise in the four-step prediction, which is consistent with the actual total trade volume.
  5. Conclusions
In recent years, China’s trade in goods has declined substantially, which suggests that the current international trade situation is not optimistic. Through China’s “One Belt One Road” initiative, the country is actively promoting infrastructure construction and the realization of interconnectivity. In the future, with the optimization and upgrading of its industrial structure, China’s reforms will begin to bear fruit. As a measure of China’s foreign trade, its total trade volume, a quantitative indicator of Chinese import and export trade, is currently recognized as a reliable indicator of the country’s actual trade. A prediction of the total Chinese trade volume is important for the country’s export enterprises. It can help them to accurately estimate China’s trade situation and schedule production efficiently. It would also be helpful to China in adjusting its trade policy in due course and to promote sustained international trade.
With graphic prediction, we use a novel method to model China’s total import and export trade volume from November 2013 to December 2016. During this period, the total Chinese trade volume declined in the trend with the irregular cycle characteristics. This method not only reflects the cyclical fluctuation characteristics of trade but also is not limited to conventional time series modeling. Based on limited data, the novel Grey wave prediction model with HP filter performs better than basic Grey wave prediction model and also better than ARMA (1,3). The novel model can be applied in the forecasting of fluctuant data series with upward/downward tendency. In real life, many economic variables such as GDP and PMI Inevitably have the same characteristics of fluctuation with total Chinese trade volume. Therefore, the novel model has a wide range of applications. Besides, the advantage of Grey theory is that it can use limited data to analyze problems systematically. The traditional model must control the stability of the original data and assume that artificial intelligence technology needs a lot of data as training set. Grey wave forecasting with HP filter improves the adaptability of Grey wave forecasting model and is meaningful to forecast the developing tendency of emerging situations.
At present, Grey wave prediction is applied sparingly in Grey system theory, despite being worth further investigation. Although this paper shows that our Grey wave forecasting model with an HP filter outperforms a traditional time series processing method (ARMA), regarding prediction accuracy, forecasts of total Chinese trade volume are based only the on graphical characteristics of the series itself. Trade volume variability always includes other factors, such as exchange rates. Therefore, the processing of raw data or contour line selection needs to be combined with other graphical prediction methods in future research. Our novel method is based on one-side HP filter, while there are still some drawbacks of the HP filter that need to be further studied. Above all, there is still great potential for the improvement of the Grey wave forecasting model.