3.2. Unit Root Test, Cointegration, Vector Correction Model (VECM), and Granger Causality Test
(1) Unit root
Before estimating the dynamic relationship, the order of integration (i.e., the stationarity properties of an individual variable) of all variables must be identified. If a non-stationary time series tend to have a unit root, then a spurious relationship among these variables tends to be revealed in a regression analysis, hence leading to invalid causality. A unit root test should therefore be performed under the framework of regression analysis. Based on the augmented Dickey–Fuller (ADF) test, the introduction of lagged terms enables the variables to capture the omitted dynamics and eliminate the biased standard errors. The ADF test needs to assume that the error process has equal statistical variances; however, the asymptotic distribution of unit root test statistics remains unchanged despite the heteroscedasticity. To test for a unit root or non-stationarity, the ADF test is performed using Equation (1):
        where 
 is an error term that is assumed to be stationary with a zero mean and a constant variance. Under the null hypothesis of the existence of a unit root, McKinnoncritical values are used for testing on the coefficients of 
. 
Zivot and Andrews [
23] proposed an improved unit root test that considers structural breaks. In this study, a structural break with an unknown break date is assumed. These authors also noted that all locations of data are likely to be breakpoints, and all T statistics that test 
 are calculated through ADF cycle tests. The minimum 
tvalue is chosen as the corresponding 
 value (
 time of structural breakpoints), where the estimated break date is obtained through 
. Given that the null hypothesis of a unit root is 
, the regression equations used for testing a unit root are listed as follows:
        where 
 is the time of structural breakpoints; when 
, 
, and 
 if otherwise; when 
,
 and 0 if otherwise. 
 refers to the estimated value at break fraction. The number of extra regressors k is the number of the kth order lag, which is determined by the significance of t statistics. The criterion is that t-statistic on 
 is less than 1.6 in absolute value if 
 whereasthe t-statistic on 
 is more than 1.6 when 
. The critical value of asymptotic normal distribution at the 10% significant level is 1.6.
For the traditional unit root test, the structural breakpoints tend to be ignored, which leads to spurious regression; however, the Zivot–Andrews unit root test is used to test the endogenous structural unit root, based on the null hypothesis of 
 being a unit root. Three different models exist:
In the empirical studies, Model C is assumed to be superior to Models A and B because the majority of variables have an increasing trend over time. Their results are more robust.
(2) Cointegration test and VECM model
While assuming the causality between international trade and economic growth, a cointegration test should be conducted prior the Granger causality analysis. The Engle-Granger test evaluates the cointegration between two variables; however, for multiple variable regression, the cointegrated relationship between trade and growth is checked by Gregory–Hansen cointegration test when controlling for structural breaks. Granger’s representation theorem posits that if multiple variables are cointegrated, then an error correction mechanism (ECM) model that represents their dynamic connection exists. If the first differences of two variables are stationary but their levels are non-stationary, then these variables are cointegrated. Under a statistically established cointegration, the residuals can be used to formulate the dynamic ECM, which is used to study the long- and short-term causality between trade and GDP growth. According to Engle and Granger [
24], to prove the causality from trade to GDP growth, all coefficients of the lagged differences of export and import growths are jointly significant. At the same time, the coefficient of the one-period lagged error term from Equation (2) is statistically significant.
        
Following cointegration theory, the VECM is specified with (p) lags; however, the model is estimated with (p-1) lags. By computing the maximum likelihood estimates in multivariate ECM, models can elaborate on howvariables respond to shocks when temporarily deviating from long-term dynamics.
        
        where 
 is an (n × 1) column vector of k variables, 
 is an (n × 1) vector of constant terms, 
 and Π represent coefficient matrices, 
 is the lagged error correction term (ECT), Δ is a difference operator, i denotes lag length, and 
 The coefficient matrix Π is known as the impact matrix that contains information about the long-term relationships among the variables. Before using the Johansen VECM model, the order of integration of the variables is tested. On the one hand, the VAR system can be used when Rank (
) = 
 which indicates that the full rank of the matrix 
 has the stationary vector process 
. On the other hand, the null matrix 
, which is designated as Rank (
) = 0, implies that the non-stationary 
 is non-cointegrated; therefore, the VAR portrayal of the involved variables can still be used only if these variables conduct the first difference. Meanwhile, 0 < Rank (
) < k suggests that 
 series is non-stationary yet cointegrated.The empirical implication of variables being cointegrated is that the involved variable temporarily deviates from the long-run equilibrium due to shocks; however, these variables stick to the long-run equilibrium throughout the entire period. In this study, two maximum cointegrating equations exist consequently. 
 contains information on the term of 
 derived from the equation, which leads to either a temporary difference from the long-run equilibrium or the equilibrium state. The estimated coefficients of the lagged variables 
 can capture the fluctuations resulting from short-term shocks. The VECM based on the VAR model focuses on the characteristics of time series and passes the diagnostic tests. Under the framework of the time series, the estimated models must be free from autoregression, while the residuals obtained from this estimation regression are supposed to be stationary, follow a normal distribution, and have the same variance; therefore, the VEC model is presented in Equations (10) in first differences:
        where 
 equals to lag length minus 1, 
 are the short-term coefficients, and 
 denote the speed of the adjustment parameter, which lies between 0 and 1, and the ECT, which is the lagged value of the residuals from the estimation of cointegrated regression, respectively.
However, with a break in any series, Gregory and Hansen [
25] designed a test for cointegration when controlling for structural breaks. The null hypothesis of the Gregory and Hansen test is that no cointegration exists at the break point in an unknown date against the alternative hypothesis of cointegration at the break point. This rejection of the null hypothesis implies that the linear combination of variables shows the long-run stable relationship. The following equations are the cointegration models:
        when 
, 
, and 
 if otherwise. 
TB is the break date.
(3) Granger causality test
Granger [
26] argued that the VECM model can be used if the variables have a cointegrated relationship. The Granger causality test that follows the chi-distribution with p degree of freedom is based on the null hypothesis that all estimated coefficients are 0 and on the alternative hypothesis of non-zero coefficients. Rejecting the null hypothesis suggests the existence of unilaterally directional links from one variable to another. Specifically, the null hypothesis (
 the coefficient of an independent variable equals 0) highlights that the independent variables fail to explain the changes in the dependent variable. For the joint causality test, the variables fail to jointly explain the explained variable if they support 
 (i.e., all coefficients are 0). In sum, the existence of a long-run link allows the identification of one or more Granger causalities; however, the stably permanent relationship cannot be inferred from the existence of a Granger causal relationship because of the short-term Granger causality test.