Some Dynamic and Steady-State Properties of Threshold Auto-Regressions with Applications to Stationarity and Local Explosivity

The purpose of this paper is to investigate the dynamics and steady-state properties of threshold autoregressive models with exogenous states that follow Markovian processes. Markovian processes are widely used in applied economics although their statistical properties have not been explored in detail. We use characteristic functions to carry out the analysis, and this allows us to describe limiting distributions for processes not considered in the literature previously. We also calculate analytical expressions for some moments. Furthermore, we see that we can have locally explosive processes that are explosive in one regime whilst being strongly stationary overall. This is explored through simulation analysis, where we also show how the distribution changes when the explosive state becomes more frequent although the overall process remains stationary. In doing so, we are able to relate our analysis to asset prices which exhibit similar distributional properties.


Introduction
The purpose of this paper is to investigate the dynamics and steady-state properties of threshold autoregressive models with exogenous states that follow Markovian processes.These models fall within the class of regime-switching models, which have become increasingly popular in applied economics and finance.Initially introduced by Goldfeld and Quandt (1973) and Tong (1978), regime-switching models have been used in economics and finance for a wide variety of applications including forecasting exchange rates (Engel 1994), understanding price transmission (Goodwin and Harper 2000), detecting bubbles in the art market (Knight et al. 2014), and providing a metric of market efficiency (Ahmed and Satchell 2018).Hansen (2011) provides a concise summary of threshold autoregressive processes and their applications.Hamilton (1989Hamilton ( , 1990Hamilton ( , 2010) ) has made seminal contributions to the theory and application of regime-switching models.As outlined above, this article discusses a particular class of regime-switching models.The problems we discuss appear to have much in common with Markov switching models, and Timmermann (2000) has provided a detailed analysis of moments and autocorrelations, which would include our model as "MSIII" in his terminology.However, his analysis does not address non-moment distributional properties or the non-existence of moment-generating functions (mgfs).In addition to deriving theoretical moments, we also use simulation analysis to show how the distributions of threshold autoregressive models change when the process's two states consist of one stationary state and an explosive state.Our analysis focuses on these models for two reasons.Firstly, we are able to derive a characteristic function for this case, thereby adding to the literature on analytical results for threshold autoregressive models.We believe this is a significant contribution to the statistical literature.Secondly, this class of models is of interest in the financial literature concerned with explosive roots.In particular, Theorem 6 in Section 2 specifically refers to a special case that is of interest to researchers working on asset pricing with some non-stationarity.The latter is what we consider for our simulation analysis, and thus we consider it the most important contribution of the current article.We also believe that these models can prove to be useful in the applied macroeconomic literature.The simulation analysis presented in this article will help the reader appreciate how these models can be useful in practice.
In applied Macroeconomics, for instance, Dynamic Stochastic General Equilibrium (DSGE) models often model shocks as AR(1) processes (see Schmitt-Grohe and Uribe 2004).The literature came under particular scrutiny after the financial crisis for its inability to simulate and thereby predict conditions and outcomes that were observed during the crisis.In addition to the absence of financial markets, such models are also restricted by their reliance on a stationary AR(1) model as a shock process, as these processes can rarely be used to study the kind of macroeconomic shocks that led to the financial crisis.On the other hand, these models will not have analytical or numerical solutions if the process is non-stationary.
We postulate that using a TAR(1) shock process which is stationary but nevertheless can exhibit locally non-stationary behavior can improve these models.Our work will enable calculation of moments for such shocks (where such moments exist), allowing the user to work with analytical solutions, or, if the user is deriving a numerical solution, ensuring that such a solution will exist.Indeed, some work has already started relying on Markov-switching DSGE models (Foerster et al. 2016).This paper complements the proposed methodology by enabling researchers to control and simulate shocks of specific variances.
There are further applications in the finance literature where TAR models have become popular.The applications may extend to forecasting oil prices through threshold models or in modelling exchange rate fluctuations.There are many areas where a TAR model and the characteristic functions we derive can provide more depth to the underlying analysis.To use one recent example, Aleem and Lahiani (2014) estimated a TAR model of exchange-rate pass through for Mexico.Their analysis was limited to an estimation of the threshold above which the pass through is greater.The characteristic functions from our article would have allowed them to estimate the volatility of exchange rates in their model, improving both their model and the resulting predictions.Similarly, in Ahmed and Satchell (2018), the empirical application relied on a Markovian exogenous trigger.The results from this article would have allowed them to derive moments of their empirical TAR(1).Corollary 2 in Section 2 offers one example of how the results of this article may contribute to applied and empirical work in finance.
The rest of the article is organized as follows.In Section 2, we present the derivation and formulae for characteristic functions of threshold autoregressive (1) models with exogenous Markovian triggers.Section 3 outlines the simulation methodology.A separate section is necessary since obtaining a sample from the steady-state distribution of a TAR model with a Markov-switching exogenous trigger is a non-trivial exercise.Section 4 presents and discusses simulation results and Section 5 concludes.

Moment-Generating Functions of TAR(1) Models with Exogenous Markov-Triggers
In this section, we introduce the threshold autoregressive model with a Markov-switching exogenous trigger.After introducing the model, we derive the moment-generating functions for this model and present some interesting results.We shift to characteristic functions after Theorem 3, where we do not have to assume the existence of all moments so that moment-generating functions for such processes need not exist.Characteristic functions on the other hand will always exist (Stuart and Ord 1994, chps. 3 and 4).Such processes are often used to model prices (particularly in finance), therefore, we refer to our model as a price process indicated by p t .
The price process has a switching AR(1) form: where α t−1 is a switching drift, β t−1 is a switching coefficient term, and σ t−1 is a switching variance term for the error process.Let: where α is a vector containing all drift terms, β is a vector of coefficients, and σ is a vector of error standard deviations.In general, all the above vectors are k × 1, but we illustrate them when k = 3 for notational convenience.We assume that z t is Markovian and follows a multinomial distribution, and that η t has a moment-generating function ψ(u) which is assumed to be location-scale.z t determines what state α t−1 and β t−1 are in.In particular, , where P represents the Transition Matrix for the Markovian state variable z t .For econometric purposes, we envisage an exogenous continuous random variable Z t and constants ϑ 0, . . ..ϑ 3 , so that z t = e j if ϑ j−1, ≤ Z t < ϑ j , i.e., when the continuous random variable Z t is between thresholds ϑ j−1 and ϑ j , and the value of the Markovian variable z t is equal to e j .The nature of Z t determines P and the kind of regime-switching model Z t is. Here is the transition matrix, which describes the probability of switching states.Note that i P = i , where i is a vector of ones, and Pπ = π where π is the vector of stationary (steady-state) probabilities and i π = 1.
p ji = P z t+1 = e j z t = e i = P z 1 = e j z 0 = e i 1 ≤ i, j ≤ 3 so that the Markov Chain is stationary.Whilst we can estimate P by counting frequencies, we can also hypothesize a Markov process for Z t and then integrate over the appropriate rectangle of the probability density function of (Z t , Z t+1 ).
We now consider exp(up t ) in order to derive the moment-generating function for p t .Here, u ∈ R.
The moment-generating function of p t is defined by where, From (1), we have that: Using iterated expectations, we find that the moment-generating function for p t is: For functions F (p t−1 , Z t−1 ), we note that: by the law of total probability.Thus, φ t (u) = k j=1 π j exp u α, e j φ t−1 β, e j u ψ σ, e j u) is a dynamic recursion for the moment-generating function of p t .

Steady-State Distribution under Markovian States
The above discussion leads to the following result.
Theorem 1. Assuming a steady state for prices, denote E(exp(up t ) ) = φ(u) and E(exp( uη t ) ) = ψ(u) as the appropriate mgfs (or characteristic functions with a trivial definitional change).Then, is the steady-state relationship.
We can use Theorem 1 to arrive at analytical expressions for different moments of the process, p t . Define: We differentiate (4) once to obtain the first moment of p t , and we get: Differentiating the mgf a second time gives us the second moment of p t : .
Further calculations and simplifications lead to an expression for the variance of p t : Various interesting results can be derived from Theorem 1 for plausible parameter values.We list one case below, but other results can be regarded as special cases.Here, we are concerned with the case where k = 2, α j = 0, β 1 = β, β 2 = 0, and σ 2 = 0, so that the mgf function becomes: Corollary 1.If ψ(u) is the moment-generating function of a negative exponential with parameter λ, and α j = 0, σ 1 = σ, σ 2 = 0, and π = β where β 1 = β 2 = β which is less than 1, then φ(u) = λ λ−u , i.e., a negative exponential random variable with parameter λ.
Proof.To show that Equation ( 5) has a solution for some ψ(u), we consider the negative exponential function, i.e., we assume that the disturbance term is distributed as a negative exponential with parameter λ.The corresponding moment-generating function, φ(u), for this disturbance term is λ λ−u if we further assume that β = π and that 0 ≤ β < 1.Note that Equation (5) corresponds to the situation where the β coefficient does not move across states, but the standard deviation of the disturbance term does, i.e., σ 1 = σ and σ 2 = 0. Our result implies that σ = 1 λ .
For our distributional assumption regarding the disturbance, the corresponding moment-generating function is (taking into consideration the two states): Substituting this in (5) and using the trial solution, φ(u) = λ λ−u , we have: If we further assume that β = π, the LHS and RHS are equal, thereby proving our result.We recognize the solution as being a negative exponential auto-regression of degree 1 (NEAR(1)).These models were investigated in detail by Gaver and Lewis (1980), which also included earlier references to related models.We note that the same arguments could be applied to Gamma random variables with integer degrees of freedom.
The attractiveness of these models is that they are AR(1) models where the underlying process is always positive, and hence, can be used to model equity or bond prices in finance.Our version is a slight extension of existing NEAR(1) models, in that Corollary 2 will be consistent with a Markov process for the state process rather than an i.i.d.process, as in the current NEAR(1) literature.
Furthermore, if Z t−1 is Markovian with transition matrix P such that π = Pπ, then: There are a number of observations relevant to (4) and ( 6) which we present below: Theorem 2. Since π = Pπ has multiple solutions for P given π, these different P s do not change the solution to Equation (6).As an example, for k = 2, suppose π = 0.5.It then follows that P 11 = P 22 , but if P 11 = 0.2 or 0.8 in this context, the steady-state distribution will be unaffected except through a change in position.
The steady-state values are equal (i.e., 0.5), and thus, such changes in the structure of the transition matrix should not influence steady-state values.Note, however, that this does not say anything about the speed at which the two processes in this example converge to the steady state.For more on speed of convergence, refer to Rosenthal (1995).Since the processes converge to the steady state through different paths, simulating the steady state becomes a non-trivial procedure, as explained in Section 3.
Theorem 3. Suppose that in Equation ( 6), α is zero, and φ(u) and ψ(u) are infinitely differentiable moment-generating functions and that the variance of the error process is constant.
Theorem 4. I f φ(u) is symmetric, then ψ(u) is symmetric.The proof is trivial.
Theorem 6. Suppose that α j = 0, the variance of the error term is constant, and that we treat as a statement about characteristic functions.Then, if at least one of the β j s is greater than 1 and all of them are non-negative, for some n, the nth moment will not exist.The proof follows from using (7) again and noting that φ n (the nth differential of φ(u)), which is proportional to the nth moment (if it exists), can be expressed as: The requirement for the existence of φ n is that k j=1 π j β n j < 1, which cannot hold for a large enough n under the assumptions of Theorem 6.This result links local explosivity to fat tails.Thus, processes with locally explosive states will cease to have moments once n becomes sufficiently large.Ahmed and Satchell (2018) derived similar conditions for the existence of a mean and variance for a TAR(1) process with an independently distributed exogenous trigger for state switching.We have generalized the result for the nth moment and for an exogenous trigger that is Markovian.Pourahmadi (1988) arrived at a similar result in Section 2 of his article (see Equations (2.3)-(2.5) in Pourahmadi (ibid.)),but he carried out his derivation in the context of doubly stochastic processes, as opposed to the specific case of a threshold autoregressive process that we consider.Secondly, Pourahmadi was mainly interested in second-order stationarity, while we present results for the existence of all moments.Thus, we substantially improve upon the results contained in Pourahmadi (1988) and Ahmed and Satchell (2018).Below, we consider a special case which will be of particular interest to finance practitioners.
Consider now the special case k = 2, β 1 = 1, and β 2 = 0.This is an important special case as it gives us a random walk in one regime and white noise in the other.Substituting into (8), we see that: This can be re-arranged to yield: Since ψ(u) ≤ 1, π ψ(u) < 1 and φ(u) can be represented in terms of a valid series expansion which can be analyzed term by term.Indeed, The right-hand side is uniformly and absolutely convergent because of the Weirstrass M-test, and thus we can integrate term by term.Pourahmadi (1988) also considered this as a special case in his article and derives conditions for the existence of a variance and covariance (see Sections 3 and 4 in his article).
We can now consider different choices for ψ(u).Suppose we have a normally distributed error term with a mean of zero and a variance of σ 2 , i.e., ψ(u) = exp − σ 2 u 2 2 .Then, ψ(u) j+1 represents a normal random variable with a mean of zero and a variance of (j + 1) σ 2 .We can identify the distribution of p t as an infinite weighted sum of normal random variables of increasing variances, but whose relative importance declines with a power of π.This process was analyzed in Knight and Satchell (2011) and extended in Grynkiv and Stentoft (2018).
Likewise, assume the variance of the error is constant.If we consider a mean corrected Poisson so that ψ(u) = exp(θ(exp(iu) − 1 − iu)) with mean parameter θ, we can identify the distribution of p t as an infinite weighted sum of Poisson random variables of increasing means (j + 1) θ, but whose relative importance declines with a power of π.
This case can be extended to include intercepts, in which case, Since exp(iα 1 u)πψ(u) < 1, the analysis proceeds as before, and: We see that the jth component is as above, but has a mean augmented by jα 1 + α 2 .For other examples, we refer the interested reader to Pourahmadi (1988) Section 4, who derived marginal distributions for different processes.
The results can be generalized to Vector threshold auto-regressions.These results are not presented here but are available upon request.The interested reader may also refer to Grynkiv and Stentoft (2018) for some discussion.

A Caveat on Simulating the Steady State
Simulating the steady state for processes similar to those considered in Section 2 is not as straight forward as it may appear at first glance and warrants further consideration, which this section seeks to provide.Generating a discrete Markov chain, which is essentially a variable that takes discrete integer values of 0 or 1 depending on the transition matrix P, does not generate a steady-state Markov chain, but rather a path to the steady state.It is common in this literature to simulate steady-state paths rather than a steady-state Markov chain.Indeed, in our earlier work, we worked with paths and not steady states, as did Timmerman in his article (see Figures 1-6 in his articles for example).While simulating steady-state paths sufficed for our earlier work, we need to simulate the steady state in order to corroborate our results from Section 2. Otherwise, the underlying moments of the simulated series can be different even if they converge to the same steady state.
This path is dependent upon the transition matrix P. If states are persistent, as determined by the transition probabilities of the process staying in the prevailing state (p ii ), this path may diverge significantly from the steady state.On the opposite spectrum is a transition matrix with frequent state switches, due to higher switching probabilities, which will take a different path to the steady state.Although the steady-state probabilities of both paths are identical, the dynamics vary due to the different paths taken by the processes.
We need to consider how discrete Markov chains converge to their steady states.The usual definition is based on total variation distance, and considers the supremum, taken over measurable subsets A, of the absolute difference between v(A) and u(A), where v() and u() are the two probability measures (see Rosenthal 1995).Whilst our process will converge in this sense, it will almost surely not converge along a sample path.Intuitively, it keeps moving from state to state.
We illustrate this by considering two different transition matrices that correspond to the steady-state probability vector 0.5 0.5 .We consider a transition matrix with highly persistent states P 1 = 0.9 0.1 0.1 0.9 and a transition matrix with frequent state switches P 2 = 0.5 0.5 0.5 0.5 .While both processes approach the same steady states, the simulated series have different probability distributions.Specifically, persistence of the non-stationary process, corresponding to β j > 1, causes the path to diverge far from the steady-state values, resulting in a process that has extreme values with significant probability, which also obtains very high kurtosis.With frequent state switches, the simulated series comes closer to a normally distributed process.
To remedy this, each of the above paths was simulated for 10,000 periods and the process was repeated 2000 times.The parameter values in the two switching states are 0 in state 1 and 1 in state 2. We also carried out simulations when the switching states corresponded to values of 0.1 in state 1 and 1.1 in state 2. We recorded the average of the first four moments of both paths in Table 1.As mentioned previously, the persistent path obtains a much higher kurtosis and standard deviation than the more frequently switching path, even though both paths continue to be symmetrically distributed.An alternative approach would be to write out the solution to Equation (1) and simulate directly by taking long samples of the error term and the exogenous process.From Table 1, we observe that the highest 2nd and 4th moments are obtained for the persistent state when β 1 = 0.1 and β 2 = 1.1.This is despite the fact that this process also converges to a steady-state vector [0.50 0.50].The moments are significantly different for the alternative paths corresponding to P 2 .Figures 1 and 2 below highlight the different distributions that result from the different paths, along with cumulative probabilities corresponding to the normal probability density function's quintile values.Tail probabilities are much higher for the more persistent path, which also has more extreme values, especially when the non-stationary state becomes explosive.Tail probabilities are 28.7% and 30.8% for δ = 0 and δ = 0.10, respectively, in Figure 1, which correspond to the more persistent transition matrix, P 1 .In fact, the distribution corresponding to δ = 0.10 appears like a horizontal line instead of a bell-shaped curve.
On the other hand, the distributions of simulated series corresponding to the more frequently switching path (transition matrix P 2 ) have lower tail probabilities and distributions that are closer to the standard normal distribution.Figure 2 contrasts the distribution for the series corresponding to path P 2 to a standard normal.While this simulated distribution has heavier tails, as evident from the higher probabilities corresponding to normal quintiles, its 2nd and 4th moments are much closer to the normal than to the simulated series for path P 1 .Similarly, for the distributions in Figure 2, corresponding to β 1 = 0.10 and β 2 = 1.10, path P 2 look much closer to a normal distribution than to their P 1 counterparts in Figure 1.
Thus, we need a different approach to generate the steady-state distribution of the threshold autoregressive process with Markovian triggers that are independent of the transition matrix, subject to the same steady state.It is important to understand that the results derived in the section above correspond to the steady state itself and not to the path of the process tending to a steady state, which, as we have shown in this section, depends on the transition matrix.We describe how we simulate the steady-state distribution in the next section.
switching path (transition matrix  ) have lower tail probabilities and distributions that are closer to the standard normal distribution.Figure 2 contrasts the distribution for the series corresponding to path  to a standard normal.While this simulated distribution has heavier tails, as evident from the higher probabilities corresponding to normal quintiles, its 2nd and 4th moments are much closer to the normal than to the simulated series for path  .Similarly, for the distributions in Figure 2, corresponding to  = 0.10   = 1.10, path  look much closer to a normal distribution than to their  counterparts in Figure 1.Thus, we need a different approach to generate the steady-state distribution of the threshold autoregressive process with Markovian triggers that are independent of the transition matrix, subject to the same steady state.It is important to understand that the results derived in the section above correspond to the steady state itself and not to the path of the process tending to a steady state, which, as we have shown in this section, depends on the transition matrix.We describe how we simulate the steady-state distribution in the next section.

Simulation Results
In order to simulate a distribution for the steady state, we observed the Markov chain 10,000 times.However, each observation was 1000 time periods or steps apart.Thus, the Markov chain we simulated was 10,000,000 periods long, and the steady-state simulation was 10,000 observations in length.We verified that each series simulated this way converges to the steady-state probability

Simulation Results
In order to simulate a distribution for the steady state, we observed the Markov chain 10,000 times.However, each observation was 1000 time periods or steps apart.Thus, the Markov chain we simulated was 10,000,000 periods long, and the steady-state simulation was 10,000 observations in length.We verified that each series simulated this way converges to the steady-state probability vector while at the same time being independent of the transition matrix, P. We did this by simulating steady states for different transition matrices P that shared the same steady states. 1 Thus, the analysis considered in this section depends only on the steady-state vector and not on the transition matrices.The steady-state chain was then used to simulate the following threshold model: where β it depends on the value taken by the exogenous discrete Markov state variable z i .Our simulations assumed that η t ∼ N(0, 1) and that α = 0.In order for our simulated series to have a steady-state stationary distribution, we required that the criterion 2 i=1 π i ln β i < 0 be satisfied.Since we considered a two-state process, the criterion can alternatively be written as We considered a maximum δ of 0.10, and checked that the criterion was satisfied for all our simulations and that a steady-state distribution does exist.For exposition, we included the value taken by the criterion function for each set of simulations in column 7 of Table 2 below.
Steady-state distributions obtained this way can be analyzed through the results derived in Section 2. For each set of steady-state vectors in Table 2, we simulated threshold autoregressive series (as described above) 2000 times.The parameter values in the two switching states are δ in state 1 and 1 + δ in state 2. Thus, the first state is always stationary, while the second state is either a random walk or explosive.Some patterns are exhibited clearly.The distributions of the series appear to be centered on zero, and statistically their skewness (column 5) is not significantly different from zero, which follows from Theorem 3. Since the disturbance term has an even distribution, it follows that the distribution of our simulated series will also be even, symmetric, and centered on zero, i.e., ψ(u) = ψ(−u) implies that φ(u) = φ(−u).
The standard deviation (column 2), however, does appear to be much larger than the standard deviation of the error process driving the threshold process.The series have excess kurtosis (column 6), which should not come as a surprise since the series display non-stationary behavior when β i ≥ 1.This state leads to excess kurtosis and higher standard deviation.As we deviate from our base case, (δ = 0), we note a clear pattern in the 2nd and 4th moments of the series.Both the standard deviation and kurtosis start to increase, since this behavior is caused by the non-stationary state becoming increasingly more explosive (β i = 1 + δ, when δ > 0).The pattern is repeated irrespective of the steady-state vector chosen.
Unsurprisingly, the higher the steady-state probability of the stationary state (β i = δ), the closer the process's distribution is to a normal distribution.This is reflected in the first four moments.For instance, when the stationary state occurs 90% of the time (as in rows 2-6 of Table 2), the standard deviation and kurtosis are both lower compared to corresponding cases (i.e., same δ) when the stationary state occurs less than 90% of the time.When δ = 0.05, the standard deviation and kurtosis respectively are 1.061 and 3.362 for π = 0.90, 1.134 and 3.762 for π = 0.80, 1.339 and 4.660 for π = 0.60, and 1.494 and 5.302 for π = 0.5.While all distributions appear symmetric and centered on 0, they become increasingly leptokurtic as π falls. 1 These results have not been included but are available upon request.We also plotted sample distributions for δ = 0 and δ = 0.10 for each set of steady-state vectors (Figures 3-6) and carried out quintile analysis by calculating the weights in the distribution corresponding to the quintile values of a normal distribution, i.e., we found P(y t ) < q 1 , q 1 < P(y t ) < q 2 , q 2 < P(y t ) < q 3 , q 3 < P(y t )< q 4 , and P(y t ) >q 4 , where q i corresponds to the normal distribution's quintile values.We note that tail probabilities, i.e., those corresponding to the 1st and 5th normal quintile values, go up as the steady-state probabilities for the non-stationary state go up.They are also dependent on the value of δ, and as we increase explosivity in the non-stationary state (by increasing δ), tail probabilities increase and the distributions moves farther away from a normal.The symmetry of the distribution is also reflected in these probabilities.
The results above depend only on the steady-state vector π 1 − π and not on the transition matrix (the dynamic path) that generates this steady-state vector.We note that all series generated this way have excess kurtosis due to the presence of a non-stationary state. 2 Below, we simulate probability density functions for some of the cases considered in Table 2.The graphs only report the distribution for δ = 0 and δ = 0.10, so we can analyze how far the distribution moves from a normal as we increase δ and increase the probability of the non-stationary part of the distribution.We also draw a comparison with a normal distribution for illustrative purposes.
It is worth mentioning at this point that δ = 0.10 may not present as interesting an empirical case as say δ = 0.01 or δ = 0.03.In empirical work (e.g., Ahmed and Satchell 2018), estimates have suggested that even with an explosive state in a threshold autoregressive model, the δ of the explosive state from the random walk state does not extend beyond 0.03.Our analysis is thus meant to provide a graphical view of how the distribution evolves as δ is varied, and this is most obvious when considering the two extreme values.Distributional analysis for other values was also considered and is available upon 2 We have checked that the kurtosis exist by verifying that 1 − 2 j=1 π j β 4 j < 0. Indeed, for our most extreme case (π 1 = 0.5, pi 2 = 0.5, β 1 = 0.10, β 2 = 1.1), the kurtosis does exist as the criterion for its existence is satisfied: 1 − 2 j=1 π j β 4 j = 0.7321.
request.While results for other values of δ follow in the same direction as those considered here, the differences are less stark as would be expected.For researchers interested in modelling non-stationary behavior using threshold auto-regressions, a value of δ < 0.03 may be more appropriate.
From Figure 3a,b, we can see that when the stationary state is dominant (π = 0.90), the distribution appears very close to a normal distribution, even if δ is increased from 0 to 0.10.Contrast this distribution with Figure 6a,b, where the stationary and non-stationary states occur with equal probability.The distributions in Figure 6a,b are significantly leptokurtic, with tail probabilities of 25.2% and 26.2%, respectively, as opposed to tail probabilities of 20.8% and 21% in Figure 3a,b, respectively.These results are interesting, particularly for those relying on DSGE models.In finance, asset prices often exhibit locally non-stationary behavior, which leads to leptokurtosis in the series.Similarly, macroeconomists often consider different shock mechanisms in DSGE models.The results in this article will assist macroeconomists in considering shock processes that follow threshold auto-regressions with a non-stationary state.Since we have outlined a procedure for deriving analytical expressions, this would enable macroeconomists to analyze locally explosive shock processes which nevertheless are stationary overall and facilitate the implementation of numerical solutions.

Conclusions
In this article, we have derived formulae for characteristic functions for threshold autoregressive models of order 1, which have a Markovian state-switching variable.In doing so, we have not only improved the results first considered in Timmermann (2000), but have also generalized the formulae to a great degree.These formulae will allow readers, if they are so inclined, to derive analytical moments for TAR models in this class for a range of different error specifications.We believe that these will have applications in both finance and applied macroeconomics, given increasing interest in these models.
Considering a special case of interest, we have also shown using simulation analysis that the existence of a non-stationary state in a TAR model can cause the distribution of a particular series to deviate significantly from normality.The further away the non-stationary state moves from a random walk, the farther away the distribution is from that of a normal.Models for asset prices often consider a mixture of stationary and non-stationary states, and we believe that this simulation analysis will aid researchers in better understanding the behavior of asset prices that go through locally explosive states.In Sections 1 and 2, we have further outlined how our results may be applicable to applied macroeconomic and finance literatures.
Indeed, articles like Ahmed and Satchell (2018) would have found these results useful.While Ahmed and Satchell (2018) provided an empirical example to estimate market efficiency using a threshold autoregressive model, they were unable to calculate means and variances for their series.Through the results in our article, they would have been able to compare the two asset price series using well-known metrics in finance, such as Sharpe ratios.Similarly, any researchers modelling asset prices through threshold autoregressive models with a Markov-switching variable will be able to calculate moments and provide a more holistic narrative.
Furthermore, the contributions in the article will allow researchers to consider non-normal distributions for asset price series.Normality or log-normality is often assumed in the asset pricing literature to allow derivation of analytical expressions at the cost of accuracy.As we have shown, not only are threshold autoregressive models better at capturing asset pricing-like behavior, but they also allow for analytical solutions that can be useful in helping us better understand asset pricing behavior.
Another avenue worth considering from a statistical point of view will be to consider relevant moment-generating functions for the error process, and analyzing the resulting distributions for the underlying process, which we have referred to as the price process.Whilst we have considered the standard normal and Poisson distributions, future research could consider other distributions, which may inform future Finance and Economics research.
Thus, our article makes significant contributions to the existing literature on TAR models and offers insights into how such models may be used in applied economics and finance.

Figure 2 .
Figure 2. Simulated Distributions for TAR(1) with  as the transition matrix.

Figure 2 .
Figure 2. Simulated Distributions for TAR(1) with P 2 as the transition matrix.

Figure 3 .
Figure 3. (a) Simulated Distributions for TAR(1) with  as the steady state vector and  = 0. (b) Simulated Distributions for TAR(1) with  as the steady state vector and  = 0.10.

Figure 4 .
Figure 4. (a) Simulated Distributions for TAR(1) with  as the steady state vector and  = 0. (b) Simulated Distributions for TAR(1) with  as the steady state vector and  = 0.10.

Figure 6 .
Figure 6.(a) Simulated Distributions for TAR(1) with  as the steady state vector and  = 0. (b) Simulated Distributions for TAR(1) with  as the steady state vector and  = 0.10.
Table1reports the first four moments of simulated threshold autoregressive series with Markov-switching exogenous variables.Each series is 10,000 observations long, and 2000 series were simulated for each set of values.

Table 2 .
Average moments of TAR(1) with changing Steady State Vectors.Table2reports the first four moments of the simulated series along with the stationary criterion.Each row corresponds to 2000 threshold autoregressive simulations with Markov triggers, 10,000 observations long; columns 3-6 report average moments.
J. Risk Financial Manag.2019, 12, x FOR PEER REVIEW 13 of 18regressions with a non-stationary state.Since we have outlined a procedure for deriving analytical expressions, this would enable macroeconomists to analyze locally explosive shock processes which nevertheless are stationary overall and facilitate the implementation of numerical solutions.