Next Article in Journal
The Efficiency of the European Non-Life Insurance: CEO Power, Macroeconomic, and Market Characteristics Impact
Previous Article in Journal
Acknowledgement to Reviewers of the International Journal of Financial Studies in 2015
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Level Set Analysis and A Nonparametric Regression on S&P 500 Daily Return

1
Department of Mathematics and Statistics, University of Houston-Clear Lake, 2700 Bay Area Blvd., Houston, TX 77058, USA
2
Department of Mathematics, University of Missouri-Columbia, Columbia, MO 65211, USA
*
Author to whom correspondence should be addressed.
Int. J. Financial Stud. 2016, 4(1), 3; https://doi.org/10.3390/ijfs4010003
Submission received: 12 October 2015 / Revised: 29 December 2015 / Accepted: 28 January 2016 / Published: 18 February 2016

Abstract

:
In this paper, a level set analysis is proposed which aims to analyze the S&P 500 return with a certain magnitude. It is found that the process of large jumps/drops of return tend to have negative serial correlation, and volatility clustering phenomenon can be easily seen. Then, a nonparametric analysis is performed and new patterns are discovered. An ARCH model is constructed based on the patterns we discovered and it is capable of manifesting the volatility skew in option pricing. A comparison of our model with the GARCH(1,1) model is carried out. The explanation of the validity on our model through prospect theory is provided, and, as a novelty, we linked the volatility skew phenomenon to the prospect theory in behavioral finance.

1. Introduction

Building a good mathematical or statistical model is fundamental to modern financial studies. Although the geometric Brownian Motion plays an important role in the modeling of modern financial studies due to its conciseness, it has become common sense that the log returns in the stock market are not i.i.d. (independent and identically distributed) normal random variables. New patterns on the stock returns have been discovered and new models have been constructed in the past three decades—for example, the phenomenon of volatility skew in the equity market and volatility smile in the currency market, and the ARCH models [1] and GARCH models [2] that were proposed to capture the clustering property of highly volatile periods in stock returns, although they lack the characterization of serial correlations in the return process. Stochastic volatility (SV) models, e.g., [3,4], were presented as an extension to the ARCH/GARCH models and they are shown, under some situations, to be capable of reproducing the volatility skew phenomenon [5]. The statistical aspects on ARCH and SV models was analyzed in [6]. In [7], Barndorff–Nielsen and Shephard constructed positive OU processes for the volatility processes and carried out extensive model calibration. The modelling with Lévy processes is presented in [8] and theoretical analysis on related processes can be found in [9]. Interested readers are referred to [10] for a comprehensive literature on financial time series analysis. Besides the traditional time domain analysis, other methods include the spectrum analysis and wavelet analysis [11,12], the analysis through P/E ratio [13], and some other recent progress [14].
Another major branch of study to capture the volatility skew/smile phenomenon is the so-called regime switching models, e.g., [15,16]. The parameters in these models vary due to the changing of market or economic states, and it can be shown that these models can replicate the volatility skew/smile phenomenon. However, the biggest challenge to this explanation is that volatility skew/smile phenomenon is seen only from market data after the 1987 crash but is not seen from market data before 1987. Therefore, the 1987 crash must have brought some fundamental change to the market and, very likely, in the behavior of investors.
Most of the aforementioned models are proposed to mimic the evolution of asset price in a statistical or mathematical point of view, and there is a tendency toward complication in these models. Furthermore, little work has been found to explain why these models are valid in a practical and realistic point of view. On the contrary, behavioral finance often provides an explanation from a different aspect. For example, Chen et al. [17] studied the price effects of changes to the S&P 500 Index and discovered an asymmetric price response around additions to and deletions from the S&P 500 Index. In [18] and also in [19], the authors presented an agent based model, where the agents’ trading behavior consists of a fundamental component and a trend chasing component. The final trading decision is a random combination of these two components, and the price is thus determined by the market clearing condition. A limiting diffusion model is derived, which is a random regime switching dynamic process.
Frey and Stremme [20] modeled the market participants as reference traders and program traders, where the former make decisions based on their utility function and investigated the effect of dynamic hedging on volatility. They showed that the heterogeneity of the distribution of hedged contracts is one of the key determinants for the transformation of volatility. Heemeijer et al. [21] showed that when the economic agents have positive expectations about the market development, due to the feedback effect, large fluctuations in realized prices and persistent deviations from the fundamental are likely; when they have negative expectations, prices converge quickly to their equilibrium values. According to the work of Danilova [22], the main explanation of the volatility of asset returns is the volume of trade. Lyengar and Ma [23] linked the asset price and trading volume through the study of behavioral finance, and proposed a logistic regression model for jointly predicting stock price and volume at the tick-by-tick level. This work provided a novel path for the search for more comprehensive results.
Furthermore, in order to explain the volatility smile and skewness phenomenon, Platen and Schweizer [24] assumed that the market participants consist of arbitrage-based agents (or speculators) and hedgers (or technical agents), and the volatility smile effect is obtained through the Black–Scholes hedging of European call options from the technical component. This research work through behavioral finance provides evidence that feedback effects from the traders’ behavior play an important role in the study of price fluctuations.
Market fluctuations are accompanied with the activities of investors, especially those of large financial agents who often have information advantages. Therefore, studying the market fluctuations such as large jumps or drops in the stock market can provide us with information about the patterns of activities of participating financial agents. In this paper, we propose a level set analysis on the S&P500 return in Section 2. We discovered that the magnitude of the return has patterns of correlations, and a volatility clustering property can be easily seen in this method. Furthermore, we find that the return process tends to have negative/oscillating serial correlation. Then, in Section 3, a nonparametric analysis is performed and new patterns on the data are discovered which are in accordance with our findings in Section 2. After that, an ARCH model is constructed which is capable of manifesting the volatility skew in option pricing, and a comparison of this model with the GARCH(1,1) model is carried out. Section 4 proposes an explanation on the validity of our model through the prospect theory [25] in behavioral finance, and the connection between the volatility skew phenomenon and prospect theory is novel to the best of our knowledge. The contributions and discoveries of this paper are summarized in Section 5.

2. Level Set Analysis

The term level set in this paper refers to the set of stock returns whose magnitudes exceed a certain level/threshold. Large jumps or drops in stock market are often associated with the actions of large financial agents, who usually have information advantages over individual investors. Therefore, we are very interested in finding out if there are patterns in large fluctuations in the stock market.
Figure 1 shows the log return of the daily S&P 500 Index and its histogram of the market data from Jan 2, 1990 to May 2, 2013. The normal distribution density curves are provided as well.
In fact, the i.i.d. normal assumption on returns can be easily challenged. For example, a simple normality test is provided in Figure 2 for the original return data.
In this section, we are especially interested in these questions such as “Does the magnitude of the log returns correlate with the return process?”, “What is the distribution of the log returns with a certain magnitude?”, and “Is there any pattern in the return process in consideration of the magnitude?”, etc. The reason we did this is that large jumps in the financial market are believed to contain special information.
For the same set of data as in Figure 1, we first remove the mean of the return and get the excessive return data. Notice that the all time mean of the original data is 0.000254, which is very close to zero. Table 1 provides some basic statistics on the excessive return with a certain minimum magnitude. For example, the variable “Return0.01” represents the return with absolute magnitude of at least 0.01, which indicates a daily index change being greater than or equal to 1% over the all time average. Thus, the entire observations are grouped into several level sets. The box plots for these groups of variables are shown in Figure 3.
It can be seen that, starting from the variable “Return 0.008” in the excessive return data, the group means begin to show large deviation from zero, and actually, most of the means are negative except the last group (“Return 0.1”) which has only two observations. In some sense, this mimics the so called “pump and dump” phenomenon. Although the “pump and dump” phenomenon is often associated with penny stocks, it seems that this phenomenon is also associated with the S&P 500 Index. To the best of the authors’ knowledge, looking into the daily S&P 500 return in this way is novel.
Then, how about the serial correlation in each group? Let us first look at the scatter plots of some groups in Figure 4, and the rest are omitted due to similarity.
An interesting question is that, if a return with a certain magnitude has been observed one day, what is the distribution of the number of days until the next occurrence with the same or larger magnitude? We call this time interval the waiting time. An observation with large magnitude indicates a shock on the market, and, therefore, this distribution is of much interest. Using the given data, we plot in Figure 5 the histograms of the waiting time for some of these groups with considerably large numbers of observations.
From Figure 5, it can be clearly seen that the probability density concentrates on the region of short waiting time, and this phenomenon is especially notable for returns with large magnitude. This phenomenon certainly agrees with the volatility clustering property in GARCH models or stochastic volatility models.
However, there is a feature that the traditional ARCH/GARCH models or stochastic volatility models might have missed, which is, besides the volatility clustering, would the observations tend to have positive or negative serial correlation? We run the Durbin–Waston (DW) test on some of the groups of data with considerably large numbers of observations, and the result is shown in Table 2. Since we are only interested in the serial correlation of the data in each group where the data tend to have a clustering manner, the time interval (or waiting time) between any two consecutive observations in a group is ignored for now.
It should not be a surprise that all of the data examined in Table 2 exhibit negative serial correlations (DW > 2). To further strengthen our findings, we perform a Run Test on the data to test the randomness. Once again, the time intervals are ignored temporarily so that we can focus on the serial correlation. The results are shown in Figure 6.
We notice that the oscillation in the group “Return 0.04” is especially significant (p-Value = 0.017). In the meantime, we understand that if we change the cutoff of the magnitude, a few observations are inserted into or discarded from the original set. Recall that the Run Test for randomness is very sensitive to the order of observations, and, as a consequence, the test results for randomness could be very different from group to group.
Then here comes a natural question: would the negative serial correlation be more manifest during economic crisis? We take the piece of data from Jan 2, 2008 to Dec 31, 2009 and perform the Durbin–Watson Test on the level sets, and the result is shown in Table 3.
If we compare Table 2 to Table 3, we see that the negative serial correlation in the level sets is more manifest during the 2008 crisis.
As a short summary of what we have found so far, the S&P 500 daily return, based on the given data, tends to have negative serial correlation as well as a volatility clustering property. Therefore, the assumption of independence of daily returns, which is the basis of much research work, is challenged. In the next section, we perform a nonparametric regression on the daily data and the results are consistent with our findings.

3. Nonparametric Regression

The level set analysis in Section 2 gives us a new insight into the S&P 500 daily return, yet we hope to build a time series model on this process that is capable of capturing the features we have found, and we accomplish this task through a nonparametric regression technique.

3.1. Local Polynomial Regression

Let { Y t } be a time series (of the return process), we want to fit an ARCH model so that
Y i = f ( Y i - 1 ) + g ( Y i - 1 ) ϵ i ,
where ϵ i are i.i.d. N ( 0 , 1 ) random variables. Local polynomial regression [26,27] is used to estimate the drift (f) and volatility (g) functions. An early work on the application of this technique on financial time series can be found in [28]. The procedure is as follows: we seek a function g ( x ) which satisfies
g 2 ( x ) = E ( Y i 2 | Y i - 1 = x ) - E 2 ( Y i | Y i - 1 = x ) .
Let n be the size of { Y t } , then for each x, consider the following two minimization problems:
[ α 1 ( x ) , α 2 ( x ) , α 3 ( x ) ] = a r g min α 1 , α 2 , α 3 i = 1 n Y i 2 - α 1 - α 2 Y i - 1 - x h - α 3 2 Y i - 1 - x h 2 2 K Y i - 1 - x h ,
[ β 1 ( x ) , β 2 ( x ) , β 3 ( x ) ] = a r g min β 1 , β 2 , β 3 i = 1 n Y i - β 1 - β 2 Y i - 1 - x h - β 3 2 Y i - 1 - x h 2 2 K Y i - 1 - x h ,
where K ( · ) denotes a nonnegative weight function and h is a positive number called the bandwidth. We choose K to be the standard normal p.d.f. Then, g 2 ( x ) is estimated at g ^ 2 ( x ) = α 1 ( x ) - β 1 2 ( x ) , and the estimation of f is given by f ^ ( x ) = β 1 ( x ) .
Let { P i } be the daily data of S&P500 index where the last date is May 2, 2013, and let { Y i } be the yield process, i.e., Y i = log P i - log P i - 1 . For each data size S, we choose the bandwidth h to be h = { max ( Y i ) - min ( Y i ) } / γ , where all Y i ’s belong to this set and γ is a chosen positive constant. In what follows, we shall pick different data sizes and different values of γ, and plot the graphs of f ^ and g ^ 2 .
From Figure 7, Figure 8, Figure 9, Figure 10, Figure 11 and Figure 12, we can observe the following phenomena.
Observations:
  • There is a pattern of f ^ . If Y i = y > 0 , f ( Y i ) is slightly negative. If Y i < 0 , f ( Y i ) is positive. This represents a mean reverting pattern, or negative serial correlation on { Y i } .
  • If Y i > 0 is large, i.e., there is a big gain at time t i , then on expectation, Y i + 1 tends to be slightly negative. However, if Y i is large negative, then on expectation, Y i + 1 tends to be largely positive. So, there is an obvious bend in the curve of f ^ .
  • All the graphs of g ^ 2 show U-shaped “smiling faces”, and the minimum is achieved at a point of Y i close to zero. In fact, a point that is slightly to the right of zero. Thus, large magnitude of Y i corresponds to large volatility. What is more, on each “smiling face”, the left side of the curve is higher than the right side of the curve. Thus, these are tilted “smiling faces”, or skew.
  • When γ is big, we observe the boundary effect on the boundaries of the interval, see, e.g., the graphs of g ^ 2 . This phenomenon was also observed in [26] where an explanation was provided.
  • These discoveries are pretty robust in view of various data sizes.
The relevance of price jumps to total variation derived from realized volatility has also been observed and investigated by other researchers. For example, Bollerslev et al. [29] developed a discrete-time daily stochastic volatility model which allows us to assess the structural, asymmetric inter-dependencies among the stocks’ returns and the volatility components. Christoffersen et al. [30] developed a discrete option valuation model with observable volatility and jump dynamics and applied it to S&P 500 index options. Our model in this paper is constructed differently through non-parametric regression and provides new discoveries on the S&P 500 return process. Traditional ARCH/GARCH models usually miss the feature of f ^ , and the forms of the volatility term are assumed a priori. Our discoveries can be used as a complement to the traditional ARCH/GARCH models and further improvement on the existing market models is expected.
The results from local regression methods agree with our conclusions in Section 2, i.e., the return process tends to be negatively serially correlated and has volatility clustering. Furthermore, these results enable us to build a time series model on the S&P 500 Index. The steps are shown in the next section.

3.2. An ARCH Model

Since Y i = log P i - log P i - 1 , we can rewrite model (1) as
log P i + 1 - log P i = f ( log P i - log P i - 1 ) + g ( log P i - log P i - 1 ) ϵ i , i = 1 , 2 , . . .
which provides us an ARCH model for the process log P i :
log P i + 1 = log P i + f ( log P i - log P i - 1 ) + g ( log P i - log P i - 1 ) ϵ i , i = 1 , 2 , . . .
Then, it is not hard to find polynomial approximations to f and g, respectively, and we get the following ARCH model
log P i + 1 = log P i - 8.948 × 10 - 5 - 7.557 × 10 - 2 Y i + 0.8305 Y i 2 - 13.60 Y i 3 + 52.84 Y i 4 + 1.288 × 10 - 2 - 0.1138 Y i + 5.503 Y i 2 + 6.492 Y i 3 - 3.306 × 10 2 Y i 4 ϵ i ,
where Y i = log P i - log P i - 1 . That is,
f ˜ ( Y i ) = - 8.948 × 10 - 5 - 7.557 × 10 - 2 Y i + 0.8305 Y i 2 - 13.60 Y i 3 + 52.84 Y i 4 , g ˜ ( Y i ) = 1.288 × 10 - 2 - 0.1138 Y i + 5.503 Y i 2 + 6.492 Y i 3 - 3.306 × 10 2 Y i 4 ,
where f ˜ , g ˜ represent the polynomial approximations of f , g , respectively. Figure 13 shows the graphs of f ˜ , g ˜ and g ˜ 2 .
It can be seen that this model is a local volatility model, but, different from most well known local volatility models which aim to replicate the implied volatility surface only, our model replicates both the drift term and volatility term through real data calibration using the technique of local polynomial regression. Our model is also different from most of the SV models, such as [7], in that the volatility term is a special function of the return. We believe that general linear regression is incapable, if not impossible, of achieving such a model. Figure 14 shows some sample paths of this ARCH model.

3.3. Model Comparison

In order to check the performance of model (5), we compare it to an GARCH(1,1) model. The GARCH(1,1) model is estimated, using our market data as in Table 1, at
y t = σ t ϵ t ,
σ t 2 = 1.013 × 10 - 6 + 0.07377 y t - 1 2 + 0.9185 σ t - 1 2 ,
where y t is the excessive daily return and ϵ t ’s are i.i.d. standard normal random variables (lower case letters are used to indicate that they come from a different model). This model can be obtained through standard techniques. We used the fGarch package in R (version 3.2.3).
A random path with 5880 daily returns is generated using model (5) with initial values P 0 = 359.69 and P 1 = 358.76 as of Jan 2, 1990 and Jan 3, 1990, respectively, and the DW test on the level sets is shown in Table 4.
We can see that model (5) shows a pattern of negative serial correlation in the return process. Due to Equation (6), the GARCH model assumes that the return process is white noise, and now we believe that there is large space to improve.
Then, using the GARCH(1,1) model (6) and (7), we generate a random path of the same length and compare the distributions of daily returns with model (5) and the market data. The normal fits and the density functions are also shown in Figure 15.
It can be seen from Figure 15 that both the GARCH(1,1) model (6) and (7) and the ARCH model (5) have “fatter” distribution than that of the market data, but the histogram of GARCH(1,1) model fits the histogram of S&P 500 return a little better. We believe that this is due to the natural advantage of GARCH models over ARCH models. A promising future research topic is to incorporate the GARCH model into the nonparametric regression in order to capture more features of the market data.
Finally, recall that Figure 14 shows only five random paths of the ARCH model (5), and we are now interested in the terminal distribution of the stock index using this model. Thus, we generated 10,000 random paths with length 300 using model (5) and examine the terminal distribution. The initial values are P 0 = 1462.42 , P 1 = 1459.37 . For the purpose of comparison, we did the same for the GARCH(1,1) model. The results and the log-normal fits are provided in Table 5 and Figure 16.
Table 5 and Figure 16 show that the GARCH(1,1) model has a slightly larger mean, smaller deviation but very large maximum. Recall that the GARCH model (6) and (7) captures the volatility clustering very well but ignores the negative serial correlation (or mean reverting) in the return, so it could generate very extreme values. Future research may focus on incorporating the mean reverting property and volatility clustering property of GARCH models into the nonparametric regression.

3.4. Option Pricing and Implied Volatility

To see if the model (5) could replicate the volatility skew in option pricing, we use Monte Carlo simulations. Other numerical methods solving option pricing problem with SV models or stochastic interest rate via partial differential equations can be seen, e.g., in [31,32].
In order to get the price of the European call options, we rewrite this model under the risk neutral measure and get
log P i + 1 - log P i = ( r - 1 2 σ 2 ( Y i ) ) Δ t + σ ( Y i ) Δ t ϵ ˜ i ,
where r is the risk free interest rate, Y i = log P i - log P i - 1 , ϵ ˜ i is the standard normal random variable under the risk neutral measure, and
σ ( Y i ) = g ˜ ( Y i ) / Δ t .
In order to find the proper number of paths, we use Monte Carlo simulation to price a European call option with the following parameter settings: annual interest rate r = 0.03 , time to maturity T = 60 months, strike price K = 800 and underlying price P 0 = 1462.42 , P 1 = 1459.37 . We chose this option because it has the largest variance in our model. The number of paths ranges from 10,000 to 1,000,000, and for each setting, 20 trials are run and the standard deviation of the option prices are calculated. The result is shown in Figure 17.
We can now set the number of paths to be 500,000. Figure 18 shows a Monte Carlo simulation on call option prices with 500,000 sample paths.
We pick the region 1100 K 2000 and 12 T 60 (months) to recover the implied volatility surface from the Black–Scholes formula. The reason is that, on this region the computation is considered to be stable because V e g a is not too close to zero. Figure 19 shows the implied volatility surface through the Black–Scholes formula. Thus, the volatility skew can be clearly seen.

4. Price Fluctuation and Market Participants

In this section, we shall give a tentative explanation of our findings in Section 3. As is widely accepted, asset price is determined by the temporary balance between demand and supply. Excessive demand would push the price high and excessive supply would press it down, with the market clearing condition being held. This effect is determined by the traders’ actions. In this approach, understanding the behavior of the interacting traders is the key to model the price process. In the agent-based model presented in [18,19] as introduced in Section 1, the reference of a fundamentalist is given by
log P t a = log P t - 1 + c F ( log F - log P t - 1 ) , c F > 0 ,
where P t a is an agent’s reference level. The idea is that the agent believes that the asset price will finally go to its fundamental value F. While by the trend chasing component, the agent believes that the asset price forms a trend and hence the reference is given by
log P t a = log P t - 1 + c C ( log P t - log P t - 1 ) , c C > 0 .
The final trading decision is a random combination of these two components, and the price is thus determined by the market clearing condition. The model in [24] assumed that the cumulative demand of the asset up to time t is given by
D ( t , log P t , U t ) = U t + γ ( log P t - L 0 ) + ξ ( t , log P t ) , γ 0 ,
where U t = v W t + m t is a Brownian Motion with drift m and volatility v. The part γ ( log P t - L 0 ) corresponds to the demand of speculators where L 0 is a constant, and ξ ( t , log P t ) is the cumulative demand of hedgers. The dynamics of log P t was then obtained by differentiating, using Ito’s formula, the equation D ( t , log P t , U t ) = c o n s t . , which is due to the market clearing condition.
Although there is no agreement yet what kind of market participants and what kind of behavior there are, it should be noticed that, besides the noisy demand in the market, there are usually more than one types of agents (traders or components) in the market. Observing the phenomenon that market participants often have expectations on their investment, either arbitrage or hedging, and they behave differently when they feel a gain vs. a loss of their investments, we present in this paper the prospect agents. Through this model, we are able to derive a stochastic price process that explains the volatility skew phenomena that is observed from real market data.

4.1. Prospect Agents

In this section, we shall derive the asset price dynamics through feedback effect via market clearing conditions. As in [19], we assume the noisy demand and trend chasing demand as part of the market component, yet our method is different from [19,24] in that we model the behavior of the other traders through a different point of view.
It has been widely observed that whenever there is a big jump or drop of a stock price, there is a large volume of transactions associated with it, see, e.g., [23], together with large volatilities, see, e.g., [33] where this phenomenon is “possibly explained by behavior factors influencing the stock markets on or near dates of extreme events”. It is commonly accepted that the decisions of the investors to buy or to sell the asset depend on the gains or losses of this asset, see, e.g., [34]. By means of gain and loss, we mean the difference between the return Y t = log P t - log P t - 1 of the asset and a reference rate of return, for example, a rate of return close to zero or comparable to the daily interest rate. That is, if Y t is bigger than a threshold, investors feel gains of the investment, and if Y t is less than this threshold, investors feel losses of their wealth. These different feelings result in different trading behaviors. If Y t is above the reference rate, the excessive demand of this asset will increase. However, the investors who long the asset feel a gain and some of them try to realize the profit by selling some shares. In other words, they show a risk-averse behavior. For investors who short the asset, some of them feel that they have already missed the chance to make profit, and therefore some of them do not want to buy the asset. As a result, the excessive demand of this asset does not increase linearly with Y t but shows a concave pattern.
When Y t is lower than the reference return, the excessive demand of this asset will decrease. However, the investors will show different behaviors. For holders, since they feel a loss, some of them tend to hold rather than bail out. This can be explained by their unwillingness to realize a loss, and so they hold in the hope that they will have a gain later. That is, they show a risk-seeking behavior. For those who short the asset, some of them tend to buy the asset due to the reduced price with the hope that the asset will gain soon. Consequently, the excessive demand does not decrease linearly with Y t but shows a convex pattern. A similar phenomenon has been observed in many articles, see, e.g., [35]. This phenomenon can be perfectly explained by the prospect theory [25].
Since Kahneman and Tversky [25] introduced the prospect theory to economics, it has been playing an important role in explaining many phenomena in economics and finance. The key idea of the prospect theory is that, in a risky situation, when people feel gains, they show risk-averse behavior and when people feel losses, they show risk-seeking behavior, and this theory can be readily applied to investment. In short, when the investors feel a gain, they show risk-averse behavior and when they feel a loss, they show risk-seeking behavior.
To illustrate this idea, consider the utility as a function of the yield, see Figure 20.
In the second sub figure in Figure 20, Scenario 1 corresponds to a positive yield of the investment. In Scenario 2, there is a possible further jump and a possible further drop of the return in the investment. By the prospect theory, this type of investors would show a risk averse behavior in this case, so the utility of Scenario 1 is bigger than the utility of Scenario 2. Therefore, once Scenario 1 happens, these investors tend to secure the gain by selling the risky asset, and as a result, the supply of this asset on the market increases.
In the third sub figure in Figure 20, Scenario 1 corresponds to a negative yield while Scenario 2 corresponds to a possible further jump and a possible further drop. Due to the prospect theory, these investors show a risk seeking behavior. Therefore, Scenario 2 has a bigger utility, and these type of investors tend to hold but not to sell the asset. As a result, the supply of this asset on the market decreases.
Many research works have discovered this phenomenon, for example, Fiegenbaum and Thomas [36] pointed out that “For returns below target, a large majority of individuals appear to be risk seeking; for returns above target, a large majority appear to be risk averse.” ; and Odean [37] showed that individual investors are reluctant to sell stocks with short-term prior losses, preferring to sell prior short-term winner stocks. This is also known as the disposition effect.
We shall extend the prospect theory to the extreme case to model the behavior of these traders, that is, when Y t is much bigger (or much lower) than the threshold. If Y t is much bigger than the threshold, for example, the stock price jumps by 8 % in one day, then the investors show extreme risk averse behavior: more holders tend to sell the shares to realize the profit, and less buyers are willing to buy the shares since they feel having missed the best chance. As a result, the net demand of this asset decreases. A similar argument can be given to the case when the yield is way below the threshold. Figure 21 illustrates a typical graph of the excessive demand as a function of return.
We use d 1 ( y ) to represent this instant demand as a function of the current return y, and the cumulative demand from this type of investors is thus given by 0 t d 1 ( Y s ) d s . Later, if we need a discrete model, we use index i to denote the sample at time t i , and then d 1 ( Y i ) will be the corresponding excessive demand function at time t i .
We believe that the types of investors that we have proposed so far, namely the noisy traders, trend chasers, and prospect investors, are not uncommon. Yao and Li [38] studied the market interaction between prospect theory-investors and regular CRRA investors and investigated their trading patterns. In this work, since we hope to replicate both the drift and volatility terms in the model (1), we need another type of investor. The actual market participants are certainly much more complicated, and this topic could go on for ever. In this paper, we want to contribute a model that could possibly explain what we have discovered from the real market data, and we hope this model could shed some light on future research in financial studies.
These last types of investors are assumed to behave in the following way. In one aspect, they are trend chasers on the return process (not the price process) since their instant demand of the asset is proportional to Δ Y i = Y i + 1 - Y i . On the other hand, they believe that the return process Y t should be mean reverting, thus their instant demand is assumed to have the form d 2 ( Y i ) ( Y i + 1 - Y i ) , where d 2 ( y ) achieves the maximum when y is close to zero and d 2 ( y ) decreases whenever y is away from zero. An illustrative graph on d 2 is shown in Figure 22.
With these proposed market participants, we shall construct an ARCH model for the return process Y i in the next section.

4.2. An ARCH Model from Prospect Theory

Let ν W t be the cumulative noisy demand and ξ log P t be the cumulative trend chasing demand as in [19], where W t is a Wiener process, and ν < 0 , ξ > 0 are constants. The market clearing condition states that
ν W t + ξ log P t + 0 t d 1 ( Y t ) d t + 0 t d 2 ( Y t ) d Y t = M ,
where M is a constant. Define Δ W i = W t i + 1 - W t i , Δ t = t i + 1 - t i , i = 0 , 1 , . . . , n , and Δ W i is assumed to follow the normal N ( 0 , Δ t ) distribution. Since we are interested in the daily yield process, we choose Δ t = 1 / 252 . The associated difference equation can be easily written as follows:
ν Δ W i + ξ Y i + 1 + d 1 ( Y i ) Δ t + d 2 ( Y i ) ( Y i + 1 - Y i ) = 0 , i = 0 , 1 , . . . , n , t = n · Δ t .
Now, it is a simple step to derive a recursive equation for Y i by rewriting (9) as (that is, solve this equation for Y i + 1 )
Y i + 1 = d 2 ( Y i ) Y i - d 1 ( Y i ) Δ t ξ + d 2 ( Y i ) + - ν ξ + d 2 ( Y i ) Δ W i = d 2 ( Y i ) Y i - d 1 ( Y i ) Δ t ξ + d 2 ( Y i ) + - ν 252 ( ξ + d 2 ( Y i ) ) 252 Δ W i = f ( Y i ) + g ( Y i ) ϵ i ,
where
f ( Y i ) = d 2 ( Y i ) Y i - d 1 ( Y i ) Δ t ξ + d 2 ( Y i ) , g ( Y i ) = - ν 252 ( ξ + d 2 ( Y i ) ) ,
and ϵ i = 252 Δ W i , i = 1 , 2 , . . . are i.i.d N ( 0 , 1 ) random variables.
With properly chosen functions of d 1 , d 2 that have the properties as shown in Figure 21 and Figure 22, respectively, we are capable of replicating the drift and volatility terms. For example, if we choose
d 1 ( y ) = - 352.1 - 504 y , y < - 0.0286 2.63 × 10 4 y + 5.12 × 10 5 y 2 , - 0.0286 y < 0 3.5 × 10 4 y - 4.5 × 10 5 y 2 + 1.44 × 10 6 y 3 , 0 y < 0.0648 435.46 + 1.85 × 10 4 y - 2.66 × 10 5 y 2 + 9.38 × 10 5 y 3 , 0.0648 y
and
d 2 ( y ) = 110 e - 150 | x - 0.008 | 1.5 , x 0.008 , 110 e - 40 | x - 0.008 | 1.3 , x > 0.008 ,
and the parameter settings ξ = 40 , ν = - 27 , then the graphs of f , g and g 2 are shown in Figure 23, compared to Figure 7, Figure 8, Figure 9, Figure 10, Figure 11 and Figure 12 as well as Figure 13.
There are many stochastic models that tried to explain the volatility skew/smile, although they lack the explanation of why the asset prices fluctuate in that way. For example, if the price process is modeled as regime switching process [15,16] that are due to the changing of market or economic states, then it is reasonable to construct the same model for price process before the year 1987 since market states kept changing all the time. However, the volatility skew/smile phenomenon is not seen from the market data before 1987. Because volatility smile/skew is only seen from the market data after 1987, it is reasonable to argue that the crash in 1987 fundamentally changed the behaviors of investors, and as a consequence, the evolution of the asset price has been changed. As we have mentioned, the actual market participants are much more complicated, and we hope that our model can bring new insights to future research in financial studies.

5. Conclusions

The contributions of this paper are as follows:
  • We proposed a level set analysis and performed a nonparametric analysis on the S&P 500 return, and the results showed that this return process has negative serial correlation and volatility clustering property.
  • We found new patterns on the S&P 500 return through local polynomial regression and constructed an ARCH model that replicates both the drift and volatility terms.
  • We brought in the prospect theory to explain the mechanism of our model and linked it to the volatility skew phenomenon observed in the actual stock market.
We believe that the application of our systematic methods to other financial instruments will bring in fruitful discoveries, and building stochastic models through behavioral finance via the microeconomic approach is very promising.

Acknowledgments

Author Yipeng Yang is supported by Faculty Research and Support Funds #1123 from University of Houston-Clear Lake.

Author Contributions

Author Yipeng Yang proposed the linkage between volatility skew(smile) and behavioral finance, and performed the data analysis, model calibration and the writing of this paper; author Allanus Tsoi gave important instructions such as to verify our theory through real market data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. R. Engle. “Autoregressive Conditional Heteroskedasticity with Estimates of the Variance of U.K. Inflation.” Econometrica 50 (1982): 987–1008. [Google Scholar] [CrossRef]
  2. T. Bollerslev. “Generalized Autoregressive Conditional Heteroscedasticity.” J. Econom. 31 (1986): 307–327. [Google Scholar] [CrossRef]
  3. E. Jacquier, N.G. Polson, and P.E. Rossi. “Bayesian Analysis of Stochastic Volatility Models.” J. Bus. Econ. Stat. 12 (1994): 371–417. [Google Scholar]
  4. S.J. Taylor. “Financial Returns Modelled by the Product of Two Stochastic Processes-A Study of Daily Sugar Prices, 1961–1979.” In Time Series Analysis: Theory and Practice. Edited by O.D. Anderson. New York, NY, USA: Elsevier/North-Holland, 1982, Volume 1, pp. 203–226. [Google Scholar]
  5. J.P. Fouque, G. Papanicolaou, and K.R. Sircar. Derivatives in Financial Markets with Stochastic Volatility. Cambridge, UK: Cambridge University Press, 2000. [Google Scholar]
  6. N. Shephard. “Statistical Aspects of ARCH and Stochastic Volatility.” In Time Series Models in Econometrics, Finance and Other Fields. Edited by D.R. Cox, D.V. Hinkley and O.E. Barndorff-Nielson. London, UK: Chapman and Hall, 1996, pp. 1–100. [Google Scholar]
  7. O.E. Barndorff-Nielsen, and N. Shephard. “Non-Gaussian Ornstein-Uhlenbeck-based models and some of their uses in financial economics.” J. R. Stat. Soc. Ser. B Stat. Methodol. 63 (2001): 167–241. [Google Scholar] [CrossRef]
  8. O.E. Barndorff-Nielsen, and N. Shephard. “Modelling by Lévy Processes for Financial Econometrics.” In Lévy Processes: Theory and Applications. Edited by O.E. Barndorff-Nielsen, T. Mikosch and S. Resnick. New York, NY, USA: Birkhäuser, 2001, pp. 283–318. [Google Scholar]
  9. J. Rosiński. “Tempering stable processes.” Stoch. Process Appl. 117 (2007): 677–707. [Google Scholar] [CrossRef]
  10. R.S. Tsay. Analysis of Financial Time Series, 3rd ed. Hoboken, NJ, USA: John Wiley and Sons, 2010. [Google Scholar]
  11. E. Bayraktar, H.V. Poor, and K.R. Sircar. “Estimating the Fractal Dimension of the S&P500 Index Using Wavelet Analysis.” Int. J. Theor. Appl. Financ. 7 (2004): 615–643. [Google Scholar]
  12. J. McCarthy, and A.G. Orlov. “Time-frequency Analysis of Crude Oil and S&P500 Futures Contracts.” Quant. Financ. 12 (2012): 1893–1908. [Google Scholar]
  13. R. Becker, J. Lee, and B.E. Gup. “An Empirical Analysis of Mean Reversion of the S&P500’s P/E Ratio.” J. Econ. Financ. 36 (2012): 675–690. [Google Scholar]
  14. Y.S. Kim, S.T. Rachev, M.L. Bianchi, I. Mitov, and F.J. Fabozzi. “Time series analysis for financial market meltdowns.” J. Bank. Financ. 35 (2011): 1879–1891. [Google Scholar] [CrossRef]
  15. N.P.B. Bollen. “Valuing Options in Regime-Switching Models.” J. Deriv. 6 (1998): 38–49. [Google Scholar] [CrossRef]
  16. A. Papanicolaou, and R. Sircar. “A Regime-Switching Heston Model for VIX and S&P 500 Implied Volatilities.” Quant. Financ. 14 (2013): 1811–1827. [Google Scholar]
  17. H. Chen, G. Noronha, and V. Singal. “The Price Response to S&P 500 Index Additions and Deletions: Evidence of Asymmetry and a New Explanation.” J. Financ. 59 (2004): 1901–1930. [Google Scholar]
  18. H. Föllmer, and M. Schweizer. “A Microeconomic Approach to Diffusion Models for Stock Prices.” Math. Financ. 3 (1993): 1–23. [Google Scholar] [CrossRef]
  19. U. Horst. “Financial Price Fluctuations in A Stock Market Model with Many Interacting Agents.” Econ. Theory 25 (2005): 917–932. [Google Scholar] [CrossRef]
  20. R. Frey, and A. Stremme. “Market Volatility and Feedback Effects from Dynamic Hedging.” Math. Financ. 7 (1997): 351–374. [Google Scholar] [CrossRef]
  21. P. Heemeijer, C. Homoes, J. Sonnemans, and J. Tuinstra. “Price Stability and Volatility in Markets with Positive and Negative Expectations Feedback: An Experimental Investigation.” J. Econ. Dyn. Control 33 (2009): 1052–1072. [Google Scholar] [CrossRef]
  22. A. Danilova. “Emergence of Stochastic Volatility from Informational Heterogeneity.” Doctoral Dissertation, Princeton University, Princeton, NJ, USA, 2005. [Google Scholar]
  23. G. Lyengar, and A.K.C. Ma. “A Behavioral Finance-based Tick-by-tick Model for Price and Volume.” J. Comput. Financ. 14 (2010): 57–80. [Google Scholar]
  24. E. Platen, and M. Schweizer. “On Feedback Effects from Hedging Derivatives.” Math. Financ. 8 (1998): 67–84. [Google Scholar] [CrossRef]
  25. D. Kahneman, and A. Tversky. “Prospect Theory: An Analysis of Decision under Risk.” Econometrica 47 (1979): 263–292. [Google Scholar] [CrossRef]
  26. W. Härdle, and A.B. Tsybakov. “Local Polynomial Estimators of the Volatility Function in Nonparametric Autoregression.” J. Econom. 81 (1997): 223–242. [Google Scholar] [CrossRef]
  27. E. Masry, and J. Fan. “Local Polynomial Estimation of Regression functions for mixing processes.” Scand. J. Stat. 24 (1997): 165–179. [Google Scholar] [CrossRef]
  28. W.S. Cleveland. “Robust Locally Weighted Regression and Smoothing Scatterplots.” J. Am. Stat. Assoc. 74 (1979): 829–836. [Google Scholar] [CrossRef]
  29. T. Bollerslev, U. Kretschmer, C. Pigorsch, and G. Tauchen. “A Discrete-time Model for Daily S&P 500 Returns and Realized Variations: Jumps and Leverage Effects.” J. Econom. 150 (2009): 151–166. [Google Scholar]
  30. P. Christoffersen, B. Feunou, and Y. Jeon. “Option Valuation with Observable Volatility and Jump Dynamics.” J. Bank. Financ. 61 (2015): S101–S120. [Google Scholar] [CrossRef]
  31. M.C. Mariani, I. SenGupta, and G. Sewell. “Numerical methods applied to option pricing models with transaction costs and stochastic volatility.” Quant. Financ. 15 (2015): 1417–1424. [Google Scholar] [CrossRef]
  32. I. SenGupta. “Option pricing with transaction costs and stochastic interest rate.” Appl. Math. Financ. 21 (2014): 399–416. [Google Scholar] [CrossRef]
  33. F.B. Slimane, M. Mehanaoui, and I.A. Kazi. “How Does the Financial Crisis Affect Volatility Behavior and Transmission Among European Stock Markets? ” Int. J. Financ. Stud. 1 (2013): 81–101. [Google Scholar] [CrossRef] [Green Version]
  34. B.M. Barber, and T. Odean. “The Behavior of Individual Investors.” In Handbook of the Economics of Finance. Edited by G.M. Constantinides, M. Harris and R.M. Stulz. North-Holland, The Netherlands: Elsevier, 2013, Volume 2, pp. 1533–1570. [Google Scholar]
  35. H. Shefrin, and M. Statman. “The Disposition to Sell Winners Too Early and Ride Losers Too Long: Theory and Evidence.” J. Financ. 40 (1985): 777–790. [Google Scholar] [CrossRef]
  36. A. Fiegenbaum, and H. Thomas. “Attitudes Toward Risk and the Risk-Return Paradox: Prospect Theory Explanations.” Acad. Manag. J. 31 (1988): 85–106. [Google Scholar] [CrossRef]
  37. T. Odean. “Are Investors Reluctant to Realize Their Losses? ” J. Financ. 53 (1998): 1775–1798. [Google Scholar] [CrossRef]
  38. J. Yao, and D. Li. “Prospect Theory and Trading patterns.” J. Bank. Financ. 37 (2013): 2793–2805. [Google Scholar] [CrossRef]
Figure 1. Market Data. (a) daily Returns; (b) histogram of Return Data.
Figure 1. Market Data. (a) daily Returns; (b) histogram of Return Data.
Ijfs 04 00003 g001
Figure 2. Normal probability plot.
Figure 2. Normal probability plot.
Ijfs 04 00003 g002
Figure 3. Box plots.
Figure 3. Box plots.
Ijfs 04 00003 g003
Figure 4. Scatter plots.
Figure 4. Scatter plots.
Ijfs 04 00003 g004
Figure 5. Histogram of waiting time.
Figure 5. Histogram of waiting time.
Ijfs 04 00003 g005aIjfs 04 00003 g005b
Figure 6. Run charts.
Figure 6. Run charts.
Ijfs 04 00003 g006
Figure 7. f ^ , S = 1000 .
Figure 7. f ^ , S = 1000 .
Ijfs 04 00003 g007
Figure 8. g ^ 2 , S = 1000 .
Figure 8. g ^ 2 , S = 1000 .
Ijfs 04 00003 g008
Figure 9. f ^ , S = 2000 .
Figure 9. f ^ , S = 2000 .
Ijfs 04 00003 g009
Figure 10. g ^ 2 , S = 2000 .
Figure 10. g ^ 2 , S = 2000 .
Ijfs 04 00003 g010
Figure 11. f ^ , S = 3000 .
Figure 11. f ^ , S = 3000 .
Ijfs 04 00003 g011
Figure 12. g ^ 2 , S = 3000 .
Figure 12. g ^ 2 , S = 3000 .
Ijfs 04 00003 g012
Figure 13. Polynomial Approximations of f , g , g 2 .
Figure 13. Polynomial Approximations of f , g , g 2 .
Ijfs 04 00003 g013aIjfs 04 00003 g013b
Figure 14. Sample Paths.
Figure 14. Sample Paths.
Ijfs 04 00003 g014
Figure 15. Density functions.
Figure 15. Density functions.
Ijfs 04 00003 g015
Figure 16. Terminal distribution.
Figure 16. Terminal distribution.
Ijfs 04 00003 g016
Figure 17. Standard deviation.
Figure 17. Standard deviation.
Ijfs 04 00003 g017
Figure 18. Call options prices.
Figure 18. Call options prices.
Ijfs 04 00003 g018
Figure 19. Implied volatility surface–S&P 500.
Figure 19. Implied volatility surface–S&P 500.
Ijfs 04 00003 g019
Figure 20. Prospect agents.
Figure 20. Prospect agents.
Ijfs 04 00003 g020
Figure 21. Excessive demand d 1 ( y ) as a function of return.
Figure 21. Excessive demand d 1 ( y ) as a function of return.
Ijfs 04 00003 g021
Figure 22. Excessive demand d 2 ( y ) as a function of return.
Figure 22. Excessive demand d 2 ( y ) as a function of return.
Ijfs 04 00003 g022
Figure 23. Drift and volatility functions.
Figure 23. Drift and volatility functions.
Ijfs 04 00003 g023
Table 1. Basic statistics on excessive returns.
Table 1. Basic statistics on excessive returns.
VariableObsMeanSE MeanStDevMedianSkewness
Ret-ex5880–0.0000000.0001520.0116690.000279–0.23
Return0.0024595–0.0000050.0001950.0131860.002211–0.20
Return0.0043516–0.0000420.0002530.0149830.004263–0.17
Return0.0062683–0.0002260.0003270.0169240.006083–0.13
Return0.0082055–0.0004280.0004180.0189440.008007–0.09
Return0.011570–0.0007860.0005320.021086–0.010137–0.04
Return0.02434–0.002500.001580.03291–0.020810.09
Return0.03152–0.003170.003660.04506–0.030310.09
Return0.0459–0.006450.007570.05815–0.042680.19
Return0.0528–0.01830.01290.0684–0.05400.63
Return0.0618–0.01650.01830.0775–0.06330.53
Return0.078–0.03640.03120.0883–0.07541.39
Return0.085–0.01390.04890.1093–0.09230.61
Return0.095–0.01390.04890.1093–0.09230.61
Return0.120.105760.003560.005030.10576N/A
Table 2. Durbin–Watson test.
Table 2. Durbin–Watson test.
VariableReturnReturn0.002Return0.004Return0.006Return0.008
DW2.116922.109422.081672.112012.11218
VariableReturn0.01Return0.02Return0.03Return0.04Return0.05
DW2.131842.201062.059842.754932.23021
Table 3. Durbin–Watson Test on 2008 Crisis.
Table 3. Durbin–Watson Test on 2008 Crisis.
VariableReturnReturn0.002Return0.004Return0.006Return0.008
DW2.271972.250192.251312.329102.34118
Obs505446385335290
VariableReturn0.01Return0.02Return0.03Return0.04Return0.05
DW2.360182.286812.021102.514612.60353
Obs249126643821
Table 4. Durbin–Watson Test on model (5).
Table 4. Durbin–Watson Test on model (5).
VariableReturnReturn0.002Return0.004Return0.006Return0.008
DW2.222492.237132.230292.217942.22877
Obs58805192449738073166
VariableReturn0.01Return0.02Return0.03Return0.04Return0.05
DW2.187042.199672.260812.104872.06198
Obs26388212135316
Table 5. Terminal distribution.
Table 5. Terminal distribution.
ModelNMeanSE-MeanStDevMinimumQ1MedianQ3MaximumSkewness
GARCH10,0001593.92.54253.8703.91427.71573.21736.03670.00.78
ARCH10,0001533.73.45345.1606.71284.91497.61739.53170.00.65

Share and Cite

MDPI and ACS Style

Yang, Y.; Tsoi, A. A Level Set Analysis and A Nonparametric Regression on S&P 500 Daily Return. Int. J. Financial Stud. 2016, 4, 3. https://doi.org/10.3390/ijfs4010003

AMA Style

Yang Y, Tsoi A. A Level Set Analysis and A Nonparametric Regression on S&P 500 Daily Return. International Journal of Financial Studies. 2016; 4(1):3. https://doi.org/10.3390/ijfs4010003

Chicago/Turabian Style

Yang, Yipeng, and Allanus Tsoi. 2016. "A Level Set Analysis and A Nonparametric Regression on S&P 500 Daily Return" International Journal of Financial Studies 4, no. 1: 3. https://doi.org/10.3390/ijfs4010003

APA Style

Yang, Y., & Tsoi, A. (2016). A Level Set Analysis and A Nonparametric Regression on S&P 500 Daily Return. International Journal of Financial Studies, 4(1), 3. https://doi.org/10.3390/ijfs4010003

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop