Next Article in Journal
The Evolution and Growth Patterns of the Road Network in a Medium-Sized Developing City: A Historical Investigation of Changchun, China, from 1912 to 2017
Next Article in Special Issue
Assessing Impacts of CAP Subsidies on Financial Performance of Enterprises in Slovak Republic
Previous Article in Journal
Consumer Perceptions Related to Clothing Repair and Community Mending Events: A Circular Economy Perspective
Previous Article in Special Issue
Does FDI Promote or Inhibit the High-Quality Development of Agriculture in China? An Agricultural GTFP Perspective
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Analysis of Agricultural Commodities Prices with New Bayesian Model Combination Schemes

by
Krzysztof Drachal
Faculty of Economic Sciences, University of Warsaw, 00-241 Warsaw, Poland
Sustainability 2019, 11(19), 5305; https://doi.org/10.3390/su11195305
Submission received: 31 August 2019 / Revised: 19 September 2019 / Accepted: 24 September 2019 / Published: 26 September 2019
(This article belongs to the Special Issue Trade Policy and Macroeconomic Impacts on Agricultural Sustainability)

Abstract

:
In the described research three agricultural commodities (i.e., wheat, corn and soybean) spot prices were analyzed. In particular, one-month ahead forecasts were built with techniques like dynamic model averaging (DMA), the median probability model and Bayesian model averaging. The common features of these methods are time-varying parameters approach toward estimation of regression coefficients and dealing with model uncertainty. In other words, starting with multiple potentially important explanatory variables, various component linear regression models can be constructed. Then, from these models an averaged forecast can be constructed. Moreover, the mentioned techniques can be easily modified from model averaging into a model selection approach. Considering as benchmark models, time-varying parameters regression with all considered potential price drivers, historical average, ARIMA (Auto-Regressive Integrated Moving Average) and the naïve forecast models, the Diebold–Mariano test suggested that DMA is an interesting alternative model, if forecast accuracy is the aim. Secondly, the interpretation of time-varying weights ascribed to component models containing a given variable suggested that economic development of emerging BRIC economies (Brazil, Russia, India and China) is recently one of the most important drivers of agricultural commodities prices. The analysis was made on the monthly data between 1976 and 2016. The initial price drivers were various fundamental, macroeconomic and financial factors.

1. Introduction

Forecasting of agricultural commodities is an important theoretical and practical topic. In 2008 a sudden spike in commodities (in general) prices was observed. Shortly afterwards, these prices had fallen. Nevertheless, since approximately 2000, it can be observed that there exists a rising trend in agricultural prices. This can be clearly seen from analyzing, for example, IMF (International Monetary Fund) commodity food price index, including cereal, vegetable oil, meat, seafood, bananas, sugar and orange prices [1].
These prices are important, because in some developing countries rising food prices or its shortages can lead to riots and increase political and economic uncertainty [2]. Moreover, these prices have a significant impact on farmers. As a result, they are an important component in policymaking and sustainable development [3].
There is also a discussion on what factors influence agricultural prices. It is quite natural to consider in this context natural conditions like the weather. Indeed, these prices are largely influenced by unpredictable droughts, floods and diseases. In this sense forecasting agricultural prices is a hard topic, because these factors are hard to include in the model [4].
Secondly, more conventional economic factors also influence agricultural prices. Of course, this starts from supply and demand forces, as for every tradable good. On the other hand, there exist arguments that interest rates and exchange rates also influence them. Moreover, in recent years there has been a discussion on “financialization” of commodities markets. In other words, the links between spot commodities prices and financial markets are tightening [5,6]. On the other hand, Amann et al. analyzed the causality in Granger sense between volume traded and open positions on derivatives markets and selected agricultural commodities prices (corn, wheat, rice and soybean). They did not find any evidence for such a relationship [7].
There are also arguments that rising crude oil prices impact the transportation costs, and therefore, impact also particular commodities [8].
As a result, building an econometric model for agricultural commodities prices is quite hard, because all the potentially important price drivers are volatile by their own. For example, modelling fluctuations of oil price itself consists of extensive literature. Economic growth in China and other Asian countries highly impacted the agricultural goods demand, etc. [9].
Most of the already applied methods of forecasting focus on a univariate time-series model, like ARIMA, or GARCH-type models (Generalized Auto-Regressive Conditional Heteroskedasticity) in case of volatility modelling [3,10]. More sophisticated approaches are based on neural networks methods, etc. [11]. Another approach is to use structural models [12]. However, in the case of models with multiple explanatory variables, classical regression analysis is mostly used [13].
These conventional econometric methods have two significant drawbacks. First, they require the narrowing of the set of potentially important explanatory variables to quite a small number. Secondly, they assume that during the whole analyzed period, there exists one “true” model. Of course, out of few competing models the final one can be selected. For example, based on fitting the data in the training period. Unfortunately, such an approach omits the fact that during different periods, the “true” model might change. Except that, for traditional models it is harder to capture possible non-linearities [14,15]. Besides, there is a need for more developed econometric models, as the already applied ones usually have poor forecasting performance [16]. In addition, the mentioned methods are not able to capture the time-varying influence of several possible factors on the agricultural prices.
Quite a natural solution comes from Bayesian econometrics. However, these methods have not been applied yet too extensively to modelling agricultural prices [17], despite the fact, that they seem to be quite useful in this context [18,19]. Therefore, the current paper was an attempt to fill this literature gap.
Herein, dynamic model averaging (DMA) introduced by Raftery et al. was applied to forecasting selected agricultural prices [20]. The advantage of this method is that it takes under consideration uncertainty about the “true” model. Therefore, in the beginning stage several potentially important explanatory variables could be inserted into the modelling scheme.
Secondly, the conventional limitations over the ratio of explanatory variables to the length of time-series do not apply for this method [21].
Third, this method is based on recursive computations, which resembles the real-market conditions, when forecasting can be done only on the basis on the past information; but in the next period the model can (and should) be updated (re-estimated), because new (more) information is available. For example, the mentioned impact of crude oil price on agricultural commodities can be asymmetric, i.e., positive and negative oil price shocks can differently influence agricultural commodities prices [22], which suggests that econometric model with time-varying regression coefficients might be desirable.
Finally, the weights ascribed to each of the component model are time-varying. This allows for certain economic interpretation of time-varying importance of the included explanatory variables [23]. Indeed, it seems that time-varying methods have still not been much applied to analysis of agricultural commodities markets [24].
In addition, this scheme can be easily modified into model selection. Herein, this was done in two ways. As commonly, the selected model was the one with the highest posterior probability. Additionally, the scheme of, so called, Median Probability Model (MPM) by Barbieri and Berger was considered [25].
This paper is organized in the following way. First, a literature review on agricultural commodities prices drivers is presented. Next, the dataset is described and the motivation for particular data proxies is given. For the reader’s convenience a short exposition of dynamic model averaging is sketched; and the used methodology is described. Finally, the obtained results are presented and discussed.

2. Literature Review

Rezitis and Sassi provided an extensive review of various drivers of agricultural commodities prices in the time-varying context [26]. They covered mostly the period between 1996 and 2008. During all this time fundamental factors (like supply and demand) played an important role in impacting agricultural prices. These come from a rise in demand due to growth of population, rapid economic growth and rise in per capita consumption.
Within this context usually the economic growth in BRIC countries is highlighted, as the individual welfare in these countries is rising, as well as their urban population, making significant changes in consumption patterns [27].
Of course, these changes were followed by the strong growth in production. However, around 2000 it was observed that the demand for stocks was declining. Since 2000 there is also growing concern about the impact of rising crude oil prices and U.S. dollar devaluation. Since 2006 researchers have also focused on rising input costs, adverse weather, aggressive purchases by importers, export bans and restrictions and import tariff reductions.
Rezitis and Sassi [26] presented also a discussion of complicated grains commodities price formation. In short, energy policies impact crude oil prices, which further impacts biofuels and ending stocks, which finally has an impact of the demand. For example, increase in demand for crops for biofuels can result in substitution effect and increase competition in using the land [28].
Bastianin, Galeotti and Manera [29] analyzed the relationship between ethanol prices and certain grain prices. They do not find a causal relationship from ethanol return to food prices. Moreover, Zilberman et al. also stated that biofuel prices do not impact food commodities price, but the introduction of biofuels does [30]. For example, Paris concluded that both non-linearities and regime switching should be considered when analyzing impact of biofuels on agricultural prices [31]. In particular, it was found that biofuels expansions can lead to higher agricultural commodities prices. Therefore, it can be said that the relationship between biofuel prices and agricultural commodities prices is not conclusively solved in literature. Sometimes, different outcomes are found depending on whether a global level or national one is analyzed [32,33].
Weather conditions, harvest area, cost of oil and fertilizers and beginning stocks have an impact on the supply side. Indeed, higher energy and fertilize prices result in higher production and transportation costs. However, the nature of links between crude oil price and agricultural commodities prices is disputable, whether it is mostly driven by common macroeconomic determinants of both prices, or it is a direct pass-through [34].
Both, conventional statistical methods and Bayesian ones, provide much evidence that crude oil price improves the forecasting performance of modelling agricultural prices [19]. Indeed, Nazlioglu and Soytas concluded that both crude oil price and exchange rates impact agricultural commodities prices in the sense of Granger causality [35]. They found that there are significant interactions between crude oil and corn prices, but not between crude oil and soybean prices and between crude oil and sugar prices. Such interdependencies were also found by Fernandez-Diaz and Morley [36].
Sukcharoen and Leatham [37] argued that regime switching should be included when analyzing the relationship between crude oil and agricultural commodities prices. In particular, they found that the statistical dependence between these commodities is high during market downturns, but not during market upturns.
The level of stocks is also impacted with both production shortfalls and political decisions. For example, some net exporting countries introduced restrictive trade policies. For example, around 2008 China, India, Russia, Ukraine and Argentina raised their export taxes [38].
Ribeiro and Oliveira [14] noticed the important role of stocks quotas in predicting agricultural commodities prices. However, they provided an analysis confirming that the convenience yield can be a more important explanatory variable in modelling these commodities prices. In fact, this factor is connected with supply shortages. It is a kind of a premium from holding the good over holding a derivative position [39].
Moreover, exchange rates have an important impact on foreign demand; whereas export restrictions impact upon foreign supply. Of course, adverse weather conditions result in production shortfalls, impacting lower level of supply and stocks. These can have also a relationship with precautionary purchases, which constitutes a part of demand. Besides, grain reports and future markets have impact on price expectations, which also impact both demand and supply. However, these expectations also impact directly both supply and demand [26].
Chen, Rogoff and Rossi [40] analyzed world agricultural prices. They found that factors like stock market indices and exchange rates can improve the predictability of agricultural prices. The theoretical argument in favor of this conclusion is that financial markets are more fluid than agricultural markets, so the information from them can be used as a signal of some future agricultural price dynamics. Also, Thiyagarajan et al. found that stock markets and exchange rates have a significant impact on the prices of agricultural commodities in India [3].
Fernandez-Diaz and Morley [36] found significant interdependence between soybean price and U.S. exchange rate and between sugar prices and global economic activity. Also, Alam and Gilbert observed the important role of global economic conditions for development of agricultural commodities prices [41]. However, Hatzenbuehler et al. found that stocks quotas and policy shifts are more important in affecting long-term agricultural prices than exchange rates, because they give rise to more inelastic market demand [42]. However, in the short-term the effect can be opposite, i.e., exchange rates can play a more important role.

3. Data

The analyzed period covered the span between July 1976 and July 2016. Such a choice was due to the data availability. Monthly or yearly frequency was taken depending also on data availability.
Prices of agricultural commodities were taken from The World Bank [43] in $/mt. Following, for example, Kapusuzoglu and Karacer Ulusoy, wheat (U.S. HRW), corn (maize) and soybean prices were collected (P_WHEAT, P_CORN, P_SOYBEAN) [44]. This data is provided in monthly frequency.
Production (prefixed with Q_), consumption (prefixed with D_) and (ending) stock quotas (prefixed with S_) estimates were taken from O’Brien [45], which is a collection of regularly published [46] reports. Production, consumption and stocks were taken in million metric tons. These data were provided in yearly frequency. Moreover, the marketing year for these agricultural commodities depends on country. However, for the world data it can be assumed that it goes from July to July.
Exchange rates were taken from Stooq [47] and FRED [48]. Following Chen et al. [40], Australian dollar (AUD) and Canadian dollar (CAD) to U.S. dollar rates were taken. The mnemonic time-series are AUDUSD from Stooq [47] for AUD and EXCAUS for CAD from FRED [48]. The motivation behind this was that these two countries are important producers (and exporters) of the analyzed commodities. They have had open markets and free-floating currency regimes for long time. Their economies are also developed ones and quite stable. On the other hand, other important exporters (Argentina, Brazil, China and Russia) suffered serious crises, hyper-inflation, etc. These time-series are provided in monthly frequency.
Market stress index (MS) was taken as Moody’s seasoned Baa corporate bond minus federal funds rate [48]. This data is provided in monthly frequency. FRED [48] mnemonic time-series is BAAFFM. The idea is to consider the difference between “risky” asset and a “risk-free” one [49].
As an interest rate (R) long-term government bond yields: 10-year: main (including benchmark) for the United States were taken. The mnemonic [48] time-series is IRLTLT01USM156N. This data is provided in monthly frequency.
Stock market index was taken as S and P 500 from Stooq [47]. The mnemonic time-series is SPX. This data is provided in monthly frequency.
Traditional biofuels production (in terrawatt-hours) was taken from Ritchie and Roser [50]. It was denoted by BIO. This data is provided in yearly frequency.
Energy prices (E) were also taken from The World Bank [43]. For the oil price (OIL) the average from Brent, Dubai and WTI (West Texas intermediate) was taken, as provided by The World Bank [43] in $/bbl. This data is provided in monthly frequency.
Fertilizer prices (F) were also taken from The World Bank [43] in nominal U.S. dollars. This data is provided in monthly frequency.
Global economic activity (KEI) was taken as the Kilian index [51,52]. This index is based on ocean bulk dry cargo freight rates [53,54]. Such an index has certain advantages in measuring real global economic activity over, for example, considering global real GDP or industrial production measures. The Kilian index was constructed to meet several goals. First, to cover global activity. Secondly, to account that in many countries service sector increases as a part of their GDP. Third, to construct a measure that captures real activity—not coincidental factors. In addition, the Kilian index is regularly updated and provided in monthly frequency, and dates back to 1968.
Annual growths of GDP per capita from BRIC countries were taken from The World Bank [55]. Then, the mean values from all four countries were taken to obtain one time-series (GDP). These data are provided in yearly frequency. Moreover, Russia is included in this collection since 1990.
Following, for example, Buncic and Moretto [39], convenience yield (prefixed with CY_) was computed as
CY   =   rf ( Fp P )   /   P   ,
where rf stands for risk-free rate, Fp—for futures price, and P—for spot price. Of course, futures price and risk-free rate should correspond to the same time horizon. Risk-free rate, for this computation, was taken as three-month treasury bill: secondary market rate [48]. The mnemonic time-series is TB3MS. Futures prices were taken from Stooq [47]. The mnemonic time-series are KE.F for wheat, ZC.F for corn and ZS.F for soybean. They are provided in ¢/bu. However, spot prices were collected in $/mt. Therefore, simple re-calculations were done, assuming that one bushel of wheat equals to 27.2155 kg, one bushel of corn equals to 25.4012 kg, and one bushel of soybean equals to 27.2 kg [56]. All these data are provided in monthly frequency.
Open interest (OP_) was also taken for each grain from Stooq [47]. It represents the sum of all contracts that have not expired, i.e., it shows the number of long (or equivalently: short) positions. This indicates the activity of futures market and can be used for measuring speculative pressures [7]. These data are provided in monthly frequency.
Because the time-series were in different frequencies, the yearly ones were disaggregated into monthly ones. Of course, disaggregation resulted in lower accuracy of the resulting time-series, but it seemed that such a bad time-series was preferable to switching all data into lower frequency. Indeed, the majority of time-series are in monthly frequency, including the dependent variables. On the other hand, the very interesting and important time-series (like, for example, production and consumption) are in yearly frequency. Therefore, it seems worth to make this procedure. It was done with the help of “tempdisagg” R package [57]. The Denton–Cholette method was used because the time-series contained trends and in order not to deal with stationarity or cointegration issue,.
In particular, BIO was disaggregated in a way to keep the yearly sums consistent with the original time-series. Similarly, time-series of production, consumption and stocks. However, for GDP the aim was to keep yearly means consistent with the original time-series. Actually, in this case geometric mean would be even more desirable, but such a disaggregation was not available in the applied package.
It was desirable for all variables to have more or less similar magnitudes. Therefore, KEI and production, consumption and stock quotas were divided by 100. Such a re-scaling does not change the economic meaning, just the quantities measures considered.
Moreover, prices and S and P 500 stock market index (SP500) were transformed to returns (i.e., monthly growths). For consumption, production and stocks the ordinary first differences (i.e., differences between subsequent months) were taken. For open interest, exchange rates, oil price, energy price, fertilizer prices and biofuels production the monthly growths were taken.
The descriptive statistics of the variables, after the above transformations, are presented in Table 1. In Figure 1 the time-series of the selected agricultural commodities are presented. In Figure 2—of all the variables, after the described transformations. For the sake of clarity and readability all variables abbreviations are listed in Table A1 in the Appendix A.

4. Methodology

The computations were done in R [58] with the help of “fDMA” and “forecast” packages [59,60].
Dynamic model averaging (DMA) was developed by Raftery et al. [20]. Assuming there are n potentially interesting explanatory variables, one can make 2n multilinear regression models out of them. In the case presented herein, n = 16 for each of the analyzed agricultural commodities. Let t denote the time index, and let yt be the dependent variable, and let xt(k) be the vector of explanatory variables in the k-th multilinear regression model. Of course, k = 1, …, 2n. For clarity, both yt and xt(k) represent the time-series after the transformations described in the previous section (i.e., those reported in Table 1 and presented in Figure 2). Moreover, independent variables are taken from the one period back comparing to the dependent variable (i.e., in their first lags). This is because the focus in this research was on forecasting the prices. The state space model is given by the following equations:
y t =   x t ( k ) θ t ( k ) +   ε t ( k ) ,
θ t ( k ) =   θ t 1 ( k ) +   δ t ( k )
with θt(k) denoting the regression coefficients of the k-th multiple linear regression model. Errors are normally distributed, i.e., εt(k) ~ N(0,Vt(k)) and δt(k) ~ N(0,Wt(k)).
The regression coefficients are recursively updated by Kalman filter. V0(k) and W0(k) need to be given the initial values. For specifying W0(k) the procedure proposed by Raftery et al. [20] was used. V0(k) were set to 1, because they should correspond to magnitudes of the variables, so such a value is in line with their variances reported in Table 1. Further updating depends on a certain forgetting factor λ, which is a number between 0 and 1. The details are presented in the paper by Raftery et al. [20].
Smaller values of λ correspond to allowing more abrupt updating (changes) of θt(k). However, too small values might result in “catching the noise”, instead of “catching the signal”. Therefore, here the most common values [61] were tested, i.e., λ = {1, 0.99, 0.98, 0.95, 0.90}.
As DMA is a model averaging scheme, each of the component k-th multiple linear regression model is given a time-varying weight. This is done through the following equations:
π t | t 1 , k =   (   π t 1 | t 1 , k α + c   ) (   i = 1 K π t 1 | t 1 , i α +   c   ) ,
π t | t , k =   [   π t | t 1 , k f k   ( y t | Y t 1 ) ] [   i = 1 K π t | t 1 , i f i   ( y t | Y t 1 )   ] ,
where fk(yt|Yt−1) is the predictive density of the k-th model at yt, under the assumption that the data up to time t is known (but not further), and K = 2n. Here, also the second forgetting factor α is used. For the economic interpretation it seems that α = λ is a reasonable choice, as then the “tempo” of updating is the same for all parameters of the DMA scheme.
The weights πt|t−1,k are called posterior predictive probabilities. Raftery et al. [20] advise the use of some small constant c to prevent these probabilities being reduced to 0. This can happen due to numerical approximations during software computations. They recommended setting c = 0.001/2n.
The values of π0|0,k need also be given the initial values. So-called non-informative prior was used, i.e., π0|0,k = 1/2n were taken. In other words, initially all component models were given the same equal weight.
The DMA prediction is obtained as an averaged forecast with the following equation:
y t D M A =   k = 1 K π t | t 1 , k y t ( k ) ^ ^
where y t ( k ) ^ is the prediction from the k-th multilinear regression model.
If α = 1 = λ, then the above scheme is a computationally efficient way to estimate Bayesian model averaging [20]. This is denoted by BMA.
The model averaging scheme described above can be easily switched to model selection scheme. The usual choice is to select in each time t the model with the highest posterior predictive probability in Equation (6), instead of weighted averaging over all component models. Such a scheme is called dynamic model selection (DMS). Similarly, for α = 1 = λ, the scheme is called Bayesian model selection (BMS).
On the other hand, Barbieri and Berger [25] argued that only under certain conditions the model with the highest posterior probability is the optimal one. Suppose that the weights πt|t−1,k from exactly those component models, which contain a given explanatory variable, are summed up. Such a sum is called posterior inclusion probability (of this explanatory variable).
Barbieri and Berger [25] proposed to select the component model, which contains exactly those explanatory variables, for which posterior inclusion probability is greater than or equal to 0.5. This scheme is called the median probability model (MED). Similarly, for α = 1 = λ this scheme is called herein the Bayesian median probability model (BMED).
Additionally, posterior inclusion probabilities together with the weighted regression coefficients can be used to describe the time-varying importance of a given explanatory variable in influencing the dependent variable [23]. More precisely, high values of posterior inclusion probability for a given variable should be confronted with the weighted regression coefficient for this variable. High posterior inclusion probability, when the weighted regression coefficient is around 0, does not indicate high importance of this variable as an explanatory variable. In other words, such a situation means that conclusions from individual component models in averaging scheme are opposite to each other.
Of course, it is necessary to compare the forecast accuracies of the analyzed model combination schemes with some benchmark models. Definitely, the common ARIMA model is the first choice. Actually, herein the automatic version of Hyndman and Khandakar [60] is used (ARIMA). In short this model is applied recursively, i.e., data up to time t − 1 are used to make the forecast for time t. Then, data up to time t are used to make the forecast for time t + 1, etc. Moreover, the model initially starts with no AR (Auto-Regressive) and MA (Moving Average) lags and searches if more lags give some improvement according to Akaike Information Criterion.
The next benchmark model is just a naïve forecast (NAÏVE), i.e., forecast for time t is simply the observed value at time t − 1. It is worth mentioning that sometimes, in case of commodities markets, it is a hard task even to make a more accurate forecast than the naïve method [62].
It is also interesting to compare the model averaging and model selection schemes with a single model with time-varying regression coefficients, but with all considered variables as explanatory ones. Indeed, the DMA scheme is described as averaging over K = 2n time-varying parameters regressions. The scheme can be also considered with no averaging, just as a way of computing time-varying parameters regression, if K = 1, i.e., just one component model is considered. For this model, denoted by TVP, the forgetting factor λ was set to 0.99.
The last benchmark model is to take historical average (HA). In other words, the forecast for time t is the mean from all observations up to time t − 1.
The whole sample was divided into in-sample consisting of the first 120 observations, and out-of-sample consisting of the remaining further observations. In such a way in-sample consisted of 25% of the whole data set. The estimated models were evaluated on the basis of the out-of-sample part. In particular, root mean squared error (RMSE), mean absolute error (MAE) and mean absolute scaled error (MASE) were computed, after deleting first 120 observations. However, the estimated models were directly forecasting the returns from the given agricultural commodity, i.e., it was previously described how the raw time-series were transformed. On the other hand, the forecast accuracy measures were applied to the prices of the analyzed agricultural commodities. It is an easy algebraic transformation, to obtain forecasted prices back from forecasted returns (annual changes).
RMSE and MAE are quite common forecast accuracy measures. RMSE penalizes more extreme discrepancies, as it is based on second power function, whether MAE treats them linearly. Hyndman and Koehler [63] advocated using MASE, which for T observations is defined by the following equation:
MASE =   t = 1 T | y t y t | ^ (   T T 1 t = 2 T | y t y t 1 |   )
where yt represents the real values, and y t ^ represents the forecasted values. The advantages of this forecast accuracy measure are: scale invariance, “good” behavior if the forecasted values are around 0, penalizing both positive and negative errors equally, penalizing both large and small differences between real and forecasted values equally. Additionally, it has an easy interpretation: values of MASE greater than one indicate that the naïve method is characterized by greater forecast accuracy than the evaluated method.
The estimated models were checked if they generated better forecast accuracy than the benchmark models. However, the stress was put on evaluating the models due to the criterion of minimizing MASE.
It was noticed that MASE, or more precisely the absolute scaled error (ASE) loss function, was compatible with the procedures of the common [64] test. Therefore, except comparing values of MASE for different models, it was also checked if the observed—greater or smaller—forecast accuracy was statistically significant for each case.

5. Results

From Table 2 it can be seen that, in the case of wheat prices, it was DMA with α = 0.99 = λ, which minimized MASE. Interestingly, this was also the model, which simultaneously minimized RMSE out of all the considered models, as well as, minimized MAE. For Diebold–Mariano test 5% significance level was assumed [65]. In such a case, it can be stated that the selected DMA model with α = 0.99 = λ produced significantly more accurate forecasts than the majority of the alternative models. The only exceptions were DMA with α = 0.98 = λ, BMA (i.e., DMA with α = 1 = λ) and MED with α = 0.99 = λ.
This meant that the selected model produced similarly accurate forecast if forgetting factors were slightly larger or smaller. It also seemed not to differ whether model averaging or model selection was applied for the selected forgetting factors. However, if the scheme of Barbieri and Berger [25] is considered, i.e., if not the highest posterior probability model was selected, then forecast accuracies do not differ significantly.
Usually, the time-varying parameter approach (i.e., α, λ < 1) leads to smaller forecast errors, while further lowering of the values of forgetting factors results in increasing forecast errors. This outcome quite confirms the suggestion of Raftery et al. [20] and Koop and Korobilis [66] that the combination of forgetting factors α = 0.99 = λ is preferable. On the other hand, there are applications of DMA, when more subtle combinations worked better [67,68].
From Table 3 it can be seen that the model which minimized MASE for corn was BMA. Therefore, in the case of corn it seems that the novel Bayesian model combination schemes did not lead to more accurate forecasts. Assuming 5% significance level, the selected model produced significantly more accurate forecasts than the majority of competing models. However, it cannot be said that it produced more accurate forecasts than DMA with α = 0.99 = λ, DMA with α = 0.98 = λ, DMS with α = 0.99 = λ and BMS. Interestingly, the minimum RMSE was produced by the BMS model, but the minimum MAE—also by BMA. Amongst various model combination schemes lower forgetting factors result in greater forecast errors. Usually, the smallest errors are for the combination of α = 1 = λ, meaning that time-varying approach in the case of corn was not so beneficial in a sense of forecast accuracy.
From Table 4 it can be seen that the model minimizing MASE for soybean was DMA with α = 0.99 = λ, similarly as in the case of wheat. This model simultaneously minimized MAE, but not RMSE, out of all competing models. The one which minimized RMSE was DMS with α = 0.99 = λ. Assuming 5% significance level, according to Diebold–Mariano test the DMA with α = 0.99 = λ produced significantly more accurate forecasts than the majority of the considered models.
However, it cannot be said that forecasts from this model were more accurate than those from DMA with α = 0.98 = λ, BMA, DMS with α = 0.99 = λ. MED with α = 0.99 = λ and BMED. This means that changing both forgetting factors slightly around the 0.99 value, as well as, the model combination scheme—from averaging to selection—did not significantly impact the forecast accuracy. Similarly, as in the case of wheat, the time-varying parameter approach (i.e., α, λ = 0.99 < 1) led to smaller forecast errors than if α, λ = 1, while further lowering of the values of forgetting factors (α, λ < 0.99) resulted in increasing forecast errors. In other words, there is a gain in a sense of forecast accuracy by switching from α = 1 = λ to α = 0.99 = λ, but lower values of α and λ made the forecast accuracy worse.
It happened that for the analyzed commodities the model averaging scheme produced the most accurate forecasts. However, it cannot be said the there is a statistically significant difference in forecast accuracy between model averaging and model selection schemes for the analyzed cases. On the other hand, the selected models produced significantly more accurate forecasts than conventional competing models (i.e., historical average—HA, ARIMA and NAÏVE method). Only in case of soybean it cannot be said that the selected model produced significantly more accurate forecast than those of ARIMA and NAÏVE methods. It is also interesting, that the selected models produced significantly more accurate forecasts than TVP models.
In other words, there was a gain in a sense of forecast accuracy from combining various component models (through model averaging or model selection). Narrowing only to the time-varying parameters approach, with dropping the problem of the model (variable) uncertainty was not enough, if more accurate forecasts were the aim.
In all analyzed cases α = 1 = λ or α = 0.99 = λ was preferred, while smaller values of forgetting factors led to lower forecast accuracy. However, the difference in forecast accuracies from these two forgetting factors combinations—as far as DMA is considered—was not statistically significant. As mentioned before, this was not a typical outcome, as some of the previous research applying DMA argued that the combination of α = 0.99 = λ is enough [19]; while some other advised to search over various combinations of α and λ, as lower values (and not necessarily α = λ) might be beneficial [67,68].
Because the forecasts accuracies indicated by MASE were quite similar, it iwa tempting to perform some equal predictive accuracy test for all the considered models. The one which seems to be consistent with ASE loss function is the one proposed by Mariano and Preve [69]. Assuming 5% significance level and standard parameters as in the paper of Wang et al. [68], the procedure eliminated only the HA method. However, in case of corn it excluded DMA with α = 0.90 = λ. This means that if the models were considered as a whole group, then it is quite hard to find the model which would significantly outperform—in a sense of forecast accuracy—all other models.
Figure 3 and Figure 4 present weighted regression coefficients and posterior inclusion probabilities for all the explanatory variables from the models minimizing MASE for the analyzed agricultural commodities. As aforementioned, these figures should be analyzed together. First of all, some first part of graphs from left (i.e., beginning period) should be considered as a period during which the model combination scheme “learns” and outcomes can be chaotic. Actually, this is the previously mentioned in-sample part of data set.
It can be seen that consumption was quite an important driver of soybean prices in the end of the 2000s. Production played an important role in the 1990s and in the beginning of the 2000s for soybean. Stock quotas played an important role for wheat and soybean in the end of the 1990s, also later there were some short periods when they played an important role as soybean price driver.
The role of open interest was important for corn between the beginning of the 1990s and end of the 2000s. Convenience yield was important driver of corn prices throughout the whole analyzed period, except the late 2000s. It was also an important driver of soybean prices in the 1990s and middle of the 2000s.
Biofuels production was an important driver for the prices of all three grain commodities in the middle of the 1990s. This result was actually a bit strange, because the corn-based biofuels growth started after the middle of 1990s. However, the sugarcane based Brazilian ethanol production had been well established even before the corn-based ethanol boom started. The role of GDP growth per capita in BRIC economies was rather marginal before 2000s, but since then it seemed to become a more and more important driver of all three grain commodities prices.
Market stress was an important driver of all three commodities prices until the middle of the 2000s. Global economic activity was an important driver of all three commodities prices in 1990s and in late 2000s. The role of stock market index was rather marginal for all the analyzed commodities throughout the whole analyzed period.
The role of interest rate decreased since the 1990s and it was marginal since the 2010s. The role of Australian dollar exchange rate increased since the 2000s, whereas the role of Canadian dollar was all the time marginal. The price of oil was an important driver of wheat price in a short period in the middle of the 1990s. A similar conclusion is valid for energy prices in the case of wheat price. However, energy prices were also quite an important driver of soybean price since the 1990s until now. Fertilizers prices were generally an important driver of all three grain commodities prices between 1990 and the middle of the 2000s, except some short periods during this time. Moreover, their role has been increasing recently.
Summarizing, it can be seen that fundamental factors (like consumption, production, stock quotas) were important price drivers, but in the past periods rather, and only for soybean. Open interest, usually considered in the context of speculative pressures, was important in the case of corn. This commodity was also influenced by convenience yield. Recently, it can be seen that the development of emerging economies is a very important price driver.
The general outlook of the results seems that fundamental factors were very important in the far past, whereas more recently the financial factors played a more important role. For the very recent period, the most important drivers were the growth of emerging economies and GDP per capita in these countries. Moreover, global economic activity, development of emerging economies, exchange rates, fertilizer prices and market stress seem to be the common price drivers for wheat, corn and soybean.
Soybean seems to be affected in certain periods quite much by fundamental factors, whereas corn by financial and speculative factors [70]. Wheat seems to be more linked with crude oil and, generally, energy prices.
It is interesting to compare the outcomes for these grain commodities with some other DMA analyses from the commodity market in general. Of course, other studies apply a completely different set of potential price drivers for other commodities, but some drivers were common. In the case of metal prices [71], it was found that supply–demand and stock quotas factors were indeed important price drivers, convenience yield was also found to be important driver, but global economic activity was found important only during short periods. Market stress and exchange rates were also found important.
If the DMA applied to crude oil price was considered, then global economic activity was also found as important driver during some periods. Similarly, exchange rates, market stress and emerging economies stock market prices [67].
It is interesting that for metals and crude oil supply–demand factors occurred to be important price drivers, whereas herein they happened not to play very important role. On the other hand, global economic activity, economic expansion of emerging economies, exchange rates and market stress seemed to be the common important price drivers for various commodities.
Finally, it should be stressed that the notion of “importance” should be considered in a relative manner [23]. In particular, it should not be understood in a sense of “absolute” manner, but rather as importance respective to the other variables considered (used) in the given model combination scheme. In other words, it can be understood that, for example, GDP growth in BRIC countries is a more important grains price driver than production, consumption, and all other factors considered in the research reported herein. Not that, for example, consumption quotas are a totally unimportant driver. Indeed, this sounds quite reasonable—the GDP growth results in increasing consumption, so it should be looked on as the most important driver, and consumption increase is its consequence. Anyway, it should be also remembered that model combination schemes are rather aimed at forecasting—not on detecting causality or explanatory relationship analysis tools [23].
As already discussed, Figure 3 and Figure 4 present averaged regression coefficients and posterior inclusion probabilities from DMA models with particular combination of forgetting factors. However, as mentioned previously, a few α and λ combinations were estimated. It can be asked, if other combinations would not result in different economic conclusions. Interestingly, the graphical analysis of plots, like in Figure 3 and Figure 4, but for all α and λ combinations used herein, indicates the same conclusions. In other words, there were some differences in case of the exact numerical values, but up to reasonably high degree the time-paths were the same. In other words, a posterior inclusion probability for the given variable increased for all considered α and λ combinations or decreased during a given period. In particular there was no situation that during some period one combination of α and λ indicated rising posterior inclusion probability for some variables, whereas some another combination—decreasing for the same variable. Similarly, this was for the averaged regression coefficients (including their signs). For clarity, the analyzed figures were not included herein. It is only mentioned, that it had been checked whether the reported economic interpretations would not depend on α and λ combinations.

6. Conclusions

The presented research was focused on applying some novel Bayesian model combination schemes to forecasting the spot prices of the selected grain commodities. First of all, it was interesting that these methods happened to produce, indeed, statistically significant more accurate one-month ahead forecasts than some conventional econometric models, like ARIMA, historical average or the naïve method. Only for soybean, it could not be said that ARIMA or NAÏVE forecasts were significantly less accurate.
It was also seen that not only the time-varying parameters approach (to regression coefficients) itself was beneficial, but also consideration of various models. In other words, combining several regression models through model averaging or model selection procedure improved forecast accuracy. Moreover, time-varying approach towards, simultaneously, both regression coefficients and model switching itself (i.e., dealing with model uncertainty) was beneficial. From the interpretative point of view it meant that the joining of these two features allowed interesting interpretation on how the importance of various grains price drivers change in time.
However, the notion of importance of a driver should be carefully understood as a relative importance, with respect to the other drivers considered in the whole forecasting scheme [23]. In other words, it should be understood rather as a factor improving forecast accuracy, not to be confused with the notion of some kind of a causal relationship. Actually, this was also some limitation of the current research. On the other hand, the performed identification of key factors influencing agricultural prices can serve as a starting point for some future research, dealing directly with studying the causal relationships.
Indeed, herein various factors were taken as potential price drivers: fundamental ones (like production, consumption, stock quotas), macroeconomic (GDP growth per capita, economic activity, etc.) and financial ones (like exchange rates, stock prices, market stress, etc.). In case of fundamental factors, they were found to be rather important ones in further periods in the past, whereas in recent years the applied econometric schemes stressed the higher role of more financial factors and, especially, GDP growth per capita in BRIC economies. This signals that the sustainable development in the studied area, and suitable policies, cannot focus solely on the production processes. They have to consider changing patterns in economies. For example, as the societies becomes richer, the consumption patterns and consumer choices can change. Secondly, links with financial instruments becomes tighter nowadays. Also, the role of derivatives becomes higher. As such, they can be used more intensively in speculation—not only for hedging purposes.
These results are in agreement with other research focusing on the role of new emerging economies, richer societies in these countries and, therefore, growing demand from these consumers. Indeed, the role of GDP growth per capita in BRIC economies was found rather marginal before the 2000s, but afterwards its role as a grain price driver started to increase. This was also an interesting point, for future considerations. In the context of the sustainable development, it can be asked whether these changes impact only the quantity of the demand, or also the quality of the product. For example, will the newly rich consumers care whether the product they buy was produced in a sustainable way [72,73,74].
The decreasing role of fundamental factors is generally in line with studies on commodities markets, indicating the rising role of, so called, financialization of commodities markets. This means that managerial decisions in the field of agriculture have to be more aware of this situation. The conventional approach based only on physical aspects, like production and distribution of the product are not enough nowadays. The market price can be impacted also by more “virtual” factors. For example, the price of an agricultural commodity can depend on both: the weather condition impacting the harvests, and general situation on financial markets—also those markets, which are not directly linked with agriculture.
Nevertheless, potential prices drivers linked with speculative pressures (like, for example, open interest) were found important only in case of corn prices in the late 2000s. However, convenience yield was found to be an important driver of corn and soybean prices in parts of the 1990s and 2000s. In recent periods, the impact of biofuels production was found rather marginal.
Stock market price index was not found as an important grain price driver. However, the index of market stress was an important driver. Overall, crude oil price, and energy prices, seemed to be important drivers of wheat prices. The financial and speculative factors seemed to be most important in case of driving corn prices, whereas fundamentals seemed to influence soybean prices the most.

Author Contributions

The whole article was written by the one author.

Funding

This research funded by the Polish National Science Centre grant under the contract number DEC-2015/19/N/HS4/00205.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A

Table A1. Abbreviations of variables used.
Table A1. Abbreviations of variables used.
AbbreviationMeaning
P_price
D_consumption
Q_production
S_stocks
OP_open interest
CY_convenience yield
BIOtraditional biofuels production
GDPannual growth of GDP per capital in BRIC countries
MSmarket stress index
KEIKilian index of global economic activity
SP500S and P 500 stock market index
Rlong-term interest rate
AUDAUD to USD exchange rate
CADCAD to USD exchange rate
OILcrude oil price
Eenergy prices
Ffertilizer prices

References

  1. International Monetary Fund. IMF Primary Commodity Prices; International Monetary Fund: Washington, DC, USA, 2019; Available online: https://www.imf.org/external/np/res/commod/index.aspx (accessed on 6 May 2019).
  2. Davenport, F.; Funk, C. Using time series structural characteristics to analyze grain prices in food insecure countries. Food Secur. 2015, 7, 1055–1070. [Google Scholar] [CrossRef]
  3. Thiyagarajan, S.; Naresh, G.; Mahalakshmic, S. Forecasting volatility in Indian agri-commodities market. Glob. Bus. Financ. Rev. 2015, 20, 95–104. [Google Scholar] [CrossRef]
  4. Kaaresvirta, J.; Mehrotra, A. Business surveys and inflation forecasting in China. Econ. Chang. Restruct. 2009, 42, 263–271. [Google Scholar] [CrossRef]
  5. Cheng, I.-H.; Xiong, W. Financialization of commodity markets. Annu. Rev. Financ. Econ. 2014, 6, 419–441. [Google Scholar] [CrossRef]
  6. Adämmer, P.; Bohl, M.T. Speculative bubbles in agricultural prices. Q. Rev. Econ. Financ. 2015, 55, 67–76. [Google Scholar] [CrossRef]
  7. Amann, S.; Lehecka, G.V.; Schmid, E. Does speculation drive agricultural commodity spot prices? J. Austrian Soc. Agric. Econ. 2013, 22, 131–140. [Google Scholar]
  8. Nazlioglu, S. World oil and agricultural commodity prices: Evidence from nonlinear causality. Energy Policy 2011, 39, 2935–2943. [Google Scholar] [CrossRef]
  9. Labys, W.C. Modeling and Forecasting Primary Commodity Prices; Routledge: London, UK, 2006. [Google Scholar]
  10. Assis, K.; Amran, A.; Remali, Y.; Affendy, H. A comparison of univariate time series methods for forecasting cocoa bean prices. Trends Agric. Econ. 2010, 3, 207–215. [Google Scholar] [CrossRef]
  11. Jha, G.K.; Sinha, K. Agricultural price forecasting using neural network model: An innovative information delivery system. Agric. Econ. Res. Rev. 2013, 26, 229–239. [Google Scholar] [CrossRef]
  12. Andrle, M.; Berg, A.; Morales, R.A.; Portillo, R.; Vlcek, J. On the sources of inflation in Kenya: A model-based approach. S. Afr. J. Econ. 2015, 83, 475–505. [Google Scholar] [CrossRef]
  13. De Gorter, H.; Drabik, D.; Just, D.R. The Economics of Biofuel Policies: Impacts on Price Volatility in Grain and Oilseed Markets; Palgrave Macmillan: Basingstoke, UK, 2015. [Google Scholar]
  14. Ribeiro, C.O.; Oliveira, S.M. A hybrid commodity price-forecasting model applied to the sugar-alcohol sector. Aust. J. Agric. Resour. Econ. 2011, 55, 180–198. [Google Scholar] [CrossRef]
  15. Mitra, D.; Paul, R.K. Hybrid time-series models for forecasting agricultural commodity prices. Model Assist. Stat. Appl. 2017, 12, 255–264. [Google Scholar] [CrossRef]
  16. Anjoy, P.; Paul, R.K. Comparative performance of wavelet-based neural network approaches. Neural Comput. Appl. 2019, 31, 3443–3453. [Google Scholar] [CrossRef]
  17. Andersson, M.K.; Karlsson, S. Bayesian Forecast Combination for VAR Models. In Bayesian Econometrics; Chib, S., Griffiths, W., Koop, G., Terrell, D., Eds.; Emerald Group Publishing Limited: Bingley, UK, 2008; pp. 501–524. [Google Scholar]
  18. Mazur, B. Cyclical Fluctuations of Global Food Prices: A Predictive Analysis. In Foundation of the Cracow University of Economics, Proceedings of the 12th Professor Aleksander Zelias International Conference on Modelling and Forecasting of Socio-Economic Phenomena, Cracow, Polska, 8–11 May 2018; Papież, M., Śmiech, S., Eds.; Wydawnictwo Uniwersytetu Ekonomicznego w Krakowie: Krakow, Poland, 2018; pp. 286–295. [Google Scholar]
  19. Osathanunkul, R.; Khiewngamdee, C.; Yamaka, W.; Sriboonchitta, S. The role of oil price in the forecasts of agricultural commodity prices. In Predictive Econometrics and Big Data; Kreinovich, V., Sriboonchitta, S., Chakpitak, N., Eds.; Springer: Berlin, Germany, 2018; pp. 422–429. [Google Scholar]
  20. Raftery, A.E.; Karny, M.; Ettler, P. Online prediction under model uncertainty via Dynamic Model Averaging: Application to a cold rolling mill. Technometrics 2010, 52, 52–66. [Google Scholar] [CrossRef] [PubMed]
  21. Onorante, L.; Raftery, A.E. Dynamic Model Averaging in large model spaces. Eur. Econ. Rev. 2016, 81, 2–14. [Google Scholar] [CrossRef]
  22. Zmami, M.; Ben-Salha, O. Does oil price drive world food prices? Evidence from linear and nonlinear ARDL modeling. Economies 2019, 7, 12. [Google Scholar] [CrossRef]
  23. Burnham, K.P.; Anderson, D.R. Model Selection and Multimodel Inference: A Practical Information-theoretic Approach; Springer: New York, NY, USA, 2002. [Google Scholar]
  24. Li, N.; Ker, A.; Sam, A.G.; Aradhyula, S. Modeling regime-dependent agricultural commodity price volatilities. Agric. Econ. 2017, 48, 683–691. [Google Scholar] [CrossRef]
  25. Barbieri, M.M.; Berger, J.O. Optimal predictive model selection. Ann. Stat. 2004, 32, 870–897. [Google Scholar] [CrossRef]
  26. Rezitis, A.N.; Sassi, M. Commodity food prices: Review and empirics. Econ. Res. Int. 2013, 2013, 694507. [Google Scholar] [CrossRef]
  27. Headey, D.; Fan, S. Anatomy of a crisis: The causes and consequences of surging food prices. Agric. Econ. 2008, 39, 375–391. [Google Scholar] [CrossRef]
  28. Wright, B.D. The economics of grain price volatility. Appl. Econ. Perspect. Policy 2011, 33, 32–58. [Google Scholar] [CrossRef]
  29. Bastianin, A.; Galeotti, M.; Manera, M. Ethanol and field crops: Is there a price connection? Food Policy 2016, 63, 53–61. [Google Scholar] [CrossRef]
  30. Zilberman, D.; Hochman, G.; Rajagopal, D.; Sexton, S.; Timilsina, G. The Impact of Biofuels on Commodity Food Prices: Assessment of Findings. Am. J. Agric. Econ. 2013, 95, 275–281. [Google Scholar] [CrossRef]
  31. Paris, A. On the link between oil and agricultural commodity prices: Do biofuels matter? Int. Econ. 2017, 155, 48–60. [Google Scholar] [CrossRef]
  32. Cabrera, B.L.; Schulz, F. Volatility linkages between energy and agricultural commodity prices. Energy Econ. 2016, 54, 190–203. [Google Scholar] [CrossRef] [Green Version]
  33. Janda, K.; Kristoufek, L. The relationship between fuel and food prices: Methods and outcomes. Annu. Rev. Resour. Econ. 2019, 11. [Google Scholar] [CrossRef]
  34. Baumeister, C.; Kilian, L. Do oil price increases cause higher food prices? Econ. Policy 2014, 29, 691–747. [Google Scholar] [CrossRef] [Green Version]
  35. Nazlioglu, S.; Soytas, U. Oil price, agricultural commodity prices, and the dollar: A panel cointegration and causality analysis. Energy Econ. 2012, 34, 1098–1104. [Google Scholar] [CrossRef]
  36. Fernandez-Diaz, J.M.; Morley, B. Interdependence among agricultural commodity markets, macroeconomic factors, crude oil and commodity index. Res. Int. Bus. Financ. 2019, 47, 174–194. [Google Scholar] [CrossRef]
  37. Sukcharoen, K.; Leatham, D. Analyzing extreme comovements in agricultural and energy commodity markets using a regular vine copula method. Int. J. Energy Econ. Policy 2018, 8, 193–201. [Google Scholar]
  38. Trostle, R. Fluctuating food commodity prices. A complex issue no easy answers. Amberwaves 2008, 6, 10–17. [Google Scholar]
  39. Buncic, D.; Moretto, C. Forecasting copper prices with dynamic averaging and selection models. N. Am. J. Econ. Financ. 2015, 33, 1–38. [Google Scholar] [CrossRef] [Green Version]
  40. Chen, Y.-C.; Rogoff, K.S.; Rossi, B. Predicting agri-commodity prices: An asset pricing approach. In Global Uncertainty and the Volatility of Agricultural Commodities Prices; Munier, B.R., Ed.; IOS Press: Amsterdam, The Netherlands, 2012; pp. 45–71. [Google Scholar]
  41. Alam, M.R.; Gilbert, S. Monetary policy shocks and the dynamics of agricultural commodity prices: Evidence from structural and factor-augmented VAR analyses. Agric. Econ. 2017, 48, 15–27. [Google Scholar] [CrossRef]
  42. Hatzenbuehler, P.L.; Abbott, P.C.; Foster, K.A. Agricultural commodity prices and exchange rates under structural change. J. Agric. Resour. Econ. 2016, 41, 204–224. [Google Scholar]
  43. The World Bank. Commodity Markets; The World Bank: Washington, DC, USA, 2019; Available online: http://www.worldbank.org/en/research/commodity-markets (accessed on 6 May 2019).
  44. Kapusuzoglu, A.; Karacaer Ulusoy, M. The interactions between agricultural commodity and oil prices: An empirical analysis. Agric. Econ. 2015, 61, 410–421. [Google Scholar] [CrossRef]
  45. O’Brien, D. Spreadsheets with WASDE Data; Kansas State University: Manhattan, KS, USA, 2017; Available online: https://www.agmanager.info/grain-marketing/grain-supply-and-demand-wasde (accessed on 6 May 2019).
  46. United States Department of Agriculture World. Agricultural Supply and Demand Estimates; U.S. Department of Agriculture: Washington, DC, USA, 2019. Available online: https://www.usda.gov/oce/commodity/wasde (accessed on 6 May 2019).
  47. Stooq. Historical Data; Stooq: Warsaw, Polska, 2019; Available online: https://stooq.com (accessed on 6 May 2019).
  48. FRED. Economic Research; Federal Reserve Bank: St. Louis, MO, USA, 2019; Available online: https://fred.stlouisfed.org (accessed on 6 May 2019).
  49. Kliesen, K.L.; Smith, D.C. Measuring financial market stress. Econ. Synop. 2010, 2, 1–2. [Google Scholar] [CrossRef]
  50. Ritchie, H.; Roser, M. Renewable Energy; Our World in Data: Oxford, UK, 2019; Available online: https://ourworldindata.org/renewable-energy#global-renewable-energy-consumption-over-the-long-run (accessed on 6 May 2019).
  51. Kilian, L. Updated and Corrected Version of the Index of Global Real Economic Activity in Industrial Commodity Markets. 2019. Available online: https://sites.google.com/site/lkilian2019/research/data-sets (accessed on 6 May 2019).
  52. Kilian, L. Not all oil price shocks are alike: Disentangling demand and supply shocks in the crude oil market. Am. Econ. Rev. 2009, 99, 1053–1069. [Google Scholar] [CrossRef]
  53. Kilian, L. Measuring Global Real Economic Activity: Do Recent Critiques Hold Up to Scrutiny? 2018. Available online: https://drive.google.com/file/d/1taCyNal2BJHqfxiid5-nTrkBaQCbRUmC/view (accessed on 6 May 2019).
  54. Kilian, L.; Zhou, X. Modeling fluctuations in the global demand for commodities. J. Int. Money Financ. 2018, 88, 54–78. [Google Scholar] [CrossRef] [Green Version]
  55. The World Bank. World Bank Open Data; The World Bank: Washington, DC, USA, 2019; Available online: https://data.worldbank.org (accessed on 6 May 2019).
  56. Murphy, W.J. Tables for Weights and Measurement: Crops; University of Missouri: Columbia, MO, USA, 2007; Available online: http://extension.missouri.edu/xplor/agguides/crops/g04020.htm (accessed on 6 May 2019).
  57. Sax, C.; Steiner, P. Temporal disaggregation of time series. R J. 2013, 5, 80–87. [Google Scholar] [CrossRef]
  58. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2018; Available online: https://www.R-project.org (accessed on 6 May 2019).
  59. Drachal, K. fDMA: Dynamic Model Averaging and Dynamic Model Selection for Continuous Outcomes. 2017. Available online: https://CRAN.R-project.org/package=fDMA (accessed on 6 May 2019).
  60. Hyndman, R.J.; Khandakar, Y. Automatic time series forecasting: The forecast package for R. J. Stat. Softw. 2008, 26, 1–22. [Google Scholar]
  61. Baur, D.G.; Beckmann, J.; Czudaj, R. A melting pot—Gold price forecasts under model and parameter uncertainty. Int. Rev. Financ. Anal. 2016, 48, 282–291. [Google Scholar] [CrossRef]
  62. Drachal, K. Some novel Bayesian model combination schemes: An application to commodities prices. Sustainability 2018, 10, 2801. [Google Scholar] [CrossRef]
  63. Hyndman, R.J.; Koehler, A.B. Another look at measures of forecast accuracy. Int. J. 2006, 22, 679–688. [Google Scholar] [CrossRef] [Green Version]
  64. Franses, P.H. A note on the Mean Absolute Scaled Error. Int. J. Forecast. 2016, 32, 20–22. [Google Scholar] [CrossRef] [Green Version]
  65. Diebold, F.X.; Mariano, R.S. Comparing predictive accuracy. J. Bus. Econ. Stat. 1995, 13, 253–263. [Google Scholar]
  66. Koop, G.; Korobilis, D. Forecasting inflation using Dynamic Model Averaging. Int. Econ. Rev. 2012, 53, 867–886. [Google Scholar] [CrossRef]
  67. Drachal, K. Forecasting spot oil price in a dynamic model averaging framework—Have the determinants changed over time? Energy Econ. 2016, 60, 35–46. [Google Scholar] [CrossRef]
  68. Wang, T.; Zhang, D.; Broadstock, D.C. Financialization, fundamentals, and the time-varying determinants of US natural gas prices. Energy Econ. 2019, 80, 707–719. [Google Scholar] [CrossRef]
  69. Mariano, R.S.; Preve, D. Statistical tests for multiple forecast comparison. J. Econ. 2012, 169, 123–130. [Google Scholar] [CrossRef]
  70. Etienne, X.L.; Irwin, S.H.; Garcia, P. Speculation and corn prices. Appl. Econ. 2018, 50, 4724–4744. [Google Scholar] [CrossRef]
  71. Drachal, K. Forecasting Prices of Selected Metals with Bayesian Data-Rich Models, 2019; Working Paper, Unpublished Work.
  72. Havranek, T.; Irsova, Z.; Janda, K. Demand for gasoline is more price-inelastic than commonly thought. Energy Econ. 2012, 34, 201–207. [Google Scholar] [CrossRef]
  73. Minten, B.; Murshid, K.A.S.; Reardon, T. Food quality changes and implications: Evidence from the rice value chain of Bangladesh. World Dev. 2013, 42, 100–113. [Google Scholar] [CrossRef]
  74. Augustin Franco, J. Principles of economethics from the Giffen demand. Technol. Econ. Dev. Econ. 2015, 21, 557–576. [Google Scholar] [CrossRef]
Figure 1. Prices of selected agricultural commodities.
Figure 1. Prices of selected agricultural commodities.
Sustainability 11 05305 g001
Figure 2. All the considered time-series after transformations.
Figure 2. All the considered time-series after transformations.
Sustainability 11 05305 g002
Figure 3. Weighted regression coefficients from the models minimizing mean absolute scaled error (MASE).
Figure 3. Weighted regression coefficients from the models minimizing mean absolute scaled error (MASE).
Sustainability 11 05305 g003aSustainability 11 05305 g003bSustainability 11 05305 g003c
Figure 4. Posterior inclusion probabilities from the models minimizing MASE.
Figure 4. Posterior inclusion probabilities from the models minimizing MASE.
Sustainability 11 05305 g004aSustainability 11 05305 g004bSustainability 11 05305 g004c
Table 1. Descriptive statistics of time-series after the transformations.
Table 1. Descriptive statistics of time-series after the transformations.
MeanSDVarianceMinMaxSkewKurtosis
P_WHEAT0.00170.05720.0033−0.19680.25750.78223.1160
D_WHEAT0.00060.00110.0000−0.00360.0044−0.02780.6676
Q_WHEAT0.00070.00320.0000−0.00830.00980.16770.0975
S_WHEAT0.00030.00180.0000−0.00430.00530.0432−0.3567
OP_WHEAT0.06380.36020.1298−0.71421.39800.67310.4122
CY_WHEAT0.10440.05660.0032−0.26550.2488−1.11404.3320
P_CORN0.00230.05870.0035−0.21720.34650.32733.6120
D_CORN0.00110.00160.0000−0.00300.00840.62661.9940
Q_CORN0.00110.00450.0000−0.01390.01530.16310.9350
S_CORN0.00150.00370.0000−0.01110.01690.27041.9660
OP_CORN0.01000.09750.0095−0.27420.48061.01402.5020
CY_CORN0.09280.06110.0037−0.12570.36780.51151.9460
P_SOYBEAN0.00250.05450.0030−0.23900.28900.31943.7330
D_SOYBEAN0.00050.00060.0000−0.00140.00210.12510.0728
Q_SOYBEAN0.00050.00140.0000−0.00340.00540.37550.4194
S_SOYBEAN0.00010.00060.0000−0.00230.0018−0.15990.8353
OP_SOYBEAN0.01730.23930.0573−0.76814.623014.8000283.5000
CY_SOYBEAN0.11290.05360.0029−0.11480.2782−0.50242.1460
BIO0.00020.00080.0000−0.00140.0018−0.3993−1.4340
GDP0.03880.02390.0006−0.00990.09210.0231−0.7001
MS0.03580.01980.0004−0.04050.0882−0.73341.2110
KEI0.01050.55370.3066−1.63401.87900.87151.2510
SP5000.00730.04290.0018−0.21760.1318−0.60032.0520
R0.06540.03170.00100.01500.15320.5573−0.2871
AUD−0.00050.03290.0011−0.17760.1048−0.73383.6420
CAD−0.00050.01520.0002−0.10680.0619−0.35585.5340
OIL0.00630.08470.0072−0.35520.53790.75516.9910
E0.00510.06990.0049−0.28380.41320.54775.5630
F0.00470.07400.0055−0.30530.88293.874045.0700
Table 2. Forecast accuracy measures and Diebold–Mariano test outcomes for wheat.
Table 2. Forecast accuracy measures and Diebold–Mariano test outcomes for wheat.
RMSEMAEMASEDM Test StatisticDM Test p-Value
DMA (α = 0.99 = λ)12.39008.00101.0360--
DMA (α = 0.98 = λ)12.53008.14501.05501.63300.0512
DMA (α = 0.95 = λ)13.61008.91801.15503.88200.0001
DMA (α = 0.90 = λ)15.21009.78201.26704.72200.0000
BMA12.49008.03401.04100.37660.3532
DMS (α = 0.99 = λ)12.73008.29701.07502.71200.0033
DMS (α = 0.98 = λ)13.11008.36201.08302.40100.0082
DMS (α = 0.95 = λ)14.17009.21901.19404.09600.0000
DMS (α = 0.90 = λ)16.400010.19001.31904.35400.0000
BMS12.99008.34401.08102.14800.0158
MED (α = 0.99 = λ)12.34008.03501.04100.37920.3523
MED (α = 0.98 = λ)12.76008.22901.06601.81500.0348
MED (α = 0.95 = λ)14.08009.17501.18804.06900.0000
MED (α = 0.90 = λ)15.83009.81701.27104.43100.0000
BMED12.70008.21401.06401.75200.0399
TVP (λ = 0.99)12.62008.27601.07202.00000.0228
HA73.550049.37006.395015.53000.0000
ARIMA13.96008.85501.14702.50700.0061
NAÏVE13.88008.88001.15002.57100.0051
Table 3. Forecast accuracy measures and Diebold–Mariano test outcomes for corn.
Table 3. Forecast accuracy measures and Diebold–Mariano test outcomes for corn.
RMSEMAEMASEDM Test StatisticDM Test p-Value
DMA (α = 0.99 = λ)9.39005.91201.00600.46630.3205
DMA (α = 0.98 = λ)9.59106.01301.02301.09900.1358
DMA (α = 0.95 = λ)10.97006.56501.11702.72400.0032
DMA (α = 0.90 = λ)12.50007.21801.22803.57800.0002
BMA9.33905.88101.0000--
DMS (α = 0.99 = λ)9.52206.01701.02401.41300.0788
DMS (α = 0.98 = λ)9.78706.17301.05001.80500.0355
DMS (α = 0.95 = λ)11.90006.81601.15902.97400.0015
DMS (α = 0.90 = λ)13.67007.61201.29503.96800.0000
BMS9.32605.93301.00900.71230.2382
MED (α = 0.99 = λ)9.58406.07201.03302.06400.0195
MED (α = 0.98 = λ)10.02006.16601.04901.90200.0286
MED (α = 0.95 = λ)11.57006.58801.12102.25100.0122
MED (α = 0.90 = λ)13.75007.48501.27303.70600.0001
BMED9.47205.97501.01601.88100.0300
TVP (λ = 0.99)9.58806.14601.04502.13200.0165
HA63.400039.74006.759013.84000.0000
ARIMA10.20006.46901.10002.61200.0045
NAÏVE10.16006.25801.06401.68100.0464
Table 4. Forecast accuracy measures and Diebold–Mariano test outcomes for soybean.
Table 4. Forecast accuracy measures and Diebold–Mariano test outcomes for soybean.
RMSEMAEMASEDM Test StatisticDM Test p-Value
DMA (α = 0.99 = λ)19.620012.55001.0060--
DMA (α = 0.98 = λ)19.660012.66001.01500.86440.1937
DMA (α = 0.95 = λ)20.670013.53001.08402.62700.0043
DMA (α = 0.90 = λ)22.830014.76001.18303.56600.0002
BMA19.980012.72001.01901.07500.1411
DMS (α = 0.99 = λ)19.520012.58001.00900.25610.3989
DMS (α = 0.98 = λ)20.130013.01001.04202.06600.0194
DMS (α = 0.95 = λ)22.840014.72001.18004.44000.0000
DMS (α = 0.90 = λ)24.680015.82001.26804.76400.0000
BMS20.440012.95001.03801.80900.0352
MED (α = 0.99 = λ)19.840012.64001.01300.67390.2502
MED (α = 0.98 = λ)20.030012.91001.03501.72800.0420
MED (α = 0.95 = λ)21.850014.05001.12603.40300.0003
MED (α = 0.90 = λ)23.880015.18001.21604.34900.0000
BMED20.150012.73001.02000.84770.1983
TVP (λ = 0.99)19.810013.03001.04402.47700.0066
HA123.800084.14006.744015.81000.0000
ARIMA20.330012.85001.03000.60870.2714
NAÏVE20.000012.76001.02300.47030.3191

Share and Cite

MDPI and ACS Style

Drachal, K. Analysis of Agricultural Commodities Prices with New Bayesian Model Combination Schemes. Sustainability 2019, 11, 5305. https://doi.org/10.3390/su11195305

AMA Style

Drachal K. Analysis of Agricultural Commodities Prices with New Bayesian Model Combination Schemes. Sustainability. 2019; 11(19):5305. https://doi.org/10.3390/su11195305

Chicago/Turabian Style

Drachal, Krzysztof. 2019. "Analysis of Agricultural Commodities Prices with New Bayesian Model Combination Schemes" Sustainability 11, no. 19: 5305. https://doi.org/10.3390/su11195305

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop