1. Introduction
As an important chemical raw material and environmentally friendly source of energy, the price fluctuation of propane not only directly affects the supply and demand balance of the energy market but also has a profound impact on global economic structure and the stability of the industrial chain [
1]. In recent years, the transformation of the energy structure and the intensification of geopolitical changes have led to significant price fluctuations in propane [
2], which exhibit highly nonlinear and complex characteristics. Traditional statistical methods face considerable challenges in analyzing the underlying dynamic mechanisms of these fluctuations. How to accurately capture the fundamental patterns of the price fluctuations and predict market trends [
3] has become a significant focus of interest in both academia and industry.
Early studies primarily utilized time series models to analyze the volatility characteristics of propane prices. For instance, the GARCH model, proposed by Hamilton [
4], is extensively utilized in the energy market. It effectively captures volatility clustering and leverage effects, thereby revealing the conditional heteroskedasticity of price fluctuations. Pindyck [
5] integrated supply and demand elasticity with inventory dynamics into the analytical framework, highlighting that propane prices are significantly influenced by interconnection with the crude oil market, and that fluctuations in this market had substantial cross-market contagion effects. Dias et al. [
6] conducted a comprehensive analysis of oil, gas, and electricity prices in the U.S. market using a multi-transformation model that effectively captured the typical characteristics of energy prices. Soohyeon et al. [
7] found that the decline in ethane price, attributed to the U.S. shale gas boom, resulted in lower ethylene prices and higher propylene prices, while propane continued to be the primary factor influencing the latter. Nazioglu et al. [
8] examined the existence of volatility transmission between oil prices and financial pressures using a volatility spillover test. Moul and Miller [
9] used instrumental variables to estimate the monthly propane demand and (inverse) supply equations, and believed that the propane distributors who usually adopt a low-price strategy have temporary market dominance. Zhang et al. [
10] employed various econometric models to empirically examine the price discovery and risk transfer functions within the crude oil and gasoline futures markets. According to the wavelet transform method, Hung [
11] examined the short-term and long-term dependencies between crude oil and propane prices during the period 2006–2018, and found that there was a strong correlation between the crude oil and propane series in the short and medium term, while, in the long term, propane returns had a unidirectional impact on the crude oil market. In summary, numerous scholars both domestically and internationally have conducted systematic research on issues related to energy price systems using various methodologies. They have established many practical models to examine the transmission relationships between energy prices and their associated factors, as well as to predict future trends and volatility in energy prices. However, traditional research methods for analyzing energy price fluctuations primarily rely on the development of various econometric models to explain the causes and mechanisms behind oil price fluctuations. It is important to note that the energy price system is fundamentally a complex, nonlinear, and non-stationary system. Traditional approaches seldom address how to construct complex network analysis models for energy prices or how to explore the intricate characteristics of the energy price system. In recent years, complex networks have emerged as a research hotspot, with the central idea of interpreting the connections between different components of the system as complex networks to gain a deeper understanding of the essence of real-world systems.
In order to address the limitations of traditional models, the complex network theory has increasingly been integrated into energy market research [
12,
13]. Chiroma et al. [
14] proposed an integration of CRT-RVFLN-SVM (Classification and Regression Tree (CRT), Random Variable Function Link Neural Network (RVFLN), and Support Vector Machine (SVM)) to enhance the robustness and effectiveness of propane price estimation. Emmert-Streib and Dehmer [
15] proposed mapping financial time series to network nodes and quantifying the transformation rules of market states through topological indicators, such as degree distribution and clustering coefficient. This method has led to significant breakthroughs in stock market research. For instance, Mantegna and Stanley [
16] utilized the minimum spanning tree algorithm to uncover the correlation structure of asset prices. Huang et al. [
17] utilized a GA-LightGBM model with genetic algorithm-optimized parameters to construct an oriented weighted network and investigated whether the centrality of the stock network aids in predicting company performance. In recent years, He and Chen [
18] applied symbolic dynamics to the construction of crude oil price networks and discovered that key nodes have a warning effect on systemic risks, thereby providing a methodological reference for propane research.
Symbolic methods are being increasingly utilized in economics [
19]. Narayan et al. [
20] examined five distinct types of oil futures contracts and analyzed the clustering of price fluctuations. Daw et al. [
21] proposed that coarse-graining can effectively reduce noise and extract the semantic features of time series data, and the results confirmed the superiority of symbolic processing in exchange rate forecasting. Wu et al. [
22] converted time series into price graphs and used graph embedding techniques to extract meaningful structural features. Zhou et al. [
23] further integrated the sliding window technique with the hidden Markov model, successfully predicting the mutation points of gold prices, which makes evident that symbolic network methods offer innovative tools for market forecasting and risk management by uncovering global interaction structures and the dynamics of the key nodes. Cheng et al. [
24] introduced a multi-modality graph neural network (MAGNN) for financial prediction using heterogeneous graphs, and the graph-centric view of market dynamics in their paper conceptually aligns with the method used in this paper.
At present, research on energy price fluctuations primarily focuses on time series analysis, volatility modeling, and the identification of influencing factors. For instance, researchers employ the GARCH model to characterize the clustering of energy price fluctuations [
25] and utilize cointegration to examine the dynamic relationships among international crude oil, natural gas, and coal price indices [
26]. However, these methods are often limited to local feature extraction and struggle to reveal the global interactive network and dynamic evolution of price fluctuations. At the same time, the successful application of complex network theory in financial time series analysis offers a novel perspective for energy market research. By mapping price series into network nodes and edges, researchers can systematically analyze the transition paths of market conditions, the influence of key nodes, and the topological robustness of the network. Despite this, existing research has primarily concentrated on oil prices [
27] or heating oil prices [
28]. In contrast, the exploration of propane price fluctuation networks remains inadequate, and there is a notable absence of the targeted methodologies regarding symbolic representation, the dynamic weight definition, and the core node identification. Choi and Kim [
29] present a graph-based risk analysis framework that considers inflation-adjusted dependence in global financial markets, and their approach to network construction and entropy-based evaluation offers conceptual support for your symbolic network methodology.
Inspired by symmetry theory, this study conceptualizes price fluctuations as a dynamic symmetry-breaking process. And the sliding window technique introduces time translation symmetry, ensuring that the generation process of the fluctuation mode adheres to the translation invariance constraint of the time series, and this symmetry framework not only reduces the dimensionality of the data but also preserves the essential dynamic characteristics of price fluctuations. Therefore, this paper constructs a PPOWN based on a specific framework and explores the network’s topological structure. This study presents three innovative contributions as follows.
(1) We transform the fluctuation sequence of propane price into a feature set consisting of five symbols {R, r, e, f, F}, which differs from the previous literature [
30,
31] that utilized three symbols {Y, O, N}. This transformation method more effectively captures the complexity of the propane price fluctuation.
(2) A PPOWN is constructed based on the fluctuation sequence and analyzed for comparison, which confirms that the PPOWN is a scale-free network.
(3) In constructing the network, the time factor was taken into account, and the evolution of nodes over time was examined. This approach not only identified important nodes but also investigated the distribution characteristics of the appearance times of these significant nodes. This method not only overcomes the limitations of traditional models in effectively representing nonlinear relationships, but it also quantifies the significance and clustering characteristics of fluctuation patterns by utilizing topological indicators such as node strength, betweenness centrality, and clustering coefficients.
The framework structure of this paper is organized as follows (as illustrated in
Figure 1):
Section 2 introduces the data sources, data processing methods, and the model construction process;
Section 3 introduces the topological properties of the PPOWN;
Section 4 provides a comprehensive discussion that integrates the previous sections; and, finally, the summaries and insights are given in
Section 5.
From the research framework of this paper, the study can be divided into three levels. First, we convert the propane price fluctuation series into a symbolic sequence. Next, we construct a PPOWN based on the transformations among different fluctuation modes, and we discuss the topological properties of this network. Subsequently, we derive the dynamic characteristics of the propane price fluctuation model from the topological properties of the PPOWN. Following this, we analyze the volatility behavior characteristics of propane price based on the dynamic features of the fluctuation model. Finally, we examine the linear characteristics of the cumulative time of outliers in the propane price fluctuation process and identify the time distribution characteristics of significant patterns that emerge during this process.
3. Empirical Analysis
By constructing the PPOWN, we aim to elucidate the patterns of these fluctuations. This approach not only offers a comprehensive understanding of the market dynamics and the underlying causes of the price changes but also enables the evaluation of propane price variations at different time points. Consequently, it assists companies in optimizing resource allocation on a global scale. Furthermore, this analysis provides scientific decision-making support for companies in production, procurement, sales, and other areas, thereby enhancing their competitiveness in a complex market environment. This section will analyze the PPOWN, focusing on key indicators such as node strength and distribution, the regularity of new node emergence, the node conversion cycle, node betweenness, and the clustering coefficient.
3.1. Strength and Distribution of the Nodes
By recognizing the significance of nodes and the strength of connections within the PPOWN, companies can effectively adjust their production and inventory strategies in a timely manner when confronted with market fluctuations. This proactive approach helps to mitigate risks and capitalize on market opportunities. In order to further elucidate the temporal distribution characteristics and power-law distribution properties of significant nodes within the PPOWN, this section examines the node strength and distribution characteristics of the PPOWN, as illustrated in
Figure 5.
The strength and distribution of nodes illustrate the correlation among the fluctuation modes of propane price. A greater strength of nodes within a modality indicates a higher frequency of transformation into other modes (including the current mode) and an increased probability of occurrence within the network. The left axis in
Figure 5 represents the strength of the corresponding node, the right axis illustrates the cumulative strength distribution of the nodes, and the horizontal axis denotes the corresponding nodes. As shown in
Figure 5, most nodes in PPOWN display low strength, whereas only a few nodes demonstrate high strength, which indicates that the likelihood of most nodes appearing in the PPOWN is relatively low. Meanwhile, the distribution of strength reflects that the PPOWN appears the typical characteristics of a scale-free network.
In addition, there are 2289 nodes in the PPOWN. Among these, 22 nodes have a strength greater than or equal to 20, resulting in a total degree of 572. These nodes account for 7.18% of the overall strength, which is 7963. In other words, 0.96% of the nodes contribute to 7.18% of the total strength, as illustrated in
Table 3.
In the PPOWN, the average contribution rate of the connections between the important nodes and node strength is 55.77%. The result indicates that the PPOWN exhibits clear positive correlation characteristics. Specifically, nodes with a large strength tend to connect with other nodes of high strength. From the analysis above, it is evident that, while the transitions between fluctuation states of the propane price occur frequently and exhibit considerable complexity, the initial 0.96% of the nodes effectively represent the core fluctuation state of the propane price, which suggests that the future fluctuation state of the propane price is likely to resemble the fluctuation state observed during a specific period in the past. Therefore, by identifying the fluctuation patterns of the initial 0.96% of the nodes and the relationships between their transformations, we can provide an approximate description of the fundamental characteristics of propane price.
The strength of the node can reflect the influence or importance of a node within the entire network to a certain extent. Therefore, we examined the nodes with a node strength greater than or equal to 20 in the PPOWN. We obtained the node names, strengths, and appearance times (as shown in
Figure 6). In this figure, the red lines indicate positive direction, the blue lines indicate reverse direction, the characters represent the names of the nodes, and the numbers on the lines between nodes indicate the weights.
In
Figure 6, it is evident that, from the moment significant nodes emerge, the PPOWN reveals that nodes with high point strength are closely interconnected. While the nodes with high point strength tend to be those that appeared earlier, it is important to note that their earlier appearance does not guarantee a high point strength. In the PPOWN, the 22 nodes with a node strength of 20 or greater range from the 12th to the 182nd nodes in the network. Among these, the node ‘ffrrr’, which has the highest strength of 33, first appeared on 18 August 1992, making it the 24th node to be introduced in the PPOWN. Among the first 182 nodes in the PPOWN, 7 nodes exhibit a strength of one. The earliest node is ‘rfssF’, which was introduced on 30 November 1992. This node is the 79th to appear in the PPOWN. It is evident that the nodes appearing early in the PPOWN do not necessarily possess strong point strength.
According to the Formulas (5)–(7), the double logarithmic representations of the node strength distribution for the PPOWN are presented below, as illustrated in
Figure 7.
In
Figure 7, the red triangles represent the fitted data points, specifically the logarithm of the strength and its corresponding proportion.
Figure 7 illustrates that, in the PPOWN, the majority of the nodes exhibit a low point strength, while only a small number of the nodes demonstrate high point strength. Using the least squares method for straight line fitting, the double logarithmic curve of the node strength in the PPOWN is subjected to linear regression. The resulting regression equation is denoted as
, and the correlation coefficient of the trend line along with the power exponent is presented in
Table 4.
As illustrated in
Table 4, the results demonstrate a high degree of credibility, indicating that the PPOWN follows a power-law distribution overall. The result also indirectly suggests that the PPOWN mentioned above is a scale-free network. For a scale-free network, a higher degree of power-law distribution corresponds to a larger power exponent. The power-law exponent (
) of the PPOWN is 2.3051, the value of which indicates that the PPOWN exhibits high complexity and regularity, which indirectly reflects the intricate intrinsic dynamic characteristics of the network.
3.2. Regularity of the New Node Appearance
According to the PPOWN, we begin by studying the cumulative time intervals of new nodes that appear in the network to analyze their occurrence patterns, as illustrated in
Figure 8.
In
Figure 8, the solid red line represents the curve of equal time intervals, in which it can be observed that, as time progresses, the intervals at which new nodes appear in the evolution of the PPOWN are not uniform; instead, they gradually increase, exhibiting a linear growth trend. Least squares regression analysis is conducted on the cumulative time of the new nodes appearing in the PPOWN. The resulting regression equation is denoted as
, and its trend line correlation coefficient is represented as
. The findings indicate a high degree of credibility, demonstrating that the cumulative time of the new nodes in the PPOWN exhibits a linear growth trend. This trend is characterized by a consistent pattern, which reflects the predictability of the changes in the propane price based on the temporal distribution characteristics.
In summary, the emergence of new nodes in the PPOWN indicates the presence of abnormally fluctuating price nodes, which differ from previous fluctuations. Additionally, the cumulative duration of these abnormally fluctuating price nodes exhibits a linear growth trend. This pattern can be utilized to effectively identify the onset of the abnormally fluctuating price nodes, enabling price decision-makers to respond promptly to the new price fluctuations. Consequently, this approach enhances the accuracy of the propane price forecasts and mitigates price risks.
3.3. Conversion Cycle of the Nodes
According to Formulas (8) and (9), the distance between any two nodes in the PPOWN is calculated using the Floyd algorithm [
33]. The results are presented in
Figure 9a, while the distribution of various path lengths is illustrated in
Figure 9b.
As illustrated in
Figure 9, the diameter and average path length of the PPOWN are
and
,respectively, but the distance between nodes ranges from 6 to 11, accounting for 82.86% of the total number of nodes; this indicates that the price fluctuation pattern in the PPOWN is predominantly short-range related, with a conversion cycle of the approximately 8 to 9 days, suggesting that conversions occur relatively frequently. This analytical result provides a theoretical foundation for predicting the periodic conversion patterns of propane prices in the future.
3.4. Betweenness of the Nodes
According to the definitions and formulas of the node betweenness in the network [
35,
36], the evolving relationship between the betweenness and strength of the nodes in the PPOWN over time is analyzed, as illustrated in
Figure 10.
According to
Figure 10, the betweenness of the nodes in the PPOWN, along with the strength, name, first appearance time, and location of their corresponding nodes, has been calculated. The nodes are then arranged in descending order based on their betweenness, as presented in
Table 5, which includes only the top eight nodes.
According to
Table 5, the relationship between the betweenness and strength of the node in the PPOWN reveals that nodes with a high betweenness exhibit low strength, which indicates that nodes with a lower strength play a crucial role in connecting the network. Their presence often signifies that the market has entered a transitional phase, which aligns with Borgatti’s [
40] theory on node betweenness centrality. The nodes with high strength are linked to other nodes through those with low betweenness. When the betweenness of the node is high and shows a continuous upward trend, it suggests that other nodes must pass through this node to connect with additional nodes, marking a significant period. Identifying these nodes during this phase can enable us to effectively predict the volatility of the propane price in the subsequent period.
3.5. Clustering Coefficient of the Nodes
This section primarily analyzes the evolutionary relationship between the clustering coefficient and strength of the node in the PPOWN with the time, which is shown in
Figure 11.
The clustering coefficient quantifies the degree of aggregation among the nodes in a network and reflects the network’s grouping characteristics. A higher clustering coefficient indicates a greater likelihood of pattern formation among the nodes. As illustrated in
Figure 11, the clustering coefficient of the nodes ‘sRRRR’ and ‘FRRRR’ are both 0.5, which suggests a high likelihood that two nodes connected to the same node are also interconnected. This result suggests that there is a significant likelihood of the sharp rising state occurring in the future, at which point decision-makers are highly likely to develop effective strategies to cope with the propane price increases. According to the calculation result, the average clustering coefficient of the PPOWN is 0.0015. Notably, only 24 nodes exhibit a clustering coefficient greater than zero, which are presented in the accompanying
Table 6, arranged from largest to smallest.
By comparison, the clustering coefficient and average clustering coefficient of the PPOWN are both small, which indicates that the likelihood of two nodes, which are connected to a common node, also being connected to each other is minimal. Furthermore, the nodes with non-zero clustering coefficients in the PPOWN typically exhibit low strength, but a small subset of these nodes also demonstrates high strength, which suggests that the PPOWN is not entirely random and possesses certain characteristics of a “community” to some extent. However, the more pronounced clustering characteristics within its network may manifest in either small or large communities. The time distribution characteristics of the node clustering coefficient, which is not equal to zero, indicate that the clustering coefficient of the nodes in the PPOWN first emerged on 30 July 1992, and last appeared on 8 November 2017. The clustering coefficients may vary depending on the time scale, indicating that the volatility of propane prices can sometimes be observed on both larger and smaller time scales. Analyzing the clustering coefficient of the nodes in the PPOWN can provide valuable insights for future research on the clustering of the propane price fluctuations.
5. Summaries and Insights
This paper utilizes the U.S. Energy Information Administration’s propane spot prices from 9 July 1992 to 22 July 2024 as sample data and defines the fluctuation pattern through a coarse-grained method. We converted the fluctuation sequence of the propane price into a character string consisting of five symbols: {R, r, e, f, F}. Using a model with a duration of 5 days and a sliding step size of 1 day, we constructed a sliding window that can be obtained. In the corresponding propane price time series, the fluctuation modes are represented as the nodes in chronological order, while the transitions between the fluctuation modes serve as edges, with the number of transitions assigned as weights, then the PPOWN is established. According to the PPOWN, we then analyzed the evolution of new nodes appearing in the PPOWN. We calculated the strength, strength distribution, clustering coefficients, and the average shortest path of the PPOWN, identifying important nodes and their occurrence times. The results are as follows:
(1) Through the analysis of the strength and distribution of nodes in the PPOWN, we find that the core fluctuation state is represented by the first 0.96% of nodes, the average contribution rate of the interconnections among the important nodes is 55.77%. Most nodes exhibit low strength, while only a few possess high strength, which is a characteristic feature of scale-free networks. Additionally, the connections between the nodes with high strength are very close. However, while the nodes with high strength tend to be those that appeared earlier, it is not necessarily the case that the earlier nodes possess high strength.
(2) According to the Floyd algorithm, the distance between any two nodes is calculated to determine the average path length in the PPOWN. The results indicate that the fluctuation mode conversion period of the PPOWN is 8 to 9 days, which suggests a short-range correlation between the price fluctuation modes within the PPOWN, as well as the complexity and diversity of its price fluctuation states.
(3) Although propane price fluctuations exhibit complex nonlinearity, the cumulative times when the new nodes appear in the PPOWN is not nonlinear. Through the analysis of the regularity in the emergence of new nodes within the PPOWN, it was discovered that the cumulative time for the emergence of the new nodes exhibited a strong linear growth trend. This result is beneficial for predicting both the emergence of new nodes and the time of their corresponding appearance in the network.
(4) The PPOWN indicates that nodes with low strength have a considerable ability to influence communication among other nodes. The presence of these low-strength nodes suggests that the network is undergoing a transitional phase. Identifying and analyzing these nodes can effectively predict the fluctuation of the propane price in the upcoming period. Through the examination of the network’s clustering effect, it has been observed that certain nodes can serve as the shortest path between other pairs of nodes within the entire network. This result offers valuable insights and guidance for future studies on the clustering of the propane price. Finally, through a comprehensive analysis of the strength, betweenness, and clustering coefficient for the node in relation to the time distribution characteristics, it was found that the PPOWN exhibits a low betweenness, clustering coefficients, and strength for the node. This behavior is distinct from that of random and chaotic networks.
The propane market price is influenced by various factors, including market demand, supply levels, fluctuations in crude oil prices, geopolitical events, seasonal variations, macroeconomic conditions, changes in the financial markets, the policy environment, and transportation and logistics conditions. Together, these factors determine the trends and levels of propane price fluctuations, making the study of these characteristics highly valuable in practical terms. Therefore, the research presented in this paper offers theoretical support for various participants in the energy market to forecast future propane price trends within the context of overarching market dynamics. This has significant implications for understanding systemic risk behaviors in the energy market and for informing investment strategies within this sector. On the one hand, future research will focus on constructing a multi-level interdependent network model driven by the multi-factor synergy, and this model aims to elucidate the core mechanisms of the price transmission among various factors, and it will integrate the patterns of the new node emergence with the identification of key nodes to predict the states of the nodes. On the other hand, future research will concentrate on developing a multi-level interdependent network model driven by synergistic interactions among multiple factors, and this model will aim to clarify the fundamental mechanisms of price transmission across various elements. Meanwhile, it will incorporate the patterns of new node emergence while identifying key nodes to forecast the states of network nodes.