1. Introduction
According to data released by the Civil Aviation Administration of China, as of early 2024, China has over 250 airports and more than 5000 flight routes, making it the world’s second-largest aviation market. With rapid economic development and increasing travel demand, aviation networks have evolved from traditional local transportation modes to national and dense network systems [
1]. During this process, the complexity of aviation networks has continuously increased, with the evolution of network structures and node connection patterns exhibiting strong dynamic and nonlinear characteristics, urgently requiring systematic theoretical analysis and practical guidance [
2]. The stability, resilience, and optimal design of aviation networks directly impact the national and regional transportation efficiency, economic development, and public welfare [
3]. Aviation networks face a fundamental paradox: why does sparse connectivity persist despite dense node distribution? This contradicts traditional cost-driven explanations (Li & Dawood [
4]). Therefore, in-depth research on the evolutionary characteristics of aviation networks can provide theoretical support for network management and planning.
Existing studies on aviation networks exhibit several notable limitations. Firstly, they suffer from static bias, as structural metrics typically fail to account for temporal evolution, leading to incomplete representations of the dynamic nature of these networks (Freeman [
5], Barrat [
6]). Secondly, many studies oversimplify the mechanisms by attributing network sparsity to a singular set of factors, thereby neglecting the multifactorial nature of sparsity in aviation networks (Leng [
7], Sun [
8]). Lastly, there is a significant methodological gap due to the absence of comprehensive multidimensional complexity metrics. This absence impedes a deeper understanding of the intricate processes governing network behavior (Bilotkach, 2019 [
9]). As a result, the majority of current research on aviation networks tends to focus narrowly on static structural analyses or the optimization of dynamic evolution models, overlooking a more systematic exploration of the complexity evolution process. This gap is especially evident in the lack of in-depth studies on the dynamic relationship between network redundancy and information entropy, which are essential factors influencing the network’s adaptability and efficiency over time.
This study integrates a network sequence analysis with a density matrix theory to provide a comprehensive framework for capturing the dynamic behavior of aviation networks. Specifically, the analysis focuses on three key dimensions:
(1) Structural evolution, assessed through temporal variations in degree distribution and clustering coefficients (Equations (1)–(4));
(2) Complexity dynamics, evaluated via the coupling relationship between network redundancy and information entropy (Equations (5)–(7));
(3) Economic-network coordination, modeled through the integration of macroeconomic factors with network performance indicators (Equations (13)–(17)).
By incorporating the temporal dimension into the density matrix framework, this approach enables a systematic investigation of how aviation network structures and complexities evolve over time. The overarching objective is to identify an optimal trade-off between transportation efficiency and response diversity, thereby enhancing the adaptability and robustness of aviation networks under dynamic operational conditions.
The key findings of this study include the following: ① The scale-free topology of the network intensifies over time. ② The sparsity–efficiency trade-off follows phase transition rules. ③ Eastern China leads in coupling coordination. The theoretical and practical contributions are threefold: firstly, the development of a time-embedded density matrix theory that explains sparse connectivity as an equilibrium between efficiency and diversity (Equations (9)–(11)), addressing the theoretical gap identified by Ghavasieh [
10]; secondly, the introduction of a redundancy–entropy coupled metric to quantify complexity evolution (Equation (7)), advancing beyond Barrat’s [
6] single-efficiency indicator; and lastly, the identification of regional disparities in economic-network coordination, providing valuable insights for the development of PPP models for airports in western China.
3. Data Sources and Aviation Network Reconstruction
This study investigates the structural evolution characteristics of China’s aviation network from 1990 to 2024 using a 27-year period of data as the research object. The research data are sourced from the Official Aviation Guide (OAG) [
25] and Civil Aviation Administration of China (CAAC) [
26] from a statistical perspective. The data at the beginning of each year serve as the basis for the annual aviation network data.
Graph theory is an effective tool for analyzing the evolution of aviation networks. In aviation network analyses, cities with airports are considered nodes, while flight paths are considered edges. This study constructs aviation networks for various years, denoted as GT, based on data collected on Chinese cities with air service and airline flight routes. is a set , consisting of the number of cities with airports () and the number of flight routes between cities (), representing the aviation network, where T denotes the year. As T varies, generates a series of aviation networks over different years. Data for cities with multiple airports are combined; for example, Shanghai Pudong Airport and Shanghai Hongqiao Airport are combined to form “Shanghai”. Due to data collection constraints, Hong Kong, Macau, Taiwan, and other locations are not included in the analysis.
Network sequence diagrams were created every five years from 1990 to 2024, as shown in
Figure 1. The black nodes in the figure represent major Chinese cities with air service as of 1990. Newly added cities with air service between 1990 and 1995 are indicated by red nodes; cities added between 1995 and 2000, 2000 and 2005, 2010 and 2015, and 2015 and 2020 are represented by orange, yellow, green, cyan, blue, and purple nodes, respectively. The figure reveals the following two distinct features:
- (1)
Larger cities, with more flight routes, attract newly added cities to initiate air service. In other words, the edges of new nodes tend to connect with high-degree core nodes. This characteristic indicates some scale-free properties within the aviation network.
- (2)
Over time, new routes connecting existing cities also emerge, alongside new routes introduced by recently added cities. Consequently, the network structure becomes increasingly complex.
4. Methodology
This research examines the temporal evolution of the aviation network structure. Utilizing network sequence analysis techniques, it systematically reveals the intricate characteristics and relationships exhibited by aircraft networks over time. Firstly, the study employs the node degree distribution, average path length, and network clustering coefficient of the aviation network sequence to explore the dynamic evolution of the network structure. Secondly, it analyzes dynamic changes in network complexity through redundancy, network efficiency, and network information entropy of the network sequence. Thirdly, to investigate the balance between aircraft network efficiency and associated diversity, the time dimension is incorporated into the theory of network density matrices. Finally, the coordinated and paired relationship between economic development and aviation networks is examined.
4.1. Analysis of the Structural Evolution of Aviation Networks
The selected metrics address two core characteristics of aviation networks: ① Scale-free property verification. Degree distribution is essential because >85% of transportation networks exhibit power-law patterns [
27], and our observation of Beijing-centric growth requires quantifying hub dominance. ② Hierarchical connectivity dynamics [
28]. The average path length measures the global efficiency of passenger flow, while the clustering coefficient captures the regional cluster formation. Therefore, we calculate the degree distribution, average path length, and clustering coefficient of China’s aviation network, respectively.
The number of aircraft routes in a city, denoted by
, represents the node degree of the aviation network. The degree distribution of the route network is represented by
, which is the probability distribution of the node degrees of all cities [
29]. This indicates the importance of each city within the overall network. The following is the calculation formula:
In Equation (1),
indicates the number of nodes with degree
k, and
N indicates the total number of nodes in the network. When the condition
is satisfied, and a logarithmic transformation is applied to both
and
, resulting in a linear correlation, it indicates that the aviation network follows a power-law distribution [
30].
The average length of the shortest paths between every pair of nodes in the aircraft network is referred to as the average path length, denoted as
L [
31,
32]. It measures the overall connectedness and efficiency of the aviation network. The calculation formula is as follows:
In Equation (2),
represents the shortest path length between nodes
i and
j, while
L denotes the average path length. The local clustering coefficient of the network, denoted by
, is defined as the ratio of the actual number of edges connected to node
i to the number of possible edges between the neighbors of node
i [
33]. The formula for this calculation is as follows:
In Equation (3),
denotes the number of actual edges between the neighbors of node
i. The local clustering coefficient
in aviation networks can be employed to determine whether a specific airport (node) has a high degree of connectivity with its neighboring airports. The average value of the local clustering coefficients across the network is represented by the global clustering coefficient, or
C, calculated using the following formula:
A higher clustering coefficient typically indicates the presence of numerous local groups or highly connected airports within specific regions.
4.2. Analysis of the Complexity Evolution of Aviation Networks
Here, we choose network redundancy and information entropy as the methods to solve the core paradox. This choice is both necessary and insightful, as it can comprehensively address two fundamental challenges in the network theory:
① Why do sparse connections persist despite dense node distribution (
Figure 1): Sparse connections in a network, despite a high density of nodes, suggest an optimization process that balances cost and efficiency. Network redundancy, quantified by Equation (5), helps explain how networks maintain robustness and fault tolerance without the need for excessive connections. By introducing redundancy, the network ensures that alternative paths exist in the case of a failure while still minimizing the number of direct connections. This ensures that the network remains resilient without incurring high costs, such as increased wiring or connectivity in dense node regions [
31].
② How networks balance functionality and cost:
Information entropy, as represented in Equation (7), is a measure of uncertainty and distribution of resources across a network. This metric allows for the assessment of how efficiently the network is utilizing its resources (such as bandwidth or connections) while maintaining optimal functionality. A network that efficiently balances functionality with minimal redundancy, or vice versa, can be better optimized both in terms of performance and cost. Information entropy enables the network design to dynamically adjust the distribution of connections based on the level of necessary functionality, ensuring minimal cost while maintaining overall network resilience [
34].
The concept of network redundancy refers to the availability of alternative routes within a network, which ensures that connectivity is maintained even if a node or edge fails. Our previous research has proposed a method to quantify the network redundancy from a comprehensive perspective [
32].The specific procedure is as follows: (1) Firstly, construct an adjacency matrix
A by mapping the aviation network. (2) Next, calculate the covariance and eigenvalues of the adjacency matrix, rank the eigenvalues by magnitude, and determine the proportion of variance explained by each eigenvalue. (3) Finally, as shown in
Figure 2, plot the percentage curve on a Cartesian coordinate system, where the redundancy is represented by the shaded area. The calculation procedure is detailed below:
The covariance between nodes
i and
j in the network is denoted by
. Next, determine the eigenvalues
of the covariance matrix
and their corresponding eigenvectors. Then, sort the eigenvalues by magnitude.
Lastly, calculate each node’s cumulative contribution rate. Plot a curve with
as the abscissa and
λ as the cumulative percentage of the first
n eigenvalues on the ordinate. The redundancy
R in
Figure 2 is represented by the area of the shaded region; the larger the value of
R, the greater the redundancy.
The complexity of the aircraft network, or the unpredictability of the information source, is quantified by the network’s information entropy, denoted as
H(
k). The network information entropy is calculated using the likelihood of degree distribution [
18]. The formula for information entropy is as follows:
Higher information entropy indicates a more complex aviation network with additional unknown factors influencing the network. According to the information entropy formula, an aircraft network with higher entropy has a more intricate structure and more unpredictable elements.
By adopting these two methods, network redundancy and information entropy, the approach offers a comprehensive framework to resolve the paradox of sparse connections in dense networks and the trade-off between cost and functionality. The combination of these methods provides insights into the structural design of resilient, efficient, and cost-effective networks, making them indispensable tools for understanding complex network behaviors.
4.3. Study on Sparsity and Efficiency in Aviation Network Evolution
A theoretical framework based on the network density matrix is presented here [
35]. Using this approach, we compare phase transitions in physical systems to the formation process of aviation networks. Specifically, employing basic diffusion dynamics, we first elucidate how information spreads in aviation networks, where “information” encompasses elements such as passenger demand and emergency signals. Information (or disruptions) within the network propagates from one node to its neighboring nodes, akin to physical systems. The Laplacian matrix L can characterize the phase transition process of information diffusion paths, which can be interpreted as the increase or reduction in edges:
If there is air service between cities i and j, then ; otherwise, . Here, A represents the adjacency matrix.
M is a diagonal matrix, where for and for . This matrix considers each node’s degree of connectivity and the connections between the nodes.
The statistical propagation factor,
, is then determined using the following formula:
where
represents the propagation scale, a very small value implies that information primarily propagates between nearby nodes, whereas a large value indicates long-range propagation. The network density matrix
is defined as follows:
The network development process can be understood as a phase transition from disorder to order. In this context, the information flow gain (
W), which is analogous to “work” in physical systems, signifies the network’s information transmission capability gained through link establishment. This is one of the two key physical quantities in this process:
The incremental information flow of the network, from a state of zero edges to various time points, is represented by
. Analogous to “heat” in physical systems, the response diversity loss (
Q) indicates the loss of free response modes as the system forms connections.
denotes the amount of response diversity loss in the network from an initial state of zero edges to various time points. The efficiency
indicates how well the network maintains response diversity as it acquires the ability to transmit information.
Here, we measure the network sparsity (
V) by the ratio of the total number of edges to the maximum possible number of edges.
where
E represents the actual edge count of the network. A network is considered sparser if
V is smaller and denser if
V is larger. We investigate the relationship between sparsity and the network’s propagation effectiveness at different levels of sparsity.
4.4. Study on the Coupling Coordination Degree Between Aviation Network Evolution and Economic Development
We examine the potential impact of economic factors on the evolution of aviation networks [
36,
37]. Firstly, we develop an indicator system for aviation networks and comprehensive evaluation indicators for urban economic capabilities, as depicted in
Table 1, to construct the dataset.
Specifically, Dataset Y consists of the evaluation indicators for urban economic development, while Dataset X represents the indicator system for aviation networks.
Subsequently, the data were processed using Min−Max normalization. The formula for Min−Max normalization is as follows:
Here, represents the minimum value of the indicator, and represents the maximum value of the indicator. This method scales all indicators to the range of [0, 1], eliminating the impact of dimensional differences and ensuring comparability across the variables.
Next, the coupling degree was calculated using the formula:
where
C denotes the coupling degree.
The comprehensive evaluation index
T was then computed to measure the overall development level of the coupled systems, defined as follows:
and are the importance weights of the two systems. In this study, the two systems are considered equally important, so .
The coupling coordination degree model further evaluates the balance between system development levels. A common formulation is the following:
Next, we employ spatial autocorrelation techniques to determine the spatial distribution pattern of data values and assess whether these values are influenced by geographic environmental factors. Subsequently, spatial autocorrelation analysis is applied within the coupling coordination model. Specifically, Moran’s I is used to identify the spatial distribution pattern of the values and detect whether there are clustering, dispersion, or random distribution patterns. Moran’s I is particularly valuable for quantifying the global spatial autocorrelation pattern in the aviation-economy coupling coordination system, aligning precisely with the core objective of this study: to verify whether regions with high or low coordination exhibit significant clustering trends, rather than modeling causal mechanisms or local spillover effects. The formula is as follows:
In this formula, Z denotes the standardized variable vector, W is the spatial weight matrix constructed from the association intensity between locations, and tr represents the trace of the weight matrix. The result of is in the range of [−1, 1]. A value of indicates positive spatial autocorrelation, meaning that high-value nodes in space tend to be adjacent to other high-value nodes, while low-value nodes tend to be adjacent to other low-value nodes. A value of signifies negative spatial autocorrelation, where high-value and low-value nodes are alternately connected. When , the spatial distribution is random with no significant pattern. In the analysis using Moran’s I, both the P-value and Z-statistic are comprehensively considered. If , the null hypothesis is rejected, indicating significant spatial autocorrelation. Moreover, a larger absolute value of the Z-statistic implies stronger spatial autocorrelation.
5. Analysis of China’s Aviation Network
The degree distribution of the aviation network at various time periods is illustrated in
Figure 3. This series of log–log scatter plots systematically demonstrates a statistically grounded transition toward scale-free topology. Each annual distribution depicts degree
against probability
, where the empirical data (black dots) increasingly align with fitted power-law models (orange dashed line) across decades. Crucially, three quantitative metrics validate the progressive convergence to power-law distribution: the absolute value of the scaling exponent
exhibits a marked rise from 0.5773 (1990) to 0.7524 (2024), quantitatively confirming the steepening skewness toward hub dominance. Simultaneously, Pearson’s correlation coefficient
strengthens from −0.7688 to −0.8463, indicating tighter linear alignment in the log–log space, a diagnostic requirement for robust power-law behavior. Complementary goodness-of-fit is evidenced by a coefficient of determination (
) improvement from 0.5910 to 0.7506, decisively rejecting visual subjectivity. The temporal trend graph explicitly documents this structural shift: the monotonic ascent of
and
alongside rising
between 1990 and 2025 corroborates how the network’s hierarchical organization intensifies through airline deregulation and hub-centric growth policies. Such systematic numerical progression—driven by preferential attachment to major hubs (e.g., Beijing/Shanghai)—establishes China’s aviation network as a canonical example of spontaneous scale-free emergence in real-world infrastructure.
Figure 4 shows the structural evolution of China’s aviation network across four critical topological metrics over four decades (1990–2024). The L plot (average path length) exhibits a distinctive pattern: following an initial dip to approximately 2 in the late 1990s, values rise steadily to 2.20 by 2025, signaling increasing global separation between the nodes despite a network expansion, which aligns with global aviation network evolution patterns quantified by Guimera et al., 2005 [
16], where route densification among existing.
Cities consistently reduce topological distances ( for ). Conversely, the post-2000 L increase correlates with the Chinese airport expansion rates exceeding the route growth (CAAC data: 82 new airports vs. 48% route growth from 2000 to 2020). The stabilization post-2020 matches OECD’s infrastructure maturity thresholds, where indicates balanced growth. Concurrently, the C plot (clustering coefficient) demonstrates monotonic growth from 0.66 to 0.71, evidencing intensified regional cluster formation, a phenomenon aligned with empirical observations of airport alliance development (e.g., Beijing–Shanghai–Guangzhou triangle consolidation). The R curve (redundancy) reveals strategic inflection: after declining to a 2000 nadir (0.25) reflecting efficiency-focused optimization, the values rebound to 0.33 by 2025, confirming deliberate resilience enhancement through alternative route expansion, as documented in CAAC infrastructure reports. Most significantly, the H metric (information entropy) climbs persistently from 5.60 to 6.00, quantifying escalating topological complexity commensurate with network scale growth, with each 0.1 increment representing of an operational uncertainty rise per Shannon’s framework.
Collectively, these trends establish China’s aviation system as a paradigm of adaptive scale-free evolution. The synchronous ascent of C (local cohesion) and L (global separation) manifests hierarchical restructuring, where regional hubs absorb connections at the expense of peripheral accessibility, consistent with Barabási’s [
20] preferential attachment principle. The R-H divergence pre/post-2000 captures strategic reprioritization: an initial redundancy reduction in economic efficiency (matching IATA’s 1990s deregulation trends), subsequently reversed to bolster robustness post-2010, exemplified by the “13th Five-Year Plan” mandating secondary route redundancy. Critically, entropy (H) escalation at 0.09/decade exceeds the 0.05 theoretical threshold for spontaneous scale-free emergence, mathematically confirming the network’s progression toward heterogeneous, hub-dominated architecture. These quantitative signatures—particularly R’s rebound and H’s surge—empirically validate China’s deliberate transition from linear expansion to strategically resilient topology.
Figure 3 illustrates that China’s aviation network exhibits clear scale-free characteristics. Therefore, we compare it with a scale-free network model (a BA network model).
Table 1 presents the structural development data for both the aviation and BA networks, where “edge” in the BA network denotes the number of edges introduced for each new node during the BA model’s growth process. The final expansion scale for all BA networks is 260 nodes. The
Table 2 shows that while the clustering coefficient and network information entropy of China’s aviation network are much larger than those of the BA network, the average path length and redundancy are comparable to the BA model when the edge count is 9 or 11.
We examine the information propagation process within the aviation network from the perspective of diffusion dynamics to determine the optimal balance between operational efficiency and network adaptability. We limit the BA model’s scale to 260 nodes, corresponding to the number of major Chinese cities with air services at the beginning of 2024. By continuously adding edges, we reduce the network model’s sparsity, creating 17 network sequences with decreasing sparsity
V, denoted as
.
Figure 5 illustrates the relationship between network sparsity and propagation efficiency over time at various propagation scales. The findings indicate that the scale of information propagation significantly impacts the propagation efficiency of both the aviation network and the BA model, and this trade-off evolves with the propagation time. As the time scale increases, the complex network structure increasingly affects the information propagation efficiency. We find that in the BA network, optimal efficiency under different propagation scales corresponds to different levels of sparsity, with network propagation efficiency initially increasing and then decreasing with the sparsity. Sparser networks are required for optimal propagation efficiency at larger propagation scales.
Figure 4 demonstrates the gradual decrease in the network sparsity and the tightening of node connections in the aviation network. Over time, the network’s structural efficiency progressively decreases as this clustering effect intensifies. While dense networks can improve local connections, they may also reduce the information propagation efficiency due to increased complexity and redundancy. The advantages of using sparser structures become evident. For example, China’s aviation network achieved higher information propagation speed and efficiency in 2005 due to its simplified connection patterns and relatively low redundancy. This phenomenon reflects the network’s phased adaptive characteristics during the development of aviation—rapid expansion with minimal redundancy. Conversely, high density can lead to inefficiency and resource waste as the network grows, while excessive sparsity can cause node overload. Therefore, it is crucial to strike an appropriate balance between network density and sparsity at various stages of development to maximize the information propagation and enhance the operational efficiency of aviation networks.
The expansion of cities with air services and the quantity of flight routes typically coincide with the development of aviation networks. The overall efficiency of the network is significantly influenced by the relationship between the growth rate of cities and routes.
Figure 6(1) illustrates how edges and nodes changed as the aviation network and BA model developed. The number of edges (
E) and nodes (
N) in the BA network are linearly correlated. However, there is no straightforward linear positive association between the number of cities with air services and the flight routes in the aviation network (see
Figure 6(2)). Instead, this relationship is characterized by an exponent
and follows an exponential distribution. In other words, the growth rate of flight routes increases with the number of cities, indicating that during network expansion, route growth significantly outpaces city growth. This outcome aligns with concepts of generic network efficiency. The relevant literature [
18] suggests that networks with optimal efficiency usually maintain a specific sparsity, where
represents sparsity under general optimal efficiency. This implies that, under optimal circumstances, the sparse network structure is greatly outpaced by the growth rate of China’s aviation network. Various factors, such as topographical features, regional economic development, and airline operational strategies, may contribute to this phenomenon. Route configuration and network density are influenced by the level of economic development, the demand for transportation, and the market structure of airlines in different geographical regions. For example, airline routes may expand more rapidly between major cities or economically developed areas, whereas cities in remote areas may experience slower growth in route connections but gradual expansion in air services. Consequently, numerous interrelated factors contribute to the network’s nonlinear development pattern, reflecting the complexity and diversity encountered by aviation networks as they evolve.
Next, we analyzed and calculated the coupling coordination between the number of urban air routes and urban economic development during the evolution of the aviation network. The classification of the coupling coordination level [
37] is detailed in
Table 3.
An empirical analysis based on the coupling coordination degree model shows that, as shown in
Figure 7, combined with the coupling coordination level in
Table 3, it can be known that the coupling coordination between China’s aviation network and economic development is continuously optimized with the growth of both, presenting a trend of enhanced coordinated development. This reflects a significant bidirectional interaction mechanism between economic growth and the aviation network. However, the coupling coordination degree remained largely unchanged between 2020 and 2024, primarily due to the severe impact of the COVID-19 pandemic on the aviation industry. Under the backdrop of restricted global mobility, a temporary mismatch occurred between the aviation network’s service capacity and urban economies’ demand for factor circulation. This mismatch weakened the momentum for economic development to support air transport upgrading. Conversely, the proactive infrastructure layout of the aviation network caused its development to temporarily outpace the economic growth during this period.
The degree of collaborative development between the two exhibits significant spatial agglomeration characteristics. The global Moran’s I values of the coupling coordination degree (
Table 4), ranging from 0.1158 to 0.3152, are all positive (
) and show a trend of first increasing and then decreasing. This indicates that cities within the network are typically connected to regions with similar coordination levels, and both low-coordination and high-coordination areas have obvious clustered distribution patterns. This further explains why the growth rate of the sparse network structure is far outpaced by that of China’s aviation network under ideal circumstances.
Continuing the analysis from a spatial perspective, as shown in
Figure 7, the overall coupling coordination degree exhibits a gradient distribution characterized by “strong in the east, weak in the west”. The eastern region took the lead in transitioning from “serious disharmony” to “primary coordination” in a coupling coordination. The central region completed a phased upgrade from “moderate disharmony” to “marginal coordination” over the same period. In contrast, the western and northeastern regions remained in the “mild disharmony” stage in 2020, with their coupling coordination degrees lagging behind.
This regional disparity is attributed to economic and geographical factors. The eastern region has formed a benign interactive cycle between air transportation and the regional economy, relying on its port-airport integrated transportation network and high-density economic activities. By contrast, the western and northeastern regions are restricted by vast geographic spaces, low population density, and heavy industrial structures, which have hindered the full release of the network’s economic effects of air transportation. As a result, their development has lagged behind the demand for economic scale expansion.