1. Introduction
The construction of the Grand Canal was completed in the Sui Dynasty (c. AD581–619), and it has long been a Chinese imperial symbol [
1]. The Grand Canal was a functional regional infrastructure. It is notable for its
caoyun transportation system (grain transportation and strategic logistics). The
caoyun system made significant contributions to establishing a foundation for maintaining the centralization of authority and unity within the multinational country [
2,
3]. Each dynasty’s core region was located in China’s eastern plain, and the Grand Canal ran through the plain from south to north, connecting northern and southern China. The regions surrounding the Grand Canal were relatively prosperous and populated.
In the historical studies of the Grand Canal, Huang et al. [
4] figure out that the rise and fall of Kaifeng mirrored the rise and fall of the Grand Canal from the Sui to the Qing Dynasties. Moreover, many ancient cities located near major waterways were primary hubs of rice cultivation for both armies and residences [
5]. People traditionally lived near canals to secure water for agriculture and daily needs, making the canals more populated than the other areas [
6]. On the other hand, the Grand Canal was frequently hit by wars. In the Tang Dynasty, An Lushan sparked the An–Shi Rebellion (AD755–763), undermining central authority and sparking both regionalism and separatism, while An Lushan’s forces damaged the Grand Canal’s accessibility [
7].
Given the historical background, we can see that the areas along the Grand Canal were populated but suffered from frequent wars. Such a phenomenon was related to the “population-pressure-led social contradiction” [
8], which might be attributable to the population growth along the Grand Canal. Nevertheless, the relationship between the population, wars, and the Grand Canal has not been investigated systematically. It is worth noting that there are various studies examining the geographical shifting [
9], cultural influence [
10,
11], historical uniqueness [
2,
3], and infrastructure sustainability of the Grand Canal [
12]. The special socio-ecological landscape created by the Grand Canal [
13] and the making of the Grand Canal as a heritage site [
14] are also highlighted. However, the main focus is still put on the role of the Grand Canal in facilitating regional development via increasing accessibility [
4,
15,
16,
17,
18,
19], at least in English literature. Briefly, there have been abundant research findings on the Grand Canal’s contribution to regional development and how the canal system differentiated the region from the others. On the other hand, the canal had a significant impact on the people who lived along with it. The Grand Canal had also changed the geographic pattern of the population significantly, forming some new densely-populated, yet conflict-prone, regions. As the role of the Grand Canal in influencing regional stability is insufficiently explored in the literature, we seek to address this research gap. There are two main objectives for us to fulfill in this study. First, we figure out the spatial distribution of population and wars. Second, we quantitatively measure the relationship between wars, population, and the distance from waterways.
It is worth noting that the most common research approach in sustainability studies is to make changes to existing facilities, monitor the changes, and compare the results to the initial conditions. One of the most prominent disadvantages of this approach is that people tend to conclude too quickly and on too small geographical scales [
12]. Therefore, we base on the historical and geographic knowledge of the Grand Canal, which is one of the largest infrastructures in Chinese history, to examine the interconnection between population, wars, and waterways in the Grand Canal Area across different historical periods. Our research focus is on how the Grand Canal changed regional stability in the long term. We use war data combined with the corresponding spatial distribution of the population and the waterways to analyze their interrelationship. Using past experiences as a guide, our findings may contribute to a better understanding of the impact of large-scale infrastructures on regional sustainability.
2. Scope of Research and Data
2.1. Study Period and Study Area
Our data include the population, wars, and waterway systems in the Grand Canal Area. Considering the availability of the fine-grained population data, the periods of AD752–820, AD1270–1393, and AD1820–1910, which included the four Chinese dynasties, namely, Tang (c. AD618–907), Yuan (c. AD1271–1368), Ming (c. AD1368–1644), and Qing (c. AD1636–1912), are examined. The starting years of the three periods are the years of prosperity and stability of the dynasty, characterized by population growth and fewer wars. The end years of the chosen periods are marked by social recovery and resilience. In between, there are times of social unrest and frequent wars.
The Grand Canal Area is relatively large. The canal flows through the provinces and cities in the east of China, such as Beijing, Tianjin, Hebei, Shandong, Henan, Anhui, Jiangsu, and Zhejiang. Nevertheless, the administrative boundaries of the Grand Canal Area and its sub-regions varied in different dynasties. Considering the scope of this study, we take the first-level administrative regions covering the Sui–Tang Canal and the Beijing–Hangzhou Canal of the corresponding dynasties as the Grand Canal Area. Moreover, the area’s second-level administrative areas are also vectored to match the spatial resolution of the population data. The geographic coverage of the administrative areas refers to the administrative divisions of the corresponding dynasties according to the
Chinese Historical Atlas [
20] and the
Chinese Civilization in Time and Space [
21].
Table 1 lists the first-level administrative divisions of the various dynasties concerned, while
Figure 1 shows the administrative areas of the Grand Canal Area in different periods. The first-level administrative areas of the Grand Canal Area include mainly the North China Plain, Huanghuai Plain, and the middle and lower reaches of the Yangtze River. The basic form of the canal area is relatively constant, which is crucial for us to extract and process war and population data over different periods.
2.2. Population Data
We obtain our population data from the
Chinese Population History [
22], covering the six time-sections, namely AD752, 820, 1270, 1393, 1820, and 1910. Population data for the Tang Dynasty cover the 11th year of Tianbao (AD752) and the 15th year of Yuanhe (AD820). The original AD752 population records come from the geographical section of the
Old Book of Tang. In contrast, the original AD820 population records come from the
Yuanhe Maps and Records of Prefectures and Counties. Temporally, the population size ratio between AD752 and AD820 is used to interpolate the missing values. Spatially, the missing data are interpolated by averaging the values of the neighboring cells.
The population data of the Yuan Dynasty cover the beginning of the Yuan Dynasty (AD1270). The original data come from the geographical book of the History of Yuan. The population estimation and adjustment methods are the same as the Tang Dynasty. Population data for the Ming Dynasty cover the 23rd year of Hongwu (AD1393). The population size in the early Ming Dynasty was precisely recorded in the History of Ming due to the population registration system. Despite changes in the administrative divisions during the period, there were few changes in the second-level administrative areas. Therefore, we re-organize the population data based on the second-level administrative areas and map the population through the administrative divisions of the late Ming Dynasty.
Population data of the Qing Dynasty cover the 25th year of Jiaqing (AD1820) and the 2nd year of Xuantong (AD1910). The original AD1820 population records come from the Jiaqing Yitongzhi. The original AD1910 population records come from the household registration survey conducted by the Ministry of the Interior in the late Qing period. There were no major changes in the administrative divisions in the Grand Canal Area during the Qing Dynasty, and the population records are relatively complete. As the accuracy of the data is higher than in other historical periods, data correction and interpolation are not required.
2.3. War Data
Our war records are obtained from the
Chinese Military History [
23]. The records cover the period from 800BC to AD1911, with information such as the year the war began, the participants, the locations, and its causes included. Wars are counted based on the number of battles. The spatial locations of the wars are geo-referenced according to the current county-level administrative boundaries in China. The war data are converted to vector data. The attributes of the war data include the outbreak years of the wars and their geographical locations at the county level.
2.4. Data Aggregation
Based on the time sections of the population data, the population and the war data are aggregated in three periods. The first period is AD752–820, extending from the Tang to the late Tang Dynasties, including the An–Shi Rebellion that affected the entire country. The second period is AD1270–1393, spanned between the heyday of the Yuan Dynasty and the stable period of the early Ming Dynasty. This period covers the wars during the transition from the Yuan to the Ming Dynasties. The third period is AD1820–1910, from the time before the outbreak of the Opium War to the establishment of the Republic of China, covering the Taipei Rebellion. The timespan of each period is approximately 100 years and includes major war disturbances. Such consistency facilitates the comparison between different periods.
3. Methods
This research consists of two parts. The first part is to figure out the spatial distribution of population and wars. It is more accurate to use population density and war density in figuring out their patterns. The distribution patterns in different periods are then compared to trace their spatio-temporal evolution. The second part is to quantitatively measure the relationship between wars, population, and the distance from waterways. We calculate the Pearson correlation coefficients between wars, population, and the distance from waterways. The result is further verified by the curve estimation method to reveal the possible non-linear effect of the population density and the distance from waterways on wars.
3.1. Population
The population density is calculated based on the population data of the second-level administrative areas and the area of the corresponding administrative divisions. The unit is the number of people per square kilometer. The calculation method is as follows:
where D represents the number of people per km
2, P is the total number of people in the secondary administrative region, and A is the area of the secondary administrative region in km
2.
When classifying population density, we need to consider the difference between the total population in different periods and the expression of the difference in population density [
24]. The number of grades and the range of each level should fully reflect the population density characteristics. Regarding the number of grades, the population density of the Tang Dynasty is fewer than that of other periods due to the relatively smaller population size during the time. Therefore, the population density in the Tang Dynasty is divided into seven grades, and that in the other periods is divided into nine grades. In the three time-sections in the Tang and Yuan Dynasties, 5, 10, and 20 are used to indicate the differences in population level. In the three time-sections of the Ming and Qing dynasties, 5, 10, 20, and 100 are used to indicate the differences in population level.
Figure 2 shows the final population density in different periods.
3.2. War Distribution and War Belt Identification
The war data are at the county level, and their digitization is made according to the online maps provided by the Tiandi Map. According to the center points of the main urban areas corresponding to the county-level units, we obtain the geographic coordinates and add them with the outbreak years of the wars to the attribute tables [
25]. The vectored contents are AD752–820, 1270–1393, and 1820–1910. There are 1679 war records, with 577 war points in AD752–820, 498 war points in AD1270–1393, and 604 war points in AD1820–1910. The principal wars of the three periods are the An–Shi Rebellion in AD752–820, the rebellion in AD1270–1393, and the Taiping Rebellion in AD1820–1910. When fitting the population–war–river relationship, war points in the Grand Canal Area are selected for further analysis. Since the wars during the transition between the Yuan and the Ming Dynasties involved two different administrative divisions, their overlapped parts in the two dynasties are chosen for the analysis.
Since the war data are in the point layer, population density is in the surface layer corresponding to each secondary administrative area, and the waterway data are in the line layer, we need to standardize the data format before exploring their relationship. The spatial distribution of wars is expressed by the density of war spots. We calculate the point density by using the Kernel Density Function, assuming that a smooth curved surface covers each point and that the surface value is the highest at the location of the point. The surface value gradually decreases when the distance from the point increases, and the distance from the point equals the search radius. The surface value at the search radius boundary is zero. Only circular neighborhoods are allowed. The volume of the space enclosed by the curved surface and the plane below is equal to the value of the weight field assigned to this point. When we specify the value of this field as none, the volume is one. The density of each output raster cell is the sum of the values of all the core surfaces superimposed at the center of the raster cell. The kernel function is based on the Quadratic Kernel Function [
26] shown as follows:
where K stands for the kernel, which is a non-negative function, the value after integration is 1, and h is the smoothing parameter, also called bandwidth, which is the search radius mentioned above.
The calculation of the kernel density is based on the war points in the three periods, and the resulting raster layer is expressed by the number of floating points. The value is determined by the raster calculation as an integer for subsequent data extraction. The war density range of the three periods obtained is 0–1080 in AD752–820, 0–785 in AD1270–1393, and 0–819 in AD1820–1910. From the numerical values of the war density and the number of war spots, the highest war density dropped from the Tang to the Qing Dynasties, indicating that the war distribution became more extensive. Finally, the aggregation of the Grand Canal regional wars is made within the current administrative divisions. The integration of the war density in the Grand Canal and the waterways’ locations is shown in
Figure 3.
The above analysis is based on the overall war density. To further express the spatial distribution of the wars and eliminate data noise, the study extracts the hot zone of wars that passes through the center point of the war hot zone. The definition of the war hot zone is unified. The classification of the level of war density is expressed as quantiles, each of which contains an equal number of elements. It is assumed that the change in the density of the war points is linearly distributed, and the war points are divided into ten levels. The quantile assigns the same number of data values to each class and finally extracts the points of the first two classification sets (i.e., extracts the top 20 percent of the data points). There are no empty classes in the quantile classification, and there are no classes with too many or too little data. According to the results of quantile extraction and the basic situation of historical urban development along the Grand Canal [
27], high-value bands of war distribution are extracted (
Figure 4).
3.3. Statistical Analysis
The spatial relationship between the waterways and population is measured by extracting the spatial pattern of wars, which considers war density the dependent variable. In contrast, population density and distance from the waterway are the explanatory variables. As mentioned above, wars are expressed in the point layers, and the calculation of the kernel density of wars is associated with other surrounding war points. In contrast, the secondary rivers (waterways) in the Grand Canal Area are expressed in the line layers. The population density in the secondary administrative area is presented as the surface layers. As two time-sections are involved, the difference in population density between the two time-sections is used to represent the changes in population density. Therefore, when integrating data, each war point is used to extract changes in population density at its location. The distance from each war point to the nearest secondary river is calculated. Finally, the attributes of population density and the war–river distance are combined with the war density for subsequent correlation and regression analyzes.
We calculate the correlation between the three variables (i.e., war, population, and distance from the waterway). All of the skewness and kurtosis statistic values are less than ±1, indicating that our data are normally distributed (see
Appendix A). Hence, Pearson correlation is applied [
28] (
Table 2). To reveal the possible non-linear effect of the population density and the distance from waterways on wars, we employ ten different curve-fitting models to generate their results. A separate model is generated for each dependent variable. Those models include Linear, Logarithmic, Inverse, Quadratic, Cubic, Compound, Power, S-curve, Growth, and Exponential. The details of those models can be found at
https://www.ibm.com/docs/en/spss-statistics/28.0.0?topic=estimation-curve-models (accessed on 20 April 2022). The Logistic model is excluded in this study because our data are interval data. In evaluating different models, their R
2-values and F-values are compared. The sum of squares, Akaike information criterion (AIC), and Bayesian information criterion (BIC) values of those models are also provided for reference (
Table 3 and
Table 4). The curve estimation in different periods is also presented in the tables.
5. Discussion
This study uses the population and war data together with the location of the waterways in the Grand Canal Area to explore their spatio-temporal dynamics. We find that the densely populated areas concentrated along the canal channels while they were subsequently affected by the shifting of the canals. In the Tang and the Yuan Dynasties, there were two population clusters located along the Sui and Tang Canals, and the branch line occurred in the eastern and southern regions of Shandong. After the Ming Dynasty, the population of this area increased continuously, forming an apparent belt running through the north and south and spreading along the canal (
Figure 3). This phenomenon might be due to a significant increase in the regional population brought by migration from the early Ming Dynasty, which moved the population of Shanxi to the Huanghuai Plain to support economic recovery during the time [
51]. Moreover, the development of trans-shipment canal trade along the shipping cities boosted the socio-economic development along with the canal areas. Canal cities and populations continued to grow on this basis, making the population concentrated along with the distribution of the waterways [
52].
On the other hand, the Grand Canal Area had more frequent wars in our study periods. This phenomenon was manifested in Luoyang, as the center stretches from the north to the south to form a war hot zone along the canal. This war zone exhibited different characteristics in different periods. The war hot zone was branched along the Fen River in the Tang Dynasty, while the war hot zone was branched along the Yangtze River after the Yuan Dynasty. With the change of the canal, the war hot zone gradually shifted eastward. Although the spatial distribution of high-density war areas had a historical contingency, it was also constrained by the physical environment and social development [
53]. This is evident in the Grand Canal Area, where most of the Tang Dynasty wars were distributed in the northern part of the canal, which was closely linked to the geographical distribution of population and economic centers at that time. With the gradual shift of China’s economic center to the south, the war hot zones also moved south, forming stable high-density war areas in the lower reaches of the Yangtze River [
54]. It can be seen from the overlay of war belts of different periods in
Figure 4 that war belts in the northern part of the Grand Canal were located along the canal. In contrast, in the southern region, due to more complicated waterway systems, the spatial distribution of war points is a bit different. The specific manifestation is that a branch line along the Yangtze River was formed after the Yuan Dynasty.
The war density in the Grand Canal Area was positively correlated with the change in population density. Every citizen has an equal and continual opportunity to participate in a rebellion [
43]. The concept of ‘conflict proclivity per capita’ states that the higher the population, the greater the risk of wars [
55]. Populations were the source of war resources since there would possibly be a strong recruitment group when there was a large population. This group might also tend to destroy populated areas to minimize the agricultural and economic bases.
On the other hand, the war density was negatively correlated with the distance from the waterway. Those states with finite reach might find it difficult to regulate activity outside their existing infrastructure [
56]. Locations far from the capital, on the other hand, may be prone to conflict because the population’s preferences were likely to diverge significantly from those of the emperor and because it was hard for the government to manage remote territory [
57]. Even if the government created military bases around the region to mitigate the effects of distance, these bases were backed by shaky supply lines and became targets for rebel groups [
58]. This may explain why the Grand Canal Area was conflict-prone.
Our statistical results show that the correlation coefficient between war density and population density in the Grand Canal Area was ~0.5, while the one across China was ~0.3, indicating that the population-war relationship in the Grand Canal Area was stronger. Whether nationwide or in the Grand Canal Area, the correlation coefficient of war density and river distance was approximately −0.3. The population–war correlation increased over the three periods, while the river–war relationship remained relatively constant. This shows that wars in the Grand Canal Area were more closely linked to the population, higher than average. This echoes the previous studies about wars in Chinese history, which were mainly brought by population pressure rather than by the environmental and climatic variables [
59,
60]. This study further extends it to regional infrastructure. In addition, the effects of population density and war–river distance on war density are curvilinear. This indicates that the inter-relationship between population, wars, and the canal was interactive. Moreover, it might be about the threshold concept, while the human–environment relationship became strongly significant once the threshold had been reached [
61]. Further investigation should be made along with this rubric.
From the perspective of military geography, the North–South regime confrontation in terms of the united empires south of the Great Wall versus nomadic tribes/polities in the steppes of Inner Asia drove the secular war–peace cycles of Chinese history [
62,
63,
64]. Specifically, China had nine strategic areas, including four corners (Guanzhong, Hebei, Southeast China, and Sichuan), four foci (Shanxi, Shandong, Hubei, and Hanzhong), and the heartland (the Central Plain) [
65]. Furthermore, war hotspots were located in the Loess Plateau and the North China Plain during warm and wet periods, but in the Central Plain, Jianghuai region, and lower reaches of the Yangtze River Delta during cold and dry periods [
8,
66]. Compared to previous studies, we narrow the scope of the analysis from the national to the regional levels. Our study area is overlapped with the Central Plain, the Jianghuai region, and the lower reaches of the Yangtze River Delta. That region was quantitatively identified as a war hotspot in recent studies [
8,
66]. Here we supplement that the war hot spot was located precisely along the Grand Canal (i.e., the Beijing–Luoyang–Nanjing region), which is attributable to the population growth in the Grand Canal Area. This finding may help provide a more nuanced picture of the military geography of historical China.
As our research findings also reveal the apparent importance of space and geographic locations, the application of spatial econometrics [
67] and spatial analysis [
8,
66] in examining the nexus between population, wars, and the canal can be explored.