Next Article in Journal
Changes and Challenges in Human Resources Management: An Analysis of Human Resources Roles in a Bank Context (after COVID-19)
Previous Article in Journal
Modelling and Environmental Profile Associated with the Valorization of Wheat Straw as Carbon Source in the Biotechnological Production of Manganese Peroxidase
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Population Prediction of Chinese Prefecture-Level Cities Based on Multiple Models

1
School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China
2
Key Laboratory of Big Data Mining and Knowledge Management, Chinese Academy of Sciences, Beijing 100190, China
*
Authors to whom correspondence should be addressed.
Sustainability 2022, 14(8), 4844; https://doi.org/10.3390/su14084844
Submission received: 12 March 2022 / Revised: 8 April 2022 / Accepted: 13 April 2022 / Published: 18 April 2022

Abstract

:
In recent years, the population growth rate has been gradually declining in China. As the population problem becomes increasingly significant, the accurate prediction of population development trends has become a top priority, used to facilitate national scientific planning and effective decision making. Based on historical data spanning a period of 20 years (1999–2018), this article presents predictions of the populations of 210 prefecture-level cities using the Malthusian model, Unary linear regression model, Logistic model, and Gray prediction model. Furthermore, because the gray prediction model exhibited the highest degree of accuracy in formulating predictions, this study uses the model to predict and analyze future population development trends. The results reveal that the population gap between cities is gradually widening, and the total urban population shows a pattern of rising in middle-tier cities (second-tier cities and third-tier cities) and declining in high-tier cities (first-tier cities and new first-tier cities) and low-tier cities (fourth-tier cities and fifth-tier cities). From the viewpoint of geographical distribution, the population growth rate is basically balanced between the northern part and the southern part of China. In addition, the population growth of the high-tier cities is gradually slowing while the low-tier cities are experiencing a negative growth of population, but middle-tier cities are experiencing skyrocketing population growth. From the viewpoint of regional development, although the development of regional integration has been strengthened over the years, the radiative driving effect of large urban agglomerations and metropolitan areas is relatively limited.

1. Introduction

As the world’s most populous country, population has always been an important issue in the social development of China. The latest statistics from China’s National Bureau of Statistics, calculated at of the end of 2019, revealed that China is facing the lowest birth rate and natural growth rate among its population (10.48‰ and 3.34‰) since the years 1949 and 1961, respectively. The total population of Mainland China is 1.405 billion, and the population growth rate has increased by 0.33% year on year. The labor force participation rate dropped to a historical low (68%) in 2019, indicating that the demographic dividend may soon be exhausted. In terms of demographic structure, as the proportion of the aged population continues to increase, China has already begun to suffer from the rapid aging problem [1]. The urbanization rate and aging rate exhibit similar growth patterns [2], while the fertility rate has been decreasing year on year [3]. China has been an aging society since 2000, requiring continuous improvements in healthcare and provisions for ensuring the welfare of aged people, and the proportion of the aged population will continue to rise. Furthermore, the fertility level and fertility pattern of young women of childbearing age, as well as the proportion of the total population who are classed as women of childbearing age, are decreasing at significant rates [4]. The number of children per woman of childbearing age is only 1.047; thus, China may now be approaching the peak of its population size. In terms of migration, due to a high degree of difference in the opportunities and welfare offered by top-tier cities, the inflow of migrants from low-tier cities into high-tier cities continues.
As the driver of social development, population is closely related to development-related issues such as economic development, resource consumption, and ecological environmental protection. Apart from being the reflection of the fundamental status of the country, as well as its regions and cities, population is also a core element for further research on microenvironmental development [5]. Social problems, such as the overall slowdown of the population growth rate in China, the increasingly serious problems of sub-replacement fertility and an aging population, the year-on-year decrease in the fertility rate [6], and the large quantities of people who have travelled from rural areas and into cities, have brought heavy pressure upon and presented severe challenges to social care and welfare systems. These issues may also bring with them risks that are yet unforeseen and may threaten the operation of the country’s entire socioeconomic system.
Recently, the competition among the regions and cities has shifted from attracting highly educated and talented people to their areas to vying for young labor instead. The competition to attract a young labor force has intensified, which indicates that the economically developed regions are attracting more people who do not yet possess a household registration for that region (China’s household registration system is a state-implemented population management policy by which the state collects, confirms, and registers citizens’ births, deaths, familial relationships, legal addresses, and other basic information as part of population management. It informs policy decisions in areas such as employment, education, and social welfare provision) [7]. Most developed cities have been able to maintain their economic growth by absorbing young laborers, whose migration has left rural areas with an ever-greater proportion of aged inhabitants [8]. Therefore, there is an urgent need to formulate urban economic and social development plans and regional population development policies that are efficient and reasonable. To formulate relevant plans and policies scientifically, it is particularly important to accurately analyze and predict the development trends of the urban population.
Through a literature review, it is found that the mainstream population theories include Malthus’ population theory, sociological population theory, biological population theory, moderate population theory, population transition theory, population explosion theory, zero population growth theory, and so on [9,10]. Does China’s current population growth conform to the previous population theories? Based on the population agglomeration effect, economic siphon effect, the dialectical materialism of the whole and the part, mean regression, and other theories, this paper proposes a hypothesis about population theory: the future population growth of prefecture-level cities in China generally conforms to the zero population growth theory.
In the following, four models are used to predict the population development trend of 210 prefecture-level cities in China, the most accurate model data is used for further analysis, and some suggestions are put forward for future development.

2. Methodology

The prediction of population involves estimations of the scale, level, and trends of population development at a certain time in the future, using scientific methods based on the current population status and factors that may influence population development. Population prediction provides information that aids decision making. Numerous methods and models have been proposed for population prediction. Seven widely used types of population prediction methods are listed in Table 1 [11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27]. Among them, the most common used methods are historical trend analysis, classic population analysis, the regression model, and the gray prediction model.
China’s unique national conditions include extensive geographical coverage (land area of 9.6 million square kilometers, ranking third in the world) [28], large number of cities (34 provincial-level administrative regions and 333 prefecture-level city administrative regions) [29], and a large population base (1.443 billion people, ranking first in the world) [30]. By organically combining China’s national conditions with the advantages and disadvantages of the model (see Table 1 for details), the model was filtered.
With the continuous improvement of science and technology, the timeliness and accuracy of statistical data are getting higher and higher. Therefore, using data close to the current year (short- and medium-term data) for prediction can avoid data prediction errors to a certain extent and the analysis results will be more accurate. The classical population analysis and population prediction models are not suitable for this study due to their high requirements for data and longtime accumulation; the BP neural network and the system dynamics approach meet these difficulties in implementation. However, the former requires extensive data for the training procedure [31], and the latter has a higher degree of subjectivity in its parameter-setting process and is prone to problems such as the omission of important variables [32].
However, the Malthusian model, the unary linear regression model, the logistic competitive biological model (or logistic model), and the gray prediction model are suitable for China’s national conditions and the analysis requirements of this study due to the characteristics of requiring small amounts of data, appropriate suitability for short- and medium-term prediction, and strong explanatory ability.
Therefore, this paper proposes a hypothesis about the model: by assuming that these four models can achieve decent results in population prediction, among them, gray prediction model has the highest prediction accuracy. Next, prediction and analysis will be carried out through these four methods. The following is an introduction to these four models.

2.1. Malthusian Model

This model was proposed by British demographer Thomas Malthus in 1798 and is based on the principle that, under normal circumstances, the population growth rate within a certain period can be approximately regarded as a certain value, and the population is predicted by an exponential growth function. The model is as follows:
P t = P 0 1 + r n ,
where P t is set as the population size at the end of the prediction target year, P 0 is set as the population size of the prediction base year, r is set as the average annual population growth rate, and n is set as the prediction period.

2.2. Unary Linear Regression Model

The unary linear regression model is as follows:
Y = a + b X ,
Among the terms above, the estimated values of the parameters are:
a ^ = Y ¯ b ^ X ¯ ,
b ^ = X i X ¯ Y i Y ¯ X i X ¯ 2 = X i Y i n X ¯ Y ¯ X i X ¯ 2 ,
where time X is set as the control variable and the population Y is set as the state variable. The parameters a and b can be calculated by Equations (3) and (4) using historical data or by utilizing the ordinary least squares (OLS) method.
In the processing of population growth, unary linear regression can be used for population estimation when the population development rates of each period are relatively close; in other words, when the slopes of the tangent lines at all points on the population development curve are the same and approximate a linear growth. Subject to the limitations of the estimation conditions, this model is more suitable for short-term predictions. When used for long-term predictions, the errors caused by population changes would be gradually amplified thus affecting the accuracy of the prediction results.

2.3. Logistic Model

The logistic competitive biological model considers the limitations of population growth and proposes a law of population growth, which says that as the total population grows, the population growth rate will gradually decrease.
Let p t be the population size at time t . When the population is large, p t can be considered continuous or even differentiable. Let b t , p represent the birth rate of a single living being in the population per unit of time at time t , while d t , p represents the death rate in the population per unit of time at time t . Consequently, r t , p = b t , p d t , p represents the natality rate of a single individual at time t . The model is as follows:
Δ P = P t + Δ t P t = r t , P P t Δ t ,
Let Δ t 0 , then we have:
d p d t = r t , p P ,
When r t ,   p = a , then:
P = a P b P 2 , P t 0 = P 0 ,
This is the idea behind the Malthusian model.
Owing to environmental constraints, competition between individuals results in a decrease in r t , P . Therefore, the Dutch biologist Verhulst proposed to add the competition term b p 2 ( b > 0 ) . Thus, we get:
P = a P b p 2 , P t = P ,
The above equation is the logistic competitive biological model.

2.4. Gray Prediction Model

Due to certain conditions that are particular to some cities (such as a unique culture, special policies, etc.), the development of the population in some urban areas will be a more complicated, versatile, and nonstationary random process. Thus, static models such as linear regression models and exponential growth models, cannot guarantee accurate prediction results. On certain occasions, the confidence interval of the prediction result is extremely large due to errors in the model; thus, the results are of no practical significance. The gray prediction model generates a dynamic or nondynamic white module from a known data sequence according to a certain rule, explores the inner law from the noisy raw data, and builds a gray model according to certain changes and solutions to start the prediction. Characteristics including a low modeling information requirement, high operation, and high accuracy, make the model significantly better than the others in improving the randomness of data and accuracy of prediction.
With reference to Wang [33], Camelia [34,35], and Gao et al. [36], the GM (1, 1) model, which is a method of the gray prediction model, is suitable for population prediction. The prediction steps taken by this study are as shown below.
  • Set the original sequence { X t } = X 1 , X 2 , X 3 , , X n , t = 1 , 2 , 3 , , n and conduct accumulative generation on { X t } to obtain the new sequence Y t and the mean-value sequence Z t :
    Y t = i = 1 t X i , Z t = Y t + Y t 1 2 ,
  • Establish a first-order differential equation:
    d Y t d t + a Y t = u ,
    where a is the gray developmental coefficient and u is the gray action quantity.
  • Conduct parameter estimation using OLS:
    a ^ = B T B 1 B T λ t ,
    B = Z 2 Z 3 Z n 1 1 1 T ,
  • Obtain the time function:
    Y ^ t t + 1 = X t μ e x a t + u a ,
    and then calculate the estimated value of X t :
    X ^ t = Y ^ t Y ^ t 1 t = 1 , 2 , 3 , , n ,
  • Test the fitting effect using the after-test rule.

3. Results

The vast majority of previous studies predicted China’s total population on the national level and/or the provincial level, while a small minority of studies focused on the prediction of city populations using large samples. While the prefecture-level city unit is not only the smallest administrative division in China (smaller units include county and district levels), prefecture-level cities are important driving forces behind China’s economic development and policy decisions made at the city level, having a large effect upon wider regional development. Therefore, it is necessary to analyze the data of prefecture-level cities.
This study focuses on the analysis of urban population data at the level of cities using data from the China City Statistical Yearbook from 1999 to 2018. First, the population data of cities with missing values or abnormal population structure changes from year to year were identified and eliminated from this study. Next, cities that have been merged, split, or renamed during the selected years were eliminated. Finally, the population data of 210 prefecture-level cities from 1999 to 2018 were screened from 333 prefecture-level cities. Predictions were conducted using the four above mentioned models and methods, while the data with the highest degree of accuracy were chosen from the prediction results as the result for subsequent analysis. Owing to the large number of cities involved in this research, it is impossible to list the prediction data of all 210 cities. Therefore, only the prediction results of four first-tier cities are listed: Beijing, Shanghai, Guangzhou, and Shenzhen. (The city classification system adopted in this paper mirrors China’s new city classification list. The list evaluates prefecture-level cities according to five dimensions: business resource concentration, urban hub, urban activity, lifestyle diversity, and future plasticity. These dimensions decide the city’s tier: first-tier, new first-tier, second-tier, third-tier, fourth-tier, or fifth-tier. It is sometimes useful to groups these city tiers into the following groupings: the high-tier group includes first- and new first-tier cities, the middle-tier group includes second- and third-tier cities, and the low-tier group includes fourth- and fifth-tier cities.).

3.1. Population Prediction Results

3.1.1. Malthusian Model

According to the population development pattern of the four first-tier cities, based on population data on these cities from 1999 to 2018, three scenarios that utilize high, middle, and low natural population growth rates for prediction were selected.
With reference to the model used by Zhang et al. [5] for population prediction, the specific settings of growth rates for the high-, low- and middle-value scenarios are as follows. The high-value scenario uses the maximum natural population growth rate in 1999–2018, and the low-value scenario uses the minimum natural population growth rate in 1999–2018. The middle-value scenario uses the mean value of the growth rates of the high- and low-value scenarios as the annual population growth rate. According to the Malthusian population model, with 2010 as the base period, the predicted values of each city’s total population from 2011 to 2030 can be obtained from the high-, middle- and low-value scenarios. The prediction results are shown in Table 2 below.

3.1.2. Unary Linear Regression Model

With reference to the approach of Jia et al. [37], the population data from 1999 to 2018 are divided into two data sequences of 2009–2018 (10-year sample) and 1999–2018 (20-year sample) according to different sample sizes. The final prediction results are as shown in Table 3.

3.1.3. Logistic Model

Differentiation and separation of variables are conducted in Equation (6). Subsequently, a logistic population prediction model for the 210 cities is established. We again use two data sequences: the 10-year sample from 2009–2018 and the 20-year sample from 1999–2018. The prediction results are shown in Table 4.

3.1.4. Gray Prediction Model

Following the step 1 to step 5 listed in Section 2.4, the prediction results of the GM (1, 1) model are shown in Table 5.

3.2. Error Tests and Analysis

3.2.1. Malthusian Model

To get as accurate a forecast as possible, based on the previous prediction results of the Malthusian model, the predicted values of the high-, middle- and low-value scenarios for each of the cities were compared with the existing population data from 2011 to 2018. Thus, the results of the error tests on the three scenarios were obtained (Table 6).
The comparison result reveals that all the population prediction results of the listed first-tier cities in the high-value scenario are more accurate. However, when the sample size expanded to 210 cities, the average relative errors of the predictions for the 210 cities in the high-, middle- and low-value scenarios are 3.88%, 0.82%, and −2.12%, respectively. Therefore, considering the overall prediction results, the predicted value of the middle-value scenario is slightly lower than the actual populations and has the smallest overall error. Under this model method, the middle scheme is the most reliable.

3.2.2. Unary Linear Regression Model

With the prediction steps introduced in Section 3.1, the prediction results of the unary regression model are as follows (Table 7).
The results of the error tests in Table 7 reveal that the prediction model of the 10-year sample has a smaller error (both error and relative error) than that of the 20-year sample, possibly owing to irregularities caused by a longer data sequence. It also shows that such a model is better suited to short-term population prediction, as the accuracy of the prediction results may decline along with the prediction term of the model.

3.2.3. Logistic Model

The following prediction results are obtained from the logistic biological competition model (Table 8).
Table 8 shows that the prediction results of the four first-tier cities fit better with the populations in reality. The average relative errors of the 10-year sample size and the 20-year sample size are 0.371% and −0.303%, respectively. The overall prediction accuracy of the logistic competitive biological model is high, but the accuracy of the 20-year sample size is much higher. This shows that the characteristics of the data cannot be fully reflected when a shorter time series is used, and the use of a longer time series will improve the model’s accuracy.

3.2.4. Gray Prediction Model

The average prediction error of the GM (1, 1) model is shown in Table 9 below.
The results in Table 9 show that, compared to the three models and methods above, the average relative error of the gray prediction method is the lowest of the four. For the first-tier cities, being popular immigration destinations, the errors increased slightly with time over the last few years. This is probably caused by increasing population flow into first-tier cities. Through error testing and numerical prediction, the prediction results of the 10-year sample size prove to be more accurate, with an average relative error of only 0.005%, which confirms that the GM (1, 1) model has high accuracy while requiring less data.
As shown in Table 10, by comprehensively comparing the errors of the four prediction models mentioned above, it is found that the 10-year sample size of the GM (1, 1) model has the smallest error: only 0.005%. Among the hypotheses about the model mentioned above, the four models can basically make appropriate predictions, and GM (1, 1) has the highest accuracy. Thus, the GM (1, 1) model passes the hypothesis.
Therefore, the prediction data of the GM (1, 1) model were selected for analysis of the future development trends in the remainder of this article.

4. Discussion

The calculations in this section refer to the data analysis methods of Langford [38] and Koenker and Bassett [39]. Based on the quartile analysis method, the box plots of the four sets of data for the years 2000, 2010, 2020, and 2030 are shown to facilitate intuitive analysis, as well as the comparison of the average levels and degrees of variation among multiple data sets. The box plots are shown in Figure 1 below.
The analysis shows that in the selected four years (2000, 2010, 2020, and 2030), among the 210 cities selected, Chongqing has the largest population (3091.09, 3303.45, 3432.93, and 3552.76, respectively), while Jiayuguan has the smallest population (15.96, 21.8, 21.34, and 22.02, respectively). The median of the urban population was 425.06, 465.71, 491.16, and 522.56, respectively. The data dispersion in each year is relatively high, while all population values are right-skewed, and all of them contain extreme values. This is because, in different years, the population values of the cities of Chongqing, Shanghai, Chengdu, and Nanyang are extremely large in their respective groups and are judged to be outliers, which increased the average population of each year thus leading to the right skewness of data. The population growth pattern in each city exhibits spatial difference over the years, while the degree of population dispersion in each city maintains an increasing momentum, resulting in enormous numerical gaps between cities in terms of their populations.
The growth in populations has accelerated the speed at which cities change their spatial layout, adopt different development models, and adapt in other ways to accommodate this growth. Cities are also profiting due to the economic benefits of population growth, which drives up demand for goods and services while providing additional labor. Consequently, there is an interaction between population growth and city expansion. As the public calls for regional development, the role of metropolitan areas in delivering development have become increasingly important [22,40].
For a clearer and more intuitive analysis of the development trends of the cities’ populations, the latitude and longitude coordinates of the above mentioned 210 prefecture-level cities were imported to the geo module of pyecharts (v1.9.0) in Python (v3.7). The highly accurate population prediction results from the GM (1, 1) model mentioned above were matched with the imported cities to draw the spatial distribution of the cities’ populations in 2000, 2010, 2020, and 2030.
As shown in Figure 2 and Figure 3, the populations of the sample cities have grown steadily in general but the population growth rates have slowed down significantly. In terms of specific cities, the total populations of the first-tier cities and new first-tier cities are stabilizing, while the growth rates of these cities all show a downward trend. The population growth rates of all cities fell below 1%, except for Guangzhou, Shenzhen, Chengdu, Hangzhou, Xi’an, Changsha, and Dongguan. Among them, the population growth rate of Zhengzhou City was −1.05%. For some cities, including Beijing, due to the implementation of population regulations, as well as factors such as housing prices and living costs, there is a noticeable growth in reverse migration.
As for total population, most first-tier and new first-tier cities have populations between 5–10 million. It is expected that the population of Xi’an and Guangzhou will exceed 10 million in 2023 and 2025, respectively. In addition, China has 19 first-tier and new first-tier cities, and it is expected that the population of seven of these cities will exceed 10 million by 2030.
The population growth rates of second- and third-tier cities are increasing steadily, and those of most cities fluctuate around 0.5%. Among them, the population growth rates of Zhuhai, Foshan, Huizhou, Zhongshan, Guiyang, Handan, Langfang, Putian, Linyi, and Yinchuan are relatively higher than other second- and third-tier cities.
As for population growth, only three second- or third-tier cities (Baoding, Linyi, Nanyang) had a population exceeding 10 million in 2000. Owing to the rapid development of middle-tier cities (second-tier cities and third-tier cities), it is expected that there will be nine second- or third-tier cities (Shijiazhuang, Xuzhou, Handan, Baoding, Fuyang, Ganzhou, Linyi, Nanyang, and Shangqiu) with a population exceeding 10 million in 2030. Many cities have a population that exceeds 1 million; the only city with less than 1 million people is Sanya (643,600 people). The population sizes of the fourth- and fifth-tier cities are significantly smaller, but there are numerous low-tier cities that have a population in the range of 2–5 million. In recent years, population growth has slowed down, and the population growth rates of most cities have been negative. It is expected that in 2030, Yichun (969,000 people), Tongchuan (763,000 people), Shizuishan (739,200 people), Jinchang (431,500 people), Wuhai (316,300 people), Jiayuguan (217,800 people), and Karamay (169,600 people) will still each have a population under 1 million.
Owing to the ever-expanding economic advantages and talent introduction policies of core cities, their agglomeration effects have been continuously enhanced [40]. To further analyze the trend of inter-regional population development in the future, Figure 4 and Figure 5 are drawn by overlaying the original urban agglomeration distribution data upon a map with latitude and longitude coordinates for each city, sourced from Google Maps, to show the distribution of urban agglomerations in China.
The analysis of cities shows that the overall population development of the north and south of the country is relatively stable, but the city disparity is obvious. (The geographical boundary between north and south China is the Qinling-Huai River line, along the southeast side of the Qinghai-Tibet Plateau, which coincides with the 800 mm isothermal precipitation line and the January zero isothermal line. The area north of this line is called Northern China, and the area south of this line is called Southern China.) In the sample data of 210 cities, the sample range from 1999 to 2018 show that the average annual population growth of southern cities (totaling 106) was 3.862 million people, and that of northern cities (totaling 104) was 2.652 million people. The development characteristic can be described as fast in the southern cities and slow in the northern cities. In the prediction data of 2021–2030, the average annual population growth in the southern and northern cities is expected to be 2.0451 million and 1.9081 million, respectively, indicating a population growth trend that is roughly equal in the southern cities and the northern cities.
As for the urban agglomerations, as shown in Figure 4 and Figure 5, the overall population growth rate of the sample city data is steady while slowing down. The population growth rates of the northern cities have gradually surpassed those of the southern cities. The ratio of the population growth in the northern urban agglomerations to that of the southern urban agglomerations is expected to increase from 0.73 from 1999–2018 to 0.79 from 2021–2030. Among them, the population growth of the urban agglomeration in the middle reaches of the Yangtze River Delta has a significantly slowing growth rate. The population growth rates of Yichang, Jingmen, Xiaogan, Huanggang, Jingzhou, Xiangtan, and Changde are all negative, and the average annual population growth is expected to be 259,600 from 2021 to 2030. The population growth of the Central Plains urban agglomeration and the Yangtze River Delta urban agglomeration has been progressing steadily. The growth rates are stable at around 1% and 0.5% in the long term, and it is expected that the average annual population growth in the next 10 years will reach 774,900 and 420,500, respectively.
Regarding the metropolitan areas, the core metropolitan areas, including the Yangtze River Delta interlocking the metropolitan areas, Guangdong–Hong Kong–Macao Greater Bay Area, and the capital, Beijing, have attracted talent from across the nation to migrate there due to these areas offering a solid economic base, being well-located, and presenting policy advantages. Consequently, the population has grown especially rapidly in these areas. It is expected that the average annual population growth in 2021–2030 will exceed 400,000. Among these areas, the average annual population growth of the Guangdong–Hong Kong–Macao Greater Bay Area is expected to exceed 600,000. For the city circles represented by Shenyang, Changchun, Xi’an, and Chengdu, the population growth rates are negative. In particular, the city population growth rates in the northeastern regions have decreased rapidly due to economic depression and the harsh climate there, and there is a serious outflow of population.
Generally speaking, the future population growth of prefecture-level cities in China generally meets the theory of zero population growth, and the hypothesis of population theory above is basically valid. This study finds that population growth exhibits inconsistent patterns between different city tiers. The population growth of high-tier cities (first-tier cities and new first-tier cities) is shrinking, the populations of low-tier cities (fourth-tier cities and fifth-tier cities) are experiencing negative growth, and the populations of middle-tier cities are exploding. With the continuous enhancement of the regional integration of large urban agglomerations and metropolitan areas, the layout with scattered groups has been replaced by a large landscape with a uniform pace. However, the promotional effect of core urban agglomerations and metropolitan areas on the surrounding areas is still not significant.

5. Conclusions

Population growth is a source of economic and social vitality, the foundation of innovation and entrepreneurship, and a core element in encouraging urban development. China is now facing population problems such as the overall slowdown of the population growth rate, the intensifying of sub-replacement fertility and an aging population, the decrease in the fertility rate, and the overly rapid mobilization of the population. Furthermore, the competition to attract labor has turned into a competition to retain labor as migration becomes ever easier.
Based on the data from the cities’ statistics bureaus, statistical yearbooks over the years, and population censuses, this study utilized the Malthusian model, unary linear regression model, logistic model, and gray prediction model to perform population predictions on 210 prefecture-level cities and analyzed the characteristics of urban population distribution and development. This study has found that the total urban population has changed from fast growth in developed cities and slow growth in underdeveloped cities to slow growth in high- and low-tier cities, and fast growth in middle-tier cities. The growth rates of city populations have changed from a fast pattern in the southern cities and a slow pattern in the northern cities to being basically balanced between the northern and southern cities. Comparing the populations of cities at different tiers, the high-tier cities had once dominated the top rankings, but the middle- and low-tier cities are gradually catching up and weakening the dominance of high-tier cities. The target of the urban agglomerations and the metropolitan areas is gradually shifting from integration to urban cohesion, although the population gaps between cities tend to widen; as the growth rate of the population declines, China’s total population is likely to be relatively stable in the future. In addition to the population aggregation effect, economic siphon effect, and other reasons, fertility decline may also be an important reason for this phenomenon. Cities with faster economic development attract highly educated people [40], and their high expectations for potential marriage partners coupled with the huge marriage costs of China’s skyrocketing housing prices may hinder the willingness of highly educated people to marry [41]. Those who are married, especially highly educated married women, often choose to have only one child or give up work because it is difficult to balance work and family [42], also leading to slow urban population growth.
In the future, the state should accelerate the reform of the household registration system and optimize the employment promotion mechanism. It should conform to the pattern of urbanization development and guide the population to migrate to small- and medium-sized cities surrounding the densely populated urban agglomerations and metropolitan areas. The purpose of this is to coordinate the populations of the regions and to connect different parts of the nation. The state must also proactively intensify the leading effect of core cities and regions while at the same time facilitate the harmonious development of regional economies. It must determine the largest scope of common interests, target the highest common factors that affect mutual benefit, promote industrial development, upgrade and renew the regional populations, and move high-quality talents to the core regions. Further, the state should focus on building a policy system and social environment conducive to elderly care and filial piety, and cancel all policy constraints on childbirth. This should encourage people to get married and reproduce at an appropriate age, let families control their own reproduction, and further accelerate the construction of a support system for accelerating reproduction.
This study used microdata on the population of prefecture-level cities in China to make population predictions and used visualization software to more intuitively show the development and change of the future population, and combines prefecture-level cities and urban agglomerations to provide suggestions for the government to help cities develop better. This study certainly has reference value to the research of urban population development, urban agglomeration, and urban agglomeration development. Due to the limitation of population data system construction, the data granularity in this study can only reach prefecture-level cities. After obtaining more detailed data, its practical and theoretical significance will be more prominent.

Author Contributions

All the authors equally contributed to this research. Conceptualization, J.D. and L.C.; methodology, J.D., X.L. and L.C.; writing—original draft, L.C. and T.M.; writing—review and editing, L.C. and T.M.; visualization, T.M.; data curation, L.C. and T.M.; investigation, L.C.; supervision, J.D. and X.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (grant number 71573244, 71850014, 71974180).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data can be obtained by email from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Chen, W. China’s aging population from the international perspective. J. Peking Univ. 2016, 53, 82–92. (In Chinese) [Google Scholar]
  2. Chen, Q.; Dietzenbacher, E.; Los, B. The effects of ageing and urbanisation on China’s future rural and urban populations. Asian Popul. Stud. 2017, 13, 172–197. [Google Scholar] [CrossRef]
  3. Murphy, M. Use of Counterfactual Population Projections for Assessing the Demographic Determinants of Population Ageing. Eur. J. Popul. 2021, 37, 211–242. [Google Scholar] [CrossRef] [PubMed]
  4. Wang, Y.; Zhong, F. Changes and prediction of birth level in China since 1990. Popul. Econ. 2017, 1, 1–12. (In Chinese) [Google Scholar]
  5. Zhang, H.; Yang, P.; Li, C.; Zhou, Q.; Gao, L.; Chen, Q. Prediction of population size in Xining city based on multiple models. J. Arid. Areas Geogr. 2013, 36, 955–962. (In Chinese) [Google Scholar]
  6. Li, J.; Liu, R. The analysis of the differences of provincial population shrinks. J. Popul. 2020, 42, 35–48. (In Chinese) [Google Scholar]
  7. He, C.; Mao, X. Population dynamics and regional development in China. Camb. J. Reg. Econ. Soc. 2016, 3, 3. [Google Scholar] [CrossRef]
  8. Zheng, Z.; Yang, G. Internal Migration in China: Changes and Trends. In Contemporary Demographic Transformations in China, India and Indonesia; Springer International Publishing: Cham, Switzerland, 2016; pp. 223–237. [Google Scholar] [CrossRef]
  9. Zeng, Y. The past, present and future of demography. Popul. Res. 2009, 33, 8–22. (In Chinese) [Google Scholar]
  10. Guo, F. Comparison between ancient Chinese population thought and Western population theory in history. Heilongjiang Hist. Rec. 2020, 11, 37–40. (In Chinese) [Google Scholar]
  11. Hauer, M.E. Population projections for U.S. counties by age, sex, and race controlled to shared socioeconomic pathway. Sci. Data 2019, 6, 190005. [Google Scholar] [CrossRef] [Green Version]
  12. Huang, M. Ordinary Differential Equation Model and its Application in the Prediction Control of Population. Appl. Mech. Mater. 2014, 631–632, 714–717. [Google Scholar] [CrossRef]
  13. Zhang, L.; Wang, G. Research on the age structure of Ethnic Minority population in China: Based on stable Population Theory. China Popul. Sci. 2019, 1, 88–100, 128. (In Chinese) [Google Scholar]
  14. Song, J. Inheritance and evolution of demography method -- on the development of demography discipline in China. Popul. Econ. 2020, 4, 1–14. (In Chinese) [Google Scholar]
  15. Miranda, L.; Limab, C. On the logistic modeling and forecasting of evolutionary processes: Application to human population dynamics. Technol. Forecast. Soc. Chang. 2010, 77, 699–711. [Google Scholar] [CrossRef]
  16. Xuan, H.; Zhang, A.; Yang, N. A Model in Chinese Population Growth Prediction. Appl. Mech. Mater. 2014, 556–562, 6811–6814. [Google Scholar] [CrossRef]
  17. Zili, A.; Mardiyati, S.; Lestari, D. Forecasting Indonesian mortality rates using the Lee-Carter model and ARIMA method. In Proceedings of the 3rd International Symposium on Current Progress in Mathematics and Sciences (ISCPMS2017), Bali, Indonesia, 26–27 July 2017. [Google Scholar] [CrossRef]
  18. Chen, M.; Lu, C.; Hao, Y.; Wang, X. China’s population projections based on gm(1,1) metabolic model. Kybernetes 2019, 38, 417–425. [Google Scholar] [CrossRef]
  19. Xie, N.; Wang, R.; Chen, N. Measurement of shock effect following change of one-child policy based on grey forecasting approach. Kybernetes 2018, 47, 559–586. [Google Scholar] [CrossRef]
  20. Ding, C.; Shi, X.; Niu, Y.; Cui, C. Prediction and its significance in urban planning, urban population in Beijing, for example. J. Urban Plan. 2018, 9, 21–27. (In Chinese) [Google Scholar]
  21. Frederick, A.; David, Y.; Pappoe, A. A systems dynamics approach to explore traffic congestion and air pollution link in the city of Accra, Ghana. Sustainability 2010, 2, 252–265. [Google Scholar] [CrossRef] [Green Version]
  22. Qiu, Y.; Shi, X.; Shi, C. A system dynamics model for simulating the logistics demand dynamics of metropolitans: A case study of Beijing, China. J. Ind. Eng. Manag. 2015, 8, 783–803. [Google Scholar] [CrossRef] [Green Version]
  23. Folorunso, O.; Akinwale, A.; Asiribo, O.; Adeyemo, T. Population prediction using artificial neural network. Acad. J. 2010, 3, 155–162. [Google Scholar] [CrossRef]
  24. Liu, E.; Yan, X. New parameter-free mobility model: Opportunity priority selection model. Phys. A Stat. Mech. Its Appl. 2019, 526, 121023. [Google Scholar] [CrossRef] [Green Version]
  25. Yan, X.; Wang, W.; Gao, Z.; Lai, Y. Universal model of individual and population mobility on diverse spatial scales. Nat. Commun. 2017, 8, 1639. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Guo, A.; Ding, X.; Zhong, F.; Cheng, Q.; Huang, C. Predicting the future Chinese population using shared socioeconomic pathways, the sixth national population census, and a pde model. Sustainability 2019, 11, 3686. [Google Scholar] [CrossRef] [Green Version]
  27. Wei, Y.; Wang, Z.; Wang, H.; Li, Y.; Jiang, Z. Predicting population age structures of China, India, and Vietnam by 2030 based on compositional data. PLoS ONE 2019, 14, e0212772. [Google Scholar] [CrossRef] [PubMed]
  28. Central People’s Government of the People’s Republic of China. Territory of the People’s Republic of China. Available online: http://www.gov.cn/test/2005-06/15/content_18252.htm (accessed on 15 June 2005).
  29. National Bureau of Statistics of China. China City Statistical Yearbook-2018; China Statistics Press: Beijing, China, 2018. [Google Scholar]
  30. National Bureau of Statistics of China. Bulletin of the Seventh National Census (No. 1). Available online: http://www.stats.gov.cn/tjsj/tjgb/rkpcgb/qgrkpcgb/202106/t20210628_1818820.html (accessed on 11 May 2021).
  31. Guo, T.; Shen, P.; Wang, W.; Shi, L.; Yang, Y.; Zhang, L.; Chen, Y.; Hu, Y.; Luo, F. Lightning occurrence recognition method based on atmospheric electric field value. J. Comput. Appl. Softw. 2018, 35, 86–90. (In Chinese) [Google Scholar]
  32. Wu, J. Study on the influencing factors of Guangxi’s low-carbon economic development. Acad. Forum 2013, 36, 105–109. (In Chinese) [Google Scholar]
  33. Wang, Y. A Forecasting Research on Population Size of Jilin Province Based on Grey GM (1,1) Model. Appl. Mech. Mater. 2014, 667, 273–276. [Google Scholar] [CrossRef]
  34. Camelia, D. Grey systems theory in economics—Bibliometric analysis and applications’ overview. Grey Syst. Theory Appl. 2015, 5, 244–262. [Google Scholar] [CrossRef]
  35. Camelia, D. Grey systems theory in economics—A historical applications review. Grey Syst. Theory Appl. 2015, 5, 263–276. [Google Scholar] [CrossRef]
  36. Gao, H.; Yao, T.; Kang, X. Population forecast of Anhui province based on the gm(1,1) model. Grey Syst. Theory Appl. 2017, 7, 19–30. [Google Scholar] [CrossRef]
  37. Jia, H.; Xie, Z.; Gao, Y. Population prediction and development trend analysis of Gansu province. J. Northwest Popul. 2018, 39, 118–126. (In Chinese) [Google Scholar]
  38. Langford, E. Quartiles in Elementary Statistics. J. Stat. Educ. 2006, 14, 1–20. [Google Scholar] [CrossRef]
  39. Koenker, R.; Bassett, G. Regression quantiles. Econometrica 1978, 46, 33–50. [Google Scholar] [CrossRef]
  40. Thisse, J. Human capital and agglomeration economies in urban development. Dev. Econ. 2018, 56, 117–139. [Google Scholar] [CrossRef]
  41. Brinton, M.C.; Oh, E. Babies, Work, or Both? Highly Educated Women’s Employment and Fertility in East Asia. Am. J. Sociol. 2019, 125, 105–140. [Google Scholar] [CrossRef] [Green Version]
  42. Hu, M.; Sun, Z. Too educated to be married? An investigation into the relationship between education and marriage in China. J. Fam. Stud. 2021, 1, 1–19. [Google Scholar] [CrossRef]
Figure 1. Population box plots for the selected four years.
Figure 1. Population box plots for the selected four years.
Sustainability 14 04844 g001
Figure 2. The spatial distribution of the population of the cities, population growth rate: the spatial distribution diagrams of prefecture-level urban populations in 2000, 2010, 2020, and 2030 (Unit: %). Note: this diagram uses the base diagram that comes with the Python pyecharts package. The base map is not modified. The bold gray lines in the figure indicate the geographical boundary between Northern and Southern China.
Figure 2. The spatial distribution of the population of the cities, population growth rate: the spatial distribution diagrams of prefecture-level urban populations in 2000, 2010, 2020, and 2030 (Unit: %). Note: this diagram uses the base diagram that comes with the Python pyecharts package. The base map is not modified. The bold gray lines in the figure indicate the geographical boundary between Northern and Southern China.
Sustainability 14 04844 g002
Figure 3. The spatial distribution of the population of the cities—population: the spatial distribution diagrams of prefecture-level urban populations in 2000, 2010, 2020, and 2030 (Unit: ten thousand people). Note: this diagram uses the base diagram that comes with the Python pyecharts package. The base map is not modified. The bold gray lines in the figure indicate the geographical boundary between Northern and Southern China.
Figure 3. The spatial distribution of the population of the cities—population: the spatial distribution diagrams of prefecture-level urban populations in 2000, 2010, 2020, and 2030 (Unit: ten thousand people). Note: this diagram uses the base diagram that comes with the Python pyecharts package. The base map is not modified. The bold gray lines in the figure indicate the geographical boundary between Northern and Southern China.
Sustainability 14 04844 g003
Figure 4. Spatial distribution of the urban agglomeration areas. Note: this diagram uses the administrative map of the People’s Republic of China from https://beijing.tianditu.gov.cn/, (Access date 31 May 2022). The base map is not modified. The urban agglomeration showed in this diagram is the approximate range of urban agglomeration in China, covering all the prefecture-level cities but not differentiating county-level cities, so it might be larger or smaller than the exact area of the urban agglomerations. The bold black lines in the figure indicate the geographical boundary between Northern and Southern China.
Figure 4. Spatial distribution of the urban agglomeration areas. Note: this diagram uses the administrative map of the People’s Republic of China from https://beijing.tianditu.gov.cn/, (Access date 31 May 2022). The base map is not modified. The urban agglomeration showed in this diagram is the approximate range of urban agglomeration in China, covering all the prefecture-level cities but not differentiating county-level cities, so it might be larger or smaller than the exact area of the urban agglomerations. The bold black lines in the figure indicate the geographical boundary between Northern and Southern China.
Sustainability 14 04844 g004
Figure 5. Urban agglomeration areas. Note: this diagram uses the administrative map of the People’s Republic of China from https://beijing.tianditu.gov.cn/, (Access date 22 May 2022). The base map is not modified. Due to the unavailability of data and the boundary problem of urban agglomeration, the figure shows the approximate range of urban agglomeration area. The bold gray lines in the figure indicate the geographical boundary between Northern and Southern China. The cities are sorted in no particular order.
Figure 5. Urban agglomeration areas. Note: this diagram uses the administrative map of the People’s Republic of China from https://beijing.tianditu.gov.cn/, (Access date 22 May 2022). The base map is not modified. Due to the unavailability of data and the boundary problem of urban agglomeration, the figure shows the approximate range of urban agglomeration area. The bold gray lines in the figure indicate the geographical boundary between Northern and Southern China. The cities are sorted in no particular order.
Sustainability 14 04844 g005
Table 1. Summary of mainstream population prediction methods.
Table 1. Summary of mainstream population prediction methods.
MethodCore ContentHypothesisProsCons
Historical trend methodHistorical growth trends, demographic characteristics, inertial measurement of population, Malthus modelFuture population would show the same pattern as population in the pastThe method is classical and widely usedLong-term prediction accuracy will decline
Classical population analysisLife table technology, hypothetical cohort analysis, population prediction technology and total fertility rateAccounting for birth and death of people, the measure of population is not the only factorHigh prediction accuracyThe demand for data is high and requires long-term accumulation
Regression modelLogistic model, ARIMA model, Leslie matrix, unitary and multiple linear regression modelSociology investors believe that population growth may have linear or quasi-linear relationship on certain factorsLess data is required and the method is simpleLong-term prediction error fluctuation is relatively large
Gray system prediction modelGM (1, 1) modelThe trend of population is a fuzzy system, which means the status in the past cannot be used to predict the accurate population growth, only the probability of the status in the futureLess required data, high accuracy of predictionOnly for short- and medium-term forecasts
Population prediction modelGravity model, radiation model, Song Jian population equation model, population flow model, etc.Spatial interaction among cities reflects the growth pattern of populationThe model form is more intuitive and explicableThe data requirements are strict and the model is complex
BP neural network, artificial intelligenceNonlinear dynamics systems, mathematical equations, etc.High-order statistical rule reflections from big data might be suitable for population prediction, which cannot be calculated by small amounts of dataStrong ability of nonlinear mapping and generalizationThe convergence rate of the algorithm is slow, the interpretability is poor
System dynamicsDifferential dynamical equations and nonlinear stochastic processesPopulation is embedded into the whole system of the society, and might be correlated with a number of factorsCan find the root cause from the internal structure of the systemThe subjective color is heavy
Table 2. Population prediction results of cities based on the Malthusian model.
Table 2. Population prediction results of cities based on the Malthusian model.
The Results of Population Prediction of Beijing
YearHigh SchemeMiddle SchemeLow SchemeYearHigh SchemeMiddle SchemeLow Scheme
20111268.531259.521250.5220211380.981276.891179.97
20121279.351261.251243.2820221392.761278.631173.14
20131290.261262.981236.0820231404.641280.391166.35
20141301.271264.711228.9220241416.631282.141159.59
20151312.371266.441221.8120251428.711283.901152.88
20161323.561268.171214.7320261440.901285.661146.20
20171334.851269.911207.7020271453.191287.421139.57
20181346.241271.651200.7120281465.581289.181132.97
20191357.721273.391193.7520291478.081290.951126.41
20201369.301275.141186.8420301490.691292.721119.89
The Results of Population Prediction of Shanghai
YearHigh SchemeMiddle SchemeLow SchemeYearHigh SchemeMiddle SchemeLow Scheme
20111413.011410.361407.7220211419.951390.951362.49
20121413.701408.411403.1320221420.651389.031358.05
20131414.401406.461398.5520231421.341387.101353.62
20141415.091404.511393.9920241422.041385.181349.21
20151415.781402.571389.4520251422.741383.261344.81
20161416.481400.621384.9220261423.431381.351340.43
20171417.171398.681380.4020271424.131379.431336.06
20181417.871396.751375.9020281424.831377.521331.70
20191418.561394.811371.4220291425.531375.611327.36
20201419.261392.881366.9520301426.231373.711323.03
The Results of Population Prediction of Guangzhou
YearHigh SchemeMiddle SchemeLow SchemeYearHigh SchemeMiddle SchemeLow Scheme
2011818.71813.33807.952021955.68888.81826.23
2012831.47820.58809.762022970.58896.74828.08
2013844.43827.89811.572023985.71904.73829.93
2014857.60835.27813.3920241001.08912.80831.79
2015870.97842.72815.2120251016.69920.94833.66
2016884.55850.23817.0420261032.54929.15835.52
2017898.34857.81818.8720271048.63937.43837.39
2018912.34865.46820.7020281064.98945.79839.27
2019926.57873.18822.5420291081.58954.22841.15
2020941.01880.96824.3820301098.45962.72843.03
The Results of Population Prediction of Shenzhen
YearHigh SchemeMiddle SchemeLow SchemeYearHigh SchemeMiddle SchemeLow Scheme
2011266.41264.37262.322021341.63313.85288.13
2012273.12268.94264.792022350.23319.28290.85
2013280.00273.60267.292023359.05324.80293.59
2014287.05278.33269.812024368.09330.42296.36
2015294.28283.15272.362025377.36336.14299.16
2016301.69288.05274.922026386.86341.96301.98
2017309.28293.03277.522027396.61347.88304.83
2018317.07298.10280.132028406.59353.90307.70
2019325.06303.26282.782029416.83360.02310.60
2020333.24308.51285.442030427.33366.25313.53
Note: unit is ten thousand people.
Table 3. Population prediction results of first-tier cities based on the unary linear regression model.
Table 3. Population prediction results of first-tier cities based on the unary linear regression model.
The Results of Population Prediction of Beijing
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
20111279.831282.8420211429.271415.97
20121294.781296.1520221444.211429.29
20131309.721309.4620231459.151442.60
20141324.661322.7820241474.101455.91
20151339.611336.0920251489.041469.23
20161354.551349.4020261503.981482.54
20171369.491362.7220271518.931495.85
20181384.441376.0320281533.871509.17
20191399.381389.3520291548.811522.48
20201414.321402.6620301563.761535.79
The Results of Population Prediction of Shanghai
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
20111418.001412.0620211482.121494.55
20121424.411420.3120221488.541502.80
20131430.821428.5620231494.951511.05
20141437.241436.8120241501.361519.30
20151443.651445.0620251507.771527.55
20161450.061453.3120261514.191535.80
20171456.471461.5620271520.601544.05
20181462.891469.8120281527.011552.29
20191469.301478.0520291533.421560.54
20201475.711486.3020301539.841568.79
The Results of Population Prediction of Guangzhou
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
2011812.52819.022021947.45932.48
2012826.02830.372022960.94943.83
2013839.51841.722023974.43955.17
2014853.00853.062024987.92966.52
2015866.49864.4120251001.42977.87
2016879.99875.7520261014.91989.21
2017893.48887.1020271028.401000.56
2018906.97898.4420281041.891011.90
2019920.46909.7920291055.381023.25
2020933.95921.1420301068.881034.59
The Results of Population Prediction of Shenzhen
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
2011274.06292.122021511.43463.50
2012297.80309.262022535.17480.64
2013321.53326.402023558.91497.78
2014345.27343.542024582.64514.91
2015369.01360.672025606.38532.05
2016392.75377.812026630.12549.19
2017416.48394.952027653.85566.33
2018440.22412.092028677.59583.47
2019463.96429.222029701.33600.60
2020487.69446.362030725.07617.74
Note: unit is ten thousand people.
Table 4. Population prediction results of first-tier cities based on the logistic competitive biological model.
Table 4. Population prediction results of first-tier cities based on the logistic competitive biological model.
The Results of Population Prediction of Beijing
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
20111299.63 1294.9820211411.371333.79
20121315.161298.8120221418.241337.73
20131329.571302.6520231424.521341.69
20141342.921306.5020241430.271345.65
20151355.261310.3720251435.521349.63
20161366.661314.2420261440.321353.62
20171377.161318.1320271444.701357.63
20181386.831322.0220281448.701361.64
20191395.721325.9320291452.351365.67
20201403.881329.8520301455.671369.71
The Results of Population Prediction of Shanghai
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
20111425.821413.4720211474.001487.02
20121432.831421.2520221476.801493.86
20131439.251428.9320231479.331500.61
20141445.111436.5220241481.641507.26
20151450.461444.0220251483.741513.82
20161455.351451.4220261485.651520.28
20171459.801458.7320271487.381526.65
20181463.871465.9420281488.951532.93
20191467.571473.0620291490.381539.12
20201470.941480.0920301491.671545.22
The Results of Population Prediction of Guangzhou
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
2011824.27814.382021931.32912.58
2012834.40824.692022942.76921.73
2013844.65834.902023954.34930.76
2014855.02845.012024966.07939.64
2015865.53855.012025977.94948.39
2016876.16864.912026989.95957.00
2017886.93874.6820271002.12965.47
2018897.82884.3420281014.43973.79
2019908.85893.8820291026.89981.97
2020920.02903.2920301039.51990.01
The Results of Population Prediction of Shenzhen
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
2011299.93274.812021581.05548.75
2012320.44294.492022620.77588.04
2013342.34315.582023663.21630.15
2014365.74338.172024708.54675.27
2015390.75362.392025756.98723.62
2016417.46388.342026808.73775.43
2017446.00416.142027864.02830.95
2018476.49445.942028923.09890.45
2019509.06477.872029986.19954.21
2020543.87512.0820301053.611022.53
Note: unit is ten thousand people.
Table 5. Population prediction results of first-tier cities based on the GM (1, 1) model.
Table 5. Population prediction results of first-tier cities based on the GM (1, 1) model.
The Results of Population Prediction of Beijing
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
20111281.701277.9320211430.311446.31
20121295.841293.8520221446.091464.32
20131310.131309.9620231462.041482.55
20141324.581326.2820241478.161501.02
20151339.191342.7920251494.471519.71
20161353.971359.5220261510.951538.64
20171368.901376.4520271527.621557.80
20181384.001393.5920281544.471577.20
20191399.271410.9420291561.511596.84
20201414.701428.5220301578.731616.73
The Results of Population Prediction of Shanghai
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
20111419.651411.3820211480.541497.13
20121425.621419.7320221486.771505.98
20131431.621428.1320231493.031514.89
20141437.651436.5820241499.311523.85
20151443.701445.0820251505.621532.87
20161449.771453.6220261511.961541.94
20171455.871462.2220271518.321551.06
20181462.001470.8720281524.711560.23
20191468.151479.5720291531.131569.46
20201474.331488.3320301537.571578.74
The Results of Population Prediction of Guangzhou
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
2011809.11816.872021957.91943.15
2012822.89828.702022974.22956.80
2013836.90840.692023990.81970.66
2014851.15852.8720241007.68984.71
2015865.64865.2120251024.83998.97
2016880.37877.7420261042.281013.43
2017895.36890.4520271060.031028.10
2018910.61903.3420281078.071042.99
2019926.11916.4220291096.431058.09
2020941.88929.6920301115.091073.41
The Results of Population Prediction of Shenzhen
YearPopulation
(10 Years)
Population
(20 Years)
YearPopulation
(10 Years)
Population
(20 Years)
2011268.32273.742021569.60558.38
2012289.29293.972022614.14599.63
2013311.91315.692023662.15643.94
2014336.30339.022024713.92691.52
2015362.59364.062025769.74742.61
2016390.94390.962026829.92797.48
2017421.50419.852027894.80856.40
2018454.46450.872028964.76919.68
2019489.99484.1820291040.19987.63
2020528.30519.9620301121.521060.60
Note: unit is ten thousand people.
Table 6. Error test of high-, middle- and low-value scenarios.
Table 6. Error test of high-, middle- and low-value scenarios.
Error Test of Each Scheme in Beijing
YearHigh SchemeMiddle SchemeLow Scheme
ErrorRelative ErrorErrorRelative ErrorErrorRelative Error
2011−9.37−0.73%−18.38−1.44%−27.38−2.14%
2012−18.15−1.40%−36.25−2.79%−54.22−4.18%
2013−26.04−1.98%−53.32−4.05%−80.22−6.09%
2014−32.13−2.41%−68.69−5.15%−104.48−7.84%
2015−32.83−2.44%−78.76−5.85%−123.39−9.17%
2016−39.44−2.89%−94.83−6.96%−148.27−10.88%
2017−24.15−1.78%−89.09−6.56%−151.30−11.13%
2018−29.76−2.16%−104.35−7.58%−175.29−12.74%
Error Test of Each Scheme in Shanghai
YearHigh SchemeMiddle SchemeLow Scheme
ErrorRelative ErrorErrorRelative ErrorErrorRelative Error
2011−6.39−0.45%−9.04−0.64%−11.68−0.82%
2012−13.20−0.92%−18.49−1.30%−23.77−1.67%
2013−17.90−1.25%−25.84−1.80%−33.75−2.36%
2014−23.61−1.64%−34.19−2.38%−44.71−3.11%
2015−27.19−1.88%−40.40−2.80%−53.52−3.71%
2016−33.52−2.31%−49.38−3.41%−65.08−4.49%
2017−37.83−2.60%−56.32−3.87%−74.60−5.13%
2018−44.13−3.02%−65.25−4.46%−86.10−5.89%
Error Test of Each Scheme in Guangzhou
YearHigh SchemeMiddle SchemeLow Scheme
ErrorRelative ErrorErrorRelative ErrorErrorRelative Error
20114.110.50%−1.27−0.16%−6.65−0.82%
20129.171.12%−1.72−0.21%−12.54−1.53%
201312.131.46%−4.41−0.53%−20.73−2.49%
201415.201.80%−7.13−0.85%−29.01−3.44%
201516.781.96%−11.47−1.34%−38.98−4.56%
201614.551.67%−19.77−2.27%−52.96−6.09%
20170.340.04%−40.19−4.48%−79.13−8.81%
2018−15.66−1.69%−62.54−6.74%−107.30−11.56%
Error Test of Each Scheme in Shenzhen
YearHigh SchemeMiddle SchemeLow Scheme
ErrorRelative ErrorErrorRelative ErrorErrorRelative Error
2011−1.49−0.55%−3.53−1.32%−5.58−2.08%
2012−14.48−5.03%−18.66−6.49%−22.81−7.93%
2013−30.50−9.82%−36.90−11.89%−43.21−13.92%
2014−45.15−13.59%−53.87−16.22%−62.39−18.78%
2015−60.71−17.10%−71.84−20.24%−82.63−23.28%
2016−83.31−21.64%−96.95−25.18%−110.08−28.59%
2017−125.72−28.90%−141.97−32.64%−157.48−36.20%
2018−137.93−30.31%−156.90−34.48%−174.87−38.43%
Note: unit is ten thousand people.
Table 7. Error test of prediction results of the unary regression model.
Table 7. Error test of prediction results of the unary regression model.
Error Test of Each Scheme in BeijingError Test of Each Scheme in Shanghai
year10 Years Sample20 Years Sample10 Years Sample20 Years Sample
ErrorRelative ErrorErrorRelative ErrorErrorRelative ErrorErrorRelative Error
20111.930.15%4.940.39%−1.40−0.10%−7.34−0.52%
2012−2.72−0.21%−1.35−0.10%−2.49−0.17%−6.59−0.46%
2013−6.58−0.50%−6.84−0.52%−1.48−0.10%−3.74−0.26%
2014−8.74−0.66%−10.62−0.80%−1.46−0.10%−1.89−0.13%
2015−5.59−0.42%−9.11−0.68%0.680.05%2.090.14%
2016−8.45−0.62%−13.60−1.00%0.060.00%3.310.23%
201710.490.77%3.720.27%1.470.10%6.560.45%
20188.440.61%0.030.00%0.890.06%7.810.53%
Error Test of Each Scheme in GuangzhouError Test of Each Scheme in Shenzhen
Year10 Years Sample20 Years Sample10 Years Sample20 Years Sample
ErrorRelative ErrorErrorRelative ErrorErrorRelative ErrorErrorRelative Error
2011−2.08−0.25%4.420.54%6.162.30%24.229.04%
20123.720.45%8.070.98%10.203.55%21.667.53%
20137.210.87%9.421.13%11.033.55%15.905.12%
201410.601.26%10.661.27%13.073.93%11.343.41%
201512.301.44%10.221.20%14.023.95%5.681.60%
20169.991.15%5.750.66%7.752.01%−7.19−1.87%
2017−4.52−0.50%−10.90−1.21%−18.52−4.26%−40.05−9.21%
2018−21.03−2.27%−29.56−3.18%−14.78−3.25%−42.91−9.43%
Note: unit is ten thousand people.
Table 8. Error test of prediction results of logistic biological competition model.
Table 8. Error test of prediction results of logistic biological competition model.
Error Test of Each Scheme in BeijingError Test of Each Scheme in Shanghai
Year10 Years Sample20 Years Sample10 Years Sample20 Years Sample
ErrorRelative ErrorErrorRelative ErrorErrorRelative ErrorErrorRelative Error
201121.731.70%17.081.34%6.420.45%−5.93−0.42%
201217.661.36%1.310.10%5.930.42%−5.65−0.40%
201313.271.01%−13.65−1.04%6.950.48%−3.37−0.24%
20149.520.71%−26.90−2.02%6.410.45%−2.18−0.15%
201510.060.75%−34.83−2.59%7.490.52%1.050.07%
20163.660.27%−48.76−3.58%5.350.37%1.420.10%
201718.161.34%−40.87−3.01%4.800.33%3.730.26%
201810.830.79%−53.98−3.92%1.870.13%3.940.27%
Error Test of Each Scheme in GuangzhouError Test of Each Scheme in Shenzhen
Year10 Years Sample20 Years Sample10 Years Sample20 Years Sample
ErrorRelative ErrorErrorRelative ErrorErrorRelative ErrorErrorRelative Error
20119.671.19%−0.22−0.03%32.0311.96%6.912.58%
201212.101.47%2.390.29%32.8411.42%6.892.40%
201312.351.48%2.600.31%31.8410.25%5.081.64%
201412.621.50%2.610.31%33.5410.10%5.971.80%
201511.341.33%0.820.10%35.7610.07%7.402.08%
20166.160.71%−5.09−0.59%32.468.43%3.340.87%
2017−11.07−1.23%−23.32−2.60%11.002.53%−18.86−4.34%
2018−30.18−3.25%−43.66−4.70%21.494.72%−9.06−1.99%
Note: unit is ten thousand people.
Table 9. Error test of the GM (1, 1) model prediction results.
Table 9. Error test of the GM (1, 1) model prediction results.
Error Test of Each Scheme in BeijingError Test of Each Scheme in Shanghai
Year10 Years Sample20 Years Sample10 Years Sample20 Years Sample
ErrorRelative ErrorErrorRelative ErrorErrorRelative ErrorErrorRelative Error
20113.800.30%0.030.00%0.250.02%−8.02−0.56%
2012−1.66−0.13%−3.65−0.28%−1.28−0.09%−7.17−0.50%
2013−6.17−0.47%−6.34−0.48%−0.68−0.05%−4.17−0.29%
2014−8.82−0.66%−7.12−0.53%−1.05−0.07%−2.12−0.15%
2015−6.01−0.45%−2.41−0.18%0.730.05%2.110.15%
2016−9.03−0.66%−3.48−0.26%−0.23−0.02%3.620.25%
20179.900.73%17.451.28%0.870.06%7.220.50%
20188.000.58%17.591.28%0.000.00%8.870.61%
Error Test of Each Scheme in GuangzhouError Test of Each Scheme in Shenzhen
Year10 Years Sample20 Years Sample10 Years Sample20 Years Sample
ErrorRelative ErrorErrorRelative ErrorErrorRelative ErrorErrorRelative Error
2011−5.49−0.67%2.270.28%0.420.16%5.842.18%
20120.590.07%6.400.78%1.690.59%6.372.21%
20134.600.55%8.391.01%1.410.45%5.191.67%
20148.751.04%10.471.24%4.101.23%6.822.05%
201511.451.34%11.021.29%7.602.14%9.072.56%
201610.371.19%7.740.89%5.941.54%5.961.55%
2017−2.64−0.29%−7.55−0.84%−13.50−3.10%−15.15−3.48%
2018−17.39−1.87%−24.66−2.66%−0.54−0.12%−4.13−0.91%
Note: unit is ten thousand people.
Table 10. Comparison of precision of urban population prediction in 210 prefecture-level cities.
Table 10. Comparison of precision of urban population prediction in 210 prefecture-level cities.
ModelMalthusian Model
(High Growth Rate)
Malthusian Model
(Middle Growth Rate)
Malthusian Model
(Low Growth Rate)
Unary Linear Regression Model (10 Years)Unary Linear Regression Model (20 Years)
Average relative error rate (%)3.885%0.825%−2.122%−0.018%0.166%
ModelLogistic biological competition model (10 years)Logistic biological competition model (20 years)GM (1, 1) model
(10 years)
GM (1, 1) model
(20 years)
Multi-model
average
Average relative error rate (%)0.371%−0.303%0.005%1.327%0.460%
Note: all models take year 2010 as the base year for prediction, the prediction duration is 20 years, and the error test period is 2011–2018.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chen, L.; Mu, T.; Li, X.; Dong, J. Population Prediction of Chinese Prefecture-Level Cities Based on Multiple Models. Sustainability 2022, 14, 4844. https://doi.org/10.3390/su14084844

AMA Style

Chen L, Mu T, Li X, Dong J. Population Prediction of Chinese Prefecture-Level Cities Based on Multiple Models. Sustainability. 2022; 14(8):4844. https://doi.org/10.3390/su14084844

Chicago/Turabian Style

Chen, Lixuan, Tianyu Mu, Xiuting Li, and Jichang Dong. 2022. "Population Prediction of Chinese Prefecture-Level Cities Based on Multiple Models" Sustainability 14, no. 8: 4844. https://doi.org/10.3390/su14084844

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop