Distribution Dynamics, Regional Di ﬀ erences, and Convergence of Elderly Health Levels in China

: Despite e ﬀ orts to improve national health levels, inequalities in health have endured in China. Most prior studies have been conducted from the perspective of inﬂuencing factors with respect to regional di ﬀ erences or focus on comparative analyses of health development levels. This paper reviews regional characteristics to examine dynamic changes in health levels within regions, measures and distinguishes inter-regional di ﬀ erences, and explores the evolution of these di ﬀ erences. Results show that elderly health levels in the overall system are polarized to varying degrees, but within regions, agglomeration continues. There were decreasing ﬂuctuations in overall di ﬀ erences during the sample period: Regional di ﬀ erences inﬂuenced ﬂuctuations the most, and inner-regional di ﬀ erences the least. In the short term, inequalities in health will continue to exist, but the gap has been closing in the long-term in China. The health development of neighborhoods has a signiﬁcant driving e ﬀ ect in narrowing the di ﬀ erences in the eastern and northeast regions, while economic and health security levels in the central and western regions should be strengthened. These ﬁndings can help decision-makers to create appropriate health policies that are more targeted to actual needs, and provide policy implications for promoting the development of better life quality in China by considering regional di ﬀ erences.


Introduction and Literature Review
Health is considered a basic human right by the World Health Organization (WHO, 1946) in its strategic goals for health planning. This represents not only improved human health, but also the promotion of inter-regional and inter-population health equity to help WHO members to improve the health of their citizens. Since China's economic reform and deregulation, with increased medical accessibility, the national health level has also significantly improved, but the issue of health differences between different regions is becoming increasingly prominent. For example, in the year 2000, Shanghai residents had the longest life expectancy (78 years), while residents of Tibet had the shortest life expectancy (64 years). In 2018, the life expectancy in Shanghai had improved to 84 years, while in Tibet the life expectancy had only improved to 68 years, with the difference between the regions increasing from 14 to 16 years. Of the top 15 provinces (cities and districts) with the longest life expectancy, 13 are in the eastern (southern) coastal areas, followed by the inland provinces (according to the China Statistical Yearbook 2018). Clearly, China's rapid economic development has not reduced health differences between different regions. Despite endogeneity in the formation and development of health geography, no country or region has a proper balance of health. However, from the perspective convergence model with spatial factors can be used for analysis to provide more scientific, accurate, and convincing results.
This paper attempts to use the above mentioned models to analyze the regional differences and dynamic evolution of the EHLs in China. Firstly, Kernel density estimation is used to analyze the distribution shape and the trend of health levels. The DG coefficient is then adopted to measure the regional differences and further reveal the differences sources. Finally, the convergence model with spatial effect is used to test the evolution trend of regional differences. The contributions of this paper lie in several aspects: The application of Kernel density estimation to reflect the health distribution and shape is conducive to a deeper understanding of health development in each region. DG coefficients can not only describe the size and source of regional differences, but also effectively solve the problem of cross and overlapping between the research samples. In addition, the convergence model with spatial factors can not only examine the evolution of health differences, but also reflect the mutual influence in adjacent regions, so as to formulate appropriate strategies to promote the development of regional health.

Kernel Density Estimation
Kernel density estimation is a non-parametric estimation method, which is widely used to fit sample data by smoothing the peak function and describing the distribution form of random variables by using a continuous density curve. Based on its strong robustness, it has also been often used to solve the dynamic evaluation problem including distribution location, trend, polarization trend.
The density function of random variable X is as follows: where N is the number of observations, and X i is the independent identical distribution for i ∈ {1, . . . , N}, x is the average value, K is the kernel density, and h is the bandwidth. As a weighting function or smooth function, the density usually is required to meet the following conditions: The main kernel functions are the triangular, quadratic, and Gaussian. This paper uses the Gaussian kernel function to analyze the distribution dynamics of health levels over the observation period. The kernel density is usually sensitive to bandwidth, with larger (smaller) bandwidth leading to smoother (rougher) density function, but lower (higher) accuracy. So, for higher accuracy, a smaller bandwidth should be used. According to the kernel density estimation results, the distribution position, shape, and ductility of random variables can be observed in the curve image.

DG Coefficient and Decomposition Method
Gini coefficient is a common measure of inequality, and DG coefficient was proposed by Dagum [21] to solve the problem of overlapping between sample data and the sources of regional differences. It has been widely used to investigate regional differences. In this paper, the DG coefficient and its decomposition method are used to analyze regional differences of EHLs in China. To save space, the solution process is not provided (see Liu [22] for more details).

Convergence Method
To further investigate the trends of regional differences in EHLs, this paper mainly considers two aspects: σ convergence and β convergence. The main differences between them are that σ convergence is based on the absolute levels, while β convergence is based on increments. Their test methods are descripted as follows.
(1) σ convergence represents an intuitive understanding of convergence, and it is used to test whether the difference degree will continue to decline over time. This paper uses the coefficient of variation (CV) as an indicator of σ convergence, and its computational formula is shown in Equation (3): Here, j ( j = 1, 2, 3, 4) represent China's four regions including east, central, west, and northeast, i represents ith province (or city); n j represents the number of provinces (centrally administered municipalities) located in j region; and y j represents mean health score in the j region.
(2) β convergence theory originated from the "Iron Law of Convergence" in the Neo-Classical Theory of Economic Growth, i.e., the Law of Diminishing Marginal Returns. The convergence refers to the health difference between the backward regions and the developed regions that will gradually narrow and eventually reach the same level. β convergence currently has been extensively applied in studies on convergence, as it can disclose the convergence mechanism [23,24].
According to preconditions, the β convergence include absolute β convergence and conditional β convergence. Absolute β convergence is a convergence without considering a series of control variables, such as regional economic levels, health, and medical services, and its basic model can be expressed as Equation (4): ln y j,t+1 − ln y j,t = α + β ln(y j,t ) + ε jt Here, ln y j,t+1 − ln y j,t+1 represents the health development rate in j region; β is the convergence coefficient, if β < 0, then it illustrates that absolute β convergence exists, and convergence rate is λ = − 1 T ln(1 + β); on the contrary, if β > 0, then the health difference has no convergence. Assume ε jt is the independent identically distributed stochastic disturbance.
To investigate the impact of other regional health developments on this region, spatial factors should be incorporated in the model to make the research more rigorous [25]. The spatial Durbin model (SDM) is constructed as shown in Equation (5). ln y j,t+1 − ln y j,t = α + β ln(y j,t ) + ρ k h=1 (w jh (ln y h,t+1 − ln y h,t )) + θ k h w jh ln(y h,t ) + ε jt (5) Here, ρ represents the endogenous interaction effect, and reflects the effect of other regions on this region in development rate. θ is the exogenous interaction effect, and reflects the effect of other regions on this region in development level. w jh is the factor of the spatial weight matrix.
Conditional β convergence can make the estimate results more convincing and accurate by adding control variables X that are highly correlated with health level [26,27]. It also conforms to the new economic growth theory that convergence may differ according to the external environment. The corresponding models are shown as Equations (6) and (7): ln y j,t+1 − ln y j,t = α + β ln(y j,t ) + δ ln X j,t + ε jt (6) = α + β ln(y j,t ) + ρ k h=1 (w jh (ln y h,t+1 − ln y h,t )) + θ k h w jh ln(y h,t ) + δ ln X j,t + ε jt It should be noted that the conditional β convergence test requires initially setting control variables, but the selection of control variables would vary according to real situations [28]. Since the control variables are not the focus of this paper, and excessive control variables would incur endogeneity, this paper selected disposable income per capita, hospitals per capita, and doctors per capita as the control variables of regional characteristics, which are related to both health level and regional characteristics. This paper also verifies and modifies whether there are omitted variables in the model.

Data Source and Regional Division
Two types of data are mainly used in this paper. The first type of data is self-reported health, which comes from the latest Chinese Longitudinal Healthy Longevity Survey (CLHLS, 2002-2014) by Healthy Aging and Development Studies of Peking University. CLHLS adopts the multi-stage unequal proportional random sampling method, covers more than 800 counties and cities in 23 provinces across China, and has been adopted by the academic community due to its high quality. This paper targeted that the group aged 65 and above were enrolled in the most recent 5 surveys. Self-reported health, like the Likert scale, is quantified by a sequencing variable, 1 = "very good", 2 = "good", 3 = "average", 4 = "poor", 5 = "very poor". Since differences between sequencing variable items cannot be clearly defined, we use the treatment method proposed by Du [14], to assign 1, 2, 3, 4, and 5 to sequencing variable items from "very good" to "very poor" in sequence, respectively. The second type of data including disposable income per capita, hospital per capita, and doctor per capita, which were collected from the China Statistical Yearbooks of corresponding years for related provinces (centrally administered municipalities).
Regional division: Based on the Chinese government report in 2015, the new space expansion strategic are of the "four big plates" in the government work report and based on data availability, this paper divided China into east, central, west, and northeast regions containing 22 provinces (centrally administered municipalities) to investigate elderly health status.

Factual Description
In order to reveal spatial distribution of the ELHs during the observation periods, this paper measured the health scores of the overall system and the four regions, and the overall horizontal characteristics can be seen from the results. In general, the EHLs distribution in China is unbalanced, with obvious differences between the four regions. For instance, in 2014, the top 5 provinces by health level are Tianjin, Hebei, Beijing, Shandong, and Jiangsu, with an average score of 2.40, and the last 5 provinces are Fujian, Hubei, Shanxi, Jiangxi, and Chongqing, with average score of 2.79, for a difference of 0.39. In 2002, the health differences are also obvious, and the value reaches the difference of 0.43. Figure 1 shows health scores for the four regions over time, in which horizontal axis represents time, and the vertical axis represents the score. As seen in Figure 1, the health levels in east and northeast are always higher than the overall average level and the central region and the west region are always lower than overall average level, among which the difference between the highest health level (east: Average score 2.49) and the lowest health level (west: Average score 2.71) is 0.22. In fact, the regional differences can be deduced from the health level ranking of provinces (centrally administered municipalities): The top 5 in 2014 belong to the east region, and the last 5 belong to the central region or west region. Similar to the situation in 2002, the top 5 belong to the east region, and the last 5 do not include eastern provinces. The east region has the highest health level, and west region has the lowest health level. This also reflects that the EHLs are highly related to the geography pattern of economic development. The east region, with relatively developed economy, has more medical resources and higher economic growth to improve EHLs and reduces elderly health vulnerability. the last 5 do not include eastern provinces. The east region has the highest health level, and west region has the lowest health level. This also reflects that the EHLs are highly related to the geography pattern of economic development. The east region, with relatively developed economy, has more medical resources and higher economic growth to improve EHLs and reduces elderly health vulnerability.

Distribution Dynamic
This paper uses the kernel density estimation to analyze dynamic change of health level in the overall system and four regions in China. In order to ensure the stability of the analysis results, we combine the northeast and east into one. The principle of the combination is to combine some regions with similar properties into one to enlarge the sample size and make the regional differences greater. Northeast only includes three provinces, and its health levels are similar to the east (as shown in Figure 1). Details of kernel density estimation results are shown in Figure 2.

Distribution Dynamic
This paper uses the kernel density estimation to analyze dynamic change of health level in the overall system and four regions in China. In order to ensure the stability of the analysis results, we combine the northeast and east into one. The principle of the combination is to combine some regions with similar properties into one to enlarge the sample size and make the regional differences greater. Northeast only includes three provinces, and its health levels are similar to the east (as shown in Figure 1). Details of kernel density estimation results are shown in Figure 2. region has the lowest health level. This also reflects that the EHLs are highly related to the geography pattern of economic development. The east region, with relatively developed economy, has more medical resources and higher economic growth to improve EHLs and reduces elderly health vulnerability.

Distribution Dynamic
This paper uses the kernel density estimation to analyze dynamic change of health level in the overall system and four regions in China. In order to ensure the stability of the analysis results, we combine the northeast and east into one. The principle of the combination is to combine some regions with similar properties into one to enlarge the sample size and make the regional differences greater. Northeast only includes three provinces, and its health levels are similar to the east (as shown in Figure 1). Details of kernel density estimation results are shown in Figure 2.  The center position of the overall distribution curve has a slight left-slight right shift and eventually returned to the starting level, and the main peak height mainly shows a slight but significant rise. The main peak gradually but significantly narrowed, especially the boundary on the right clearly tends to converge, which indicates that the overall EHLs in China do not rise significantly, and the absolute difference shows a narrowing trend, and the growth rate of some provinces may be accelerated significantly. From the perspective of polarization, the overall system gradually evolved from a weak bimodal to a relatively obvious bimodal   height mainly shows a slight but significant rise. The main peak gradually but significantly narrowed, especially the boundary on the right clearly tends to converge, which indicates that the overall EHLs in China do not rise significantly, and the absolute difference shows a narrowing trend, and the growth rate of some provinces may be accelerated significantly. From the perspective of polarization, the overall system gradually evolved from a weak bimodal to a relatively obvious bimodal distribution, and the height of the two peaks has changed from weak initial difference to significant difference. The left peak value is significantly lower, indicating that the overall EHLs of the overall system had an obvious gradient effect over time, showing a polarization trend. Since individual province had obvious heterogeneity in some factors, such as policy support, government fiscal capacity, resource endowment, economic development level, and population density, thus the low health level provinces have difficulty to quickly catch up with the high health level provinces, so the gap is likely to continue to widen. Secondly, from the perspective of distribution pattern, the main peak height of the eastern distribution curve declines sharply to rise slightly, and its width widens slightly, which indicates that the absolute difference in the eastern region tended to expand. Thus, the EHLs between provinces are gradually dispersed, and the number of provinces deviating from the mean gradually increase. In the central region, the obvious lowering of the main peak and significant narrowing indicate that the absolute differences are clearly smaller and the health development speed of some provinces are obviously accelerated. The main peak height in the west continues to rise and to narrow, which also shows that the absolute differences become smaller, provinces with lower health level have a strong catch up momentum, and the differences with higher health levels are becoming smaller.
Finally, from the point of polarization, the east and central have an initial state of "one main and one side" bimodal distribution that gradually evolves into the "single-peak" state. Then, the east peak values of the right side gradually fall to a smooth, which means that the lead of healthier provinces is weakening; in the central region, the left side peak value tends to smooth, which indicates that the provinces with lower health levels develop faster, the gap with those with higher health levels gradually narrows, then form a gathering trend. The west retains a single-peak, and the peak value increases. Overall, the decentralization agglomeration characteristics in the four regions are gradually obvious, the comprehensive agglomeration characteristics in each region increased significantly.

Spatial Differences and Decomposition
Health scores could relatively intuitively reflect the facts of health differences, but cannot reflect the degree and structure of differences. This paper applied DG coefficient to estimate and decompose to the spatial difference, as shown in Tables 1 and 2.

Spatial Differences
(1) The overall differences. In Table 1, the DG coefficients are between 0.0296 and 0.0392, and the largest and smallest differences occur in 2008 and 2014, respectively. Based on the difference's evolution trend, the DG coefficient will not increase or decrease continuously, and the periodic change is obvious. The DG coefficient increases from 0.0352 in 2002 to 0.0392 in 2008, for an annual growth rate of 1.81%; and then decreases from 0.0392 in 2008 to 0.0296 (minimum) in 2014, for an annual decline rate of 4.79%. This indicates that the overall elderly health difference is gradually reduced in recent years.
(2) Regional differences. According to the the regional DG coefficient shown in Table 1, intra-regional differences within four regions showed fluctuations and a rise-fall trend in east, central, and northeast regions, except for the west region. In particular, 2011 showed a significant inflection point. The east region had the smallest fluctuation (0.0081), while northeast had the largest fluctuation (0.0305), indicating that health differences among provinces in east region remained relatively stable, while in northeast region, they fluctuated significantly. In respect of coefficient values, the largest value has always been observed in the east region, followed by the central and west, with the northeast lowest, indicating that heath inequality is most prominent in east region, while in west and northeast regions, there are no differences.
(3) Inter-regional differences. As shown in Table 2, inter-regional differences throughout the survey period changed little, though there were significant periodic fluctuations. For example, the east-central DG value increased at rate of 1.02%, from 0.0392 in 2002 to 0.0431 in 2014, while it reached 6.92% between 2008 and 2011. Central-northeast difference increased from 0.0305 in 2002 to 0.0486 in 2014, which demonstrates that the differences between the two regions is expending. During the entire observation period, east-central, northeast-central, and east-west differences remained at high levels in most years, while central-west and east-northeast differences were relatively small, indicating that the difference between higher level region and lower level region is larger. The order by inter-regional health difference is the same as the order by regional heath grouping characteristics.

Difference Decomposition
The recourse and contribution of spatial differences are listed in Table 3. As listed in Table 3, inter-regional difference contribution, transvariation intensity, and inner-regional difference during the survey period were 39.62%, 36.48%, and 23.90%, respectively, and inter-regional difference is the major cause of overall difference, while inner-regional difference made the minimum contribution.  Figure 3 shows difference contribution trends, and the contribution of the intra-regional difference changed little, remaining within 22.68-24.98%. The contribution rate of inter-regional difference remained at a high level and changed little during the sample period. Contribution rates of inter-regional difference, intensity of transvariation, and intra-regional differences and their trends are substantially consistent with the above mentioned difference characteristics that inter-regional difference is large and intra-regional difference is small.  Figure 3 shows difference contribution trends, and the contribution of the intra-regional difference changed little, remaining within 22.68-24.98%. The contribution rate of inter-regional difference remained at a high level and changed little during the sample period. Contribution rates of inter-regional difference, intensity of transvariation, and intra-regional differences and their trends are substantially consistent with the above mentioned difference characteristics that inter-regional difference is large and intra-regional difference is small.

Convergence Analysis of EHLs
Based on the discussion on the evolution of dynamic distribution and spatial difference decomposition of EHLs in China, to more accurately investigate the trends of health level spatialtemporal pattern, this paper explores this issue in depth to further reveal its spatial convergence characteristics.

Convergence Analysis of EHLs
Based on the discussion on the evolution of dynamic distribution and spatial difference decomposition of EHLs in China, to more accurately investigate the trends of health level spatial-temporal pattern, this paper explores this issue in depth to further reveal its spatial convergence characteristics.

σ Convergence Test
This paper uses CV coefficient to analyze σ convergence. Table 4 shows the CV dynamic change trend of the overall and four regions during the observation period. From the perspective of the trend, the overall system shows a slight decline, steady rise, and gradual decline, with no consistent convergence or divergence over time. In particular, a steady decline occurred after the peak in 2008, which means spatial differences continue to narrow from then. The CV evolution trend in thr four regions varies: The east remained relatively stable with no σ convergence, the central region fluctuated greatly without convergence, the northeast region had a slight decline, and west declined after 2005. The CV values for the east and central regions remained relatively high, which demonstrating that intra-regional health differences within these two regions are large, while north and west regions are relatively small. In general, the difference trends for the overall and the regional levels measured from the absolute difference show no significant convergence over all the sample periods, but the differences gradually narrowed after 2011. This is generally consistent with the above analysis results of factual analysis and difference decomposition.

β Convergence Test
Before β convergence test, Moran's I index was used to investigate spatial correlations of health levels, and the spatial weights adopted first-order binary proximity matrix. If the two regions are proximal, the weight is taken as 1. Otherwise, it is taken as 0. In order to guarantee the measurement consistency and model balance, standardized processing was made in the matrix. The calculation results from Geoda software show that Moran's I index is between 0.05-0.23, with greater than 5% significance, illustrating that the health level has the spatial reliance. Though there is not a strong correlation, it has still passed the significance test, so this paper adopts the SDM for convergence analysis.
(1) Absolute β convergence test. Table 5 reports the absolute β convergence results for the overall system and regional levels. According to the estimated results, it is clear that the β coefficients of the overall system and the four regions are all less than 0, and all of them have passed the significance level tests, which indicate that the EHLs in the overall system and the four regions all have absolute β convergence. That is, the health level will eventually converge to the same stable level. Provinces with lower health level have faster growth rate than those with higher health level, and the regional differences are gradually narrowed. According to the convergence rates of the four regions, the western region has the fastest convergence rate, followed by the central region, and east regions, with the northeast region being slowest. From the perspectives of convergence mechanism, the spatial interaction has the significant driving on difference convergence for the overall system and the east region, but has no significant effect in other regions. It is noteworthy that the central and the west spatial effect is negative, but still has convergence, indicating that competition within the central and the west regions are fierce, and the provinces with lower level develop relatively faster.  (2) Conditional β convergence analysis. Conditional β convergence, including the regional characteristics variable related to health level, are listed in Table 6. Stepwise regression method is used to avoid collinearity among three variables, and the variable of hospitals per capita is not included in models because of its insignificance. δ 1 is the disposable income coefficient, δ 2 is doctors per capita coefficient, and the results are listed in Table 6. It is clear that the convergence coefficients of the overall system and the four regions are all negative, and all of them have passed the significance level test of 1%, which indicates that conditional β convergence exists under the condition of considering the regional heterogeneity. It also means that the EHLs are all developing towards their steady state level, and their rates of convergence are faster than the absolute β convergence. In respect of convergence mechanisms, the spatial effects have significantly narrowed the overall health differences, also have positive but weaker or insignificant effects in east and northeast, and are negative and significant effects in central and west. The coefficients and significance levels of the control variables in the overall system and the four regions are different. The disposable income coefficient δ 1 is significant in the west and the central regions, indicating that higher disposable income per capita has narrowed the differences in both regions. But the doctors per capita δ 2 is an important factor to most regions, especially to the central regions.  It should be noted that the effect of the control variables on difference conditional β convergence shows apparent regional heterogeneity. Disposable income per capita is significant to the central and west regions, and insignificant to the others, which may be attributable to insignificant marginal utility of income in east due to a relatively higher health level. The doctors per capita is helpful to improve overall health level in most regions, especially to the central region, which should be enhanced due to its large populations and poor economic level. However, specific analysis is not implemented since influence of the control variable is not a focus of this paper, and further exploration is needed in the future. Regional differences for disposable income per capita, doctors per capita, and hospitals per capita are shown in Figure 4. helpful to improve overall health level in most regions, especially to the central region, which should be enhanced due to its large populations and poor economic level. However, specific analysis is not implemented since influence of the control variable is not a focus of this paper, and further exploration is needed in the future. Regional differences for disposable income per capita, doctors per capita, and hospitals per capita are shown in Figure 4.

Conclusions and Policy Enlightenments
This paper, using self-evaluation health indicators in CLHLS (2002-2014), including 22 provinces (centrally administered municipalities) in four regions, depicts Chinese EHLs regional differences, kernel density estimation is used to investigate distribution dynamic and characteristics,

Conclusions and Policy Enlightenments
This paper, using self-evaluation health indicators in CLHLS (2002-2014), including 22 provinces (centrally administered municipalities) in four regions, depicts Chinese EHLs regional differences, kernel density estimation is used to investigate distribution dynamic and characteristics, and then the DG coefficients and decomposition methods are used to provide scientific measurement and decomposition for the relative differences. Based on the analysis of regional differences, adopting σ convergence and β convergence methods, different evolution characteristics are empirically tested. Specifically, we investigate the effect of spatial factors and economic level on convergence. The main conclusions of this paper are as follows: First, from the typical facets, during the observation period, there was no significant improvement in the EHLs in China, but there were significant spatial differences among regions and within regions. There was a declining trend gradually from east to west, with the east and northeast levels always higher than the other regions, the central and western levels lower, and the central area average slightly Sustainability 2020, 12, 2288 13 of 15 higher than the western area. The distribution pattern of the health level is closely related with regional economy and endowment.
Second, from the perspective of absolute differences, the distribution curves for health level in the overall and the four regions vary. Specifically, during the observation period, the absolute difference in EHLs in the overall system and the central and western regions gradually decreased, while the east and northeast showed expanding trends, indicating that the development of some provinces in the central and west regions accelerated significantly, while the provinces in the east and northeast regions developed in parallel. Polarization phenomenon shows that the overall EHLs had a weak but clear bimodal distribution, while the east and central mainly evolved to bimodal-unimodal, whereas the west remained unimodal. The overall health level clearly has a gradient effect, and a polarization trend, but clear agglomeration characteristics exist in regions.
Third, from the sources and contributions of the overall difference, the inter-regional differences are the first source of the overall spatial differences, with the highest contribution rate, followed by the supervariable density, and finally, the intra-regional differences, whose contributions to the overall differences is relatively small. The internal imbalance in the east was the most prominent, followed by that in the central. The differences between east-central and east-west are the biggest, and the differences between central-west is the smallest.
Finally, from the dynamics of health difference, the differences are still present in a short period, but tend to narrow due to faster development speed of the low health level regions. There is heterogeneity in the convergence mechanisms. According to spatial variation coefficients, we conclude that the provinces with higher health levels have a driving effect to narrow the difference in the east and northeast, but in the central and west, the health differences convergence are more dependent on the improvement of the economic and health security levels.
Based on the above conclusions, we can draw the following enlightenment.

1.
We should pay more attentions to the spatial imbalance of EHLs in China, especially the fact that the inter-regional difference is the main source of the overall differences, so as to realize the coordinated development among regions. For the central and west regions with relatively low health level, based on the fact that the economic development level is relatively low, we should allocate more medical resources, especially health technical personnel to bridge medical personnel "lowland", increase order-oriented training, and increase incentives on salary and promotion channels. These actions will make up for the shortage of doctors per capita, and more hospitals can be built to accommodate the supplements and further improve the medical standard. 2.
Regional heterogeneity of health differences should be paid attention to for further development. The health levels, dynamic distributions, and convergence characteristics are different in different regions. Therefore, the formulations and guidances of health policies should vary from region to region and cannot be generalized. For example, for the east and northeast, it is necessary to further strengthen the driving and radiation effects of other provinces and cities with higher health levels, while in the central and west, due to the competition and compression phenomena between provinces and cities, coordinated development and improvement of the overall health level are feasible solutions.

3.
Convergence test shows that the short-term health unbalance state will remain, but the area will eventually achieve their stable state as time goes by. So respecting the natural endowment characteristic of health levels and the regional endogenous of health geographic patterns and development, healthy policies in light of local conditions are formulated to ensure the regional health equivalent development.
This study is subject to three limitations: First, there is the issue of sample selection bias, since follow-up survey data were used in this study. Generally, people exposed to a high health risk are more likely to die, and follow-up survey would be discontinued, whereas people in the samples who are still alive may have a better health level. Therefore, our research may overestimate the health level of elderly group; second, the logic of choosing the first-order binary-valued neighboring space weight matrix has not been explained in detail when using spatial econometric analysis. Third, only one control variable was selected in difference convergence research, and lacking discussion on regional natural endowments. The influence directions and intensity of some indicators are also not discussed too much, and these issues should be overcome in future research.
Author Contributions: The conceptualization, structure, and flow were designed by C.-T.C., the writing, information collection, data processing, and analysis of the paper were completed by X.C. All authors have read and agreed to the published version of the manuscript.
Funding: This study was not funded by any organization.