Fast Identification of Urban Sprawl Based on K-Means Clustering with Population Density and Local Spatial Entropy

As urban sprawl is proven to jeopardize the sustainability system of cities, the identification of urban sprawl is essential for urban studies. Compared with previous related studies which tend to utilize more and more complicated variables to recognize urban sprawl while still retaining an element of uncertainty, this paper instead proposes a simplified model to identify urban sprawl patterns. This is a working theory which is based on a diagram interpretation of the classic urban spatial structure patterns of the Chicago School. The method used in our study is K-means clustering with gridded population density and local spatial entropy. The results and comparison with open population data and mobile phone data verify the assumption and furthermore indicate that the accuracy of source population data will limit the precision of output identification. This article concludes that urban sprawl is mainly dominated by population and surrounding unevenness. Moreover, the Floating Catchment Area (FCA) local spatial entropy method presented in this research brings about an integration of Shannon entropy, Tobler’s first law of geography and the Moore neighborhood, improving the spatial homogeneity and locality of Batty’s Spatial Entropy model which can only be used in a general scope.


Introduction
Urban spatial expansion is described as a process in which the city encroaches surrounding open space and extends the territory of urban areas, due to its growing population, increasing incomes and decreasing commuting costs [1].As a disfavored type of expansion, urban sprawl refers to excessive spatial growth of cities, which results in negative consequences such as rapid loss of farmland, despoiled ecological system and swelling traffic congestion in both the West and the East [2][3][4][5][6][7][8][9][10], which resulted in a wide range of economic [2], social [3], health [4] and environmental problems [11], jeopardizing the sustainability system of cities and becoming a major concern for urban studies.Urban sprawls have attracted considerable attention from various perspectives, such as its definition, measurement, cause, effect, simulation and solutions [12][13][14][15][16][17][18]; however, it still has no universal definition [19][20][21], which has made it very difficult to quantify and model the sprawl [22].
Most methods for identification of urban sprawl can be categorized as relative and absolute methods [23].Relative measures compare urban sprawl indicators among different times usually with data of population density or satellite map [23][24][25], which is often limited by the resolution of data and faces problems of the definition and recognition of urban built-up areas.Relative measurements allow analysts to conclude whether the study area is sprawling or not, while absolute assessments clearly distinguish compact cities from sprawled ones [26], with spatial metrics of landscape ecology, multiple statistic indices such as employment, resource consumption, population, living quality and architecture aesthetics.Similarly, the lack of a threshold value between harmful sprawl and benign expansion still challenge the outcomes of such methods.Contrary to the uncertainty in such a threshold value, the entropy method presents a clear value of urban sprawl measurement on a general scale [27][28][29][30][31], due to the entropy being able to measure the uneven degree of particle distribution in the system.Bhatta argues that the entropy method is more spatial, robust and static than others [20,23].However, the entropy method cannot provide a spatial result that indicates which area is sprawling.This article hereby argues that the combination of the entropy method and urban sprawl indices presents an ideal way of identifying urban sprawl.
As Ewing claims, they "know it when they see it" [32] or can "identify urban sprawl based on their judgment" [33]-a simple and universally accepted sprawl indicator deserves exploration, one that could be implemented for a wide range of locations and explained straightforwardly to decision makers [34].Although the driving forces underlying urban sprawl differentiate throughout the world [35,36], there still exists well-accepted common sense on spatial features of urban sprawl: leapfrog development, low-density and unlimited outward expansion [8].Phelps and Silva indicate that urban interstices at four geographic scales will influence the measure of urban sprawl, wherein the so-called empty lands are not inert at all [37].Desalvo and Qing argue that the undeveloped land around an average dwelling could be the main determinant of sprawl, based on the testing of the traditional monocentric model in which population density plays a dominant role [38].
Based on the reinterpretation of urban spatial structure models of the Chicago School with gridded population density map, this paper concludes that there exist five combination patterns of spatial population density in Moore neighborhood units, which helps to identify whether the central unit is in a sprawling area.This paper hereby presents a simplified urban sprawl measuring method based on K-means clustering with merely the population density of gridded units and their local spatial entropy which calculates the uneven distribution degree of population density of surrounding units.With testing of the case study in Wuhan, located in China, the result clearly shows the urban expansion pattern of Wuhan with the identification of the sprawling area.
The paper is organized as follows: the first section reviews research on methods of measuring urban sprawl.Section 2 provides the methodology for identifying urban expansion with the population density map and FCA local spatial entropy, followed by the introduction of the case study area and source data in Section 3. Section 4 presents the findings on urban sprawl analysis, further discussion is in Section 5.The final section summarizes the main findings and reflects on the methodology.

Urban Expansion Diagrams of Gridded Population
Batty indicated that urban modeling and planning methods had been evolving with the development of cities [39], while a monocentric city grew into a multicentric city, connected with regional cities, weaving a more complicated system in the global socioeconomic networks.Classical urban spatial structure diagrams were first revealed by the Thünen ring model in Isolated State theory and then by Ernest Burgess 's Concentric zone model [40].More importantly, Alonso's Bid Rent curve theory perfectly explained the monocentric city model with the distance decay pattern of population density away from the city center [41].As the city keeps expanding outward, the Hoyt's Sector Model [42] and Harris and Ullman's Multiple Nuclei Model [43] indicate that the transportation system and emerging subcenters generated sector and multi-nuclei model, which still locally obeyed the distance decay pattern in population density near the expanding core or corridor.According to common sense on urban sprawl, sprawling unit is leapfrog, discontinuous away from the main city with a population density lower than central area but still relatively higher than its surrounding area.Thus, based on the different pattern of population density distribution, the diagram of the three classical urban spatial structure models by the Chicago School and the fourth pattern as urban sprawl could be concluded (Figure 1).In the previous figure of gridded population density, it is obvious that population density plays the key role in urban expansion analysis, which is consistent with what Ewing and Angel claimed [32,33].With a Moore neighborhood perspective, this study focused on the land unit and its neighbor units (NU) which could be categorized into five kinds of population density distribution pattern: central area (CA), edge area (EA), rural area (RA), sprawl area (SA) and inner ecological area (IA) (Figure 2).Rather than merely utilizing population density of land unit as a single variable in plenty of related research, this category took the surrounding condition into account.CA located in the main urban area has relatively higher population density as well as its surrounding units, showing a high evenness in the local neighborhood.Similarly, the RA units possess a same even state but only with lower population density.As SA is defined as a discontinuous unit from the main urban area, it would have a higher population density than its neighbor units, In the previous figure of gridded population density, it is obvious that population density plays the key role in urban expansion analysis, which is consistent with what Ewing and Angel claimed [32,33].With a Moore neighborhood perspective, this study focused on the land unit and its neighbor units (NU) which could be categorized into five kinds of population density distribution pattern: central area (CA), edge area (EA), rural area (RA), sprawl area (SA) and inner ecological area (IA) (Figure 2).Rather than merely utilizing population density of land unit as a single variable in plenty of related research, this category took the surrounding condition into account.In the previous figure of gridded population density, it is obvious that population density plays the key role in urban expansion analysis, which is consistent with what Ewing and Angel claimed [32,33].With a Moore neighborhood perspective, this study focused on the land unit and its neighbor units (NU) which could be categorized into five kinds of population density distribution pattern: central area (CA), edge area (EA), rural area (RA), sprawl area (SA) and inner ecological area (IA) (Figure 2).Rather than merely utilizing population density of land unit as a single variable in plenty of related research, this category took the surrounding condition into account.CA located in the main urban area has relatively higher population density as well as its surrounding units, showing a high evenness in the local neighborhood.Similarly, the RA units possess a same even state but only with lower population density.As SA is defined as a discontinuous unit from the main urban area, it would have a higher population density than its neighbor units, CA located in the main urban area has relatively higher population density as well as its surrounding units, showing a high evenness in the local neighborhood.Similarly, the RA units possess a same even state but only with lower population density.As SA is defined as a discontinuous unit from the main urban area, it would have a higher population density than its neighbor units, showing an obvious unevenness state in its local area.EA and IA show a middle even state comparing to the uneven state of SA and the even state of CA/ RA, wherein the EA units can be seen as the natural expansion of urban core.This article supposed to categorize all the units by their population density and the different evenness state in terms of local situation, which included the unit and surrounding neighbor units.If grouped into 3, the urban area, rural area and expanding area can be recognized, where the expanding area will contain the edge area, sprawl area.While as grouped into 5, all the above units should be further recognized (Table 1).As the population was already acquired in gridded population map, this experiment only need to calculate the local evenness and define the grouping method, which were defined as FCA local spatial entropy and K-means respectively in this study.

FCA Local Spatial Entropy
The concept of entropy was first defined in Thermodynamics by Clausius, describing the energy unable to do work during the transferring procedure of energy.Then Boltzmann used entropy as a measurement of disorder in Thermodynamic statistics, which was introduced into information theory and then defined as uncertainty by Shannon [30].According to the definition of Shannon Entropy, Shannon Entropy can measure the evenness of probability distribution.In a probability set of K (n 1 ,n 2 , . . ., n k ), wherein ∑n i = 1, the evenness of K can be calculated as: H(k) ranges from 0 to LogK, while one of the P i equals 1, H(k) = 0; while all the Pi equals 1/k, H(k) will be the maximum value of LogK.In other words, H(k) is related to the amount of K, so when comparing entropy value among data sets of a different amount, the relative entropy is always utilized and written as: R(k) = − ∑ P i log P i /logK, (2) While applying the entropy method for land units to calculate the local evenness, this study presented a method similar to floating catchment area (FCA) method, searching surrounding area within a threshold distance for land unit m, calculating the Shannon Entropy value of the units inside as the evenness value of the central unit m (Figure 3).
Although Batty introduced entropy theory in space-related studies and defined it as Spatial Entropy [27], spatial entropy only can compute the total evenness of population distribution, which had seldom been used locally.While the First Geographic Law of Tobler pointed out, the nearer the lands exist, the more related their attributes are.Because the FCA method with Shannon entropy applied the spatial correlation rule of Tobler's law and is capable to measure the uncertainty locally, this study defined this method as Local Spatial Entropy.The local spatial entropy for gridded units can be calculated in ArcGIS by three steps: 1) Use FEATURE TO POINT to generate centroids for every unit; 2) Use POINT DISTANCE within a threshold searching radius to generate local matrix for each centroid, ensuring merely including the surrounding 8 centroids; 3) Calculate the local spatial entropy for the central point based on Shannon Entropy (Equations 1 and 2) with the population of all the 9 centroids.

K-means Clustering
Since urban sprawl was supposed to be defined by clustering with population density and local spatial entropy, a handy tool suitable for area classification would be preferred.Although there exist many statistical packages, the commonly used K-means clustering was chosen according to Vickers's comparison on cluster analysis methods [44].K-means clustering is capable to partition units by spatial correlation, aiming to make the differences among the units in a group, overall groups, is minimized [45].Furthermore, the K-means algorithm is suitable for the situation that the number of clusters has already been designated [46] , since the previous assumption has divided the cells into 3 or 5 groups.Besides, the K-means algorithm is distance-based, taking distance as the evaluation index of similarity, that is, the closer the distance between two objects is, the larger the similarity is.The algorithm considers the cluster to be composed of objects that are close together, so the compact and independent cluster is the ultimate target.Considering the division of the data set X into k groups using Euclidean distance, the minimum of Sum of the Squared Error (SSE) is the objective of clustering algorithm: Wherein, k is the number of clusters, x is a set of value in a certain cluster with a mean of Ci .Admittedly, the uncertainty in defining the number of clusters is the main challenge for K-means clustering, if the number is unknown.Normally, the higher Pseudo F-statistic is, the number would be better, however, when the F-Statistic reaches its max value, the algorithm can result in a local optimum rather than a global one.That is to say, with k increasing, the sample partitioning will be more refined, SSE will definitely become smaller.Wherein, when k is smaller than an ideal number of clusters, the SSE will decrease largely, while k is larger than that, SSE will decrease slowly.Such pattern constructs an elbow shape of SSE and k, or F-Statistic and k ( Figure 4), which is also called  1) and ( 2)) with the population of all the 9 centroids.

K-Means Clustering
Since urban sprawl was supposed to be defined by clustering with population density and local spatial entropy, a handy tool suitable for area classification would be preferred.Although there exist many statistical packages, the commonly used K-means clustering was chosen according to Vickers's comparison on cluster analysis methods [44].K-means clustering is capable to partition units by spatial correlation, aiming to make the differences among the units in a group, overall groups, is minimized [45].Furthermore, the K-means algorithm is suitable for the situation that the number of clusters has already been designated [46], since the previous assumption has divided the cells into 3 or 5 groups.Besides, the K-means algorithm is distance-based, taking distance as the evaluation index of similarity, that is, the closer the distance between two objects is, the larger the similarity is.The algorithm considers the cluster to be composed of objects that are close together, so the compact and independent cluster is the ultimate target.Considering the division of the data set X into k groups using Euclidean distance, the minimum of Sum of the Squared Error (SSE) is the objective of clustering algorithm: Wherein, k is the number of clusters, x is a set of value in a certain cluster with a mean of C i .Admittedly, the uncertainty in defining the number of clusters is the main challenge for K-means clustering, if the number is unknown.Normally, the higher Pseudo F-statistic is, the number would be better, however, when the F-Statistic reaches its max value, the algorithm can result in a local optimum rather than a global one.That is to say, with k increasing, the sample partitioning will be more refined, SSE will definitely become smaller.Wherein, when k is smaller than an ideal number of clusters, the SSE will decrease largely, while k is larger than that, SSE will decrease slowly.Such pattern constructs an elbow shape of SSE and k, or F-Statistic and k (Figure 4), which is also called the Elbow method [47].It is an ideal choice to select the k value at elbow point or slightly larger than that, rather than at the maximum value while using F-statistic.the Elbow method [47].It is an ideal choice to select the k value at elbow point or slightly larger than that, rather than at the maximum value while using F-statistic.K-means clustering can be performed in ArcGIS with a Pseudo F-statistics report, wherein the Group Analysis in toolbox has a built-in module of K-means algorithm while setting NO SPATIAL CONSTRAINT for the Spatial Constraints parameter, which makes the process more convenient.

Study Area
The study area is the Wuhan city, located in central China (Figure 5).As the capital of Hubei province and one of the nine National Central Cities of China, Wuhan is the most populous city in Central China.The city boasts abundant mountain and water resources and is divided into the "Three Towns" of Wuchang, Hankow and Hanyang by the Yangtze River and Han river.Limited by the complicated geographic condition, Wuhan keeps expanding along the Yangtze River and the main inter-provincial roads, especially the north mountain area and south lake area extremely affect the expansion.As the same as other Chinese cities [35,36], the urban expansion of Wuhan has been largely driven by government, making more enclaves outside the urban core.

Open Gridded Population Data of GeodataCn
The gridded population datasets in 2010 was obtained from National Earth System Science Data Sharing Infrastructure of China (http://www.geodata.cn),which covered mainland China with a spatial resolution of 1 km, with land use regression model based on census data at the county level and land use data at a scale of 1:100,000 in national scale.Land use data contained urban and rural built-up area, grassland, forest land and etc., in which built up area made an important role in disaggregating population, hence the GeodataCn represented a static combining population density K-means clustering can be performed in ArcGIS with a Pseudo F-statistics report, wherein the Group Analysis in toolbox has a built-in module of K-means algorithm while setting NO SPATIAL CONSTRAINT for the Spatial Constraints parameter, which makes the process more convenient.

Study Area
The study area is the Wuhan city, located in central China (Figure 5).As the capital of Hubei province and one of the nine National Central Cities of China, Wuhan is the most populous city in Central China.The city boasts abundant mountain and water resources and is divided into the "Three Towns" of Wuchang, Hankow and Hanyang by the Yangtze River and Han river.Limited by the complicated geographic condition, Wuhan keeps expanding along the Yangtze River and the main inter-provincial roads, especially the north mountain area and south lake area extremely affect the expansion.As the same as other Chinese cities [35,36], the urban expansion of Wuhan has been largely driven by government, making more enclaves outside the urban core.the Elbow method [47].It is an ideal choice to select the k value at elbow point or slightly larger than that, rather than at the maximum value while using F-statistic.K-means clustering can be performed in ArcGIS with a Pseudo F-statistics report, wherein the Group Analysis in toolbox has a built-in module of K-means algorithm while setting NO SPATIAL CONSTRAINT for the Spatial Constraints parameter, which makes the process more convenient.

Study Area
The study area is the Wuhan city, located in central China (Figure 5).As the capital of Hubei province and one of the nine National Central Cities of China, Wuhan is the most populous city in Central China.The city boasts abundant mountain and water resources and is divided into the "Three Towns" of Wuchang, Hankow and Hanyang by the Yangtze River and Han river.Limited by the complicated geographic condition, Wuhan keeps expanding along the Yangtze River and the main inter-provincial roads, especially the north mountain area and south lake area extremely affect the expansion.As the same as other Chinese cities [35,36], the urban expansion of Wuhan has been largely driven by government, making more enclaves outside the urban core.

Open Gridded Population Data of GeodataCn
The gridded population datasets in 2010 was obtained from National Earth System Science Data Sharing Infrastructure of China (http://www.geodata.cn),which covered mainland China with a spatial resolution of 1 km, with land use regression model based on census data at the county level and land use data at a scale of 1:100,000 in national scale.Land use data contained urban and rural built-up area, grassland, forest land and etc., in which built up area made an important role in disaggregating population, hence the GeodataCn represented a static combining population density

Open Gridded Population Data of GeodataCn
The gridded population datasets in 2010 was obtained from National Earth System Science Data Sharing Infrastructure of China (http://www.geodata.cn),which covered mainland China with a spatial resolution of 1 km, with land use regression model based on census data at the county level and land use data at a scale of 1:100,000 in national scale.Land use data contained urban and rural built-up area, grassland, forest land and etc., in which built up area made an important role in disaggregating population, hence the GeodataCn represented a static combining population density of both work-time and non-worktime.The GeodataCn had been tested having an estimation accuracy of above 50% in a general national scope [48], which may result in uncertainty in a city level.

Gridded Population Generated by Mobile Phone Data
To further verify the assumption, this study took a gridded population data generated by mobile phone data.Data of phone call records used in the present study was provided by a partner telecommunication operator whose market share was about 60%, verified for representing whole population distribution proportionally in Wuhan [49].Mobile phone data of 7,300,000 users in November 2015 in Wuhan City was used in the study.Data was pre-processed, eliminating all privacy-related information.With a similar land use regression model and two-step floating catchment area method (2SFCA), the population of work time was disaggregated to gridded land units, in which the accuracy of the gridded population was verified with an adjusted R 2 of about 80% [50].The mobile phone data in 2015 provided a verification of the urban sprawl pattern of 2010 and a comparison of the different accuracy of data source (Figure 6).Theoretically, the work time population was more capable to help identify the structure of urban expansion, however, the residential area with few populations in the daytime may influence the result of identifying urban sprawl.
Sustainability 2018, 10, x FOR PEER REVIEW 7 of 16 of both work-time and non-worktime.The GeodataCn had been tested having an estimation accuracy of above 50% in a general national scope [48], which may result in uncertainty in a city level.

Gridded Population Generated by Mobile Phone Data
To further verify the assumption, this study took a gridded population data generated by mobile phone data.Data of phone call records used in the present study was provided by a partner telecommunication operator whose market share was about 60%, verified for representing whole population distribution proportionally in Wuhan [49].Mobile phone data of 7,300,000 users in November 2015 in Wuhan City was used in the study.Data was pre-processed, eliminating all privacy-related information.With a similar land use regression model and two-step floating catchment area method (2SFCA), the population of work time was disaggregated to gridded land units, in which the accuracy of the gridded population was verified with an adjusted R 2 of about 80% [50].The mobile phone data in 2015 provided a verification of the urban sprawl pattern of 2010 and a comparison of the different accuracy of data source (Figure 6).Theoretically, the work time population was more capable to help identify the structure of urban expansion, however, the residential area with few populations in the daytime may influence the result of identifying urban sprawl.

Results
Extracted from gridded population map of GeodataCn, the population density map of Wuhan could be easily symbolized, the white area inside the map were units with no population, most of which existed river, lakes and mountains.With the calculation of local spatial entropy base on population, local evenness of grid units could be mapped.However, the units along the border of Wuhan showed obviously lower value than nearby units, because the border units have fewer neighbor units, make the k and LogK smaller.Thus, the relative entropy was further calculated to diminish the influence of k, the number of nearby units.It can be seen in the result mapping that the relative entropy shows a more continuous distribution of evenness value than original Shannon Entropy, especially, the border effect has been eliminated (Figure 7).While applying Group Analysis in ArcGIS with a value of population and local spatial entropy, original population density map was categorized into three groups and five groups (Figure 8) with summary reports (Tables 2 and 3).

Results
Extracted from gridded population map of GeodataCn, the population density map of Wuhan could be easily symbolized, the white area inside the map were units with no population, most of which existed river, lakes and mountains.With the calculation of local spatial entropy base on population, local evenness of grid units could be mapped.However, the units along the border of Wuhan showed obviously lower value than nearby units, because the border units have fewer neighbor units, make the k and LogK smaller.Thus, the relative entropy was further calculated to diminish the influence of k, the number of nearby units.It can be seen in the result mapping that the relative entropy shows a more continuous distribution of evenness value than original Shannon Entropy, especially, the border effect has been eliminated (Figure 7).While applying Group Analysis in ArcGIS with a value of population and local spatial entropy, original population density map was categorized into three groups and five groups (Figure 8) with summary reports (Tables 2 and 3).Table 2 shows that in 3-group clustering, the green units in the left picture of Figure 6 share relative high mean population density of 14278 people/km 2 and high relative entropy of 0.94, reflecting the feature of Central Area.The gray units with a lower mean population of 449 people/km 2 and relative entropy of 0.96 also reflect the feature of Rural Area.Obviously, the red ones are the expanding area and it is not difficult to distinguish the urban sprawl area and edge area from whether it is continuous from the central area, which is aligned to Ewing's claim of "knowing it by seeing it." Table 3 shows that in 5-group clustering, the green and yellow units in the right picture of Figure 6 reflect features of central area with high population and entropy value, which be recognized as urban area and suburban area.The other ones share lower population but with different entropy value, wherein the red ones reflect an expanding pattern of edge area, gray ones reflect the pattern of rural area, the purple ones could be supposed to be in the sprawling area.Table 2 shows that in 3-group clustering, the green units in the left picture of Figure 6 share relative high mean population density of 14278 people/km 2 and high relative entropy of 0.94, reflecting the feature of Central Area.The gray units with a lower mean population of 449 people/km 2 and relative entropy of 0.96 also reflect the feature of Rural Area.Obviously, the red ones are the expanding area and it is not difficult to distinguish the urban sprawl area and edge area from whether it is continuous from the central area, which is aligned to Ewing's claim of "knowing it by seeing it." Table 3 shows that in 5-group clustering, the green and yellow units in the right picture of Figure 6 reflect features of central area with high population and entropy value, which could be recognized as urban area and suburban area.The other ones share lower population but with different entropy value, wherein the red ones reflect an expanding pattern of edge area, the gray ones reflect the pattern of rural area, the purple ones could be supposed to be in the sprawling area.Table 2 shows that in 3-group clustering, the green units in the left picture of Figure 6 share relative high mean population density of 14,278 people/km 2 and high relative entropy of 0.94, reflecting the feature of Central Area.The gray units with a lower mean population of 449 people/km 2 and relative entropy of 0.96 also reflect the feature of Rural Area.Obviously, the red ones are the expanding area and it is not difficult to distinguish the urban sprawl area and edge area from whether it is continuous from the central area, which is aligned to Ewing's claim of "knowing it by seeing it." Table 3 shows that in 5-group clustering, the green and yellow units in the right picture of Figure 6 reflect features of central area with high population and entropy value, which could be recognized as urban area and suburban area.The other ones share lower population but with different entropy value, wherein the red ones reflect an expanding pattern of edge area, the gray ones reflect the pattern of rural area, the purple ones could be supposed to be in the sprawling area.Although Figure 9 indicates that the maximum of group number is 13 while the Pseudo F-Statistic gets the maximum value, according to the Elbow method, the 5-group and 3-group clustering are preferred because 3 is the ideal k value at the elbow point.Meanwhile, 5-group also shows a relative high F-statistic value and R 2 with an overlapped value of media and mean.In conclusion, the 3-group clustering can indicate the expanding area, the 5-group with a higher R 2 of 0.94 and 0.84 can further point out the units in urban sprawl area.
Comparing the grouping result map of Figure 8 and the right image of Figure 6, in other words, the expanding trend recognized in 2010 and the existing population distribution in 2015, it can be concluded that the expanding area near the main core has transferred into central area, while those far away from the center still keeping undeveloped.The comparison confirms that the methodology can be used to identify the pattern of urban sprawl.This study applied the same method to the gridded population 2015 of Wuhan generated by mobile phone data, clustering the land units into 3 and 5 groups (Figure 10) with output summary tables (Tables 4 and 5).Table 4 and Table 5 show the same pattern as Table 2 and Table 3, as the group 1 is located in the rural area, group 4 and 5 represents the central area.The red units in Group 3 indicate the expanding area, while in 5-Group the red units and purple units show the expanding pattern.Interestingly, Figure 11 indicates that the ideal group number is 3, the same as the previous result for the Geodatacn population.This study applied the same method to the gridded population 2015 of Wuhan generated by mobile phone data, clustering the land units into 3 and 5 groups (Figure 10) with output summary tables (Tables 4 and 5).Tables 4 and 5 show the same pattern as Tables 2 and 3, as the group 1 is located in the rural area, group 4 and 5 represents the central area.The red units in Group 3 indicate the expanding area, while in 5-Group the red units and purple units show the expanding pattern.Interestingly, Figure 11 indicates that the ideal group number is 3, the same as the previous result for the Geodatacn population.This study applied the same method to the gridded population 2015 of Wuhan generated by mobile phone data, clustering the land units into 3 and 5 groups (Figure 10) with output summary tables (Tables 4 and 5).Table 4 and Table 5 show the same pattern as Table 2 and Table 3, as the group 1 is located in the rural area, group 4 and 5 represents the central area.The red units in Group 3 indicate the expanding area, while in 5-Group the red units and purple units show the expanding pattern.Interestingly, Figure 11 indicates that the ideal group number is 3, the same as the previous result for the Geodatacn population.To further illustrate the difference and similarity in these two data, a four-quadrant diagram was provided with the mean value of population and local spatial entropy in all the groups (Figure 12).As the clustering number increases from 3 to 5, both kinds of data show three features: relatively stable rural units with high entropy value, subdivision of urban units and expanding units.Derived from the separating of urban units, the suburban units are refined with a relatively large amount of mean population.Meanwhile, the expanding units are divided into two groups, one with lower To further illustrate the difference and similarity in these two data, a four-quadrant diagram was provided with the mean value of population and local spatial entropy in all the groups (Figure 12).As the clustering number increases from 3 to 5, both kinds of data show three features: relatively stable rural units with high entropy value, subdivision of urban units and expanding units.Derived from the separating of urban units, the suburban units are refined with a relatively large amount of mean population.Meanwhile, the expanding units are divided into two groups, one with lower local spatial entropy and larger population, the other with higher entropy and a smaller population, both could be identified as sprawling units in term of different definitions of urban sprawl.In this article, the scattered units are defined as sprawling ones, thus both the east and west side of Wuhan show an obvious sprawling trend, wherein most sprawling units are located along the main roads.Instead, the north side in Wuhan shows a classic Hoyt's sector pattern, a strip or belt expansion.To further illustrate the difference and similarity in these two data, a four-quadrant diagram was provided with the mean value of population and local spatial entropy in all the groups (Figure 12).As the clustering number increases from 3 to 5, both kinds of data show three features: relatively stable rural units with high entropy value, subdivision of urban units and expanding units.Derived from the separating of urban units, the suburban units are refined with a relatively large amount of mean population.Meanwhile, the expanding units are divided into two groups, one with lower local spatial entropy and larger population, the other with higher entropy and a smaller population, both could be identified as sprawling units in term of different definitions of urban sprawl.In this article, the scattered units are defined as sprawling ones, thus both the east and west side of Wuhan show an obvious sprawling trend, wherein most sprawling units are located along the main roads.Instead, the north side in Wuhan shows a classic Hoyt's sector pattern, a strip or belt expansion.A detailed expansion map of Wuhan provided by local planning bureau with the built-up area information in 2010 and 2015 (Figure 13) was compared to further testify the result of the proposed method.Not only most of the strip-growth pattern in the north of Wuhan but also the expansion area in east and south side is overlapped, which show the applicability of the proposed fast identification method.Moreover, the scatter sprawling units detected by this article shows more information about the urban sprawling pattern of Wuhan.Generally, Wuhan is expanding along the Yangtze River, keeping the restraints on the development in the direction of the south and north, where exist ecological barriers of mountains and lakes.As the east side of Wuhan has been planned as a new city with the high-tech industrial park, the sprawling state can be seen as a result of upside down driving force.However, it is noteworthy that the massive sprawling area also emerges in the west direction, where is located in the plains and near Hanyang, which is a city with a famous automobile industrial park.
the Yangtze River, keeping the restraints on the development in the direction of the south and north, where exist ecological barriers of mountains and lakes.As the east side of Wuhan has been planned as a new city with the high-tech industrial park, the sprawling state can be seen as a result of upside down driving force.However, it is noteworthy that the massive sprawling area also emerges in the west direction, where is located in the plains and near Hanyang, which is a city with a famous automobile industrial park.

Discussion
The research we have done suggests that gridded population can be the only source data to identify the pattern of urban sprawl with K-means clustering based on population and local spatial entropy.These findings are understandable because grid units located in different location have the corresponding level of population and unevenness of surrounding situation, wherein urban sprawl area shows a relatively high population and low local spatial entropy, different from other units, making it easy to be identified.
The ideal cluster number of 3 at the Pseudo F-statistics elbow point in both GeodataCn and mobile phone data further verifies that K-means clustering is suitable to identify the expanding land units from the urban and rural area.On the other hand, the coincidence that Pseudo F-statistics reaches its max value at the elbow point in Figure 9 indicates that the higher resolution population data provides stronger support for the method.
These results agree with Ewing [32] and Angel [33]'s statement, urban sprawl pattern can be recognized by seeing it, moreover, the two variables of population and local spatial entropy support what Joseph [38]and Burchfield [51] have pointed out that the determinant of urban sprawl are population and the amount of undeveloped land around an average dwelling.However, the procedure and data utilized in this research are more simple and straightforward than previous

Discussion
The research we have done suggests that gridded population can be the only source data to identify the pattern of urban sprawl with K-means clustering based on population and local spatial entropy.These findings are understandable because grid units located in different location have the corresponding level of population and unevenness of surrounding situation, wherein urban sprawl area shows a relatively high population and low local spatial entropy, different from other units, making it easy to be identified.
The ideal cluster number of 3 at the Pseudo F-statistics elbow point in both GeodataCn and mobile phone data further verifies that K-means clustering is suitable to identify the expanding land units from the urban and rural area.On the other hand, the coincidence that Pseudo F-statistics reaches its max value at the elbow point in Figure 9 indicates that the higher resolution population data provides stronger support for the method.
These results agree with Ewing [32] and Angel [33]'s statement, urban sprawl pattern can be recognized by seeing it, moreover, the two variables of population and local spatial entropy support what Joseph [38] and Burchfield [51] have pointed out that the determinant of urban sprawl are population and the amount of undeveloped land around an average dwelling.However, the procedure and data utilized in this research are more simple and straightforward than previous studies.This approach corresponds to the classical urban spatial structure theory of the Chicago School and is highly consistent with Alonso-Muth-Mills' work, which explained the importance of population density in identifying urban structures [52].On the other hand, the interstices in the scattered sprawling units verify the pending feature of undeveloped land in urban expansion [37], which should be further explored.
Based on the comparison of output result by open population data and mobile phone data, the finding of this study is restricted to the resolution and accuracy of the data source.Moreover, the FCA local spatial entropy method applied in this research only considers the surrounding 8 units, further study of which threshold distance should be selected is needed.

Conclusions
This study suggests the identification of urban sprawl based on K-means clustering with the merely gridded population and its local spatial entropy, wherein the Elbow method is used to verify the group number.Based on the result of the experiments, such an assumption is verified, which further indicates that the role of population density deserves more exploration.The other important contribution of this study is the application of FCA local spatial entropy, which combines Shannon entropy, Tobler's first law of geography and Moore neighborhood, focusing on the measurement of local spatial unevenness.
The present study focuses on the measurement based on human activities, other than physical environment emphasized by most remote sensing method, which may result in a malfunction in identifying the typical sprawling pattern of Ghost towns [53], where few populations live in newly developed cities.Another limitation is that the accuracy of population data which may influence the discrimination of urban sprawl units, however, with the quick development of urban data environment, population data will be more precise, real-time, thus the method of fast identification for urban sprawling would help government and planners makes quicker and more flexible decisions.In prospect studies, comparison among cities or sprawling areas within the same city-region could be conducted, exploring the driving force underlying urban sprawl.

Figure 2 .
Figure 2. Population Density Distribution Pattern of Land Unit and Neighbor Units.

Figure 1 .
Figure 1.Urban expansion diagrams based on gridded population distribution.

Figure 2 .
Figure 2. Population Density Distribution Pattern of Land Unit and Neighbor Units.

Figure 2 .
Figure 2. Population Density Distribution Pattern of Land Unit and Neighbor Units.

Figure 4 .
Figure 4. Elbow Method for determining the number of clusters.

Figure 5 .
Figure 5. Location of Wuhan, Hubei province in China.

Figure 4 .
Figure 4. Elbow Method for determining the number of clusters.

Figure 4 .
Figure 4. Elbow Method for determining the number of clusters.

Figure 5 .
Figure 5. Location of Wuhan, Hubei province in China.

Figure 5 .
Figure 5. Location of Wuhan, Hubei province in China.

Figure 8 .
Figure 8. K-means Clustering in Group 3 and Group 5 of Population 2010.

Figure 8 .
Figure 8. K-means Clustering in Group 3 and Group 5 of Population 2010.

Figure 8 .
Figure 8. K-means Clustering in Group 3 and Group 5 of Population 2010.

Figure 9 .
Figure 9. Pseudo F-Statistic Plot of Different Groups.

Figure 10 .
Figure 10.K-means Clustering in Group 3 and Group 5 of population 2015.

Figure 9 .
Figure 9. Pseudo F-Statistic Plot of Different Groups.

Figure 10 .
Figure 10.K-means Clustering in Group 3 and Group 5 of population 2015.Figure 10.K-means Clustering in Group 3 and Group 5 of population 2015.

Figure 10 .
Figure 10.K-means Clustering in Group 3 and Group 5 of population 2015.Figure 10.K-means Clustering in Group 3 and Group 5 of population 2015.

Figure 11 .
Figure 11.Pseudo F-Statistic Plot of Different Groups.

Figure 11 .
Figure 11.Pseudo F-Statistic Plot of Different Groups.

Figure 11 .
Figure 11.Pseudo F-Statistic Plot of Different Groups.

Figure 12 .
Figure 12.Mean value of Population and Local Spatial Entropy.

Figure 12 .
Figure 12.Mean value of Population and Local Spatial Entropy.

Figure 13 .
Figure 13.Comparison with Built-up area changes.

Figure 13 .
Figure 13.Comparison with Built-up area changes.
Author Contributions: L.L. and H.W. conceived and designed the experiments; L.L. performed the experiments; Z.P. and J.Z. acquired and analyzed the data; Y.Y. and H.J. contributed reagents/materials/analysis tools; H.W. and L.L. wrote the paper.Funding: The research is funded by China Postdoctoral Science Foundation (No. 2016M600609); National Science Fund for Young Scholars (No. 51708425); and China Postdoctoral Science Foundation (No. 2016M602357).

Table 1 .
Groups with Population Density and Local Evenness.

Table 2 .
Variable-Wise Summary of Group 3 with K-means Algorithm.

Table 3 .
Variable-Wise Summary of Group 5 with K-means Algorithm.

Table 4 .
Variable-Wise Summary of Group 3 with K-means Algorithm.

Table 5 .
Variable-Wise Summary of Group 5 with K-means Algorithm.