A Spatial-Territorial Reorganization Model of Rural Settlements Based on Graph Theory and Genetic Optimization

Rural China has experienced rapid urbanization and industrialization, accompanied with rural–urban migration since 1978. This tremendous transition has caused a series of negative consequences, necessitating a spatial-territorial reorganization of rural settlements. Previous studies on the restructuring of rural settlements are insufficient for inter-settlement connection consideration and practical and dynamic decision-making techniques. To overcome these concerns, a dynamic spatial-territorial reorganization model (SRM) of rural settlement is proposed herein based on graph theory and genetic algorithm (GA). The model involves two parts. In Part 1, consolidated settlements are identified according to the socio-economic network performance under four types of attack. In Part 2, GA model is repeatedly executed to scientifically resettle consolidated settlements into nearby townships or central settlements with objectives of suitability, compactness, and local connectivity under the control of the constraints. This paper presents an application of SRM to Chengui Town, Hubei Province. Empirical results suggest that: (1) removing settlements in order of node degree is the least efficient way to destroy the entire functional system; and (2) the proposed model can yield satisfactory solutions in terms of spatial reorganization of settlements. The SRM may also serve as a valuable reference for planners in devising plans and making decisions.


Introduction
Since 1978, rural China has experienced rapid urbanization and industrialization, accompanied with rural-urban migration [1][2][3].Owing to this tremendous transition, in China a unique rural settlement morphology has emerged; that is, hollowed villages [4].The "hollowed villages" is a phenomenon of depopulation leading to abandonment of buildings and land in rural communities, due to the dual-track structure of rural-urban development (i.e., urban land is state owned, whereas farmland is collectively owned) and the restriction of hukou (i.e., household registration system) [4,5].Such phenomenon has caused several negative consequences, such as the weakening function of critical rural organizations, the fragility of structures and networks, the chaotic flow of rural development elements, and the lack of economies of scale and output efficiency [5,6].Hence, the current layout of China's rural settlements, which can be described as "scattered, massy, small, and hollowed," demands emergency measures.For example, the consolidation of the hollowed villages primarily aims to promote the spatial-territorial reorganization, which is accompanied with administrative reorganization in rural restructuring [1,4].This type of rural restructuring encourages concentrating the rural population in communities or central settlements and merging settlements [1,4].Nevertheless, the majority of regional studies are concerned with urban studies or theoretical research on rural settlements (e.g., policy analysis, strategic decision and planning) [1,3,[5][6][7].Existing studies on rural settlement restructuring remain insufficient in terms of two aspects: consideration of inter-settlement connections and absence of dynamic and practical decision-making techniques.To address these research gaps, we propose a dynamic spatial-territorial reorganization model (SRM) of rural settlements based on graph theory and genetic optimization.
The traditional village system consists of relatively independent villages or rural settlements [7].Recently, inter-settlement interactive scope and content are expanding with population as well as social, economic, and traffic flows [7][8][9].The spatial-territorial reorganization or restructuring of rural settlements (involving settlement removal and incorporation) should consider inter-settlement connections [10].Moreover, this consideration should entail two aspects.First, consolidated settlements should exert the least effect on the entire village system to maximally maintain the functionality of system and the stability of the villagers' life when they are removed.Second, consolidated settlements should be relocated to adjacent high-related central settlements to reduce the separation between villagers after consolidation.In response to these requirements, the robustness of network and local connectivity are introduced in this paper.Similar to other complex systems, inter-settlement connection system can be modeled as a network, in which settlements are nodes and interactions and activities among settlements are edges [9,11,12].The robustness of inter-settlement network refers to its ability to maintain the functionality under attacks or failures [11].Accordingly, settlement importance can be evaluated based on how much the removal of the node "disrupts" the graph structure [11,13,14].Specifically, if removing a settlement exerts no noticeable effect on the network structure, the settlement is a good option to serve as a settlement to be absorbed or relocated.Local connectivity represents the frequency of all types of connections and activities (working, visiting, shopping, and entertainment) from consolidated settlements to central settlements.Local connectivity acts as an important objective in the SRM to search for high-related relocated settlements in the following sections.This paper provides a system perspective to realize the reorganization of rural settlements rather than a simple individual analysis [6,10].
Existing studies on rural settlement restructuring mainly concern theoretical research, policy analysis, and macro and static planning [1,3,[5][6][7].Practical and dynamic decision-making techniques are in demand to scientifically realize reorganization and optimization.Several decision-making techniques have been proposed for land-use planning [15][16][17].In particular, most of them make use of linear programming when a single clear objective or even multi-objective problems can be identified [17].Although the linear programming models can quickly lead to optimal solutions [16,17], they cannot cope with large combinatorial optimization problems within reasonable time [15,18,19], incommensurable and/or conflicting objectives [19], and spatial optimization [16,20].To overcome these concerns, various heuristic algorithms have been developed, such as simulated annealing algorithm [15,21], particle swarm algorithm [22], and genetic algorithm (GA).GA, as introduced by Holland [23] and described in detail by Goldberg [24], optimizes by mimicking the genetic procedures of natural selection and reproduction observed in populations for adaptation and survival.As one of the most robust heuristics [25], GA has been applied to provide optimization solutions for different spatial optimization problems, such as land-use planning [16,18,25,26], optimal location search [15], forestry management [19], urban planning [27], and water allocation planning [28], and confirmed effective.To our knowledge, such an approach has rarely been used in the research on the reorganization and optimization of rural settlements.This paper therefore presents a GA to dynamically realize the reorganization of rural settlements.
Combining the above two points, this paper provides a spatial-territorial modeling technology of rural settlements based on graph theory and GA.The SRM is expected to solve the existing land use problems of rural settlements (e.g., scattered, massy, small, and hollowed).This approach may serve as a valuable reference for planners in devising plans and making decisions.Section 2 describes the spatial-territorial reorganization in detail.Section 3 provides the details of the SRM.Section 4 introduces the study area and relevant data.Section 5 describes and analyzes the results, and the final part gives conclusions.

Spatial-Territorial Reorganization of Rural Settlements
Spatial-territorial reorganization is a type of rural restructuring in which villagers are encouraged to relocate in communities or central settlements and incorporate settlements [1].Spatial-territorial reorganization aims to rejuvenate dispersed, abandoned, and idle rural settlements to improve the effectiveness and efficiency of local governance in rural communities and restructure suitable living space [4].Long et al. [4] reported three modes of spatial-territorial reorganization depending on geographical context.These modes are: (1) settlement to city, to incorporate urban and peri-urban settlements to cities; (2) settlement to township, to resettle villagers to nearby small towns; and (3) settlement to settlement, to relocate residents from scattered settlements to central settlements (Figure 1).Accordingly, the problem of spatial-territorial reorganization involves two main parts: identification of settlements to be relocated (i.e., which settlements need consolidation) and consolidated settlement relocation (i.e., where to reallocate these settlements).Section 4 introduces the study area and relevant data.Section 5 describes and analyzes the results, and the final part gives conclusions.

Spatial-Territorial Reorganization of Rural Settlements
Spatial-territorial reorganization is a type of rural restructuring in which villagers are encouraged to relocate in communities or central settlements and incorporate settlements [1].Spatial-territorial reorganization aims to rejuvenate dispersed, abandoned, and idle rural settlements to improve the effectiveness and efficiency of local governance in rural communities and restructure suitable living space [4].Long et al. [4] reported three modes of spatial-territorial reorganization depending on geographical context.These modes are: (1) settlement to city, to incorporate urban and peri-urban settlements to cities; (2) settlement to township, to resettle villagers to nearby small towns; and (3) settlement to settlement, to relocate residents from scattered settlements to central settlements (Figure 1).Accordingly, the problem of spatial-territorial reorganization involves two main parts: identification of settlements to be relocated (i.e., which settlements need consolidation) and consolidated settlement relocation (i.e., where to reallocate these settlements).

Specifications of SRM of Rural Settlements
SRM is a modeling technology for generating compact and contiguous settlement pattern based on graph theory and GA.The model involves two parts (Figure 2).In Part 1, consolidated settlements are identified according to the network performance under attack.In Part 2, GA model, equipped with objectives of suitability (S), compactness (C), and local connectivity (L) under the control of the constraints, is repeatedly executed to scientifically resettle consolidated settlements into cities, nearby townships, or central settlements.

Specifications of SRM of Rural Settlements
SRM is a modeling technology for generating compact and contiguous settlement pattern based on graph theory and GA.The model involves two parts (Figure 2).In Part 1, consolidated settlements are identified according to the network performance under attack.In Part 2, GA model, equipped with objectives of suitability (S), compactness (C), and local connectivity (L) under the control of the constraints, is repeatedly executed to scientifically resettle consolidated settlements into cities, nearby townships, or central settlements.

Consolidated Settlement Identification
To identify consolidated settlements with the least effect on the village system, we generate a series of experiments to investigate the network performance under numerous removals of settlements.First, we design a certain order of settlements.Next, we successively remove settlements following the corresponding strategy order until the network only has one settlement.Finally, the reactions and divergences of the network after each attack are recorded.If the divergence increases slowly, then the corresponding strategy is treated as a reliable node-importance measure.

Node-Removal Strategies
We consider two types of hypothetical node-removal strategies: random and targeted attacks.For targeted strategies, the two most widely used node-importance characters, namely, degree and betweenness, are chosen in this paper [11].We also use the single node-importance order evaluated by single attack as our fourth strategy to figure out the relationship between single and successive attacks.
Random.No strategy; we randomly choose attacked settlements.Minimum-degree first.We remove nodes in increasing in-degree order.Degree is defined as the number of connections of a given node [12,14].A high degree demonstrates that the node connects to more nodes [11].Boldi, Rosa and Vigna [14] reported that this strategy is a baseline as the degree is the first shot at centrality in a network.
Minimum-betweenness first.We remove node in increasing betweenness order.The betweenness index is computed by identifying the shortest paths linking pairs of nodes and counting the number of times these paths cross each Freeman node [11,12].Betweenness is a type of global factor [11,12].
Minimum single node-removal importance first.This strategy includes two steps.First, we randomly remove a settlement at each step and obtain the node-removal importance rank according to the record after single attack.Then, we successively remove settlements in increasing single node-removal importance order until the network only has one settlement.

Measures of Divergence
Previous literature often used the diameter or analogous measures to establish whether the network structure has significantly changed after deleting some nodes [13,14,29].In this paper, we choose six widely used measures to evaluate the network structure: the number of directly reachable pairs (RP), the number of indirectly reachable pairs (IRP), the average length of the shortest paths (l), the cluster coefficient (C), the number of components (NC), and the relative size of the largest

Consolidated Settlement Identification
To identify consolidated settlements with the least effect on the village system, we generate a series of experiments to investigate the network performance under numerous removals of settlements.First, we design a certain order of settlements.Next, we successively remove settlements following the corresponding strategy order until the network only has one settlement.Finally, the reactions and divergences of the network after each attack are recorded.If the divergence increases slowly, then the corresponding strategy is treated as a reliable node-importance measure.

Node-Removal Strategies
We consider two types of hypothetical node-removal strategies: random and targeted attacks.For targeted strategies, the two most widely used node-importance characters, namely, degree and betweenness, are chosen in this paper [11].We also use the single node-importance order evaluated by single attack as our fourth strategy to figure out the relationship between single and successive attacks.
Random.No strategy; we randomly choose attacked settlements.Minimum-degree first.We remove nodes in increasing in-degree order.Degree is defined as the number of connections of a given node [12,14].A high degree demonstrates that the node connects to more nodes [11].Boldi, Rosa and Vigna [14] reported that this strategy is a baseline as the degree is the first shot at centrality in a network.
Minimum-betweenness first.We remove node in increasing betweenness order.The betweenness index is computed by identifying the shortest paths linking pairs of nodes and counting the number of times these paths cross each Freeman node [11,12].Betweenness is a type of global factor [11,12].
Minimum single node-removal importance first.This strategy includes two steps.First, we randomly remove a settlement at each step and obtain the node-removal importance rank according to the record after single attack.Then, we successively remove settlements in increasing single node-removal importance order until the network only has one settlement.

Measures of Divergence
Previous literature often used the diameter or analogous measures to establish whether the network structure has significantly changed after deleting some nodes [13,14,29].In this paper, we choose six widely used measures to evaluate the network structure: the number of directly reachable pairs (RP), the number of indirectly reachable pairs (IRP), the average length of the shortest paths (l), the cluster coefficient (C), the number of components (NC), and the relative size of the largest connected component (S).Boldi, Rosa and Vigna [14] believed that the RP and IRP are the most immediate global features that are computationally approachable.To evaluate the efficiency behavior of network under attack from both global and local viewpoints, l and C are used [11].NC and S are the most efficient means to measure network fragmentation under failures [11].
The number of directly reachable pairs.It is the number of settlement pairs <x, y> which have a direct path from settlement x to y [14].
The number of indirectly reachable pairs.It is the number of pairs <x, y> which have a path from settlement x to y through other settlements.
The average length of the shortest paths.It measures the separation between settlements in a network.
where n is the number of settlements, and d(v, w) denotes the length of the shortest path between settlements v and w.The cluster coefficient.It is defined as the probability that two settlements are directly connected to a third settlement and each other.
where n is the number of settlements, and where N i is the set that contains all neighbors of settlement v, and k v is the size of set N i .Thus, k v (k v − 1)/2 is the maximal number of edges between k v settlements, and |e uw | is the number of edges existing in the network between all k v neighbor of settlement v. E is all edges existing in the network.A higher C indicates a more clustered neighborhood [12].
The number of components.Components are sub-graphs that are connected within but disconnected among sub-graphs [30].This measure is a helpful tool to reveal the intermediate scales of network organization [12].
The relative size of the largest connected component.S is defined as: where n s is the number of settlements in the largest connected component, and n is the number of settlements in the initial network.
Once the metrics that can reflect the network structure are determined, the divergence of these metrics can be evaluated by relative change r [14,29].
where x original is the metric value in the original network, and x new is the metric value after node removal.

Chromosome Representation
The most common representation in land-use optimization is the vector and grid chromosome [27].To effectively represent the land use and conveniently manipulate the land units, we select the grid chromosome.The problem of spatial-territorial reorganization of rural settlements can be defined as a search for the most suitable central settlements for consolidated settlements within a spatial dimension (M × N grids).Accordingly, we introduce the concept of "source" and "sink" to the genotype representation (Figure 3).The "source" represents consolidated settlements, and the "sink" denotes central settlements.All grid cells of source have a value of 1; all sink cells have a value of 2; and others are 0. All settlement patches are encoded.A patch is a set of cells that are allocated to the same land-use type [16].Grid cells in a settlement patch share the same number.The chromosome has m genes, and m is the number of sources.Each gene represents a relocation strategy for a source.
In rural China, farmers are limited to selling their poor-returning farmland due to the restrictions on the sale of property and ambiguous property rights [4,31].To protect the farmland, we apply migration radius d during reorganization.The central grid cell of each source can be calculated by Equations ( 6) and (7).The final migration zone of each source can be identified by four values: I − d, I + d, J − d, and J + d (Figure 3a).The migration radius d will increase in multiples if no central settlement exists in the current migration region.The central settlements within the migration region form an alternative solution data set S of each source (e.g., the S of source 3 has two alternatives: 1 and 4).Each source randomly chooses a sink from S as initialization value.Once all solutions are founded, sources will be incorporated to corresponding sinks with a scale coefficient α (0 < α < 1, the new area A 2 of source equals the original area A 1 multiplied by α) (Figure 3b).
where I and J are the row and column number of central grid cells, respectively; N is the number of grid cells in a source; i n is the row number of grid cell n; and j n is the column number of grid cell n.
Sustainability 2017, 9, 1370 6 of 18 grid chromosome.The problem of spatial-territorial reorganization of rural settlements can be defined as a search for the most suitable central settlements for consolidated settlements within a spatial dimension (M × N grids).Accordingly, we introduce the concept of "source" and "sink" to the genotype representation (Figure 3).The "source" represents consolidated settlements, and the "sink" denotes central settlements.All grid cells of source have a value of 1; all sink cells have a value of 2; and others are 0. All settlement patches are encoded.A patch is a set of cells that are allocated to the same land-use type [16].Grid cells in a settlement patch share the same number.The chromosome has m genes, and m is the number of sources.Each gene represents a relocation strategy for a source.In rural China, farmers are limited to selling their poor-returning farmland due to the restrictions on the sale of property and ambiguous property rights [4,31].To protect the farmland, we apply migration radius d during reorganization.The central grid cell of each source can be calculated by Equations ( 6) and (7).The final migration zone of each source can be identified by four values: I − d, I + d, J − d, and J + d (Figure 3a).The migration radius d will increase in multiples if no central settlement exists in the current migration region.The central settlements within the migration region form an alternative solution data set S of each source (e.g., the S of source 3 has two alternatives: 1 and 4).Each source randomly chooses a sink from S as initialization value.Once all solutions are founded, sources will be incorporated to corresponding sinks with a scale coefficient α (0 < α < 1, the new area A2 of source equals the original area A1 multiplied by α) (Figure 3b).
where I and J are the row and column number of central grid cells, respectively; N is the number of grid cells in a source; in is the row number of grid cell n; and jn is the column number of grid cell n.

Objectives
Suitability S, compactness C, and local connectivity L are the objectives considered in the SRM.Improving the settlement suitability is conducive to the rational use of land resource [16,32].Compactness is a basic desirable feature in land use management [28].The landscape shape index is chosen as the measure for land-use compactness [16].The objective of local connectivity is an effective way to comprehensively consider social and economic connections between source settlements and

Objectives
Suitability S, compactness C, and local connectivity L are the objectives considered in the SRM.Improving the settlement suitability is conducive to the rational use of land resource [16,32].Compactness is a basic desirable feature in land use management [28].The landscape shape index is chosen as the measure for land-use compactness [16].The objective of local connectivity is an effective way to comprehensively consider social and economic connections between source settlements and sinks in the relocation.Local connectivity represents the frequency of all types of activities (working, visiting, shopping, and entertainment) from consolidated settlements to central settlements.This objective is based on the idea that high-related relocation (i.e., relocate sources to adjacent high-related central settlements) will be helpful for the integration of villagers between sources and sinks after resettlement (Figure 3b).To satisfy these objectives, we integrate the three objectives by a weighted sum.The performance of each solution or chromosome is assessed by the fitness function, as shown in Equation (11).The values of three objectives are normalized by Equation (12).
Maximize : where s ij is the suitability of rural settlements in the cell indexed by i and j. u ij is a binary-state variable that is 1 if the cell indexed by i and j is located in rural settlement; otherwise, the value is 0. In Equation ( 9), H is the number of rural settlement patches in the chromosome, P h is the perimeter of a patch, and A h is the area of a patch.A small value of C indicates the chromosomes with compact spatial pattern of settlements.In Equation (10), L is defined as the sum of the weights of all edges from source nodes to sink nodes in a network after processing the SRM.T is the number of source patches, l t is the local connectivity value from source t to corresponding sink after consolidation or relocation.f s norm (S), f c norm (C) and f L norm (L) are normalized variables regarding objective S, objective C, and objective L, and w 1, w 2 and w 3 are the weights of these variables.These weights represent the preference of each variables [15], and they should satisfy the following constraints: (1) w 1 + w 2 + w 3 = 1; and (2) 0 ≤ w 1 ≤ 1, 0 ≤ w 2 ≤ 1, and 0 ≤ w 3 ≤ 1. S max and L max are the ideal values for objective S and objective L, and S min , C min and L min are the worst values for each objective.This paper only uses the worst value C min to normalize the objective C due to that it is challenging to estimate the ideal value of objective C (see the study of Liu et al. [16] for details).

Constraints
Normally the constraints can be divided into two types: (1) area constraints; and (2) land-use constraints [16].The area constraints aim to macroscopically maintain a rational land use structure in optimization, such as the maximum and minimum number of cells for different land uses [16].The land-use constraints restrict land-use conversion within specific grid cells.Here, we focus primarily on the land-use constraints.Especially, in SRM, land in basic farmland zones, reserved green open spaces, or other high-cost conversion areas (e.g., water, industrial and mining land) should restrict the conversion to settlements.Moreover, each cell can only have one land use type [28].Within the model, the land-use constraints are considered by a restricted layer.In the restricted layer, the restricted land is encoded as 1 and the other land is 0. Only the land with code 0 can be converted to settlements in the process of resettlement of sources.

Genetic Operators
Roulette-wheel technique is applied as the selection operator in our SRM [33].A single-point crossover with random point-selection is implemented [27].Specifically, we randomly cut two parent chromosomes right after the crossover gene and exchange the genes following the crossover gene with the crossover probability P c [32].The "mutation" operator changes each one of the genes with a probability P m , i.e., the gene g(i), to be mutated, loses its current value, and receives a random value from data set S depending on the part of the chromosome to which it belongs.

Study Area
Chengui Town (114 • 43 to 114 • 49 E, 30 • 03 to 35 • 30 N) is located in the Hubei Province, middle of China (Figure 4).In 2014, the town has a total area of 160.40 km 2 and a population of 66,185 people.Chengui has pioneered the province in economic development.Its administrative hierarchy is town (township) and village, and the corresponding settlements are market town and rural settlements.Land-use data in 2014 in vector format were obtained from local government through Huangshi's land consolidation planning project.One township (Chengui township) and 335 rural settlements patches were identified in the vector map (Figure 4).The total area of rural settlement reaches 1073.55 hm 2 , and it is inhabited by a rural population of 60,727.Per capita rural settlement land is approximately 176.78 m 2 , which is 26.78 m 2 above the national standard (defined by the National Village and Township Planning Code).The land-use data were converted to a 1225 × 1797 grid with a resolution of 10 × 10 m.
layer, the restricted land is encoded as 1 and the other land is 0. Only the land with code 0 can be converted to settlements in the process of resettlement of sources.

Genetic Operators
Roulette-wheel technique is applied as the selection operator in our SRM [33].A single-point crossover with random point-selection is implemented [27].Specifically, we randomly cut two parent chromosomes right after the crossover gene and exchange the genes following the crossover gene with the crossover probability Pc [32].The "mutation" operator changes each one of the genes with a probability Pm, i.e., the gene g(i), to be mutated, loses its current value, and receives a random value from data set S depending on the part of the chromosome to which it belongs.

Study Area
Chengui Town (114°43′ to 114°49′ E, 30°03′ to 35°30′ N) is located in the Hubei Province, middle of China (Figure 4).In 2014, the town has a total area of 160.40 km 2 and a population of 66,185 people.Chengui has pioneered the province in economic development.Its administrative hierarchy is town (township) and village, and the corresponding settlements are market town and rural settlements.Land-use data in 2014 in vector format were obtained from local government through Huangshi's land consolidation planning project.One township (Chengui township) and 335 rural settlements patches were identified in the vector map (Figure 4).The total area of rural settlement reaches 1073.55 hm 2 , and it is inhabited by a rural population of 60,727.Per capita rural settlement land is approximately 176.78 m 2 , which is 26.78 m 2 above the national standard (defined by the National Village and Township Planning Code).The land-use data were converted to a 1225 × 1797 grid with a resolution of 10 × 10 m.

Data Resources
Information about socio-economic ties among 336 settlements came from face-to-face interviews within Chengui in autumn 2014.In each settlement, selected questions were asked about the relationships between settlements.For instance, "How many people and households live in this settlement?","Where do you go to work, attend school, recreate, visit or something else?", and "How many times do you go to that settlement or place in a year?"Based on these questions, we could identify whether villagers tended to move from the responding settlements to other settlements and, if so, how often.During the survey, ties with eight settlements outside of Chengui were also noted.A total of 2085 respondents (3.15% of all population) were survey using a random sampling method.The total number of valid questionnaires was 2056, with 55 respondents in Chengui township.Overall, 2001 respondents were selected in the 335 rural settlements.The average number of respondents in a rural settlement was six, with a sample density (the proportion of respondents to total rural population) of 3.30%.During the survey, we tried to categorize relationships according to different activity types (working, visiting, shopping, and entertainment).Unfortunately, only 454 questionnaires covered this.Therefore, socio-economic ties represented all types of activities among settlements.A given pair of settlements had multiple frequency values due to more than 1 questionnaire in a settlement.To address this, we used mean values to represent the flows among settlements.Then, we standardized the frequency of ties by dividing it by 365.The final values were on a scale of 0-1 (daily = 1, no directed edge connecting = 0).Finally, a weighted adjacency matrix A was constructed.The weights w i described the frequency of socio-economic activities between settlements i and j.We then imported the matrix as a data layer into ARCGIS 10.2 (Figure 5).The lines with arrows indicate the presence of frequent socio-economic interactions between settlements.

Data Resources
Information about socio-economic ties among 336 settlements came from face-to-face interviews within Chengui in autumn 2014.In each settlement, selected questions were asked about the relationships between settlements.For instance, "How many people and households live in this settlement?","Where do you go to work, attend school, recreate, visit or something else?", and "How many times do you go to that settlement or place in a year?"Based on these questions, we could identify whether villagers tended to move from the responding settlements to other settlements and, if so, how often.During the survey, ties with eight settlements outside of Chengui were also noted.A total of 2085 respondents (3.15% of all population) were survey using a random sampling method.The total number of valid questionnaires was 2056, with 55 respondents in Chengui township.Overall, 2001 respondents were selected in the 335 rural settlements.The average number of respondents in a rural settlement was six, with a sample density (the proportion of respondents to total rural population) of 3.30%.During the survey, we tried to categorize relationships according to different activity types (working, visiting, shopping, and entertainment).Unfortunately, only 454 questionnaires covered this.Therefore, socio-economic ties represented all types of activities among settlements.A given pair of settlements had multiple frequency values due to more than 1 questionnaire in a settlement.To address this, we used mean values to represent the flows among settlements.Then, we standardized the frequency of ties by dividing it by 365.The final values were on a scale of 0-1 (daily = 1, no directed edge connecting = 0).Finally, a weighted adjacency matrix A was constructed.The weights wi described the frequency of socio-economic activities between settlements i and j.We then imported the matrix as a data layer into ARCGIS 10.2 (Figure 5).The lines with arrows indicate the presence of frequent socio-economic interactions between settlements.

Consolidated Settlement Identification
We initially measured settlement importance based on the performances of network under different failures.Through this approach, we can find satisfied consolidated settlements with the least effect on the entire village system.This part involves two steps: evaluating the network structure changes under single removal and assessing network reactions under four successive attacks.The result of single attack is shown in Table 1.The single strategy provides little guidance on the identification due to the minor changes in the network structure.Note that IRP suffers relatively dramatic changes.We suppose that the fluctuation is largely due to the exaggeration and overlap of the damage according to the definition of IRP.
Figure 6 shows the structural changes of socio-economic network in function of the number of removed settlements under four successive attacks.For ease of interpretation, we also calculated relative change r of RP, IRP, L, and C when half of settlements were removed (we did not report the relative change r of NC and S because these metrics concentrate on the final network structure rather than the relative change) (Table 2).According to Figure 6 and Table 2, our major findings are as follows.First, the network is relatively stable against degree-based and betweenness-based attacks but fragile to random and single node-removal importance attacks.In detail, the random strategy and single node-removal importance strategy disconnect nearly half of direct (r(RP random ) = 44%, r(RP Single ) = 47%) or 70% of indirect pairs (r(IRP random ) = 76%, r(IRP sin gle ) = 71%) of network by removing 50% of settlements, whereas two centrality-based strategies disconnect only roughly 30% of direct (r(RP degree ) = 26%, r(RP betweenness ) = 51%) or 50% of indirect pairs (r(IRP degree ) = 41%, r(IRP betweenness ) = 49%).As for L and C, the results of all strategies remain stable even when 30% of settlements are removed.When f gets larger (i.e., half of settlements are removed), two centrality-based strategies show higher stability especially for betweenness-based strategy (r(L random ) = 15.87%,r L sin gle = 16.38%,r L degree = 11.58%,r(L betweenness ) = 8.06%, r(C random ) = −21.95%,r(C sin gle ) = −3.03%,r(C degree ) = −13.99%,r(C betweenness ) = 15.17%).Regarding NC, centrality-based strategies consistently attack isolated settlements or small components, whereas random and single node-removal importance strategies continually damage large components.This observation is based on the finding that the network under centrality-based failures keeps two large components even when 28 settlements are removed, whereas the NC of random and single node-removal importance strategies changes all the time.As for S, centrality-based strategies can also achieve enhanced results (S random = 42.44%,S sin gle = 41.28%,S degree = 49.71%,S betweenness = 50.19%by removing half of settlements).Accordingly, centrality-based strategies can efficiently identify structurally unimportant or important settlements in a socio-eonomic network.The low efficiency of single node-removal importance strategy suggests that the removal of multiple nodes is a complex problem rather than a simple effect superposition of single node-removal.The second important observation is that two centrality-based strategies are similar and diveregent with each other.On one hand, both of them show high efficient to maintain the functionality of the network under attacks.The rank provided by degree is highly correlated to betweenness rank in our empirical data (r = 0.77).On the other hand, degree-based strategy is good at maintaining the accessibility and connectivity of network (RP and IRP), whereas betweenness-based strategy shows high potential in the efficiency behavior of network (L and C).The abnormal situation in L and C (i.e., significantly decrease in L and increase in C) using random, single node-removal, or degree-based strategies is largely caused by the separation and fragmentation of network (i.e., the network splits into several clusters).Considering the good performance on the basic raw datum (i.e., RP and IRP) and the stability of L and C in the first half of f, the degree centrality is chosen as the reliable node-removal importance measure in our settlement study.Apart from the robustness of network, we also considered four traditional factors: (a) suitability (on a scale of 1-5); (b) dynamic change degree of source (see the study of Yang et al. [34] for details); (c) the level of hollowing (the proportion of unoccupied dwellings in all dwellings in each settlement); and (d) villagers' receptiveness of consolidation (the proportion of people who are willing to consolidate in all people in each settlement).The area of consolidated settlements in the target year (i.e., 2020) was obtained according to the population data from 1988 to 2014 derived from Daye Statistical Yearbook (see more in the study of Xuesong et al. [35]).
Through comprehensive consideration of the aforementioned five factors, we finally derived a classification map of source and sink (Figure 7b).To test the validity of the SRM, we also conducted a traditional classification (i.e., considering four traditional factors) (Figure 7a).Through contrastive analysis, we found that sources in traditional classification are mainly concentrated in the northern part of Chengui.Conversely, the spatial pattern of sources in optimized classification is dispersed and homogeneous.This divergence is caused by the discrepancy between traditional factors and socio-economic interactions.Specifically, although settlements in the north are relatively insufficient in suitability and villagers' receptiveness, they play an important role in the socio-economic village system (Figure 5).
We also conducted a detailed statistic comparison of two classifications (Table 3).The optimized classification shows higher potential in terms of maintaining the function of network system.The optimized strategy causes less harm to the entire village system than the traditional strategy especially for network connectivity and fragmentation (e.g., RP and CN).As for traditional factors, the optimized strategy also can obtain satisfied results except for villagers' receptiveness of consolidation (0.72 vs. 0.55).This exception indicates that the consolidation using optimized strategy faces challenges (e.g., the resistance of farmers).The local government may be able to effectively overcome these challenges (in terms of coordination of villagers' interests and sustainable development of the countryside).Apart from the robustness of network, we also considered four traditional factors: (a) suitability (on a scale of 1-5); (b) dynamic change degree of source (see the study of Yang et al. [34] for details); (c) the level of hollowing (the proportion of unoccupied dwellings in all dwellings in each settlement); and (d) villagers' receptiveness of consolidation (the proportion of people who are willing to consolidate in all people in each settlement).The area of consolidated settlements in the target year (i.e., 2020) was obtained according to the population data from 1988 to 2014 derived from Daye Statistical Yearbook (see more in the study of Xuesong et al. [35]).
Through comprehensive consideration of the aforementioned five factors, we finally derived a classification map of source and sink (Figure 7b).To test the validity of the SRM, we also conducted a traditional classification (i.e., considering four traditional factors) (Figure 7a).Through contrastive analysis, we found that sources in traditional classification are mainly concentrated in the northern part of Chengui.Conversely, the spatial pattern of sources in optimized classification is dispersed and homogeneous.This divergence is caused by the discrepancy between traditional factors and socio-economic interactions.Specifically, although settlements in the north are relatively insufficient in suitability and villagers' receptiveness, they play an important role in the socio-economic village system (Figure 5).
We also conducted a detailed statistic comparison of two classifications (Table 3).The optimized classification shows higher potential in terms of maintaining the function of network system.The optimized strategy causes less harm to the entire village system than the traditional strategy especially for network connectivity and fragmentation (e.g., RP and CN).As for traditional factors, the optimized strategy also can obtain satisfied results except for villagers' receptiveness of consolidation (0.72 vs. 0.55).This exception indicates that the consolidation using optimized strategy faces challenges (e.g., the resistance of farmers).The local government may be able to effectively overcome these challenges (in terms of coordination of villagers' interests and sustainable development of the countryside).

Implementation of GA to Relocate Consolidated Settlements
The typical parameter values of the GA are listed in Table 4.The population size was set to 100, and the number of generations to 300 (as the improvement of the best fitness value is stabilized after 200 generations).The crossover rate was set to 0.9, and mutation occurred in 5% of cases.For fully considering the socio-economic interactions and avoiding simple nearest reconstruction, d was set to 1000 m [36].Based on the study of Yu [36], the scale coefficient α was set to 0.8.S, C, and L were all taken into consideration in optimized scenario (w S = 0.3, w c = 0.3, w l = 0.4), whereas the traditional scenario only considered S and C with equal weights.The number of genes in optimized scenario was 137, whereas the length of chromosome in traditional scenario was 111.The solutions of spatial-territorial reorganization in different scenarios are presented in Figure 7.The moving distance (i.e., the distance from sources to relocated central settlements) of optimized solution is significantly longer than that of the traditional result (674.05 vs. 844.85).The moving direction of traditional scenario is more dispersed than that of the optimized solution.These differences suggest that the traditional strategy is a closer and dispersed integration, whereas the optimized strategy is wide-range and concentrated reconstruction.The optimized solution also shows more potential in terms of township development (i.e., the increased area of township, 15.37 hm 2 vs. 22.41 hm 2 ).
We also found two further advantages of the SRM through the detailed spatial comparison (Figure 8).First, the optimized scenario shows enhanced performance on the identification of administrative boundaries (Figure 8a,b).Specifically, the relocation of optimized strategy is under control of administrative system, whereas the traditional scenario seems to be under the control of distance.The reorganization solution of optimized scenario is always controlled within a village (the smallest administrative hierarchy in China), although it has longer moving distance.The cross-boundary relocation in traditional scenario may cause a separation between villagers even though they later live together.As reflected in our survey data, villagers in Wanjia rarely connected with others as they mostly came from other villages after natural disasters.This sense of belonging shared among villagers (located within a village) is formed by many factors (e.g., administrative and geographic context) in the long-term process of historical evolution.Second, the optimized scenario shows enhanced performance on the recognition of high-quality public service (e.g., shop, office, and school) (Figure 8c,d).The resettlement areas in optimized scenario are roughly twice as likely to own public services as the resettlement areas in traditional scenario (15.45% vs. 39.16%).Existing service facilities will largely reduce the relocation cost in a later period.
Sustainability 2017, 9, 1370 14 of 18 own public services as the resettlement areas in traditional scenario (15.45% vs. 39.16%).Existing service facilities will largely reduce the relocation cost in a later period.Moreover, the optimized and traditional solutions were compared with certain metrics to objectively prove the rationality of the SRM (Table 5).Significant improvements are observed in objective L and spatial metrics using SRM, whereas the traditional scenario has relatively better performance on objective S (4.12 vs. 4.06) and objective C (1.27 vs. 1.33).The optimized solution provides a local connectivity that is 40% higher than that of the traditional scenario (38.61 vs. 53.95).This efficiency demonstrates that the optimized solution has more demand for socio-economic interactions (working, visiting, shopping, and entertainment) from sources to sinks.The efficiency will be conducive for integration (e.g., lifestyle) between villagers in the resettlement area after consolidation.The receivable L value in traditional scenario is largely due to the propinquity effect-people who are located closer together in space show a higher probability of forming relationships [9,[37][38][39][40].Interactions among settlements are always bi-direction.In this context, we focused on the villagers' acceptability toward the central settlements because it is more crucial to the rural restructuring.Compared to traditional scenario, the NP, PD, MPS, MPI, and MNN of optimized solution have improved by 8.02%, 7.69%, 9.02%, 3.90%, and 2.23% respectively.In sum, the SRM helps to improve local connectivity and spatial pattern of rural settlements (e.g., bigger, more adjacent, and more concentrated) at the expense of a small reduction in suitability and compactness.Moreover, the optimized and traditional solutions were compared with certain metrics to objectively prove the rationality of the SRM (Table 5).Significant improvements are observed in objective L and spatial metrics using SRM, whereas the traditional scenario has relatively better performance on objective S (4.12 vs. 4.06) and objective C (1.27 vs. 1.33).The optimized solution provides a local connectivity that is 40% higher than that of the traditional scenario (38.61 vs. 53.95).This efficiency demonstrates that the optimized solution has more demand for socio-economic interactions (working, visiting, shopping, and entertainment) from sources to sinks.The efficiency will be conducive for integration (e.g., lifestyle) between villagers in the resettlement area after consolidation.The receivable L value in traditional scenario is largely due to the propinquity effect-people who are located closer together in space show a higher probability of forming relationships [9,[37][38][39][40].Interactions among settlements are always bi-direction.In this context, we focused on the villagers' acceptability toward the central settlements because it is more crucial to the rural restructuring.Compared to traditional scenario, the NP, PD, MPS, MPI, and MNN of optimized solution have improved by 8.02%, 7.69%, 9.02%, 3.90%, and 2.23% respectively.In sum, the SRM helps to improve local connectivity and spatial pattern of rural settlements (e.g., bigger, more adjacent, and more concentrated) at the expense of a small reduction in suitability and compactness.The aforementioned scenarios have highlighted the significance of socio-economic interaction consideration in the process of reorganization.For different preferences/weights, the final solution may be understandably different [26].To assess whether different combinations of the weight values would change the effectiveness of the model or solution significantly, we finally performed a preference analysis using different weight strategies.Herein, each objective preferred solution and solution with equal weights of object S and object C were taken as examples of the effectiveness of the SRM (Table 6).The classification result of source and sink came from the optimized scenario.The comparison table shows that all weight strategies demonstrate similar effectiveness in terms of object S, object C, as well as spatial pattern of rural settlements.This result implies the robustness of the SRM [26].The obj-S and obj-C preferred solutions may not always reach the best scores with respect to their preferred single objectives, due to the small extent of study area and the characteristics of the objective C [26].For obj-L preferred solution, the value of obj-L is much higher than the other solutions.Through contrastive analysis between four solutions and the aforementioned optimized scenario, we find that the optimized scenario has most balanced metric values.It is worth noting that it is arguably the best among these solutions because the aim of our reorganization method is to help planners or policy makers in finding suitable solutions according their preferences [26].Regarding the metrics, we find that object L and moving distance are more sensitive to the weight strategy than suitability, object C, and landscape metrics.Meanwhile, we suppose that the pursuit of objective C and object L maybe face the problem of the increasing of moving distance.

Relevant Policies/Practices
As the rural hollowing has become a major problem facing China's rural development, a series of policies, regulations, and practices have been introduced, such as "increasing vs. decreasing balance" land-use policy [4], "one family, one house" policy [3], and rural residential land consolidation and allocation (RRLCA) [6].The aim of "increasing vs. decreasing balance" land-use policy is to achieve equilibrium in the supply of land through balancing the increases in urban construction land with decreases in rural construction land [4].This policy has been developed at national and provincial levels to address the perceived rural problems [4].The Constitutional policy of "one family, one house" has been proposed to address the problem of "outward expansion while inside hollowing" and "one family, more houses" [3].RRLCA is an integrated approach to coordinate the numeric change of rural settlement and population [6].It has been widely used and showed significant progress on the intensive utilization of rural residential land [6].These policies are driven by the central state, represented by the Ministry of Land and Resources, and normally a top-down rural restructuring strategy [4].They are insufficient for enrolling local actors into the planning and decision making [4].The RRLCA can provide a gradually shift towards bottom-up endogenous development in the rural development strategies [6].The SRM in this paper can serve as a local practice and participatory restructuring and planning method.With the help of related macro-policy platform and various RRLCA practices, the local practice of spatial-territorial reorganization can be advanced by local government [1].Conversely, our local practice and SRM can provide an effective reorganization technique and optimization tool to realize these policies and practices.As such, accompanied by "top-down" policies/elements and platform provided by land consolidation, the local practice (SRM) and other "bottom-up" restructuring strategy are conducive to smoothly pushing forward spatial-territorial reorganization of rural settlement in China.The reorganization involves a diverse set of actors, such as local government, private enterprise, villagers, and other powerful actors, reflecting the complex and hybrid process of rural restructuring.The enrollment and motivation of these actors and participants in the process of consolidation could help to accelerate the reorganization/restructuring, safeguard farmers' benefits, and promote sustainable development in rural areas.

Conclusions and Future Work
This paper proposes a SRM of rural settlements to overcome the existing rural problems (e.g., scattered, massy, small, and hollowed).The proposed model is constructed based on graph theory and GA, and it involves two parts.In Part 1, we generate a series of experiments to investigate the network performance under numerous successive removals of settlement.Through this approach, we expect to find satisfied consolidated settlements with the least effect on the entire village system.In Part 2, GA model is repeatedly executed to optimize the objectives of suitability (S), compactness (C), and local connectivity (L) under the control of the constraints.The primary goal of this part is to scientifically resettle consolidated settlements into cities, nearby townships, or central settlements.
To verify the validity of the SRM, the proposed method has been applied in Chengui Town, Hubei Province.Two major findings are summarized as follows.First, removing settlements in order of node degree is the least efficient way to destroy the entire village system.Second, the proposed model can produce satisfactory solutions for spatial reorganization of rural settlements.The SRM helps to improve local connectivity and spatial pattern of rural settlements at the expense of a small reduction in suitability and compactness.The model also shows great potential in recognizing administrative boundary and high-quality public services.
The case study of Chengui is only a straightforward application of the SRM.It does not imply that the actual planning can be replaced.The optimization solution is merely a planning scenario based on different preferences.Moreover, the spatial-territorial reorganization of rural settlements is a complex systematic problem rather than a simple technological process.The reorganization may involve many aspects, such as the protection of basic farmland, industrial reorganization (off-farm employment), administrative reorganization, and land legal and managerial system [4].The SRM may also be extended by including more objectives and/or constraints.Future research may analyze these perspectives in detail.

Figure 1 .
Figure 1.Conceptualization of the three modes of spatial-territorial reorganization.

Figure 1 .
Figure 1.Conceptualization of the three modes of spatial-territorial reorganization.

Figure 3 .
Figure 3. Chromosome structure (a); and the resettlement of sources (b).

Figure 3 .
Figure 3. Chromosome structure (a); and the resettlement of sources (b).

Figure 4 .
Figure 4. Location and land use of the study area, Chengui.Figure 4. Location and land use of the study area, Chengui.

Figure 4 .
Figure 4. Location and land use of the study area, Chengui.Figure 4. Location and land use of the study area, Chengui.

Figure 6 .
Figure 6.Typical behavior of socio-economic network under four successive node-removal attacks in function of the number of removed settlements f (RP: the number of directly reachable pairs, IRP: the number of indirectly reachable pairs, L: the average length of the shortest paths, C: the cluster coefficient, NC: the number of components, S: the relative size of the largest connected component).

Figure 6 .
Figure 6.Typical behavior of socio-economic network under four successive node-removal attacks in function of the number of removed settlements f (RP: the number of directly reachable pairs, IRP: the number of indirectly reachable pairs, L: the average length of the shortest paths, C: the cluster coefficient, NC: the number of components, S: the relative size of the largest connected component).

Figure 7 .
Figure 7. Classification and reorganization solutions in: (a) traditional; and (b) optimized scenarios.Figure 7. Classification and reorganization solutions in: (a) traditional; and (b) optimized scenarios.

Figure 7 .
Figure 7. Classification and reorganization solutions in: (a) traditional; and (b) optimized scenarios.Figure 7. Classification and reorganization solutions in: (a) traditional; and (b) optimized scenarios.

Figure 8 .
Figure 8. Different behaviors on: village boundary (a,b); and public services (c,d) (note: (a,c) represent the traditional scenario, while (b,d) represent the optimized scenario).

Figure 8 .
Figure 8. Different behaviors on: village boundary (a,b); and public services (c,d) (note: (a,c) represent the traditional scenario, while (b,d) represent the optimized scenario).

Table 1 .
Performance of socio-economic network under a single attack.

Table 1 .
Performance of socio-economic network under a single attack.

Table 2 .
Relative change r of four metrics when half of settlements are removed.

Table 2 .
Relative change r of four metrics when half of settlements are removed.

Table 3 .
Comparison of two classification strategies.

Table 4 .
Parameter values of the genetic algorithm (GA).

Table 5 .
Comparison of traditional and optimized scenarios.

Table 6 .
Attribute information associated with four objective preferred solutions.