Analyzing the Driving Factors Causing Urban Expansion in the Peri-Urban Areas Using Logistic Regression : A Case Study of the Greater Cairo Region

The peri-urban area (PUA) of the Greater Cairo Region (GCR) in Egypt has witnessed a rapid urban expansion during the last few years. This urban expansion has led to the loss of wide, areas of agriculture lands and the annexation of many peripheral villages into the boundary of the GCR. This study analyzed the driving factors causing the urban expansion in the GCR during the period 2007–2017 using the logistic regression model (LRM). Eight independent variables were applied in this model: distance to the nearest urban center, distance to the nearest center of regional services, distance to water streams, distance to the main agglomeration, distance to industrial areas, distance to nearest road, number of urban cells within a 3 × 3 cell window and population density. The analysis was conducted using LOGISTICREG module in Terrset software. This research showed that the population density and distance to the nearest road have the highest regression coefficients, 0.540 and 0.114, respectively, and were the most significant driving factors of urban expansion during the last 10 years (2007–2017). Moreover, based on the results of the LRM, a probability map of urban expansion in the PUA was created, which shows that most urban expansion would be around the existing urban areas and near roads. The relative operating characteristic (ROC) value of 0.93 indicates that the probability map of urban expansion is valid.


Introduction
Urban expansion (UE) is one of the urgent matters in developing countries, especially in light of the rapid increase of population [1][2][3].Most of the UE in developing countries occurs in the peri-urban area (PUA) [4][5][6].The PUA is generally known as a transitional zone between city and countryside [7][8][9][10][11].It can be found at the urban fringe along the edges of the built-up area and extends to encompass a scattered pattern of lower density villages within the rural area [12,13].Managing of the urban expansion in the PUA needs a comprehensive understanding of the UE process, particularly the driving factors of UE [14,15].
The issue of driving factors of the UE in the PUAs has been discussed from different perspectives through various case studies in the developing countries to understand the key factors of the UE and its influences.For example, Webster and Muller focused on the role of public policy, which supports the dispersal of manufacturing towards the PUAs as one of the driving factors of UE in these areas [9].Bhatta et al. and Siddiqui et al. argued that population growth is the main factor of the UE in developing countries [16][17][18].However, Osman et al. found that the proximity to urban centers was the most significant factor of the UE in the PUA around Giza city in Egypt [19].Lawanson et al. and Braimoh and Onishi mentioned that migration from rural areas and affordable rent in the PUAs were the most important factors of UE in the PUAs of Lagos in Nigeria [20,21].Appiah et al. found two different factors as the most significant in Ghana: the increased demands for new housing in the city and the good accessibility of the settlements in the PUAs [22].In addition, Mahamud et al. found that the distance to public amenities, affordable housing, and the distance to the workplace are the most important factors in the George Town conurbation in Malaysia [23].McGregor et al. and Ramachandra et al. argued that speculation in the housing market is a very important factor [3,4].Although all mentioned studies are from developing countries, the results are different from case to case.Thus, the driving factors that influence UE in PUAs should be studied for each case on its own [23,24].Therefore, the main goal of this study was defining the driving factors of the UE within the PUAs in the Greater Cairo Region (GCR), and then analyzing their influence on the UE process.
The GCR in Egypt is the largest metropolitan region in the Middle East with a population of 20 million.The UE issue is considered one of the urgent matters in the GCR.During the last 30 years, most UE of the GCR occurred in the PUAs.Nevertheless, the driving factors of UE in the PUAs had not yet been analyzed in a separate study.The previous studies of the PUAs in the GCR either focused on UE within parts of the region or mentioned a few factors without analysis of their impacts.For example, Osman et al. investigated the driving forces in the west part of the GCR only [19], while Mohamed examined a few of the driving factors in the urban peripheries within 5 km only around the GCR [25].Sims and Salem et al. discussed a set of driving factors that influence the UE in the GCR without measuring the impact of these factors on the UE in the GCR [26,27].Therefore, this study sought to fill the gap by defining and analyzing the influences of driving factors causing the UE for the whole study area using one of the urban expansion models, in addition to building a probability map of UE based on the results of this model.
Models are one of the tools used by researchers for studying the behavior of UE and examine the driving factors which influence this expansion [17,28,29].For instance, structure simulation model (SEM) has been used to model the UE in many case studies.SEM is a specific type of regression analysis and explains the relationships between independent and dependent variables.However, SEM is not applicable when dependent variables have spatial characteristics [28].Rule-based simulation models (RSMs) such as cellular automata (CA) can be used with this kind of variables.However, CA model can only simulate the spatial pattern without interpretation of urban expansion driving forces [30].Logistic regressions Model (LRM) can be used to analyze driving forces that have spatial characteristics as well as to give an interpretation of the influence of driving factors [29,31,32].Moreover, LRM can incorporate socioeconomic variables such as population density with other spatial variables [33,34].Thus, the LRM was applied to determine and analyze the driving factors of urban expansion in the study area.
This paper is divided into six sections.Section 1 includes the introduction and the background.Section 2 includes the boundary of the PUA in the GCR, the dataset types and sources.Section 3 presents the methodology of the study and explaining the utilized model in detail.Section 4 presents the results of the LRM and the validation of the model.Section 5 discusses the findings of the LRM, explanation for the unexpected results, the limitations of the model, suggestions for developing the model and future research.Finally, Section 6 highlights the overall significance of the study.

Study Area
The GCR in Egypt is located at latitude 30 • 06 N and longitude 31 • 28 E. Based on the study of the Strategic Urban Development Master Plan for GCR, the region boundary comprises five main parts: Main agglomeration, New Urban Communities (NUC), Peri-urban area (PUA), Agriculture lands, and Desert [35].The PUAs are located around the main agglomeration and include many villages and towns (see Figure 1).The population size of PUAs' settlements is widely different; there are small settlements with populations of less than 5000 inhabitants as well as huge settlements of more than 100,000 inhabitants.Currently, the total population in the PUAs exceeds 5 million and the annual population growth rate in these areas is 3%.In addition, the percentage of the PUAs' population to the total population increased from 21.6% to 25.5% during the last 20 years [10] (see Table 1).

Data
The data used in this study were collected from various sources.First, Landsat imagery for 2007 and 2017 was downloaded from the United States Geological Survey (USGS) and used in calculating the urban expansion from 2007 to 2017.Second, shapefiles of roads, regional services, water streams, industrial areas and urban centers were derived from the GCR database from General Organization of Physical Planning (GOPP) in Egypt.Third, the data source of population density was the Central Agency for Public Mobilization and Statistics (CAPMAS) in Egypt (see Table 2).All maps in this study were geometrically referenced to the WGS 1984, UTM zone 36 projection systems.The population size of PUAs' settlements is widely different; there are small settlements with populations of less than 5000 inhabitants as well as huge settlements of more than 100,000 inhabitants.Currently, the total population in the PUAs exceeds 5 million and the annual population growth rate in these areas is 3%.In addition, the percentage of the PUAs' population to the total population increased from 21.6% to 25.5% during the last 20 years [10] (see Table 1).

Data
The data used in this study were collected from various sources.First, Landsat imagery for 2007 and 2017 was downloaded from the United States Geological Survey (USGS) and used in calculating the urban expansion from 2007 to 2017.Second, shapefiles of roads, regional services, water streams, industrial areas and urban centers were derived from the GCR database from General Organization of Physical Planning (GOPP) in Egypt.Third, the data source of population density was the Central Agency for Public Mobilization and Statistics (CAPMAS) in Egypt (see Table 2).All maps in this study were geometrically referenced to the WGS 1984, UTM zone 36 projection systems.

Classification of Satellite Images
ArcMap10.3GIS software was used in image processing and classifying land use land cover (LULC) maps.The PUA of the GCR was extracted from the Landsat images using Extract by Mask in Arc GIS.Supervised classification method was conducted for the image classification process.In total, 150 training sites (signatures) were chosen in each image to represent four LULC types: (1) Urban; (2) Water; (3) Agriculture; and (4) Desert.The maximum likelihood (MLC) was applied for the classification process.Then, 100 random points were selected at each classified image on a stratified random approach and compared digitally with the corresponding pixels of the original images in Google Earth Pro as a reference data.Kappa index was applied to compute the accuracy, which showed an accuracy rate of 0.88 and 0.92 for 2007 and 2017, respectively.

Identifying the Variables of the Logistic Regression Model
In this model, a map of the urban expansion between 2007 and 2017 was used as a dependent variable to perform the logistic regression analysis.This map has two land use classes: urban expansion (changed cells from agricultural to urban) where y = 1 and no urban expansion (still agricultural land) where y = 0.
Thirteen independent factors (variables) were derived from the literature review and discussion with experts to include in the model [17,19,26,27,36,37].Then, three factors (income per capita, economic activities and housing rent) were eliminated from the model due to lack of data for the whole study area.In addition, two other factors (distance to Nile River and slope) were eliminated from the model because of the low value of R 2 statistics with these factors.This process was carried out for each set of variables to assess their reliability.Finally, the total number of independent (variables) decreased from thirteen to eight factors (variables) (see Table 3).The number of urban cells within a window was decided according to the most widely used window sizes in dynamic simulation models where sizes are usually 3 × 3, 5 × 5, or 7 × 7 [30,36,38].Cell windows of 7 × 7, 5 × 5, and 3 × 3 cell windows were investigated during the initial analysis and the best results were found using 3 × 3 cell window.This is probably because the smallest cell could capture the reality of small urban expansion in the study area.

Logistic Regression Model (LRM)
A logistic regression model was used to analyze the driving factors causing urban expansion in the PUAs of the GCR.The logistic model was utilized to estimate the probability of urban expansion (dependent variable) based on independent variables (X).The dependent variable (Y) in this model is the developed cells (change from non-urban to urban) presented as a binary raster where a value of 1 indicates urban expansion (change on the specific pixels) while 0 indicates no urban expansion (no change on the specific pixels) within the period of 2007-2017.
The analysis was conducted using LOGISTICREG module in Terrset software.The basic assumption is that the probability of the dependent variable taking the value of 1 follows the logistic curve and its value can be calculated with the following Equation (1): where P is the probability of the dependent variable being 1; X is the independent variables, x = (x 0 , x 1 , x 2 . . .The state of a cell is dichotomous: either the presence of urban expansion or absence of urban expansion.The logistic regression model employs a Maximum Likelihood Estimation (MLE) procedure to find the best fitting set of coefficients.The maximum likelihood function used by LRM is the following: where L is the likelihood; µ i is the predicted value of the urban expansion (dependent variable) for sample i; and y i is the observed value of the urban expansion (dependent variable) for sample i. Maximizing Equation ( 2) thus requires the solution for the following simultaneous nonlinear equations: where X ij is the observed value of the independent variable j for sample i.The rest is the same as for the likelihood function.In solving the above equations, LOGISTICREG uses the Newton-Raephson algorithm.
In a logistical regression, the odds of Y being 1 is calculated using the equation φ = eα + ∑ βiXi and the odds ratio for the dichotomous variable X is demonstrated as a relation between each of the variables with urban expansion (Y).By fitting the logistic regression model to the variables, the study calculated the respective odds ratios of the independent variables to predict the outcome using Equation ( 4): where P is the the probability that a case is in a particular category for the response variable; exp = the base of natural logarithms (approximately 2.72); a is the constant of the equation in particularly explicit in step-wise models; and β is the coefficient of the predictor variables.
The study used Terrset software to conduct the LRM using stratified sampling.Sampling can significantly decrease the processing time as well as reduce the negative impact of spatial interdependence [31].The study used a mask file to select pixels of interest for the analysis as well as apply this mask to the output image.

Calibration of the Logistic Regression Model
The aim of the calibration of the LRM is detecting the ideal resolution for modeling.This study used the same fractal method as Hu and Sefidi to determine the optimal scale of modeling [28,39].Fractal dimensions were calculated for the probability maps predicted using the LRM calibrated in different resolutions from 30 m to 360 m.Fractal dimension increases almost linearly with the change of resolution from 30 m to 300 m, and then decreases after 300 m.The scale at which the highest fractal dimension is measured might be the scale at which the model performs best [39,40].To test if the model indeed performs best at the turning point of 300 m, a series of probability maps of urban expansion generated from the LRM was compared against an actual urban growth map and ROC values were calculated (for details of model validation, see Section 4.6).The highest ROC value was achieved at the resolution of 300 m.Therefore, the resolution of 300 m was selected as the scale at which the LRM best represents the dynamics of urban expansion.To reduce the impact of spatial dependency between observations, the study specified a stratified random sampling by taking a 10% portion of total cells.At the resolution of 30 m, the number of cells within the PUA is 4,648,192, of which 444,012 cells have been sampled.The number of cells that have changed from non-urban to urban (Y = 1, urban expansion), is 123,230, accounting for 2.65% of the total number of cells.For the random sample, there are 11,749 points whose cell values are 1 in the Y variable layer.The percentage of urban expansion (Y = 1) in the sample is 2.64%, matching very well with the percentage of 2.65% for the full dataset, which demonstrates the representativeness of the stratified random sampling.The outputs proved that the eight driving factors are significant enough.Cox and Snell R 2 value for the model was 0.76, which refers to a very good fit in the study area and the R 2 value was 0.90, which reflects the accuracy of the model (see Table 4).

Urban Expansion
The urban expansion (UE) in the PUAs was about 5400 ha with annual loss of 540 ha.The number of changed cells from agricultural to urban (Y = 1, UE) was 123,230, accounting for 2.65% of the total number of cells as shown in Figure 2. The direction of UE was mainly toward north and northeast area, especially along or near roads network.In addition, there was a significant UE around existing urban areas.

Driving Factors (Independent Variables)
Distance to nearest road, distance to the nearest center of regional services, distance to water streams, distance to the main agglomeration, distance to industrial areas, and distance to nearest urban centers were analyzed in Terrset software using distance command from Idrisi Gis analysis.Number of urban cells within a 3 × 3 cell window was calculated using pattern tool from Context

Driving Factors (Independent Variables)
Distance to nearest road, distance to the nearest center of regional services, distance to water streams, distance to the main agglomeration, distance to industrial areas, and distance to nearest urban centers were analyzed in Terrset software using distance command from Idrisi Gis analysis.Number of urban cells within a 3 × 3 cell window was calculated using pattern tool from Context Operators menu in Terrset software (see Figure 3).

Driving Factors (Independent Variables)
Distance to nearest road, distance to the nearest center of regional services, distance to water streams, distance to the main agglomeration, distance to industrial areas, and distance to nearest urban centers were analyzed in Terrset software using distance command from Idrisi Gis analysis.Number of urban cells within a 3 × 3 cell window was calculated using pattern tool from Context Operators menu in Terrset software (see Figure 3).

Multicollinearity Analysis for Independent Variables
To eliminate useless driving factors from our model and preserve the reliability of our coefficients, a multicollinearity test was performed for each set of factors in the study area.VIF is the most frequently utilized test to find multicollinearity in statistical analysis.VIF computes how much

Multicollinearity Analysis for Independent Variables
To eliminate useless driving factors from our model and preserve the reliability of our coefficients, a multicollinearity test was performed for each set of factors in the study area.VIF is the most frequently utilized test to find multicollinearity in statistical analysis.VIF computes how much the variances of the applied regression coefficients are distended compared to when driving factors are not linearly related.The analysis was performed in SPSS by regressing one of the independent variables against the residual seven driving factors in a repeated mechanism.The analysis shows that all driving factors give VIF < 5, which is a good outcome (see Table 5).

Influence of Driving Forces on Urban Expansion
The findings explained the nature of the relationship between urban expansion and the driving factors in the study area.It can be explained that the pixels near any independent variable having higher coefficient value have more development probability for change [18].The sign of the coefficient (±) indicates a positive or negative correlation to the response of the dependent variable [32].For example, a negative (−) coefficient index means that, if the distance increased, the urban expansion decreased.The factors of population density (X7) and the number of urban cells within a 3 × 3 cell window (X8) had a positive correlation with urban expansion, while the correlation was negative with the other factors: the distance to nearest road (X1), distance to the nearest center of regional services (X2), distance to water streams (X3), distance to the main agglomeration (X4), distance to industrial areas (X5) and distance to nearest urban centers (X6) (for more details, see Table 6).i Odds ratio meaning the ratio of the probability of success to the probability of failure for the variable, which represents the effect of each variable in the urban expansion process.
The research results show that the eight factors affected the urban expansion to different degrees, as indicated by odds ratios.The odds ratio for distance to the nearest road (X1) is 0.861, which mean that the predicted urban expansion in an area close to a road is estimated as 1.16 times more than the predicted urban expansion in an area 1 km further away from a road.The odds ratio was 1 for distance to nearest center of regional services (X2), distance to water stream (X3), distance to the main agglomeration (X4) and distance to industrial areas (X5), which mean that there is no impact of these factors in the urban expansion process within the study area.The odds ratio of urban expansion probability around existing urban centers (X6) is 1.955, which mean that the odds of urban expansion will decrease by 0.5 if the distance to nearest existing urban area has increased by 1 km.The odds ratio for the population density (X7) is 0.110, which mean that the probabilities of urban expansion in the areas with high density of population are nine times more than the probabilities of urban expansion in the areas with low population density.The odds ratio for the number of urban cells within a neighborhood (measured by a window of 3 × 3 cell size) (X8) is 1.909, which mean that, with an increase of one urban cell within the neighborhood, the probability of urban expansion will increase by 0.5.Therefore, if a land lot includes many urban cells, it is more likely to witness more urban expansion

Prediction of Urban Expansion
The probability of urban expansion was predicted by using the coefficients of the logistic regression model encompassing the eight variables (driving factors) into the following equation: where P (Y = 1|X 1 , X 2 , . . ., X k ) is the probability of the dependent variable Y being 1 given (X 1 , X 2 , . . ., X k ), i.e., the probability of a cell being urbanized; Xi is an independent variable representing a driving force of urban expansion; and β i is the coefficient for variable X i .
The probability map of urban expansion for the PUAs in Figure 4 is a 0-0.73 color classification of urban expansion probability values.The darker tones indicate higher probabilities of urban expansion.The future pattern of urban expansion is easy to identify from this map, and is expected to be around main existing urban areas and adjacent to roads, particularly in northern areas.
Infrastructures 2019, 4, x FOR PEER REVIEW 10 of 14 The probability map of urban expansion for the PUAs in Figure 4 is a 0-0.73 color classification of urban expansion probability values.The darker tones indicate higher probabilities of urban expansion.The future pattern of urban expansion is easy to identify from this map, and is expected to be around main existing urban areas and adjacent to roads, particularly in northern areas.This probability map can be used to predict where urban expansion will happen.Additionally, the study highlighted the major factors of UE, which can be adapted by appropriate policies to promote or discourage UE in the predicted areas.This probability map can be used to predict where urban expansion will happen.Additionally, the study highlighted the major factors of UE, which can be adapted by appropriate policies to promote or discourage UE in the predicted areas.

Model Validation
The study used the relative operating characteristic (ROC) curve to examine the efficacy of the model for the probability of urban expansion in the PUA.The ROC computes the area under the curve, which varies between 0.5 and 1.0., where 1 indicates a perfect fit and 0.5 indicates a random fit.The ROC calculates the percentages of false positives and true positives for a range of thresholds or cut-off values, relating them in a chart.
Previous studies show that a perfect spatial agreement can exist between the real urban expansion and the simulated urban expansion by the LRM [34,38,39].In this study, the model validation was conducted by comparing the predicted urban expansion of the probability map with the actual urban expansion map of 2017 using random samples of 1000 cells in the two maps.The ROC curve is based on several two-by-two contingency tables.The contingency table is based on the comparison between the actual and the predicted probability image.
Table 7 shows the contingency table."A" is the number of true positive cells, which are predicted as urban expansion and agree with the actual image."B" is the number of false positive cells, which are predicted as urban growth but disagree with the actual image."C" is the number of false negative cells, which are predicted as non-urban growth but disagree with the actual map."D" is the number of true negativecells, which are predicted as non-urban growth and agree with the actual image.From every contingency table, a single data point (x, y) is created, where x and y are the rate of false positives and the rate of true positives, respectively.The true positive percent value is derived from A/(A + C), while the false positive percent value is derived from B/(B + D), where A, B, C, and D are pixel counts in the contingency table for each threshold.The ROC value of the model was 0.93, which indicates that the probability map of urban expansion is valid (see Figure 5).
The true positive percent value is derived from A/(A + C), while the false positive percent value is derived from B/(B + D), where A, B, C, and D are pixel counts in the contingency table for each threshold.The ROC value of the model was 0.93, which indicates that the probability map of urban expansion is valid (see Figure 5).

Discussion
Similar to most cities in developing countries, the expected urban expansion in the PUA of GCR will be a kind of chaotic expansion, which probably will establish slums areas on agriculture lands.The findings explain the driving factors that affect the urban expansion in different degrees, which are discussed as follows: The findings show a lesser effect than expected for the roads factor (distance to nearest road) in comparison with results of similar case studies [27,32,39].This is probably due to most of the urban expansion occurring on specific regional roads in the study area, and the study investigated all types of roads in the study area.For the neighborhood factor (the number of urban cells within a neighborhood), the findings show a lower effect than common in comparable metropolitan regions [28,39].However, the expansion will probably continue to take place in locations near developed areas due to the other factors, which will lead to more informal areas.
On the other hand, the findings show that the population density has the greatest effect on urban expansion in the study area and this result is compatible with the findings of most previous studies [25,27].The findings of this study can be useful to manage expansion in the study area by using appropriate techniques for dealing with the significant driving factors.

Discussion
Similar to most cities in developing countries, the expected urban expansion in the PUA of GCR will be a kind of chaotic expansion, which probably will establish slums areas on agriculture lands.The findings explain the driving factors that affect the urban expansion in different degrees, which are discussed as follows: The findings show a lesser effect than expected for the roads factor (distance to nearest road) in comparison with results of similar case studies [27,32,39].This is probably due to most of the urban expansion occurring on specific regional roads in the study area, and the study investigated all types of roads in the study area.For the neighborhood factor (the number of urban cells within a neighborhood), the findings show a lower effect than common in comparable metropolitan regions [28,39].However, the expansion will probably continue to take place in locations near developed areas due to the other factors, which will lead to more informal areas.
On the other hand, the findings show that the population density has the greatest effect on urban expansion in the study area and this result is compatible with the findings of most previous studies [25,27].The findings of this study can be useful to manage expansion in the study area by using appropriate techniques for dealing with the significant driving factors.
The study suggests some recommendations for dealing with urban expansion in the case study and similar areas in the metropolitan regions of developing countries.First, the results obtained from LRM should be used to identify the tendencies of urban expansion in local municipalities.Second, decision makers must recognize and track the rapid population growth in the PUAs and the increasing urban expansion.Accordingly, they should utilize restricted policies to adjust urban expansion on agriculture lands.Third, local urban municipalities in the PUAs should identify the potential growth areas around the high-density urban edges, near roads, and near urban centers, as well as prepare urban plans with strict building regulations to prevent future informal settlements.
Despite the LRM's potentials in comprising many variables in the analysis, some important factors still cannot be included in the model, for instance government policy factor and real estate speculation factor, which probably influence urban expansion.In addition, the probability map of urban expansion produced by the model indicates where urban expansion will happen without clarification of the temporal dimension.Therefore, it is recommended for future research to integrate Cellular Automata (CA) model with the logistic regression model to predict urban expansion.Finally, the study suggests a separate research on the mechanisms of land management in PUAs, especially for the lands situated along major roads.

Conclusions
This study identified eight driving factors causing the urban expansion within the PUAs in the GCR, and then analyzed their influence on the urban expansion process using LRM.The population density and distance to the nearest road were the most significant driving factors in the study area during the last 10 years (2007-2017).Moreover, the probability map generated from the model can be used to predict where urban expansion will occur.The ROC value was used to examine the efficacy of the model for the probability of urban expansion, which indicated that the model is valid.Despite the LRM's strengths, this study has shown some limitations of the model, most notably the inability to include all types of variables.
On the other hand, the results of this study are not only useful to manage UE in the study area, but also in other PUA in developing countries.The study identified the major factors of urban expansion, which can be controlled by using the appropriate regulations and policies.
The study suggests combining Cellular Automata (CA) model with LRM to overcome the weakness of LRM in dealing with temporal dynamics.In addition, the future research should address the role of government policies and real estate speculation in the urban expansion process in the GCR.Moreover, the study suggests more future research on the mechanisms of land management in PUAs, especially for lands situated along major roads.
x k ) (representing driving factors of urban expansion); and b is the estimated parameters, b = (b 0 , b 1 , b 2 . . .b k ) (representing variable coefficients)

3. 3 . 2 .
Goodness of Fit of the Model The −2log(likelihood) was computed to measure the goodness of fit of the logistic regression model.The −2log(likelihood) (1744.354) is the value of the likelihood function for the full model as fitted.Cox and Snell R 2 and Nagelkerke R 2 were computed to further test the goodness of fit of the model.The Cox and Snell R 2 values and the Nagelkerke R 2 values are analogous indicators to the R 2 statistics of the linear regression.R 2 values closer to 1 indicate the model is closer to certainty.

Figure 2 .
Figure 2. Dependent variable (Y): the change to urban area from 2007 to 2017.

Figure 2 .
Figure 2. Dependent variable (Y): the change to urban area from 2007 to 2017.

Figure 4 .
Figure 4. Probability map of urban expansion in the PUAs.

Figure 4 .
Figure 4. Probability map of urban expansion in the PUAs.

Table 1 .
Peri-urban areas share of Greater Cairo's population.

Table 2 .
Dataset types and sources.

Table 1 .
Peri-urban areas share of Greater Cairo's population.

Table 2 .
Dataset types and sources.

Table 3 .
List of factors (variables) included in the logistic regression model.

Table 4 .
Goodness-of-fit statistics of the logistic regression in the study area.

Table 5 .
Results of multicollinearity analysis.

Table 6 .
Coefficients and odds ratios for the LRM containing the eight independent variables.

Table 7 .
Two by two contingency (the number of grid cells in actual map versus a predicted map).