1. Introduction
As more than 75% of the global population is expected to live in urban areas by 2050 [
1], there is an urgent need to assess the risk of natural hazards through a future-focused lens so that adequately informed spatial planning decisions can be made to define preventive risk policies in the upcoming decades.
Disaster risk [
2] is defined as the combination of three determinants, such as hazard, exposure, and vulnerability [
3,
4,
5,
6]. These three components change over time due to natural and human influences impacting future risk levels. Among them, exposure is the most important because it represents the elements that can be affected by a particular hazard in a specific area.
Exposure is defined as “the people, property, systems, or other elements present in hazard zones subject to potential losses” [
7]. Exposure analysis is fundamental (i) to organize emergency response and relief to people affected or potentially affected; (ii) to provide information to decision-makers and the crisis management operators that manage resources and logistical infrastructure to support reconstruction and rehabilitation; and (iii) to reduce potential future hazard impact through disaster risk reduction strategies, by taking more risk-informed decisions.
Therefore, assessing exposure levels is a topic of great interest among scientists and researchers. Exposure to natural hazards has been evaluated using different methods and tools. Some authors used traditional technologies, while others applied new remote sensing technologies and techniques.
In the study developed by Duan et al. [
8], aimed at assessing the waterlogging risk, the authors combined multi-criteria decision analysis and geographic information systems to determine, among other components, such as hazard, vulnerability, and emergency response and recovery capability, the exposure level to that risk. Boni et al. [
9] developed a remote-sensing-based procedure to update the population exposed to natural hazards assessment quickly. A relationship between satellite nightlight intensity and urbanization density from globally available cartography is assessed and used to extrapolate urbanization data at different time steps, updating exposure each time. To test the reliability of the proposed methodology, the number of people exposed to riverine floods in Italy is assessed, deriving a probabilistic relationship between defense meteorological program nightlight intensity and urbanization density from the global urban footprint database for the year 2011. In another scientific work, Geiß et al. [
10] combined a supervised remote sensing model, such as rotation forest, and open street map data for the automatic estimation of crucial exposure components, such as the number of buildings and population, with a high level of spatial detail referring to the tsunami hazard. Bhuyan et al. [
11] applied a deep learning method, the ResUNet model, to detect the footprints of the buildings and used auxiliary open data to examine their exposure to floods in two areas in Kerala, India. Ehrlich et al. [
12] combined the global built-up area and the global population density data with five global hazard maps to produce global layers of built-up area and population exposure to every single hazard for the epochs 1975, 1990, 2000, and 2015 to assess changes in exposure to each hazard over forty years. Freire et al. [
13] evaluated and characterized the worldwide population distribution from 1975 to 2015 concerning the volcanism phenomenon using the population grids produced in the Global Human Settlement Layer project frame. Torres et al. [
14] proposed a scalable and transferable procedure to assess urban exposure and vulnerability to seismic risk by integrating remote sensing technologies and machine learning techniques. Jitt-Aer et al. [
15] applied a multi-stage spatial interpolation in the ArcGIS software to determine the population exposed to a tsunami impact in Patong (Phuket, Thailand).
All these existing studies on urban exposure to natural hazards rely on past or static observation. Therefore, they do not consider the evolutionary nature of risk, confirming what was stated by the authors of the reference [
16]. Although the need to consider the long-term implications of natural hazards is widely recognized by organizations such as the United Nations, the European Union [
17], and the World Bank [
18], exposure applications focused on that issue are still rare. In the meantime, the emphasis of scientific works on disaster risk reduction is still on individual hazards. Even if it is clearly defined by the Sendai Framework 2015–2030 [
19] that it is essential to promote comprehensive multi-risk analyses, only some studies consider combining more hazards.
Multi-risk methodology integrates multiple hazards and assesses their potential impacts on a specific area. It comprehensively explains the risks of specific land use and cover (LULC) patterns. LULC plays a crucial role in determining the vulnerability and exposure of an area to different hazards. Land use refers to the human activities and purposes for which land is employed (such as residential, commercial, industrial, agricultural, or recreational purposes); land cover refers to the physical and biological cover of the Earth’s surface, including forests, wetlands, urban areas, water bodies, and agricultural fields.
Land use choices are the main drivers of future exposure to natural hazards [
20]. In disaster management planning, future exposure to natural risks is typically related to analyzing and predicting the effect of urban growth and LULC changes [
21,
22] regarding built-up area development and population growth. These analyses on land use changes are a valuable means to determine the impact of urbanization on disaster risk reduction policies.
Many methods to assess land use transformations have been developed recently, especially using remote sensing and geographic information systems (GIS) technologies. This analysis reached significant consideration in different fields, including defining rural and urban plans [
23], identifying biodiversity hotspot landscapes [
24], desertification dynamics, and the management planning process.
Future land cover transformations can be assessed by applying different LULC models. These are generally characterized by defined operational phases, such as data collection, calibration, simulation, validation, and prediction [
25]. The data collection consists of deriving raw data about the historical LULC transformation and the driving factors data. LULC data can be retrieved from different sources, such as orthoimages, satellite imagery, and LULC change surveys, or can be obtained from the application of image classification techniques, both traditional and innovative. Driving factors data represent bio-physical, proximity, demographic, socio-economic, economic, and institutional factors, which can be defined as being responsible for the land cover changes over time. These data are available in public and administrative datasets. The calibration of the prediction model based on the LULC data and driving variable values at different time points is needed to parameterize the model. This latter is then applied to simulate future changes by generating the transition probability maps. The model validation phase estimates the model accuracy in predicting the LULC changes. This phase compares the LULC prediction with the LULC data available at a specific time. After checking the model prediction accuracy and validation, the last stage of LULC modeling consists of predicting future LULC.
According to the scientific literature, these LULC prediction models can accurately predict LULC changes for a short time, such as twenty or thirty years, because they depend on historical patterns. Different LULC prediction models have been developed in the scientific literature over time. Each model has specific characteristics.
The statistical models are the LULC prediction models based on a mathematical relationship between the LULC historical data and explanatory variables, defined as driving factors. These models generate transition probability maps, representing the suitability of specific areas to change. The most popular statical models are logistic regression [
26], generalized additive models [
27], and stochastic models, such as Markov chains [
28,
29,
30]. These models are characterized by their easy implementation and generalization. However, they could perform better if stakeholder perception about land cover changes is considered.
The spatial dynamics of land cover change models, based on defined transition rules obtained considering neighborhood interaction effects or expert knowledge, are cellular automata (CA). These models simulate spatial analysis changes by generating suitability maps and are helpful for decision-making. Nevertheless, they lag in the temporal dynamic of change. Some examples of CA models employed to predict LULC changes are CLUES [
31,
32], CA [
33,
34], Slope, Land cover, Excluded region, Urban land cover, Transportation and Hillshade (SLEUTH) [
35,
36,
37,
38,
39,
40], and GEOMOD [
41].
Another family of LULC simulation models is agent-based (ABMs). These models consent to simulate the LULC changes by considering the stakeholders’ considerations and elicitations as they are characterized by an environment through which decision-making entities can interact to define the relationship between them and their environment [
42,
43,
44]. Due to this characteristic, the low generalizability of these models represents their main limitation.
All these models are often combined to strengthen the potentiality and overcome the limitations of each one. An example of an integrated model is CA-Markov [
45,
46,
47]. These models generally well predict land cover changes but are challenging to implement and validate.
Land cover prediction modeling generates credible information about the LULC changes and provides valuable information about future urban development patterns. This is crucial for the sustainable urban planning of cities and disaster management planning in the uncertainty of tomorrow’s future. Therefore, the accurate geospatial modeling of urbanization to assess the multi-hazard exposure of urban areas is a fundamental prerequisite for enhancing urban resilience and disaster risk reduction.
Moving from this background, the authors provide a disaster management planning method to predict future land use changes and assess, through a census analysis, the future exposure of urban areas considering different hazards. To this end, after applying remote sensing technologies to map built-up areas, the authors predicted future urban growth dynamics, considering the main driving factors under different scenarios through a single averaged binary logistic regression model (BLRM). Based on the prediction results, the authors assessed the future multi-hazard exposure of urban areas by evaluating indicators related to the built-up area and socio-economic conditions. They applied the proposed methodology to the municipality of Rende (Calabria, Italy) to test and validate it.
The main objectives of this research are to (i) propose an innovative and transferable methodology that, using innovative methods and models connected to remote sensing and GIS techniques, could analyze the pattern of urban growth for a determined period and predict its trends by considering different driving factors; and to (ii) develop a decision support system able to predict the effects of territorial land use planning scenarios on the exposure to multiple natural hazards.
By employing this method, urban planners and decision-makers can model, predict, and quantify the future impact of risk from natural multi-hazards, according to which proper disaster management policies for tomorrow’s world can be defined.
The structure of this research includes five sections. The first section covers the introduction and the literature review. The second section presents the study area and the dataset used in the research, while the third section presents the overall methodology. The fourth section includes the results of the research. The fifth section discusses the results, while the sixth section gives the research conclusions.
3. Methodology
3.1. Methodological Framework
Applying a quantitative methodology (
Figure 2), the authors assessed the future multi-hazard exposure of urban areas under different socioeconomic growth scenarios by predicting the expected built-up area development through a single averaged BLRM calibrated considering the urban growth dynamics during a past reference period. In the proposed study, the authors applied the methodology to the case study described in
Section 2, considering 2000–2022 as the past reference period to identify the urban growth patterns and 2040 as the future time point associated with the future urban growth prediction. The authors analyzed a 20-year observation period as the most significant territorial transformations linked to planning policies generally occur within 15/20 years.
The authors obtained the built-up cover dynamics between 2000 and 2022 from the land use information detected by applying remote sensing and GIS technologies. Specifically, the authors employed the experimental Artificial Intelligence (AI) platform developed by the Laboratory of Environmental and Territorial Planning (LabPAT) of the Department of Civil Engineering of the University of Calabria. This experimental AI platform is based on the U-Net model. It is trained and validated to classify and segment the built-up area from satellite and aerial images. The results obtained from the AI platform are then refined manually by dividing the study area into a grid system of approximately 60,000 cells of 30 m × 30 m resolution to improve the accuracy of the prediction results.
After selecting the driving factors that potentially determined the urban growth dynamic, the grid system is used to perform the BLRM, obtaining the transition probability map for 2000 and 2022 and, therefore, to evaluate the coefficients’ driving factors values for the two years and the model’s goodness of fit.
Then, the authors evaluated a single averaged BLRM valid for 2000–2022 by averaging the coefficients’ value to the inverse of the respective variances and selecting only the transferable coefficients by performing a transferability test. The authors assessed the accuracy of the single averaged BLRM by comparing the results obtained from its application to 2022 and the actual built-up area pattern detected through the application of the AI platform, and the following refinement of the results using the grid system. The single averaged BLRM prediction accuracy is quantitatively assessed through the Kappa coefficient.
Once the accuracy of the built-up area prediction through the single averaged BLRM is demonstrated, the authors employed it to predict future urban growth patterns. Specifically, the single averaged BLRM is employed to predict urban growth in 2040 by considering different socioeconomic growth scenarios: business as usual (BAU), slow, and fast socio-economic growth. The prediction of the single averaged BLRM for these different scenarios in 2040 is combined with the socio-economic data and the hazard maps to assess future multi-hazard exposure at the census level by generating a future exposure map for each hazard.
3.2. Image Classification and Segmentation
A multistage procedure is applied to identify the urban growth dynamics of the study area from 2000 to 2022 through the classification and segmentation of built-up areas for each year.
First, the orthoimages referred to 2000 and S-2 satellite images for 2022 are processed employing the experimental AI platform, which combines Deep Learning (DL) technologies to classify and segment building heritage from satellite and orthoimages, with GIS techniques to import, use, and process its output to analyze the urban growth over time [
59,
60].
The experimental AI platform uses a U-Net model. The U-Net model is trained and validated for classifying and segmenting six different classes of features as buildings, vegetation, land, roads, water, and unlabeled. For this study, only buildings are considered. The platform is implemented using Python language using Pytorch with a TensorFlow backend.
A RELU activation function is used for the U-Net model, and the loss function considered is the IoU-based loss function. Adam model optimizer, with an initial learning rate of 0.001 and default hyper-parameters β1 = 0.9 and β2 = 0.999, is used. The validation task is carried out during the training stage, and its loss is calculated and monitored. The training is stopped to prevent overfitting if the validation loss does not improve within 100 epochs. The U-Net model is trained and validated using the “Semantic Segmentation of aerial imagery” [
61], reaching an F1-Score of 0.72. The training and validation tasks are carried out on the Google Colaboratory Pro Plus Environment, using 52 GB of RAM and the GPU NVIDIA Tesla P100. The NVIDIA Tesla P100 Graphics Card provides 3584 CUDA cores and 16 GB of HBM2 vRAM linked via a 4096-bit interface. Moreover, the performance is 9.3 TFLOPS at single precision, 18.7 TFLOPS at half precision, and 4.7 TFLOPS at double precision.
The AI platform is employed to detect the built-up coverage in 2000 and 2022. The output of the DL model is imported into the GIS environment, which, in this case, coincides with the open-source QGIS software [
62], obtaining an initial classification and segmentation of the built-up area cover changes in the 2000–2022 period. The accuracy of this first attempt to identify the built-up area cover is assessed by comparing the classification and segmentation obtained for 2022 with the information on land use land cover available from the DBSN. The accuracy level of the classification and segmentation through the AI platform is demonstrated through the F1-Score.
Then, the authors improved the accuracy of the results obtained automatically with the AI platform for each reference year. Specifically, the authors divided the study area into a grid of approximately 60,000 cells of 30 m × 30 m dimensions inside the QGIS environment. They manually refined the results obtained from the DL classification and segmentation of the built-up area, employing the orthoimages and the satellite images.
Therefore, the built-up and non-built-up cells are identified by exploiting the potential of remote sensing, artificial intelligence algorithms, and GIS tools through a two-step semi-automatic procedure.
After identifying the built-up area in 2000 and 2022, the authors calibrated a BLRM for each reference year, determining the influence of a selected cluster of driving factors on the urban growth dynamics of the study area for each year.
3.3. Selection and Assessment of Driving Factors
The urban growth driving factors widely used in the scientific literature are mainly related to the environmental conditions of the study area, the specification of its urban environment attributes, and the socio-economic condition of the resident population [
63,
64,
65]. The consideration of these driving factors varies according to the geographic conditions and their relevance to the specific context where the method is applied [
66].
For the environmental conditions, many studies considered terrain factors [
67], such as slope, elevation, and aspect, and climate factors, such as temperature, precipitation, and moisture. Considering urban environment attributes, the most popular driving factors considered in the scientific literature are related to proximity to transportation services, such as roads, highways, railways, and public transportation infrastructures [
68], proximity to urban and territorial services, such as hospitals, institutions, and proximity to job opportunities, such industrial, agricultural, farmland, and business areas [
69]. Socioeconomic conditions include population density, household number, job density, employment rate, and gross domestic product per capita [
70].
The authors considered nine driving factors in this study. The driving factors selected for this study include terrain factors, such as slope and elevation; socio-economic data, such as population and employees; and proximity factors, such as distance to the main roads, distance to railway stations, distance to main rivers, distance to built-up areas, and distance to territorial services. In the proposed study, the authors considered the university campus as a territorial service for higher education because of its presence in the study area. The authors selected these driving factors because they consider them to have the main explanatory power for built-up area changes and because of their easy calculation linked to their general availability in public and accessible datasets.
The authors assessed each driving factor employing the 30 × 30 m grid system. Indeed, they retrieved driving factors data and reclassified them using QGIS software functions. Slope and elevation values for each grid cell are derived from resampling the 20 × 20 m DTM. Population and employee data retrieved from the ISTAT census are associated with the grid system via the QGIS software. The authors evaluated the proximity to the main urban environment attributes by assessing the distance of each grid cell to the roads, railway stations, water streams, university, and buildings geometry. This analysis is performed through the tool “Distance to the nearest hub” from the QGIS Processing Toolbox. Giving the source point layer and the destination hub layer, which correspond to centroids of 30 × 30 m cells and the hub of destinations, respectively, this tool generates the lines to the hubs layer, indicating the nearest destination hub to each source point. The attribute linked to each line is the distance to the hub value. It represents the distance from the source point to the destination hub. The authors considered the centroid of every polygonal geometry for railway stations, higher education services, and built-up areas as destination hub layers. For main roads and water streams, the authors considered discretized hubs every 30 m along the line of these elements.
After assessing the driving factor values, the authors calibrated the BLRM for 2000 and 2022.
3.4. Binary Logistic Regression Model
The BLRM is used when the dependent variable is binary and the independent variables are continuous and categorical [
71,
72]. The main characteristic of the binary dependent variable in a logistic regression model is that it takes the value of 1 and 0 as the probability that a certain event happens or not, respectively [
73].
The authors applied a BLRM to assess the probability of urban growth in a specific spatial location. The urban growth probability (P) follows the logistic curve and can be estimated through the mathematical expression of Equation (1).
In Equation (1), P is the probability of the dependent variable; X represents the independent variable; is a constant; is the correlation coefficient of the explaining variable and explained variable; and is the odds ratio (OR), which represents the odds that the event will occur given a particular condition compared to the odds of the event occurring in the absence of that condition.
Considering the built-up condition as the dependent variable and the driving factors values as the independent variables, the authors analyzed the relationship between urban growth and the driving factors following Equation (2).
Transforming Equation (2), the authors obtained Equation (3).
In Equation (3), P is the probability of a cell to built-up; Xi is the i-th driving factor of the urban growth; β0 is a constant; and βi is the correlation coefficient between the i-th driving factor and urban growth.
The authors performed the BLRM for 2000 and 2022 using the grid system and changing the value of driving factors according to the environmental, built environment, and socioeconomic conditions for each reference year. The BLRM for each year and, therefore, the estimation of the coefficient values is performed in SPSS [
74].
3.5. Stratified Cross-Validation
In this research, the authors employed cross-validation for the accuracy evaluation of BLRMs calibrated for 2000 and 2022. Repeated stratified cross-validation is a valid technique recommended as a standard procedure for evaluating model accuracy by several studies [
75,
76,
77,
78,
79,
80,
81,
82,
83,
84,
85] because it provides a stable and reliable estimate with low bias. The main advantage of this method over regular cross-validation is that it can provide a less biased estimate of the model skill.
The employed cross-validation method is a 10-fold cross-validation, especially useful in studies where the dataset is imbalanced such as in the present study. The first step in 10-fold cross-validation is randomly dividing the dataset into 10 equal parts. If the dataset contains 1000 instances, each fold will have 100 instances. Stratification ensures that each fold represents the whole dataset well and maintains the same distribution of class labels in each fold as in the entire dataset. For each of the ten folds, a single fold is retained as the validation data, and the remaining folds are used as training data. The regression model is trained using the nine training folds and validated on the remaining fold. The process is repeated ten times, with each of the ten folds used precisely once as validation data. After running the cross-validation loop, three different performance scores are obtained, providing an insight into how well the model might perform on unseen data. As score values of 10-fold cross-validation, Kappa statistic, mean absolute error, and Root mean squared error are used.
3.6. Goodness of Fit of the Binary Logistic Regression Model
Estimating the coefficient values of the BLRM is followed by assessing the model’s appropriateness, adequacy, and usefulness. The importance of each explanatory variable is evaluated by carrying out statistical tests of the significance of the coefficients. After, the overall goodness of fit of the models is tested.
To test the significance of each coefficient, the Wald statistic expressed by Equation (4) is calculated.
In Equation (4), βi is the estimated coefficient value, and σ(βi) is the estimated coefficient value Standard Error (SE).
Each calculated Wald statistic is compared with a one-degree-of-freedom χ2 distribution. Although Wald statistics are easy to calculate, the standard error could be inflated for datasets that produce large coefficient estimates, resulting in a lower Wald statistic. This condition identifies an explanatory variable that could be incorrectly assumed to be unimportant in the model.
Another suitable indicator to determine the impact of each driving factor on urban growth is the OR. This indicator is the predicted change in odds for a unit increase in the predictor. When this value is less than one, an increase in variable values corresponds to decreased odds of the event’s occurrence. Instead, when this value is greater than one, an increase in variable values corresponds to the increased odds of the event’s occurrence.
Once the significance and the impact of coefficients for 2000 and 2022 are assessed, the authors determined the goodness of fit of the BLRMs performed for each time point to measure how well the coefficient values predicted by the BLRM for each year are close to the observed values.
The Cox and Snell and the Nagelkerke R2 statistics are frequently used for assessing a model’s goodness of fit. These statistics determine the explained variation of the model. The maximum value of Cox and Snell R2 is less than 1. The Nagelkerke R2, instead, assumes values inside the full range interval from 0 to 1 and represents an adjusted version of the Cox and Snell R2, and, for these reasons, it is often preferred.
3.7. Estimation of the Single Averaged Binary Logistic Regression Model Coefficients and Assessment of Their Transferability
Once the significance of coefficients and the goodness of fit of the performed BLRM for 2000 and 2022 are assessed, the obtained BLRM coefficients estimated for 2000 and 2022 have been averaged to obtain a single averaged BLRM.
Model averaging is a widespread technique in statistical applications to identify the best model from a set of models according to a defined criterion. It consents to have a good prediction of the future outcomes [
86], such as the future urban growth patterns, which is the purpose of this study.
In the proposed study, the coefficients of the single BLRM are estimated as the average of the obtained values from the application of the BLRM for 2000 and 2022, weighted to the inverse of the respective variances, according to Equation (5).
The variance of β
m (j) results to be expressed by Equation (6).
Then, the authors carried out a transferability test of the estimated coefficients of the single BLRM [
87] to assess if the single averaged BLRM can be applicable to predict future urban growth patterns. Employing this test, it can be verified whether the parameters of models estimated in 2000 and 2022 are systematically different or can be considered sample fluctuations of the single “true” averaged BLRM.
To test the null hypothesis E[β
i] = β
m for i = 1, …, k, where E[β
i] represents the mean of the coefficients β
i and K is the coefficient number corresponding to the i-th variable of BLRM, the likelihood ratio test expressed in Equation (7) is used.
The statistic of the TRANSF test, in the case of validity of the null hypothesis, results are distributed as a χ2 variable with a number of degrees of freedom equal to the number of periods minus one.
3.8. Validation of the Single Averaged Binary Logistic Regression Model
Before applying the BLRM to simulate future urban growth scenarios, the authors assessed the accuracy prediction of the single averaged BLRM. The built-up area distribution for 2022 is predicted by applying the single averaged BLRM considering the driving factors value for the 2022 reference period. The accuracy of the single averaged BLRM is evaluated through Cohen’s Kappa coefficient by comparing the 2022 built-up area map obtained from the two-step semi-automatic procedure described in
Section 3.2, with the built-up area simulated by the single averaged BLRM. The Kappa coefficient is selected to measure the prediction accuracy as it considers the possibility of the agreement due to chance [
88].
The Kappa coefficient is assessed employing Equation (8).
P
o is the proportion of the observed total agreement between the real and the predicted, and P
e is the proportion of the expected total agreement between the real and the predicted due to chance. The Kappa statistic puts the measure of agreement on a scale ranging from 0 to 1, where 0 indicates agreement being no better than chance, while 1 represents perfect agreement. A typical interpretation of a Cohen’s Kappa result is that values from 0 to 0.2 represent slight agreement, values from 0.2 to 0.4 correspond to fair agreement, values from 0.4 to 0.6 indicate moderate agreement, values from 0.6 to 0.8 coincide with a substantial agreement situation, and values from 0.8 up to 1 demonstrate almost perfect agreement [
89,
90].
The future urban growth scenarios are simulated once the prediction accuracy is assessed, and the model with the estimated coefficients is validated.
3.9. Definition of the Urban Growth Scenarios
This study’s reference period for predicting future urban growth patterns is 2040. The authors simulated urban growth for this time, considering the BAU, slow, and fast socio-economic growth scenarios [
91,
92]. These are defined by considering the different growth rates of the population and employees in the industrial and service sectors in 2022, which is considered the baseline period. The authors synthesized specifications for the different scenarios in
Table 2.
Analyzing the growth rate of the population and of the employees in the industrial and service sectors that occurred during the period 2000–2022, the BAU scenario (S0) foresees for 2040 a growth rate of the population and employees in the industry and services sectors equal to the one that characterizes the period from 2000 to 2022.
The slow socioeconomic growth condition consists of scenarios S1, S2, and S3. S1, S2, and S3 foresee an increase in the population and employees in the industry and services sectors equal to 50%, 75%, and 100% of the growth rate occurred during the 2000–2022 period.
The fast socioeconomic growth condition comprises scenarios S5, S6, and S7. These scenarios foresee a significant growth of the population and employees in the industry and services sectors. Specifically, the authors assumed an increase in the population and of the services and industrial employees equal to the 50%, 75%, and 100%, compared to 2022.
The authors assessed the urban growth prediction for each defined scenario by applying the single averaged BLRM.
3.10. Future Multi-Hazard Exposure Assessment
Once the urban growth patterns in 2040 for each scenario are identified, the authors assessed the future multi-hazard socio-economic exposure at the census level, considering defined socio-economic exposure indicators and the hazard maps that, in this study, are related to the landslide and hydraulic hazards.
The exposure levels of urban systems depend on the presence of material and immaterial urban features as they represent urban risk hotspots that can be potentially affected by hazards. The level of exposure of these elements can be analyzed through specific indicators, which, in most cases, are related to the distribution, density, and quantity of these exposed elements.
In the literature, different methodologies to assess socioeconomic exposure using standard measures exist [
93,
94,
95,
96,
97]. The measures widely used to assess natural hazard exposure levels are the built-up area [
98], the gross domestic product [
99], and the population density [
100,
101]. These essential variables assess natural hazard impact as they represent human activities and, in general, society [
102].
Moving from this background, the authors employed these three indicators (
Table 3) to assess the future exposure of urban areas to natural hazards. Specifically, the future socio-economic exposure is analyzed in this study by determining the built-up area density, the population density, and the number of employees exposed to hazards. The gross domestic product indicator, mainly used for economic exposure evaluation, is replaced with the number of employees exposed to hazards as this indicator can be retrieved more easily in public and official databases at the census level.
The future socioeconomic exposure is performed at the census level as they are the minor sampling units recognized at the national and international levels by employing GIS techniques within the QGIS environment.
The built-up area density exposed to natural hazards is evaluated by intersecting for each scenario the built-up area predicted by the single averaged BLRM with the map of the hazards. The ratio between the built-up area crossed by the hazard map and the total area of the census tract is evaluated.
The population density is assessed according to the population growth rate defined in
Section 3.9. The authors evaluated the population at the census level, then averaged the population data according to the built-up area intersected by the hazard and divided it by this latter. The same procedure is carried out for the number of employees affected by hazards.
After calculating the indicators in their unit of measurement, the authors normalized them using the minimum-maximum standardization method. This standardization method allows for the normalizing of raw values, obtaining a measurement scale between 0 and 1. The lowest values of this scale correspond to lower values of built-up area density, population, and number of employees exposed to hazards. The highest values correspond to higher levels of built-up area density, population, and employees exposed to hazards. The normalization step makes indicators comparable and facilitates aggregation operations. The aggregation phase is performed by summing the normalized values, obtaining a variation of the level of exposure ranging from 0 to 3.
The authors performed these steps within the QGIS software. The authors repeated these steps for all the defined scenarios. The result is an exposure map for each scenario and hazard in which the exposure value varies from 0 to 3. To provide a variation measure on the exposure level from one census tract to the other, the authors defined ten levels of value variation equal to 0.3.
4. Results
4.1. Built-Up Area Detection from Image Classification and Segmentation
From the application of the experimental AI software platform for the classification and segmentation of the Rende built-up area in 2000 and 2022, the authors assessed the accuracy of the results model by evaluating the F1-Score for 2022. The F1-Score for this timepoint equals 0.69, showing a good model level to classify and build up segment areas from orthoimages. As stated in the methodological section, the authors defined a square grid cell characterized by a dimension of 30 × 30 m to refine the segmentation of the built-up area retrieved from applying the DL model and to perform future urban growth predictions. By refining the results, the authors reached an F1-Score of 0.87.
The built-up area detected by the semiautomatic procedure equals 13.13 km2 in 2000 and 16.01 km2 in 2022. Therefore, the built-up area change detected for the study area during 2000–2022 revealed an increase of 2.88 km2. This built-up area change is related to the realization of residential and services buildings in Commenda, Roges, and Quattromiglia settlements, as well as the completion of supporting structures for the UNICAL campus, such as new residential buildings for students (San Gennaro, Chiodo, Monaci) and new services, such as the University Sports Center (CUS). The main driver of the increase in the built-up area during this period was the presence of the UNICAL campus, which determined the development of settlement areas next to it in terms of new buildings and urban services. On the contrary, the urban areas far from the university and the main center, such as the historical center of Rende and Nogiano, did not experience urban development during this period. Instead, they are characterized by many abandoned buildings and areas that need to be regenerated.
4.2. Binary Logistic Regression Models for 2000 and 2022
The authors calibrated the BLRM to assess the urban development prediction for 2000 and 2022, investigating the impact of the selected driving factors on the urban growth dynamics. For each cell in which the study area is divided, the dependent variable is binary and assumes values 1 and 0 in the case of built-up or not built-up cells. As stated in the methodological section, nine independent factors are examined in the BLRM to estimate the influence of each factor on urban growth. The coefficients of independent factors identified the nature of the correlation between urban growth and these driving factors for the municipality of Rende. If the coefficient value is positive, the probability of urban growth increases. Otherwise, the urban growth probability decreases if the coefficient value is negative.
Table 4 and
Table 5 show the values and the statistics of the nine independent factors for 2000 and 2022, respectively.
For 2000, considering the Wald statistic and the significance test, the coefficients related to the elevation and the distance to water are not significant for the model. The significance values, indeed, in both cases, are more than 0.05, while the Wald is low. Thus, except for these two variables, the other seven factors’ values have remarkable significance, indicating that these factors are statistically significant for urban growth. Looking at the significant variables, the authors found that three positively influence urban growth, while the other four negatively influence it. The driving factors that have a positive influence are population, employees, and distance to the university campus. Slope, distance to buildings, distance to main roads, and distance to railway stations have negative influence. Population and employees have the same impact on the urban growth dynamic. Their OR values indicate a greater probability of urban growth related to increased population and employees. Also, the probability of urban growth is higher for urban areas far from the university campus. Considering the slope, the distance to the main roads, and the distance to railway stations, the OR values indicate that the impact of the slope is more significant concerning the other factors on urban growth for the reference year. The probability of urban growth is higher for areas with lower slopes and areas near the main roads and railway stations. Moreover, urban areas tend to grow near existing buildings. Indeed, the OR value is 0.812, meaning that the probability of urban growth near already urbanized areas is estimated to be 1.23 times larger than the predicted expansion in an area far from these areas. The assessment of the model’s goodness of fit revealed that the Nagelkerke R2 statistic assumes a value of 0.83, meaning the BLRM explains about 83% of the total variance. To confirm that, Cox and Snell’s statistic also reaches a value of 0.51. These values indicate the high capability of the model to predict the observed values.
Regarding the results of the BLRM calibrated for 2022, the overall goodness of fit of the model indicates the high capability of the model to predict the observed values, with a Nagelkerke R2 value of 0.83 and a Cox and Snell statistic value of 0.55. Reading the significance test and Wald statistics of the driving factors, the elevation is not significant for urban growth. Contrary to what is determined for 2000, the distance to water is significant for the model. Also, for 2022, the other seven factors’ coefficients are significant. Population, employees, and distance to the university have a positive influence, while slope, distance to water, distance to buildings, distance to main roads, and distance to railway stations have a negative influence. As for 2000, the driving factor with the most significant influence on urban growth is the distance to buildings. The OR is 0.816 for this factor, meaning that the predicted expansion around existing urban areas is estimated to be 1.22 times more than the predicted expansion in an area far from urbanized areas. Similarly to what happened in 2000, the OR values indicate that the slope has a more significant impact concerning the distance to the main roads and the distance to railway stations, whose contribution to urban growth is similar. From OR values, it can be noted that the probability of urban growth is 1.02 times more likely in flat areas. Urban growth tended to occur near the main roads, the railway stations, and the water bodies. Also, for 2022, the probability of urban growth is higher far away from the university campus, although it has a small impact, as demonstrated by its OR values. The population and employment presence contribute almost the same to urban growth patterns. Specifically, the probability of urban growth increases slowly with the population and employees since the OR value for these variables is a little higher than 1.
4.3. Results of the Stratified Cross-Validation for 2000 and 2022 Models
The results of the 10-fold cross-validation analysis are reported in
Table 6. The results for 2000 and 2022 are promising, providing stable and reliable estimations with low bias. For 2000, the number of totally correctly classified instances exceeds 95%, reaching Kappa Statistic, Mean Absolute Error, and Root Mean Squared Error values, respectively, of 0.85, 0.073, and 0.19. Also, for 2022, the results are promising, with 94.22% of correctly classified instances and Kappa Statistic, Mean Absolute Error, and Root Mean Squared Error values of 0.84, 0.09, and 0.21. The employed cross-validation method provides a less biased estimate of the models’ skills.
4.4. Definition of the Single Averaged Binary Logistic Regression Model
As can be seen from the results obtained by calibrating the BLRMs for the years 2000 and 2022, the impact of the driving factors on the urban growth phenomenon is almost the same in both models, with excellent response in the levels of the goodness of fit between the predicted and observed values. For this reason, the authors verified whether the parameters of models estimated in different periods are systematically different or can be considered sample fluctuations of a single true model. If this condition is verified, it is possible to consider a single averaged predictive model valid and transferable to subsequent analysis periods.
The coefficient relating to elevation was excluded from the test as it is not significant in both reference years. However, the coefficient relating to the driving factor distance to water is included in the test as it is strongly significant in 2022.
By applying the methodology described in the methodological section, the results in
Table 7 are obtained.
The hypothesis of the statistical theory of transferability can be accepted at a level of significance of 0.05 for all the coefficients. Respecting this limitation, the value of TRANSF does not exceed the value of 3.841 to satisfy each coefficient’s transferability condition. As shown in
Table 4, the TRANSF value for all the significant driving factors is less than 3.841, demonstrating that all are transferable for 2000–2022. Therefore, the average coefficient value of each driving factor is considered to define the single averaged BLRM, which simulates the changes that occurred in a past timeline and represents the best model to predict future outcomes.
4.5. Model Validation Assessment
Before employing the single averaged BLRM to predict the 2040 urban growth dynamics under different socioeconomic growth scenarios, the authors validated the single averaged BLRM by comparing the urban growth dynamics obtained for 2022 employing the single averaged BLRM and the actual built-up area patterns.
First, the authors applied the single averaged BLRM to simulate the built-up area patterns for 2022. This result is represented in
Figure 3, where orange and grey represent the built-up and non-built-up areas, respectively.
The prediction is, then, compared to the actual built-up area pattern identified by the application of the AI platform and the refining operations. The accuracy level of the prediction is assessed by evaluating Cohen’s Kappa coefficient. For the 2022 time point, the Kappa coefficient is equal to 0.84, demonstrating a great level of accuracy of the single averaged BLRM. The real and the predicted built-up area patterns showed an observed agreement proportion of 0.94 and a proportion of probable agreement occurrences rate due to chance equal to 0.64. This result revealed that the calibrated logistic regression model predicts the built-up area changes well.
4.6. Built-Up Area Prediction for 2040 Based on Urban Growth Scenarios
After validating the single averaged BLRM, the authors predicted the future urban growth patterns in 2040 according to the scenarios defined in the methodological section. The predicted urban growth for the defined scenarios is represented in
Figure 4, where the grey color represents the non-built-up area, the orange color the built-up area in 2022, and the brown color the built-up area changes compared to 2022.
The spatial distribution and the changes in the built-up area in 2040 for the different scenarios are synthesized in
Table 8.
The BAU scenario determined slight modifications compared to 2022. Indeed, the change in built-up area is equal to 0.07 km2, which represents an increase rate of 0.44%. This increase in the built-up area is mainly located within the consolidated part of Rende, which corresponds to the Roges and Villaggio Europa settlements, by occupying the vacant land in this urban context. Other built-up area changes for this scenario are located in the industrial zone of Rende.
For S1 and S2, the built-up area changes are equal to 16.14 and 16.15 km2 for an increasing rate of 0.81% and 0.87%, respectively. Regarding the localization of built-up area changes, S1 and S2 follow the BAU scenario as the changes are mainly distributed in Rende’s urban consolidated and industrial area. The changes in the Arcavacata and Santo Stefano settlements represent some exceptions.
Under S3, the Rende municipality will experience an increase in the built-up area following a densification process of the already existing settlements. The main changes will interest Roges, Santo Stefano, and the industrial area. S3 will determine an increase in the built-up area of 0.18 km2 compared to 2022 for a rate of 1.12%.
Considering the fast growth scenarios, the authors found that S4 determines a widespread build-up area increase. Indeed, the built-up area changes compared to 2022 registered for this scenario is 0.51 km2, corresponding to a rate of 3.19%.
S5 determines an increase in the built-up area of 0.73 km2. It means an increased rate of 4.56% compared to 2022. These built-up area changes are mainly located in the Roges and Commenda settlements area. Moreover, these changes are in the industrial zone of Rende and in the development area of Quattromiglia. Another significant built-up area change occurs near the university campus UNICAL, within the Arcavacata and Santo Stefano settlement areas, while others are in the Saporito di Rende.
The built-up area change found for S6 follows the pattern changes registered for S5 with an increase in the built-up area compared to 2022 equal to 0.99 km2, corresponding to a rate of 6.18% of the built-up area more than the baseline year. Also, in this case, significant changes interest Quattromiglia, Commenda, Rogers, Saporito, Arcavacata, Santo Stefano, and Arcavacata settlement areas. For this scenario, significant changes also occur within the Rocchi area, the urban settlement behind the UNICAL campus bordering the Montalto municipality.
The analyzed scenarios revealed that significant changes will occur in the case of the fast growth scenarios. Despite this, looking at the built-up area changes that occurred over time for the analyzed case study, the authors consider the slow growth condition as the most probable urban growth forecast in 2040.
4.7. Future Multi-Hazard Exposure Prediction for 2040 based on Urban Growth Scenarios
After assessing the urban growth prediction, the future multi-hazard exposure is evaluated for the defined scenarios. The authors analyzed the indicators for the hydraulic and landslide hazards in the methodological section. Using the census tracts, the authors analyzed the main urban features exposed to the aforementioned hazards through QGIS software and assessed the exposure indicators defined in
Section 3.10.
The exposure to the hydraulic hazard over the different scenarios is represented in
Figure 5. Over the different scenarios, significant changes in the exposure can be observed.
As the main threats regarding the hydraulic hazard are represented by the rivers Campagano, Crati, Emoli, and Surdo, the main exposure changes occurred in the census tracts facing the attention areas of these two rivers. Specifically, over the analyzed scenario, the main changes in the exposure level are registered for the Saporito, Commenda, Roges, and Santo Stefano settlements.
In
Figure 6, the exposure level to landslide hazard of Rende’s census tracts is represented across the different scenarios. Also, in this case, the exposure level of Rende increases over the different scenarios.
Due to the geomorphological characteristics of the municipality of Rende and the differences in the urban elements exposed to landslide hazard across the analyzed scenario, the main changes in the exposure level are mainly noticed in the historical center of Rende, Malavitani, Dattoli, and Surdo.
5. Discussions
An analysis of the outcomes of this study reveals meaningful research insights.
As stated in reference [
103], DL algorithms applied to orthoimages and satellite images can obtain a detailed land cover classification and discriminate among classes of interest with promising results. These results demonstrate that new classification methods can improve the capabilities of performing land cover maps from remote sensing information. Despite the expanding amount of satellite images and land cover data provided by spatial agencies, these are affected by an excessive aggregation level and, often, the satellite images do not have the required spatial and temporal resolution. Therefore, creating an AI platform based on DL algorithms interfaced with GIS systems allows several advantages, including automated data analysis, improved accuracy, scalability, data integration and interoperability, and real-time processing and predictions. The applied methodology using satellite and aerial images from Unmanned Aircraft System (UAS) flights planned ad hoc can guarantee higher spatial resolution than most land cover maps provided by space agencies, often based on aggregated data from various sources.
The results revealed that the single averaged BLRM calibrated to determine the urban growth dynamics and patterns by considering the output of the semi-automatic procedure to classify and segment satellite and aerial imageries as the dependent variable, and nine driving factors as the independent variables have a great level of accuracy, as demonstrated by the Kappa statistic employed to perform the model validation assessment. Indeed, despite the accuracy of the semi-automatic procedure to determine the dependent variable, which, in this case, consisted of the application of an experimental AI platform that uses the U-Net model to classify and segment the built-up area from satellite and aerial imageries and the following refinement performed manually by the authors has a quite good level of accuracy in the identification of built-up features, the urban growth seems to be well predicted. The accuracy of the single averaged BLRM is also linked to the good calibration of the BLRM relating to 2000 and 2022. The goodness of fit of these two models was equal to 0.83, greater than that of the BLRMs calibrated to simulate urban growth dynamics in other research work [
104]. Moreover, the use of the single averaged BLRM, which is calibrated to analyze the changes that occurred between 2000 and 2022 and to predict the future urban growth dynamics, could help to overcome the limitation of considering only one specific BLRM calibrated for a single time point or a single period, as carried out by reference [
105]. The transferability test employed to assess the validity of each driving factor over time consented to define a single averaged BLRM for predicting better future urban growth dynamics. The benefit of calibrating a single averaged BLRM that considers the transferability of the impact of each driving factor is also demonstrated by the accuracy of the model prediction.
From the analysis of the impact of the selected driving factors on the urban growth dynamics for the 2000 and 2022 time points, the results showed that the proximity to existing buildings represents the dominant driving factor estimating a predicted expansion around existing urban centers greater than the predicted expansion in areas far from urban centers. This result is consistent with the other published studies about urban growth dynamics analyzed employing BLRMs in other case studies, even if they are mostly related to big cities, such as Atlanta [
106] and Bucharest [
107]. In these cases, the negative impact of the distance to existing buildings or existing built-up is major. The OR in these studies is equal to 0.38 and 0.14, while it is equal to 0.82 in the proposed study. The difference between the extension and the complexity of the study area can determine this difference. Calibrating the proposed single averaged BLRM to another case study could confirm this thesis. As a result of this study, urban growth probability is higher for areas with lower slopes. This result confirms that urban growth dynamics occurred in the analyzed territorial context over the years, tending to increase the built-up area in areas characterized by flat topography. Population, employees in the industry and services sectors, and the distance to territorial services have the same impact on the urban growth dynamic. A greater probability of urban growth is related to increased population and employees. The authors, moreover, found that urban growth probability is greater near the main roads, railway stations, and water. This result is in line with other scientific studies [
108,
109]. This latter scientific study also revealed that the influence of the proximity to higher education, such as a university, is not influential to the phenomenon as the obtained OR is equal to 1. Within the proposed study, the OR related to the distance to the university campus is slightly greater than 1, confirming that its impact could be more influential. Starting from these considerations, it can be highlighted how urban growth in the territory of Rende, even if less evident than in other large cities, is guided by the need to access better economic opportunities and services, mainly linked to infrastructures and services.
The prediction of urban growth dynamics in 2040, assessed considering the different socio-economic growth scenarios, revealed that a more substantial urban growth may occur in the fastest socio-economic growth conditions. Indeed, the cells that from non-built became built are more prevalent in these scenarios than the BAU and the slow socio-economic growth scenarios. The urban growth patterns in slow socio-economic growth conditions are areas closest to the existing buildings where the population, the economy, and the services are more concentrated. As the population and the number of employees in the services and industries sector grow, the diffusion of urban cells increases in correspondence with the already consolidated areas, such as Quattromiglia, the high-density settlements of Roges, and, lastly, the peripheral areas of Saporito, Surdo, Arcavacata, and Santo Stefano of Rende. The prediction model results applied in this study provide valuable insights for the future urban planning of Rende. Planning policies should promote a balance between the expansion phenomena along the valley axis, which connects the South (near Cosenza) to the North (near Montalto) part of the study area, and the transverse directions. As demonstrated in this study, the valley part of the study area has a higher urban growth probability than the other part of the city. These latter areas, such as Surdo, and Saporito, experience urban growth only if fast socio-economic growth conditions occur. It happens because they are marginal areas concerning the new urban center characterized by degraded and abandoned neighborhoods. These areas do not need to be expanded but require regeneration actions oriented to an improvement in existing settlement quality. These integrated regeneration interventions could involve the physical and functional recovery of buildings and the redevelopment of the adjacent spaces. These smaller-scale interventions should be combined with wider-scale interventions, enhancing infrastructure connections between the center and the periphery and strengthening the services offered to residents.
The results obtained from the future multi-hazard exposure assessment based on predicting future built-up areas, population density, and employees for industrial and services exposed to hazards showed the areas potentially affected by these hazards. These elements are the main variables that provide an initial measure and spatial pattern of the exposed urban features, population, and employees linked to the urban growth phenomenon. The exposure geography obtained by assessing the future exposure of built environments to natural hazards reveals that land use management and policies need to be optimized to minimize hazard damages. Strong and context-fitted disaster management tools can guide future urban growth localization outside current and, if defined, future hazard areas. The proposed method fits well with this necessity as it can be employed as a valuable support for urban and disaster management planners in evaluating planning scenarios considering their impact on risk. Indeed, urban planners can define robust strategies for the entire urban area and context-related specific interventions for neighborhoods [
110].
The proposed research addresses different novelties in this research area. First, the study presents an innovative quantitative and transferable methodology that guarantees a high level of precision in urban growth prediction. The calibrated BLRM is characterized by a high level of stability in the assessment of each driving factor determining the predicted urban growth. The level of detail contributes to determining slight differences in built-up area changes determined by the changes in driving factors. This contributes to evaluating the effects of these changes in hazard impacts. The assessment of changes in exposure level considering different scenarios is an effective means to define proper planning strategies aimed at enhancing the resilience of the urban environment and making urban settlements safer and more resilient. Furthermore, the main challenge of this research was the harmonization of data from different sources caused by the complexity of the proposed method, which involved the geospatial processing of different data formats, such as orthoimages, aerial imageries, shapefiles, raster, and alphanumerical data.
Despite its strength and novelty, the study has limitations as well. The accuracy in predicting the built-up area through the AI platform should be improved to avoid the two-step semiautomatic procedure. Also, the accuracy of predicting urban growth patterns through the single average BLRM could be further demonstrated by calibrating and employing the single averaged BLRM in another context. This would demonstrate if the single averaged BLRM works well for other areas. A useful test would be applying the single averaged BLRM to big cities or metropolises and comparing the results with existing research work. Moreover, the calibration and use of other models to predict future urban growth patterns and their comparison could help identify the best predictive model in terms of result and accuracy. In this study, the authors employed nine driving factors. The integration of other driving factors could be considered in future research to improve prediction accuracy. For the definition of scenarios, better results and research insights could be obtained if the land use planning scenarios defined in urban plans were analyzed to predict future urban growth patterns. Considering the future multi-hazard exposure, in this research, the authors assessed the future exposure to hazards by evaluating the current hazard maps available at the regional scale. Multisectoral and multidisciplinary studies would overcome this limitation by defining future hazard maps, which, combined with this study, could define the future overall risk of urban areas. Additionally, the authors are aware that only the built-up area density, population, and employees variables do not give a broad measure of exposure. For example, considering the built-up area, a more precise assessment of the future multi-hazard exposure would require information about its specific characteristics, such as the use destination, physical characteristics, and volume. For this kind of investigation, having precise information about this data could be addressed in the future development of this research.