1. Introduction
Floods are among the most frequent and threatening natural disasters that endanger human lives and built environments. According to the Emergency Events Database (EM-DAT) of the Centre for Research on the Epidemiology of Disasters (CRED), 147 major flood events over the last three decades (1990 to 2020) have affected 12,299,022 people in the United States [
1]. The data show 953 deaths and 480 severe injuries resulting from flooding in the country in that period. Additionally, 29,200 persons lost their homes in those flood events from 1990 to 2020 [
1]. Flood risk of damage from inundation is the product of three components: (1) hazard from the flood event, including frequency, magnitude, depth, duration, location, and timing, (2) exposure that includes the population and assets, such as buildings and infrastructure, prone to the hazard, and (3) vulnerability or susceptibility of the exposed elements to inundation [
2,
3,
4]. Studies related to flood risk assessment have developed experimental models for quantifying and mapping the above risk elements.
Many studies, mainly since the mid-2010s, have focused on flood hazard estimations and floodplain mapping [
5]. However, prepared floodplain maps, like the flood zones delineated for the United States by the Federal Emergency Management Agency (FEMA), have been the source of hazard data for researchers who concentrate on other aspects of flood risk assessment, including exposure and vulnerability as their primary study scope [
6,
7]. Flood exposure refers to valuable elements, specifically humans and physical or natural assets located in flood zones that can be subject to losses [
4,
8]. The exposure is usually excessive in urban areas due to higher population density, physical development intensity causing more flood-prone assets, and concentration of economic activities [
9,
10]. Studying flood exposure in large areas such as cities, metropolitan regions, and countries is generally classified as macro-scale exposure analysis [
11]. Land use/Land cover (LULC) data are often appropriate for flood exposure assessment on that scale [
9,
11]. Flood vulnerability, referring to a community’s circumstances making it more susceptible to inundation, is an influential factor determining the impacts [
12,
13]. It can be investigated under two categories. First, physical vulnerability includes the characteristics of physical assets, like buildings and infrastructure, for example, structure age, quality, floor elevation, and materials, resulting in more severe damage to them while flooding [
3,
9,
14]. Second, social vulnerability is a situation caused by socioeconomic or demographic characteristics making some groups of people more sensitive to natural hazards, like flooding, affecting their capacity to anticipate, respond to, and recover from them [
9,
13,
15,
16]. Based on the reviewed literature,
Table 1 summarizes important socioeconomic factors influencing social vulnerability and cites the authors who have identified these factors.
A study at the scale of the United States counties that has been a reference for many researchers indicated that African Americans and Hispanics were the most vulnerable racial and ethnic groups, respectively, to environmental hazards such as floods [
15]. There are other socioeconomic characteristics affecting vulnerability that can apply to some study areas. Examples are occupation type [
15] and industrial sectors [
15,
20] and their loss probability in the event of a disaster. Potentially, communities with rapidly growing populations can be more vulnerable to floods due to the deficiency of social services networks or lower housing quality [
15]. In addition, areas developed with higher density can be more susceptible to natural hazards, like floods [
15,
20]. Social vulnerability indices can be produced based on the influential factors, as indicators, to recognize the most vulnerable communities and define the priorities of mitigation plans [
17,
20]. Cutter et al. created the first Social Vulnerability Index (SoVI) at the national scale in the United States as a tool to compare the vulnerability among different counties using socioeconomic data from the 1990 U.S. census [
15]. Flanagan et al. built on the SoVI to develop a new Social Vulnerability Index (SVI) at the scale of census tracts for a project funded by the Centers for Disease Control and Prevention (CDC) in collaboration with the Agency for Toxic Substances and Disease Registry (ATSDR) [
17].
In recent decades, there has been a shift from conventional policies focused on flood hazard protection to risk mitigation plans [
11]. More comprehensive flood risk analyses are required for efficient mitigation, preparedness, and response to future floods [
9,
10]. Analyses of flood risk distribution identifying the significant clusters of exposure, vulnerability, or their combination can highlight the locations with higher potential of human losses or physical damages as the main focus of mitigation plans [
4,
21]. Although many researchers have discussed the concept of flood risk as a function of the above elements, a few recent experimental efforts, such as [
7,
8], included the spatial patterns of flood risk, covering its components. To fill this need, the goal of this study was to test this hypothesis for the study areas: Do high flood risk clusters exist in urban communities where socioeconomically disadvantaged people reside? We investigated the magnitude of the population and the developed land prone to flooding in addition to their level of vulnerability in the census tracts as communities. We employed geospatial technologies as the primary tools to spatially analyze the data. After quantifying the exposure and social vulnerability, deciding the most significant factors in each city, we studied the spatial patterns of flood risk in the study areas. We used appropriate spatial statistics techniques to identify and map the significant hot spots or clusters of high risk. Flood risk and vulnerability maps can display communities’ degree of susceptibility and risk more appropriately if combined with flood indices [
22]. An objective of this study included the production of flood indices.
Human exposure to floods has been increasing due to the hydrological alterations caused by climate and land use changes, but our knowledge and estimations about the exposed populations are still limited as most of the studies have investigated physical exposure [
8]. The Internal Displacement Monitoring Centre (IDMC) data imply the total number of people displaced due to flooding in the United States was 535,281 between 2008 and 2021 [
23]. However, a literature review indicated only a few experiments regarding flood-induced population displacement, mainly at the global scale. Therefore, an additional objective of this study was to analyze the short-term population relocation due to flooding in the study areas.
Overall, this study utilized a combination of geospatial data analytics and risk index development techniques to examine the distribution of flood exposure, social vulnerability, and risk in two distinct types of urban environments. These cities differ in size, natural and built environments, and demographics. The distinctions between these cities are detailed in the Study Areas and Scale section. This approach allowed for identifying specific areas of concern and potential flood risk reduction and mitigation strategies. The following methodology section describes the study areas and units, data collection, and data analysis processes. Next, the results of flood hazard, exposure, social vulnerability, and risk analyses are presented. The discussion section compares results between the two cities and identifies limitations and potential solutions. Finally, the conclusions section summarizes the outcomes, implications, contributions, and future research directions.
4. Discussion
The study results helped to highlight a higher proportion of the 100-year floodplains existed within Houston (23%) than in Charleston (7%) which can be related to the stream network, development pattern, land cover, topography, and other geomorphic characteristics in those urban regions. As mentioned in the study areas description, Houston developed in a flat, low-altitude region, while Charleston is located in a mountainous area. If streams spill over their banks in Houston, they can potentially inundate a larger area due to the region’s flatness. Additionally, the extent and intensity of built environment development are much higher in Houston compared to Charleston, resulting in significantly more areas covered by impervious surfaces in Houston. These factors, along with other detailed geomorphic characteristics, such as soil type, can potentially contribute to larger inundations and a higher floodplain area ratio in Houston. A larger percentage of the developed land area was located within the 100-year flood zones in Houston (13%) compared to Charleston (9%). Consequently, a larger percentage of the population was estimated to reside in the floodplains in Houston (13%) than in Charleston (9%).
We assumed a homogeneous distribution of people in each tract’s developed areas, including the low-, medium-, and high-intensity developed lands, which might be thought of as a limitation. However, it seemed reasonable for the purpose of the research concerning the size of the study areas and data availability. Additionally, we did not assign different weights to the above land cover classes to avoid human bias. However, the non-residential development was kept out of the population estimates model as the above data were combined with the population density of the census tracts. Therefore, low population density in the census tracts with the dominance of non-residential developed areas could control the above effect. Depending on the nature of the data, we created the vulnerability index investigating the demographic and socioeconomic data collected either at the individual or household levels. We preferred the 2019 American Community Survey (ACS) 5-year estimates over the 2020 Decennial Census to keep away the COVID-19 impact on the vulnerability factors. In accordance with the literature [
17], we assumed homogenous distribution of those factors within each census tract.
The PCA process removed 8 social vulnerability factors in Houston while taking off 10 in Charleston. The excluded factors were considered less important for the variability in each urban region due to collinearity with the other parameters or not being significantly different among the census tracts. Seven variables related to socioeconomic status, household composition, housing, development pattern, and essential facilities were among the selected principal components in both study areas. However, the other factors were only chosen in one city (
Table 2). For example, the percentage of Hispanics or Latinos was one of the principal components in Charleston, not Houston. The reason could be the large population of the above ethnicity distributed in numerous Houston census tracts making them less distinct in this regard. As a general limitation, correlations may still exist among the reduced social vulnerability factors. With more time and resources, future studies can address these relationships and propose solutions for improving the social vulnerability index. In addition, a future research direction could involve investigating more than two urban areas and studying the significant social vulnerability factors that are common across all of them.
While producing the indices, we assumed the same weight for all the indicators to avoid human subjectivity in the model. The decision was consistent with the previous experiments conducted by other authors, such as [
15,
17]. The final flood risk index combined the exposure and vulnerability indices, averaging their scores. Therefore, a socially vulnerable census tract with only a small area or low population in the floodplain did not get a high score on that index. The disparity between the city limits and census tracts was a limitation while defining the study areas. The tracts did not match the municipality boundaries completely. We decided to include those intersecting the boundaries where most of their developed areas were in the cities. As a result, the modified study areas did not precisely comply with the city limits. With better access to more detailed data, the aerial units can be defined at finer scales, such as census blocks, to make the study areas match the city boundaries better. We investigated the number of essential facilities located in each census tract as a vulnerability factor. However, the scope of services for some of those facilities, such as hospitals, is usually beyond the tract level, which can be a limitation of this approach. In addition, the available data for Houston were limited to the public schools. Therefore, other schools were not included in the model. Access to more comprehensive data on essential facilities and their range of services can result in more accurate vulnerability analyses.
As a general issue of spatial statistics, the study results may be affected by the Modifiable Areal Unit Problem (MAUP). This is defined as the sensitivity of geospatial analysis results to the scale and aggregation type of the data collection units. As a result, the spatial analysis outcomes of a variable depend on the selected scale and size of the study areas [
35]. Therefore, replicating the experiments of this study on a different scale than the census tracts may result in different outputs. This problem should be acknowledged and considered for extrapolating to other scales of analysis. While it is impossible to eliminate the MAUP effect, conducting comparable experiments at various scales within the same study area can assist in identifying correlations between the outcomes and allow for the quantification of bias [
35].
5. Conclusions
The research supported the defined hypothesis in the Houston study area. The average values of the selected socioeconomic parameters were significantly different, towards vulnerability, in the risk hot spot tracts compared to those in the entire urban area. Thus, with 95% confidence, we could say that there were clusters of high flood risk in Houston where socioeconomically disadvantaged people resided. However, the model could not map any significant flood risk clusters for Charleston. The reason could be the size of the city and its low number of census tracts. The goal of this study was to better understand the spatial pattern of flood risk in two differently sized urban communities and potential correlations to socioeconomic factors. Beyond race and ethnicity, we examined other factors related to social vulnerability to disasters based on the local patterns. The spatial analysis model showed more acceptable performance for mapping risk clusters in large urban areas, like Houston. A need for future studies is to design and test alternative methods for smaller study areas for the identification of significant spatial risk patterns. We applied the 100-year flood zones related to fluvial or riverine flooding published by FEMA as the hazard source. As further steps, more comprehensive hazard maps can be considered, including 500-year zones or different types of floods, such as pluvial or overland flooding. Additionally, the methodology can be combined with future climate change scenarios.
From our literature review searches, we have not been able to find a similar comprehensive study combining the risk elements of hazard, exposure, and social vulnerability in the United States’ urban areas, specifically in Houston, TX or Charleston, WV. Furthermore, no published articles about the spatial distribution of risk in the above regions could be found. Therefore, the study can be considered the first effort in this regard. Moreover, most current disaster vulnerability indices, such as SVI or SoVI, use nationally developed models that apply predefined factors to all study areas. Based on a place-based approach, this study recognized the most significant factors explaining the variations in each urban area and employed those to create the vulnerability and total risk indices for the census tracts in each city. In addition, this article can be considered to contribute toward short-term flood-induced population displacement at the scale of U.S. cities. The detailed result of the evacuation estimates in each census tract can assist city officials and community leaders in devising more appropriate pre-disaster preparation plans. Our methodology limited the population only to developed areas, which can be a more accurate assumption than the other approaches, such as FEMA’s population estimate models [
14], which presume homogenous population distribution in each entire census block.
The key parties, such as floodplain managers, urban planners, and policymakers, can use the outcomes of this study for more effective flood risk management. The index scores and other outputs can assist them with a more comprehensive understanding of the most at-risk communities. The risk hot spot areas in Houston should be the priority for flood mitigation actions along with the social justice efforts in this city. This study’s detailed exposure outcomes, including the development and population distribution in floodplains on the tract level, can provide the city, county, and region officials with data to allocate resources, regulations, and plans more efficiently for risk mitigation and resiliency in areas with higher exposure. In addition, more specific results from the study can highlight the areas for urgent mitigation action. For instance, the social vulnerability analysis showed an at-risk census tract in Houston with 99% of the population living in group quarters. The reason was the Harris County Jail located in the floodplain of that tract.
The findings of this study highlight important insights into the flood risk and potential impacts in the studied areas. Despite the discussed limitations, the geospatial analyses combined with the index development techniques enabled a comprehensive location-based assessment of flood exposure, vulnerability, and risk. The approach allowed for identifying significant areas of concern in a large urban area to assist in developing strategies to reduce flood risk and implement effective mitigation measures. Communicating the results of such studies effectively is crucial to ensure the information is understood and utilized by various stakeholders [
36]. Clear and concise visual representations, including maps and tables, were used to help with the above purpose. Future efforts to convey results could include public displays on an online interactive tool to be easily accessed by the above groups and the public who want to know more about the risk status at their places of living. Such an online tool can be designed and developed using the output data of this study.