Flood Risk Mapping Using GIS and Multi-Criteria Analysis: A Greater Toronto Area Case Study

Given the increase in flood events in recent years, accurate flood risk assessment is an important component of flood mitigation in urban areas. This research aims to develop updated and accurate flood risk maps in the Don River Watershed within the Great Toronto Area (GTA). The risk maps use geographical information systems (GIS) and multi-criteria analysis along with the application of Analytical Hierarchy Process methods to define and quantify the optimal selection of weights for the criteria that contribute to flood risk. The flood hazard maps were generated for four scenarios, each with different criteria (S1, S2, S3, and S4). The base case scenario (S1) is the most accurate, since it takes into account the floodplain map developed by the Toronto and Region Conservation Authority. It also considers distance to streams (DS), height above nearest drainage (HAND), slope (S), and the Curve Number (CN). S2 only considers DS, HAND, and CN, whereas S3 considers effective precipitation (EP), DS, HAND, and S. Lastly, S4 considers total precipitation (TP), DS, HAND, S, and CN. In addition to the flood hazard, the social and economic vulnerability was included to determine the total flood vulnerability in the watershed under three scenarios; the first one giving a higher importance to the social vulnerability, the second one giving equal importance to both social and economic vulnerability, and the third one giving more importance to the economic vulnerability. The results for each of the four flood scenarios show that the flood risk generated for S2 is the most similar to the base case (S1), followed by S3 and S4. The inclusion of social and economic vulnerability highlights the impacts of floods that are typically ignored in practice. It will allow watershed managers to make more informed decisions for flood mitigation and protection. The most important outcome of this research is that by only using the digital elevation model, the census data, the streams, land use, and soil type layers, it is possible to obtain a reliable flood risk map (S2) using a simplified method as compared to more complex flood risk methods that use hydraulic and hydrological models to generate flood hazard maps (as was the case for S1).


Introduction
River floods represent the most frequent and expensive natural disaster affecting most of the countries around the world [1]. They take place when the river exceeds its storage capacity and the water excess overflows the banks and fills the adjacent low-lying lands, producing significant social, economic, and environmental impacts [2], including: loss of human life and negative effects on population, damage to the infrastructure and essential services, damage to crops and animals, spread of diseases, and contamination of the water supply.

GIS-Based Multi-Criteria Approach
The combination of multi-criteria decision making (MCDM) with geographical information systems (GIS) proposed in the present research allows integrating the three components of risk assessment (hazard, exposure, and vulnerability), in which the social, economic, and/or environmental vulnerabilities can be considered. The outcome of this methodology is a flood risk map showing the spatial distribution of flood risk along with its intensity level, ranging from very high to very low risk. In MCDM, there are different methods for assessing criterion weights: entropy, ranking, rating, trade-off analysis, and pairwise comparison, among others [9]. The Analytic Hierarchy Process (AHP) proposed by Saaty [10] is one of the most common MCDM methods, and it has been widely applied to solve decision-making problems related to water resources [11]. The purpose of AHP is providing decision makers with the decision, among different alternatives and criteria, that best suits their goal [11,12]. This method compares two criteria at a time through a pairwise comparison matrix, in which values of relative importance from one criterion over another criterion are assigned. It provides a measure of a level of consistent judgment supported by a theoretical background. The scale of relative importance ranges between one and nine, where one is equal importance, and nine is extreme importance.
This GIS-based multi-criteria approach has been increasingly used for flood risk assessment, as it presents several advantages. Some more complex alternative methods involve hydraulic and hydrological models for flood hazard assessment, and require information of the monetary value of the assets exposed to evaluate the vulnerability [13]. In contrast, the proposed approach only requires the spatial layers of the parameters (examples of which are listed below) that contribute to the flood hazard, and land-use and demographic information to evaluate the vulnerability. Therefore, the main advantage of this methodology is the possibility of obtaining a reliable flood risk map with a relatively low monetary and time investment to support stakeholders to evaluate only the areas that need further detailed assessment. Additionally, it is easy to update, uses easy-to-get open source input data, and is flexible in terms of which criteria are included.
has suffered four severe flood events during the last 100 years, each of which has resulted in high economic and social impacts. Hurricane Hazel in 1945 left 81 people dead and also resulted in 1.2 billion dollars (in 2016 dollars) in damages [14]. The latest flood event, in 2013, was caused by a heavy storm in which the intensity of precipitation was 90 mm in 2 h, and some gauges reported up to 126 mm in 90 min (compared to an average of 70 mm per month [15]). There are many factors that make the GTA susceptible to floods. The impervious cover in the study area is estimated in 73% (based on our analyses of the land use in the region). Additionally, large portions of the six tributaries of the lower Don River have been filled or artificially channelized, and significant wetlands have been filled or drained [16]. Hence, this has resulted in conditions that contribute to an increase in flood susceptibility. Another factor is that the river valley is a non-confining valley; it is about 400 m wide, while the river itself is only 15 m wide [17]. The topography of the GTA also plays an important role, since it is relatively smooth. Flat slopes lead to a decrease in the surface runoff velocity, which results in a longer period of time for the runoff to drain. Only limited studies of flood risk assessment (which include hazard, social, and economic vulnerability) in the GTA have been developed. A previous study [18] (which serves as the basis of the present research) generated flood risk index maps using GIS through the spatial combination of the flood hazard, social, and economic vulnerability layers. The criteria used to generate the flood hazard map were slope, land depression, the floodplain map (provided by the Toronto and Region Conservation Authority, TRCA), and land cover. For the economic vulnerability, a spatial layer that identified large buildings was used. Finally, the social vulnerability was determined by identifying the vulnerable groups from the 2006 census data from Statistics Canada. The limitation of this study was the direct assignment of criteria weights, instead of using a weighting method in the context of MCDM. Assigning criteria weights in this way does not allow obtaining a suitable assessment, as there is no theoretical background that supports a consistent assessment. Another study by Abdalla [19] used a web-based three-dimensional (3D) GIS approach for flood risk assessment, in which the Only limited studies of flood risk assessment (which include hazard, social, and economic vulnerability) in the GTA have been developed. A previous study [18] (which serves as the basis of the present research) generated flood risk index maps using GIS through the spatial combination of the flood hazard, social, and economic vulnerability layers. The criteria used to generate the flood hazard map were slope, land depression, the floodplain map (provided by the Toronto and Region Conservation Authority, TRCA), and land cover. For the economic vulnerability, a spatial layer that identified large buildings was used. Finally, the social vulnerability was determined by identifying the vulnerable groups from the 2006 census data from Statistics Canada. The limitation of this study was the direct assignment of criteria weights, instead of using a weighting method in the context of MCDM. Assigning criteria weights in this way does not allow obtaining a suitable assessment, as there is no theoretical background that supports a consistent assessment. Another study by Abdalla [19] used a web-based three-dimensional (3D) GIS approach for flood risk assessment, in which the flood hazard was based on hydraulic models (using the software HEC-RAS) under hypothetical flow rates simulated using the Canadian Hydrographic Service. The footprint of buildings within the flood hazard area was used for analyzing the exposure. Although this approach provides a 3D view of the flood extent, it does not include the economic and social vulnerability. In this sense, this study is a flood hazard assessment instead of a flood risk assessment. Thus, there is a need to develop flood risk maps of the GTA to account for the social and economic vulnerability, with a consistent method to assign weights to the different criteria used.

Flood Risk Assessment Using GIS and MCDM
Flood risk assessment using the GIS-based multi-criteria methodology is a relatively new approach, and only a few studies have implemented it. A pilot study for the River Mulde in Saxony, Germany [8], was developed using the multi-attribute utility theory (MAUT) and a disjunctive approach was used to assign weights for four different criteria (economic risk, population, social hot spots, and environmental risk). The flood hazard was obtained through a quasi two-dimensional (2D) hydrodynamic model using HEC-RAS. For the economic risk, the land use and estimates of the value of assets were used along with depth-damage curves (using damage functions). The social risk was based on the population density map. Finally, the social hot spots used the surveyed locations of old people and children, schools, and hospitals [8]. Similar to AHP, MAUT is a very popular method in MCDM analysis. Nevertheless, MAUT is extremely data-intensive, as it requires a large amount of input data, and the weights are assumed by the decision maker instead of being calculated (as is the case in the proposed AHP process) [20].
A study from the Iskandar region in Malaysia [21] used a rating method for assessing criterion weights. The criteria selected for generating the flood susceptibility map included: distance from the main stream and river, elevation, slope, land use, cover type, distance to discharge channel, and population density. One of the limitations of this study is the use of the rating method, as it lacks the theoretical foundation, and thus is difficult to justify the meaning of the weights assigned to the criteria. A study for the Red River Delta, Vietnam [22] was more sophisticated in terms of the weighting method used and the criteria selected. Similar to in the present paper, the AHP was used as the weighting method. The criteria selected for evaluating the flood hazard, economic, social, and environmental vulnerability are more suitable than the previously listed studies; they are also more difficult to obtain. The flood hazard was based on flood depth, flood duration, and flood velocity. The economic vulnerability used the area of residential buildings, special-use buildings, public infrastructure, and agricultural area. As for the social vulnerability, the population density, risk perception, spiritual values, and income levels were selected. Lastly, the environmental vulnerability was based on pollution (from industries, waste matter, and stagnations of floodwaters), erosion, and open spaces. Hence, when assessing flood risk for large-scale areas or in regions that lack detailed information (as is often the case), these criteria are not possible to use or require more resources, which makes this methodology difficult to implement [22].
As mentioned above, some of the limitations of previous studies that use the GIS-based multi-criteria flood risk approach include criteria weighting methods that are not appropriate, or in some cases are not used at all. Moreover, the use of more complex criteria (which may be difficult to access or to generate), or conversely the use of very simple criteria can lead to unreliable flood risk maps. In addition, some studies do not consider the social component, which is very important for implementing mitigation measures that are aiming to reduce the number of people affected after a flood event.
In the present paper, several improvements of the previous study for the GTA [18] are proposed. First, we implement MCDM as part of the methodology for determining the criteria weights. Second, we propose the use of more appropriate flood hazard parameters, including: distance to streams (DS), height above the nearest drainage (HAND), Curve Number (CN) (which is a widely used empirical parameter developed by the United States Department of Agriculture-Soil Conservation Service (SCS), and is commonly used in hydrology for predicting direct runoff [23]), total precipitation (TP), and effective precipitation (EP). For economic vulnerability, we propose using land-use data instead of large buildings. Lastly, we use the most recent census data (2016) instead of using the previous census (2006) data for determining the social vulnerability. We then use the proposed methods to develop four different flood risk scenarios to demonstrate the utility of the proposed research.
Hence, the aim of the present study is to provide a reliable but simplified method for flood risk mapping based on GIS spatial analysis integrated with AHP, which assesses both the economic and social vulnerability. Furthermore, the current study aims to analyze the criteria that should be considered for each component (using pairwise comparison) in order to generate reliable flood risk maps using inputs that are just suitable and easy to access. The results of this research will provide enhanced flood risk maps for the GTA, compared with the previous studies, which do not include social or economic vulnerability, nor a methodology for calculating criteria weights. Additionally, this research introduces more suitable criteria for flood risk analysis. Note that while the method is demonstrated for a watershed in the GTA, the approach can be easily applied to other watersheds.

Study Area
The study area for this research was selected as the shared area between the GTA and the Don River watershed, which corresponds to an area of 202.6 km 2 ( Figure 1). Precipitation data from four rain gauges (labeled HY008, HY016, HY021, and HY027) in Toronto were provided by the TRCA. The locations of the rain gauges are shown in Figure 1. The data showed that between 2009-2016, the months with the highest precipitation are typically from May to October, with an average rainfall of 82 mm/month, while the average rainfall from November to April is 29 mm/month (as illustrated in Figure 2). The Don River is one of the main watercourses that drains the GTA, running from Oak Ridges Moraine into Lake Ontario with a total length of about 100 km. It is composed of three main branches (East Don, West Don, and lower Don) and other two tributaries (Taylor/Massey Creek and German Mills Creek). Hence, the aim of the present study is to provide a reliable but simplified method for flood risk mapping based on GIS spatial analysis integrated with AHP, which assesses both the economic and social vulnerability. Furthermore, the current study aims to analyze the criteria that should be considered for each component (using pairwise comparison) in order to generate reliable flood risk maps using inputs that are just suitable and easy to access. The results of this research will provide enhanced flood risk maps for the GTA, compared with the previous studies, which do not include social or economic vulnerability, nor a methodology for calculating criteria weights. Additionally, this research introduces more suitable criteria for flood risk analysis. Note that while the method is demonstrated for a watershed in the GTA, the approach can be easily applied to other watersheds.

Study Area
The study area for this research was selected as the shared area between the GTA and the Don River watershed, which corresponds to an area of 202.6 km 2 ( Figure 1). Precipitation data from four rain gauges (labeled HY008, HY016, HY021, and HY027) in Toronto were provided by the TRCA. The locations of the rain gauges are shown in Figure 1. The data showed that between 2009-2016, the months with the highest precipitation are typically from May to October, with an average rainfall of 82 mm/month, while the average rainfall from November to April is 29 mm/month (as illustrated in Figure 2). The Don River is one of the main watercourses that drains the GTA, running from Oak Ridges Moraine into Lake Ontario with a total length of about 100 km. It is composed of three main branches (East Don, West Don, and lower Don) and other two tributaries (Taylor/Massey Creek and German Mills Creek). The GTA, with a population of about 6.4 million people in 2016, is the most densely populated metropolitan area in Canada. Furthermore, Toronto has experienced one of the fastest periods of growth in the last decades compared to other large and high-income metropolitan areas in the world. Since 1931, the population has increased 700% [24]. This rapid and sustained growth has made this region and watershed highly urbanized, with more than 75% of the area developed. Over the last 100  The GTA, with a population of about 6.4 million people in 2016, is the most densely populated metropolitan area in Canada. Furthermore, Toronto has experienced one of the fastest periods of growth in the last decades compared to other large and high-income metropolitan areas in the world. Since 1931, the population has increased 700% [24]. This rapid and sustained growth has made this region and watershed highly urbanized, with more than 75% of the area developed. Over the last 100 years, four significant flood events have occurred, and have significantly affected the area with fatalities, considerable economical losses, and infrastructural damage. Table 1 summarizes some statistical information of these previous events, including the dates and intensity of the precipitation.

Overview of the Proposed Modeling Methodology
The applied methodology in this paper is a GIS-based multi-criteria approach for flood risk assessment based on the spatial intersection of three components: flood hazard, social vulnerability, and economic vulnerability. The AHP was selected as the criteria weighting method in the context of MCDM. The criteria used for determining the flood hazard are based on the parameters that most contribute to river floods (as identified in Section 1.3). The social vulnerability is based on the vulnerable groups from the 2016 Canadian census. Finally, the economic vulnerability is based on the land-use spatial layer.
The software used in this research is ArcMap (version 10.5.1, 2017), which is a popular GIS software that allows users to create, manipulate, and analyze geospatial data. The input data required for creating all of the maps were: the digital elevation model (DEM) map with a ground resolution of 5 m × 5 m (collected from [25]); soil type map [25]; the Don River watershed boundary map [26]; the GTA boundary map [27], land-use map [27], and streams map [27]; precipitation data from rainfall gauges provided by the TRCA; the floodplain map (provided by the TRCA); and the 2016 census data from Statistics Canada [28], along with the census tract boundaries.
Each of the criteria maps was transformed to the corresponding projected coordinate system (NAD_1983_UTM_Zone_17N) and converted to raster format. In addition, all of the maps were reclassified in a scale from 1 to 5, using the Reclassify Tool in ArcMap, where 1 refers to very low (e.g., low flood risk), and 5 is a very high level of flood risk. After reclassifying the maps, the Weighted Overlay Tool of ArcMap was used to carry out the spatial overlay of the maps. To illustrate how the Weighted Overlay Tool works, an example is shown in Figure 3 below. fatalities, considerable economical losses, and infrastructural damage. Table 1 summarizes some statistical information of these previous events, including the dates and intensity of the precipitation.

Overview of the Proposed Modeling Methodology
The applied methodology in this paper is a GIS-based multi-criteria approach for flood risk assessment based on the spatial intersection of three components: flood hazard, social vulnerability, and economic vulnerability. The AHP was selected as the criteria weighting method in the context of MCDM. The criteria used for determining the flood hazard are based on the parameters that most contribute to river floods (as identified in Section 1.3). The social vulnerability is based on the vulnerable groups from the 2016 Canadian census. Finally, the economic vulnerability is based on the land-use spatial layer.
The software used in this research is ArcMap (version 10.5.1, 2017), which is a popular GIS software that allows users to create, manipulate, and analyze geospatial data. The input data required for creating all of the maps were: the digital elevation model (DEM) map with a ground resolution of 5 m × 5 m (collected from [25]); soil type map [25]; the Don River watershed boundary map [26]; the GTA boundary map [27], land-use map [27], and streams map [27]; precipitation data from rainfall gauges provided by the TRCA; the floodplain map (provided by the TRCA); and the 2016 census data from Statistics Canada [28], along with the census tract boundaries.
Each of the criteria maps was transformed to the corresponding projected coordinate system (NAD_1983_UTM_Zone_17N) and converted to raster format. In addition, all of the maps were reclassified in a scale from 1 to 5, using the Reclassify Tool in ArcMap, where 1 refers to very low (e.g., low flood risk), and 5 is a very high level of flood risk. After reclassifying the maps, the Weighted Overlay Tool of ArcMap was used to carry out the spatial overlay of the maps. To illustrate how the Weighted Overlay Tool works, an example is shown in Figure 3 below. After obtaining the weights, the cell of each input raster are multiplied by its corresponding weight, and then they are summed to obtain the output raster. For example, looking at the upper left cell, the input values are (i) (2 × 0.6) = 1.2 and (ii) (1 × 0.4) = 0.4. The output value then is 1.2 + 0.4 = 1.6. The final value is rounded to 2, since the weighted overlay output is an integer. After obtaining the weights, the cell of each input raster are multiplied by its corresponding weight, and then they are summed to obtain the output raster. For example, looking at the upper left cell, the input values are (i) (2 × 0.6) = 1.2 and (ii) (1 × 0.4) = 0.4. The output value then is 1.2 + 0.4 = 1.6. The final value is rounded to 2, since the weighted overlay output is an integer.
In order to analyze the influence of the criteria that contribute to flood susceptibility, four flood hazard scenarios were generated with different criteria (these scenarios are labelled S1, S2, S3, and S4, the details of which are provided below). Additionally, three total vulnerability scenarios were created that varied in the importance of its two components, the social and economic vulnerability (labelled V1, V2, and V3, and summarized below). The process of generating these hazard and vulnerability maps and final risk maps are detailed in the following sections.

Flood Hazard Spatial Layer
For developing the flood hazard layer, there is no consensus on the criteria that should be included [29]. Nevertheless, some of the criteria that contribute to flood susceptibility that are used in several studies [30] are: slope, distance from rivers, land use, and altitude. In the current research, slope and distance from rivers were included. However, land use was replaced by the USDA (United States Department of Agriculture) runoff Curve Number [23], while the altitude was replaced with the height above the nearest drainage point, as they have a higher influence in determining floods. Very few studies have included precipitation for mapping flood susceptibility under the GIS-based multi-criteria analysis [31,32]. Therefore, in this study, the total precipitation and effective precipitation (defined below) was also included in order to analyze the influence of these criteria. The floodplain map, provided by the TRCA, was also included in this research. This floodplain map was generated using extensive hydrologic and hydraulic models. The current study includes new criteria, and it also evaluates different flood scenarios by varying the criteria used to improve flood maps compared to previous studies, and the floodplain map generated by the TRCA. The criteria used are defined in detail below.

Slope (S)
The slope affects the velocity in which the water is conveyed through the drainage channel and the watershed. Additionally, the steeper the slopes, the higher the runoff will be, and consequently, higher peak discharges will be generated. The slope layer was derived from the DEM, with a resolution of 5 m × 5 m, using the Slope tool in ArcMap. It was reclassified in a scale from 1 to 5, where a value of 5 was given to lower slopes and 1 was given to higher values (details are shown in Table 2). The height above the nearest drainage has influence in flood susceptibility, since low-lying land adjacent to streams is more susceptible to be flooded than higher land. The map was generated with a previously developed tool by Dilts [33], which requires the DEM and the Streams layer as inputs. When reclassifying, lower values of HAND were assigned to a higher class, and higher values were assigned to a lower class, as shown in Table 2. The measure of distance to streams plays an important role in defining areas susceptible to flooding. The zones closest to rivers are the most affected by floods. To obtain these distances, the Euclidean Distance tool was used in the Spatial Analyst Tools in ArcGIS. The reclassification was based on assigning a value of 5 to areas farthest from streams, and a value of 1 to areas near streams (as shown in Table 2).

Curve Number (CN)
The CN is an empirical parameter by the Soil Conservation Service (SCS) used in hydrology for predicting direct runoff or infiltration from rainfall excess. It considers the characteristics of land use and soil type. The values of CN ranged from 100 for impermeable surfaces to 30 for very permeable soils with low runoff potential. The CN map was obtained using both the land-use type data and the soil type. CN values are published by the SCS for different land uses and soil types [23]. As a high CN means high runoff and low infiltration and a low CN means low runoff and high infiltration, high values of CN where assigned to a scale of 5, and low values were assigned a CN value of 1 (details are provided in Table 2).

Total Precipitation (TP)
The precipitation is the major cause of river floods. Heavy rainfall can lead to flooding when the streams can no longer convey excess water. As runoff is related to the amount of precipitation, higher precipitations increases the amount of runoff. Therefore, higher precipitation leads to higher flood susceptibility. Daily precipitation data from four rain gauge stations in the Don River watershed, from 2009 to 2016, were used to create the precipitation-isohyet map generated by interpolating the monthly average precipitation data, for the wet season (May to October), using the inverse distance weighted (IDW) interpolation method. The TP map was reclassified on a scale from 1 (for low values of TP) to 5 (for high values of TP), as shown in Table 2.

Effective Precipitation (EP)
Effective precipitation represents the precipitation after losses (such as infiltration and evapotranspiration). It is calculated using the total precipitation and the CN at a given point using Equation (1) where EP is the effective precipitation in mm, TP is the precipitation of the storm in mm, and CN is the Curve Number. Table 2 shows that high values of EP were assigned to a scale of 5, and low values were assigned to a scale of 1.

Floodplain
The floodplain is the area adjacent to a stream that is prone to flooding during high discharge periods. This is the most important criteria for determining the flood hazard map. The floodplain for the Don River watershed was generated and provided by the TRCA using hydrological and hydraulic models. As this map only contains the area (boundary) of the floodplain, the reclassification consisted of assigning a value of 5 to all of the cells that are within the boundary, and a value of 2 (following the selection in [18]) to those that are outside the boundary.  Table 2 presents the classes for each criteria, showing the ranges for each class. These classes were based on values from the literature (some examples are listed in Section 1.3) and professional expertise. These classes may be adjusted, as required, for different case studies or by decision makers. The values presented here are used to demonstrate the applicability of the proposed approach, but can be varied as required. Figure 4 presents the reclassified maps and highlights the levels from very low to very high for each criteria. different combinations of the criteria (S1, S2, S3, and S4). Figure 5 shows the criteria selected for each scenario. S1 is selected as the base case, since it includes the base criteria (DS, HAND, S, CN) and the floodplain map, which is the most influential criteria in the flood hazard map, as it is based on hydrological and hydraulic models. S2 contains the same base criteria as S1, but not the floodplain map. S3 contains S, HAND, DS, and EP, but does not include the CN, as it is implicit in the calculation of effective precipitation. S4 contains the base criteria as well as TP. As mentioned above, these criteria are some of the most significant parameters to affect flood susceptibility. Additionally, TP and EP were included to determine whether or not the precipitation (which is the driver for flood hazard) is needed to generate food risk maps under the current methodology. The objective of developing scenarios S2, S3, and S4 is to determine if they can general flood risk maps that are comparable to the S1 map, since S1 used the detailed floodplain map from the TRCA. If these maps are indeed comparable, then it demonstrates that the proposed approach can create useful tools, with relatively low data requirements, as compared to S1.

Social Vulnerability Spatial Layer
The social exposure is based on the number of vulnerable groups in the area.  In order to determine which criteria are the most suitable, four scenarios were created with different combinations of the criteria (S1, S2, S3, and S4). Figure 5 shows the criteria selected for each scenario. S1 is selected as the base case, since it includes the base criteria (DS, HAND, S, CN) and the floodplain map, which is the most influential criteria in the flood hazard map, as it is based on hydrological and hydraulic models. S2 contains the same base criteria as S1, but not the floodplain map. S3 contains S, HAND, DS, and EP, but does not include the CN, as it is implicit in the calculation of effective precipitation. S4 contains the base criteria as well as TP. As mentioned above, these criteria are some of the most significant parameters to affect flood susceptibility. Additionally, TP and EP were included to determine whether or not the precipitation (which is the driver for flood hazard) is needed to generate food risk maps under the current methodology. The objective of developing scenarios S2, S3, and S4 is to determine if they can general flood risk maps that are comparable to the S1 map, since S1 used the detailed floodplain map from the TRCA. If these maps are indeed comparable, then it demonstrates that the proposed approach can create useful tools, with relatively low data requirements, as compared to S1. (g)

Social Vulnerability Spatial Layer
The social exposure is based on the number of vulnerable groups in the area. The vulnerable groups were determined by age, family structure, language proficiency, income, education level, land tenure (i.e., renters), and population density (the corresponding indicators are listed in Table 3). This information was obtained from the 2016 census tracts data [28]. These criteria for determining social vulnerability were selected based on a previous study [35], which reviewed 142 articles related to social vulnerability to flooding disasters. The study found that the category with the highest frequency of appearance was demographic characteristics (58%), followed by socioeconomic status (55%), health (45%), coping capacity (39%), risk perception (36%), neighborhood characteristics (30%), and land tenure (30%). Details of why the criteria were selected are described in the following sections.

Social Vulnerability Spatial Layer
The social exposure is based on the number of vulnerable groups in the area. The vulnerable groups were determined by age, family structure, language proficiency, income, education level, land tenure (i.e., renters), and population density (the corresponding indicators are listed in Table 3). This information was obtained from the 2016 census tracts data [28]. These criteria for determining social vulnerability were selected based on a previous study [35], which reviewed 142 articles related to social vulnerability to flooding disasters. The study found that the category with the highest frequency of appearance was demographic characteristics (58%), followed by socioeconomic status (55%), health (45%), coping capacity (39%), risk perception (36%), neighborhood characteristics (30%), and land tenure (30%). Details of why the criteria were selected are described in the following sections. Among all the demographic drivers of social vulnerability, age (very young and elderly people) has the highest number of citations within the literature review [35]. Their vulnerability is associated with their physical condition and dependency; therefore, their mobility is limited. The elderly are more affected due to their health condition, coping difficulties, and inability or reluctance to leave their place [36].
Family structure (such as lone parent families) is a useful measure of social vulnerability, since, e.g., families with lone parents have only one income, they have to deal with the financial costs by themselves. They also have to support their children, meet their basic needs, and deal with the stress without the support of a partner [37]. Lastly, language proficiency is a measure of social vulnerability that affects people in all the stages of the disaster, as they cannot be well-informed about the warnings, evacuations, and resources for recovery [36].

Socioeconomic Status: Income and Education
People with low income have less access to resources, suffer from more stress, and have less coping capacity [38]. They also suffer from more health impacts than people with higher incomes. Finding a temporary accommodation is more difficult and is often not possible [39]. Lower education is associated with poverty, unemployment, and overpopulation [35]. Therefore, this group is vulnerable to flooding due to their low coping capacity after a flood event.

Land Tenure and Neighborhood Characteristics
Renters (a type of land tenure) often suffer from more stress and health problems than homeowners [40]. Additionally, compared to owners, renters are less aware of flood risks, have a lower understanding of warnings, take less action to reduce damage, have less control over protective measures and repairs, and have higher inundation levels [41]. The population density of a neighborhood is often linked with lower income housing placed in floodplain areas [42]. Furthermore, overcrowded neighborhoods are more likely to have poor drainage systems and infrastructure [43]. People from neighborhoods with these characteristics face difficulties in evacuating quickly and have a higher risk of disease transmission [44].

Summary of Social Vulnerability
To sum up, the social vulnerability included was based on eight criteria identified in Table 3. The data obtained by Statistics Canada was the boundary of the census tract, in a shapefile format (polygons), and the census data was a dBase (dBF) file. After joining the shapefile with the dBase file, the resulting map was transformed into a raster file, which then was divided into individual rasters for each criterion (thus, eight rasters in total). Each raster was then reclassified on a scale from 1 to 5, assigning higher levels of vulnerability to a higher percentage of the population belonging to each criterion. The levels used for all of the social vulnerability criteria are shown in Table 4. Very low ≤20

Economic Vulnerability Spatial Layer
The physical exposure refers to man-made infrastructure elements that are exposed to damages due to flooding [18]. Generally, these physical damages are measured in terms of economic losses. For this, the required information includes: property value, structural type, and height of the buildings. As an approach to estimate the vulnerability of properties in terms of economic losses, the land-use layer was used. A previous study [45] developed for flood risk assessment in Canada, at a national scale, also used land use as an indicator of property value.
The land-use map was reclassified into a scale from 1 to 5 according to their economic value. In this study, the Resource and Industrial land-use type was established as the one with the highest economic value, while parks, recreational, and open areas were assigned the lowest. The classification system used for economic vulnerability is shown in Table 5. The total vulnerability was generated for three scenarios: V1 where the social vulnerability was given a higher importance than economic vulnerability (67% versus 33%), V2, which represented the opposite case (33% versus 67%), and V3, in which both were equally important (50% for both of them).

Determining Criteria Weights
AHP, developed by Saaty in 1990 [10], is one of the several criteria weighting methods of MCDM. This method allows comparing two criteria at a time through a pairwise comparison matrix in which the values of relative importance from one criterion over another criterion are assigned. It provides a measure of a level of consistent judgment supported by a theoretical background. The standardized scale of relative importance ranges between 1 and 9, where 1 is equal importance, and 9 is extreme importance (the details are provided in Table 6). AHP has been used worldwide in a wide range of fields such as industry, education, transportation, and healthcare, among others, especially due to its application in group decision making [11]. Table 6. Saaty's scale for weight assignment [10]. This method has been increasingly applied in flood management. In a case study in Eldoret Municipality (Kenya), AHP was used to determine the flood vulnerable areas, in which through a validation process, it was found that the AHP-GIS-based results were reliable with a 92% of accuracy when compared with the simulated results (HEC-RAS simulation) [46]. Similarly, in a case study in the Yasooj River (Iran) [23], the authors used AHP to identify the potential flood hazard zones; the method was validated against an inundation simulation (conducted using HEC-RAS), and it showed the reliability of the flood hazard map. A study conducted in Terengganu (Malaysia) used AHP to generate a map of flood susceptibility areas, and found a significant similarity when compared with an original flood map obtained from the Department of Irrigation and Drainage [47].

Intensity of Importance Definition
Besides the AHP, there are other methods for determining criteria weights such as the Fuzzy Multi-Criteria Decision Making Process (FDM), multi-attribute utility theory (MAUT), SMARTER, rank order, weighted summation, Electré, Regime, and Evamix, amongst others [48]. The random forest, which is an ensemble learning method of n tree-based classifiers, is an alternative method that was used in a study case in Germany to identify variables that influence damages and model flood damages [49]. The AHP method presents many advantages: it is appropriate for group decision making; it works with both quantitative and qualitative input data; the stakeholders can answer the pairwise question easily; it allows a hierarchical configuration of the criteria; it enables evaluating the consistency of preference; and it provides a measure of a level of consistent judgment supported by a theoretical background.
The first step for determining the criteria weights under the AHP method is to create a pairwise comparison matrix for the criteria selected, in which the values of relative importance from one criterion over another criterion are assigned on a scale from 1 to 9. The pairwise comparison matrix for flood hazard, social and economic vulnerability was based on the literature review and self-criteria from professional experience. The complete detailed pairwise matrices are included in Appendix A for reference.
In order to provide a better understanding of the methodology, the calculation procedure of criteria weights for scenario S2 is presented here as an example. The first step in calculating the criteria weights is to establish the order preference and assign the intensity of importance for the selected pair of criteria based on the scale presented in Table 6. Table 7 summarizes this for scenario S2; here, DS and HAND are given an important score of 2, which translates to equal to moderate importance using the scale in Table 6. The next step is to build the pairwise comparison matrix where in each cell, every criterion is assigned an intensity of importance over another. Each criterion are compared with the other criteria (see Table 8). The next step consists of a linear normalization of the pairwise comparison matrix: all of the values from each column are summed, and then each element of the matrix is divided by its total. Finally, the weights are obtained by computing the average of each row of the normalized values of the matrix (Table 8). Since the AHP may have inconsistencies in establishing the values for the pairwise comparison matrix, it is important to calculate this level of inconsistency using the consistency index (CI), which should be less than 0.1. To calculate the CI, the first step is to calculate the weighted sum vector by multiplying the weight of the criteria with the pairwise comparison values (shown in Appendix A for all four scenarios). Then, the consistency vector is calculated as the ratio between the weighted sum vector and the weights and finally lambda (λ), which is the average of the consistency vector, is also calculated. Table 9 shows a sample calculation for the values. The value of λ and the number of criteria (n) are used in Equation (2) to calculate the CI [10]: The average of the consistency vector (λ) is simply the arithmetic mean of the consistency vector from Table 9, and a sample calculation is included below. Then, Equation (2) is used where n = 4 (since there are four criteria) to determine CI, which in this example indicates a high level of consistency, since it is less than 0.

Flood Risk Spatial Layer
The flood risk map is generated by a spatial layer overlay operation between the flood hazard map and the total vulnerability (social vulnerability and economic vulnerability), i.e., the consequences. The flood risk map (R FSV ) based on flood hazard (H F ) and social vulnerability (V S ) is derived from Equation (3): The flood risk map (R FEV ) based on flood hazard (H F ) and economic vulnerability (V E ) is derived from Equation (4): The total vulnerability map (V T ) is derived from Equation (5): where W S is the weight of the social vulnerability, and W E is the weight of the economic vulnerability. Similarly, the flood risk map (R FTV ) based on flood hazard and total vulnerability is derived from Equation (6).

Comparison of Scenarios
In order to determine the scenario that best suits the base case, a comparison was made between each scenario and the base case using Equation (7).
where RS 1 is the risk map under the scenario S1, and RS n is the risk map for each scenario (being n = 2, 3, and 4), and RC is the risk change between S1 and the other scenarios. The results from Equation (7) can take values from −4 to +4, where negative values indicate a higher scale of RS n , meaning an overestimation of the risk compared to the base case (S1), and positive values indicate a higher scale of RS 1 , and thus an underestimation of the risk. Hence, values of RC that are equal or close to zero indicate a best approximation to the base case. The risk change maps were generated for each vulnerability scenario (V1, V2, and V3). Additionally, from the risk change maps, the percentage area of each RC value were obtained. Table 10 shows the weights of the criteria selected for each flood hazard scenario and their corresponding value of CI. These weights were used to generate the flood hazard maps for each scenario (Figure 6). The flood hazard for S1 (the base case) shows a higher susceptibility to floods in the surroundings of the streams (the closer to the stream, the higher the susceptibility). The flood hazard for S2 also presents areas of high susceptibility in the surroundings of the streams, but with a higher buffer distance from the streams, i.e., a larger area in yellow surrounding the streams. S3 is dominated with areas of medium susceptibility (yellow), some areas close to the river with high susceptibility, and just a few regions with very high susceptibility (red). Zones of higher effective precipitation are reflected in areas with higher susceptibility. S4 shows high and very high susceptibility in a much wider buffer distance from streams, compared to the other scenarios, and medium and low susceptibility are clearly marked by the level of precipitation. Generally speaking, these maps show that the map generated in S2 is closest to S1 (the base case map that uses detailed hydrologic and hydraulic models). This means that the simple criteria used for S2 (DS, HAND, S, and CN) can adequately reproduce the flood hazard maps generated in S1.

Generation of Vulnerability Maps
The weights of social vulnerability (shown in Table 11) were assigned to the corresponding criterion to generate the social vulnerability map (Figure 7a). As the economic vulnerability is based on only one criterion (land use), no pairwise comparison was needed to calculate the weights. Instead, the classes established in Table 5 were used directly (with the results presented in Figure 7b). The weights of total vulnerability for each scenario are shown in Table 12. Finally, the total vulnerability maps (shown in Figure 8) for each scenario, V1, V2, and V3, were derived from the combination of social vulnerability and economic vulnerability, by applying the weights determined for each scenario (Equation (5)). These figures show that the economic vulnerability has a higher impact on the total vulnerability: there are relatively more high total vulnerability zones when the economic vulnerability is given higher priority as compared to when social vulnerability is given higher priority.

Generation of Vulnerability Maps
The weights of social vulnerability (shown in Table 11) were assigned to the corresponding criterion to generate the social vulnerability map (Figure 7a). As the economic vulnerability is based on only one criterion (land use), no pairwise comparison was needed to calculate the weights. Instead, the classes established in Table 5 were used directly (with the results presented in Figure 7b). The weights of total vulnerability for each scenario are shown in Table 12. Finally, the total vulnerability maps (shown in Figure 8) for each scenario, V1, V2, and V3, were derived from the combination of social vulnerability and economic vulnerability, by applying the weights determined for each scenario (Equation (5)). These figures show that the economic vulnerability has a higher impact on the total vulnerability: there are relatively more high total vulnerability zones when the economic vulnerability is given higher priority as compared to when social vulnerability is given higher priority.   Children four years and younger 17 3 Lone parents 16 4 Non-English or French-speaking people 15 5 People with low income 11 6 High school education or less 9 7 Population density per km 2 8 8 Rented apartments 6 CI 0.046

Generation of Flood Risk Maps
Flood risk maps were generated as a spatial overlay (equal weights) between the total vulnerability maps (either V1, V2, or V3) and the flood hazard maps (S1, S2, S3, and S4), and these maps are shown in Figure 9. These maps were generated by combining the flood hazard maps with  Children four years and younger 17 3 Lone parents 16 4 Non-English or French-speaking people 15 5 People with low income 11 6 High school education or less 9 7 Population density per km 2 8 8 Rented apartments 6 CI 0.046

Generation of Flood Risk Maps
Flood risk maps were generated as a spatial overlay (equal weights) between the total vulnerability maps (either V1, V2, or V3) and the flood hazard maps (S1, S2, S3, and S4), and these maps are shown in Figure 9. These maps were generated by combining the flood hazard maps with the total vulnerability. Generally speaking, scenario S2 was the most similar to the base case scenario S1 (due to the similarity between the flood hazard maps). For each hazard scenario, a higher risk is seen when economic vulnerability is given higher priority (V2) than the other two cases: there are more regions with medium (yellow) or higher risks (orange and red) than the cases where social vulnerability is given priority (V1) or has equal priority (V3). This is expected, since there are more economically vulnerable regions in the study area (based on Figure 7b) than social vulnerable regions ( Figure 7a); in fact, there are no socially vulnerable areas with a very high (red) risk-a consequence of the selected criteria, pairwise comparison, and available data.

Comparison Flood Risk Maps
A process of validation is essential when modeling, as it confirms if the model is representing the reality or observations sufficiently. Since there are no previous flood risk maps in the study area to directly validate the flood risk maps generated in this research, the flood risk maps for scenarios S2, S3, and S4 were compared to the base case (S1). Scenario S1 is considered the most reliable scenario, since it includes the floodplain map (generated and provided by the TRCA as a criterion). This validation approach was made between the different flood scenarios and the different social vulnerability scenarios (V1, V2, and V3). Figure 10 shows the percentage of the total area that corresponds to each RC value, as obtained from Equation (7), for each flood and vulnerability scenario. An analysis of this figure indicates that overall, S2 is the most similar scenario to the base case (S1), followed by S3 and S4. The scenario S2 is

Comparison Flood Risk Maps
A process of validation is essential when modeling, as it confirms if the model is representing the reality or observations sufficiently. Since there are no previous flood risk maps in the study area to directly validate the flood risk maps generated in this research, the flood risk maps for scenarios S2, S3, and S4 were compared to the base case (S1). Scenario S1 is considered the most reliable scenario, since it includes the floodplain map (generated and provided by the TRCA as a criterion). This validation approach was made between the different flood scenarios and the different social vulnerability scenarios (V1, V2, and V3). Figure 10 shows the percentage of the total area that corresponds to each RC value, as obtained from Equation (7), for each flood and vulnerability scenario. An analysis of this figure indicates that overall, S2 is the most similar scenario to the base case (S1), followed by S3 and S4. The scenario S2 is where there is a minimum difference from S1. For the scenarios where more importance is given to the social vulnerability (scenario V1), S2 and S3 present similar results in terms of likeness to the base case, as the percentage of the areas with the same risk scale as the base case were around 80% for both scenarios. The difference between S2 and S3 relies on that the former tends to overestimate the risk scale, and the latter tends to underestimate it. In contrast, S4 presents 70% in the areas with the same risk scale as the base case, and overestimates much more of the risk scale in comparison with the other scenarios.
When giving more importance to the economic vulnerability (scenario V2), S2 is by far the best scenario, as it has the highest percentage of area with same risk scale as the base case, followed by S2 and S3 (around 98%, 70%, and 62%, respectively). When giving equal importance to the social and economic vulnerability (scenario V3), S2 also has the highest percentage of area with the same risk scale as the base case, followed by S3 and S4 (around 75%, 70%, and 61%, respectively). S3 and S4 underestimate almost the same percentage area (around 24% for both), and overestimate a minimum. Figure A1 (Appendix B) shows the flood risk change maps for the vulnerability scenarios V1, V2, and V3, respectively, where the grey colored regions represents equal values of risk between the base case and each flood scenario, blue represents an underestimation, and red represents an overestimation of the risk scale. From Figure A1, it is clear that S4 is the least similar scenario compared to the base case scenario, presenting larger areas of higher risk than the base case. Most of the areas where the risk is overestimated correspond to the areas of higher precipitation. the social vulnerability (scenario V1), S2 and S3 present similar results in terms of likeness to the base case, as the percentage of the areas with the same risk scale as the base case were around 80% for both scenarios. The difference between S2 and S3 relies on that the former tends to overestimate the risk scale, and the latter tends to underestimate it. In contrast, S4 presents 70% in the areas with the same risk scale as the base case, and overestimates much more of the risk scale in comparison with the other scenarios. When giving more importance to the economic vulnerability (scenario V2), S2 is by far the best scenario, as it has the highest percentage of area with same risk scale as the base case, followed by S2 and S3 (around 98%, 70%, and 62%, respectively). When giving equal importance to the social and economic vulnerability (scenario V3), S2 also has the highest percentage of area with the same risk scale as the base case, followed by S3 and S4 (around 75%, 70%, and 61%, respectively). S3 and S4 underestimate almost the same percentage area (around 24% for both), and overestimate a minimum. Figures A1 (Appendix B) shows the flood risk change maps for the vulnerability scenarios V1, V2, and V3, respectively, where the grey colored regions represents equal values of risk between the base case and each flood scenario, blue represents an underestimation, and red represents an overestimation of the risk scale. From Figure A1, it is clear that S4 is the least similar scenario compared to the base case scenario, presenting larger areas of higher risk than the base case. Most of the areas where the risk is overestimated correspond to the areas of higher precipitation.

Conclusions
The flood risk mapping based on GIS and multi-criteria analysis is a valuable tool for estimating areas prone to flood risk, and help water resources planners and decision makers to focus on specific areas in order to perform a further detailed assessment of flood risk (e.g., through the use of hydrological and hydraulic models). Consequently, this simplified but reliable methodology can help the reduce resource requirements for fairly accurate flood risk assessments.
Some of the advantages of this approach are its flexibility, ease of handling, and low cost, making it possible to apply it to areas where there is a lack of detailed information, when the aim is to obtain

Conclusions
The flood risk mapping based on GIS and multi-criteria analysis is a valuable tool for estimating areas prone to flood risk, and help water resources planners and decision makers to focus on specific areas in order to perform a further detailed assessment of flood risk (e.g., through the use of hydrological and hydraulic models). Consequently, this simplified but reliable methodology can help the reduce resource requirements for fairly accurate flood risk assessments.
Some of the advantages of this approach are its flexibility, ease of handling, and low cost, making it possible to apply it to areas where there is a lack of detailed information, when the aim is to obtain large-scale flood risk maps, or when the policy makers require a rapid flood risk assessment. The use of AHP as the weighting criterion method presents several advantages as well, such as emphasizing the structure of the preferences of the decision makers among a number of criteria. Thus, instead of selecting the "best" alternative, this method provides the basis and tools for intelligent decision-making, allowing decision makers to obtain the alternative that best suits their needs. It is also important to note that S2 is very similar to S1 (based on the comparisons presented above), but was generated with much less data (i.e., did not require the intensive hydrological and hydraulic models used to generate the floodplain map used in S1).
In addition, by changing the weights of vulnerability (both social and economic), it is possible to obtain different risk maps based on a higher, lesser, or equal importance of one vulnerability over another. This allows managers to take action on the main objective as they have deemed best, e.g., focus on either reducing the number of people affected (i.e., social vulnerability) or the economic losses.
The results of the study show that for flood hazard mapping, only the criteria of distance to streams, slope, height above the nearest drainage, and CN are enough to obtain reliable maps. However, if the precipitation data is included, it is better to use the effective precipitation rather than using the total precipitation. In this sense, the inputs required are the DEM, soil type, and the streams layer. The flood risk maps generated in this study can be very helpful for the City of Toronto or the TRCA to implement required mitigation measures for insurance purposes, disaster response, and land management. The proposed approach is easily extended to other jurisdictions to determine flood risk maps using indicators or criteria that are available. An improvement on the current methodology would be to use the fuzzy majority approach proposed by [50] for GIS-based multi-criteria analysis.
Author Contributions: D.R., U.T.K., and C.A. conceived and designed the research methodology; D.R. performed the analysis; D.R. wrote the paper; U.T.K. and C.A. reviewed the paper. Acknowledgments: The authors would like to thank the Toronto and Region Conservation Authority for providing the floodplain map for the Don River watershed used in this study. We would also like to acknowledge the National Sciences and Engineering Research Council of Canada and York University for funding this research. Lastly, we are grateful for the comments from three anonymous reviewers who helped improved the quality of this manuscript.

Conflicts of Interest:
The authors declare no conflict of interest.