Assessment of Deep Convective Systems in the Colombian Andean Region

: In tropical regions, deep convective systems are associated with extreme rainfall storms that usually detonate ﬂash ﬂoods and landslides in the Andean Colombian region. Several studies have used satellite data to address the structure and formation of tropical convective storms. However, there is a local gap in the characterization, which is essential for a better understanding of ﬂash ﬂoods and preparedness, ﬁlling a gap in a region with scarce information regarding extreme events. In this work, we assess the deep convective storms in a mountainous region of Colombia using meteorological radar observations between 2014 and 2017. We start by identifying convective and stratiform formations. We reﬁne the convective identiﬁcation by classifying convective systems into enveloped (contained in a stratiform system) and unenveloped (not contained). Then, we analyze the systems’ temporal and spatial distributions and contrast them with the watersheds’ features. According to our results, unenveloped convective systems have higher reﬂectivity and hence higher rainfall intensities. Moreover, they also have a well-deﬁned spatial and temporal distribution and are likely to occur in watersheds with elevation gradients of around 2000 m and an aspect contrary to the wind direction. Our assessment of the convective storms is of signiﬁcant value for the hydrologic community working on ﬂash ﬂoods. Moreover, the spatiotemporal description is highly relevant for stakeholders and future local analysis.


Introduction
Convective systems usually turn into intense storms that could develop flooding events. Moreover, in regions with a steep terrain, convective storms are linked to the occurrence of flash floods [1][2][3][4], which are likely to produce human and infrastructure losses [5,6]. At the mesoscale, convective systems cover areas of around 600 km 2 [7], but it can be smaller at a local scale. Moreover, convective-detonated flash floods are usually limited to small catchments (less than 1000 km 2 ) [8][9][10] and their effects have been described in different world regions [9,11,12]. Nevertheless, there is a lack of analysis in the tropical Andean region, where topography-driven convective storms detonate shallow landslides and flash floods [13,14].
The Colombian Andean is made up of three mountainous rages, with its weather being dominated by the Intertropical Convergence Zone (ITCZ), the El Niño Southern Oscillation (ENSO), and the Pacific and Atlantic oceans' oscillations [35][36][37]. The ITCZ the stratiform and convective systems observed. Then, we classify the convective systems into enveloped and unenveloped, allowing us to describe the most intense ones better. We analyze the convective systems' size, reflectivity, and spatial and temporal occurrences using radar data acquired by the Sistema de Alertas Tempranas Ambientales (SIATA). Additionally, we compare the localization of the identified convective systems with 18 watersheds. In the comparison, we explore how the aspect and elevation gradient of the watersheds are intertwined with the convective systems' formation. Our main goal is to develop a comprehensive regional analysis of the storms linked to the occurrence of rainfall-related catastrophes in the Colombian Andean region. Additionally, we seek to fill a knowledge gap in a relevant topic that could worsen in the coming years due to climate change and population growth.
We begin this paper by describing the radar and topographic data in Section 2. Then, in the methodology Section 3, we describe the algorithms to identify convective systems, extract their features, and the methodology followed to contrast them against the selected watersheds. In Section 4, we present the results of the convective systems' identification and the comparison with the watersheds. Finally, Section 5 offers our conclusions and remarks for future work. Figure 1. Region of analysis, radar localization is presented at the center of the image, and gray circles correspond to radar radii of 30 and 90 km, respectively. Yellow to dark blue colors represents the stream network Horton orders from 3 to 6. This work uses local meteorological radar data to assess convective systems in the central Colombian Andean region between 2014 and 2017. Some of these convective systems detonated flash floods and landslides during this period [13,14]. First, we identify the stratiform and convective systems observed. Then, we classify the convective systems into enveloped and unenveloped, allowing us to describe the most intense ones better. We analyze the convective systems' size, reflectivity, and spatial and temporal occurrences using radar data acquired by the Sistema de Alertas Tempranas Ambientales (SIATA). Additionally, we compare the localization of the identified convective systems with 18 watersheds. In the comparison, we explore how the aspect and elevation gradient of the watersheds are intertwined with the convective systems' formation. Our main goal is to develop a comprehensive regional analysis of the storms linked to the occurrence of rainfall-related catastrophes in the Colombian Andean region. Additionally, we seek to fill a knowledge gap in a relevant topic that could worsen in the coming years due to climate change and population growth.
We begin this paper by describing the radar and topographic data in Section 2. Then, in the methodology Section 3, we describe the algorithms to identify convective systems, extract their features, and the methodology followed to contrast them against the selected watersheds. In Section 4, we present the results of the convective systems' identification and the comparison with the watersheds. Finally, Section 5 offers our conclusions and remarks for future work.

Data and Information
We used radar reflectivity, digital elevation data, and wind data. Using radar reflectivity, we identified and analyzed the convective systems. Moreover, we delineated the watersheds and extracted their properties using a digital elevation model. Subsequently, we describe in detail the data used.

Radar Data
The polarimetric 350 Kw Doppler C-band radar manufactured by Enterprise Electronics Corporation is in the occidental central hill of the Aburra Valley watershed (Rio Medellin in Figure 1). The radar operation includes four plan position indicator sweeps (PPIs) at 0.5, 1.0, 2.0, and 4.0 • . We used radar reflectivity every 5 min at the PPI of 1.0 • with a Cartesian projected resolution of 128 m. The radar beam has a beam width of 1 •, and, depending on the PPI, it reaches a distance between 50 and 200 km. It has a wavelength of 5.3 cm and an antenna gain of 45 dBZ. However, to avoid issues due to the bright band interception, we limited the information to a radius of 90 km (second gray ring in Figure 1).
Radar data have been available since 2012. However, it has continuous quality data since 2014. The radar image quality control uses the co-polar correlation coefficient (CC). CC tends towards one when water particles have a uniform distribution. On the other hand, CC decreases when there is a heterogeneous distribution of the particles' shape and orientation. Moreover, CC values above one represents errors due to noise. Between 2014 and 2017, around 93% of the radar data has a good quality.

Topography and Wind Data
We used an ALOS-PALSAR digital elevation model (DEM) to describe the region's topography with a resolution of 12.7 m. Considering the area of the study domain, we resampled the DEM to a resolution of 40 m (background in Figure 1). We applied the A T algorithm [44] over the resampled DEM to estimate the direction map (DIR). Using the DEM and DIR maps, we extracted the watersheds shown in Figure 1 and their respective boundary sub-watersheds of orders 3, 4, and 5. Additionally, for each watershed, we estimated the total area, the elevation difference, and the predominant aspect. We used the watershed modeling framework (WMF) (https://github.com/nicolas998/WMF accessed on 12 May 2022) to perform the watershed delineation and analysis.
Additionally, we used ERA-5 observations at 750 hPa to determine the preferential wind direction. The selected pressure level was around 2500 m above sea level, a value close to the mean elevation of the region.

Methodology
The region's mountainous terrain induced extra challenges in our convective system assessment. Hillslopes with high elevation gradients promoted the formation of deep convective systems and, at the time, added noise to the radar images [45,46], increasing the uncertainty in the classification. Therefore, we divided the convective systems into two categories: enveloped and unenveloped. Enveloped systems are embedded into a stratiform formation, while unenveloped ones are not. With the convective discretization, we performed a more comprehensive assessment.
Additionally, we explored the relationship between convective systems' spatial distribution and the topography. To this end, we delineated 18 watersheds (see Figure 1) and their boundary sub-watersheds with Horton orders 3, 4, and 5. Boundary sub-watersheds share a boundary with their containing watersheds. We measured the overlap between the convective systems and the watersheds and contrasted it with their area, aspect, and elevation difference. In this analysis, we also included the preferential direction of the wind near an elevation of 2500 m.
Subsequently, we describe in detail each one of the methodology steps.

Systems' Classification and Analysis
We first identified the convective systems present in each radar image between 2014 and 2017. To perform the identification, we implemented in Fortran90 and Python the algorithm proposed by [47], and then modified by [48]. We also included functions to measure the features of the convective systems and classified them into enveloped and unenveloped. The code used can be found at the following GitHub repository: https://github.com/nicolas998/Radar (accessed on 12 May 2022).

Convective and Stratiform Systems' Classifications
Convective storms exhibit high reflectivity and are prone to develop extreme rainfall events. On the other hand, stratiform formations have lower reflectivity and are associated with low-intensity rainfall. Considering the described differences, we started by separating convective and stratiform systems using the algorithm proposed by [47], and then modified by [48]. In our implementation, we used a background radius of 11 km and a threshold for peaks of 40 dBZ. We validated the algorithm's accuracy by comparing the classified systems with vertical profiles obtained by the radar [13]. Nevertheless, we identified noise due to convective systems embedded in stratiform formations. Therefore, we created the enveloped and unenveloped convective categories.

Enveloped and Unenveloped Convective Systems
Depending on the meteorological conditions and the temporal evolution of the storm, convective systems may occur as individual systems or as part of a cluster [49]. Single storms usually cover more area and have higher reflectivity than clustered systems. Moreover, during the storm, clustered systems are generally enveloped by a stratiform formation [50,51]. Therefore, we categorized convective systems into enveloped and unenveloped. Enveloped systems are embedded into stratiform formations, while unenveloped ones are not. We implemented the following procedures to identify both:

1.
First, we separated convective and stratiform objects into two binary images (Bin c and Bin s , respectively). Bin s is equal to one where there are stratiform formations and zero elsewhere. Bin c is equal to two in regions with convective systems and zero elsewhere. Figure 2a presents a schematic of Bin s (left) and Bin c (right).

2.
Then, we eroded Bin s using a 3 × 3 kernel. In the erosion, each Bin s element with a value of one and at least one neighbor equal to zero in the kernel became zero. From the erosion, we obtained BinE s (light blue image in the left panel in Figure 2b).

3.
After the erosion, we filled the holes left in BinE s . From this procedure, we obtained the eroded and then filled stratiform binary BinEF s (Figure 2b, right).

4.
Then, we computed the superposition between Bin c and BinEF s as SupBin = Bin c + BinEF s (Figure 2c). In SupBin, values equal to 1 correspond to stratiform formations, 2 to unenveloped convective systems, and 3 to enveloped convective systems.

5.
Finally, we classified each convective system as enveloped or unenveloped using their corresponding modal values in SupBin. For example, if 90 pixels of a convective storm were 2 and 10 pixels were 3, the system was considered unenveloped (or 2).
The described methodology was applied to each radar image. In Figure 3, we present an example showing the results obtained. The process starts with the reflectivity information (Figure 3a), from where we performed the convective and stratiform classifications ( Figure 3b). Then, we identified each convective object (Figure 3c). An object is formed by all the adjacent pixels previously identified as convective. Finally, we obtained the enveloped and unenveloped convective systems' identification ( Figure 3d). values equal to 2 correspond to unenveloped convective systems, and values of 3 correspond to enveloped systems.
The described methodology was applied to each radar image. In Figure-

Reflectivity Statistics and Morphometrics
Using the described methodology, we processed the radar images between 2014 and 2017, obtaining a classified image every 5 min. We obtained a collection of convective objects (systems) from the classification. Then, we extracted the reflectivity statistics and morphometric features of each object. We used the equivalent reflectivity factor Z to compute the mean, µ re f , and deviation, σ re f , of each system, and transformed them to reflectivity by using their logarithm. Additionally, we computed the area, A km 2 , the Hydrology 2022, 9, 119 6 of 18 centroid coordinates, C [lat, lng], and the time of occurrence of each convective system. The area corresponded to the pixel count in a system multiplied by 16,684 m 2 (the square of 128 [m], the radar resolution), and the centroid was the system center of mass in X and Y. Moreover, we marked each object as enveloped (3) or unenveloped (2).
The described information allowed us to obtain an extensive collection of data and perform a comprehensive assessment of the convective systems in the region.

Watersheds Analysis
According to several studies, convective system formation is intertwined with topography [20,32]. The connection has also been reported in tropical regions [17,22] with links between rainfall rates and elevation [18,40]. However, most of the work used mesoscale information and did not consider the characteristics of the watersheds. Therefore, we compared the convective system localization with features of the watersheds in the region.
We analyzed the 18 watersheds delimited by the black divisor lines in Figure 1, covering most of the domain. The areas of the watersheds oscillated between 230 and 2000 km 2 , and the Horton stream orders at their outlets oscillated between 5 and 7. Additionally, we included the boundary sub-watersheds of the 18 watersheds with orders 3, 4, and 5 (see Figure 4a-c). A boundary sub-watershed has no upstream tributaries; in most cases, it shares its divisor line with its containing watershed. Then, we computed the following features of the watersheds: boundary elevation, maximum elevation gradient, predominant aspect (direction), and the upstream area [km 2 ].
We also measured the intersection between the convective systems and the watersheds using the overlap index, O i , as follows: where A o km 2 is the intersected area between the convective system and the watershed and A s km 2 is the system area. Figure 4d presents three different cases of overlap: There is no overlap when the convective system falls outside of a watershed and A o becomes equal to zero. There is a weak overlap when A o is less than 15% of the total area, and it is strong when A o is greater than 15%. Nevertheless, the described categories are subjective and will change depending on the watershed area and the resolution of the images used. Therefore, the given values are a reference that allows us to contrast the overlap between the convective systems and the watersheds. Finally, we compared O i with the described features of the watersheds and the direction of the wind at 750 hPa (ERA-5 data). Using the described procedure, we explored how the topographical features of the watersheds were intertwined with the occurrence of deep convective systems at different scales. Finally, we compared with the described features of the watersheds and the d rection of the wind at 750 hPa (ERA-5 data). Using the described procedure, we explore how the topographical features of the watersheds were intertwined with the occurrence deep convective systems at different scales.

Results and Discussion
The main goal of our work was to present a characterization of the tropical convectiv systems observed by a meteorological radar. We also contrasted the convective system with the region's orientation, elevation, and aspects of several watersheds. We presente and discussed the obtained results after applying the methods described in Section 3.

Results and Discussion
The main goal of our work was to present a characterization of the tropical convective systems observed by a meteorological radar. We also contrasted the convective systems with the region's orientation, elevation, and aspects of several watersheds. We presented and discussed the obtained results after applying the methods described in Section 3.

Convective Systems Analyses
We started by identifying the observed convective and stratiform systems following the method proposed by Steiner (1995). Figure 5 presents the area, A km 2 (X axis), mean reflectivity, µ re f [dBZ] (Y axis), and reflectivity deviation, σ re f [dBZ] (colors), of both kind of systems for a sample of 10,000 elements. According to it, the mean reflectivity distribution was different for the convective and stratiform systems. Convective µ re f distribution (green line in the vertical pdf panel) was less skewed and centered around 27 dBZ. On the other hand, the stratiform µ re f pdf (purple line) was skewed and centered around 12 dBZ. We also observed some differences in the distributions of their areas (top pdf panel). While stratiform areas oscillated around 0.01 and 13,000 km 2 with 80% below 40 km 2 , convective systems oscillated between 0.01 and 80 km 2 with 80% below 9 km 2 .
reached their maximum values of 48 dBZ and 50 dBZ, respectively. Moreover, th stratiform maximized for areas around 50 km . In both cases, the mean reflectivit decreased for systems with areas below or above the ones mentioned. The described resul may be related to the size of the watersheds where flash floods usually occur (areas below 100 km ). According to the convective vs. area scatter in Figure-5, extensive convec tive system usually had a lower .

Figure-5.
Reflectivity and area for 10,000 randomly selected systems. Circles correspond to convec tive systems, triangles to stratiform systems, and the color is associated with the variability of th reflectivity inside the system. Vertical histograms present the PDF for the mean reflectivity for bot types of systems, and the horizontal histograms correspond to the PDF of the area.
By comparing the area and distributions, we observed many relatively smal convective systems with reduced reflectivity. The described pattern is also similar in th case of the stratiform systems. Both cases are likely attributed to misclassification caused by noise in the radar images. Therefore, we improved our classification by dividing th convective systems into two categories.
Using the methodology described in sub-Section 3.1.2., we classified 80% of the sys tems as unenveloped and 20% as enveloped. Figure-6 compares the area and of th Figure 5. Reflectivity and area for 10,000 randomly selected systems. Circles correspond to convective systems, triangles to stratiform systems, and the color is associated with the variability of the reflectivity inside the system. Vertical histograms present the PDF for the mean reflectivity for both types of systems, and the horizontal histograms correspond to the PDF of the area.
Convective and stratiform systems' reflectivity statistics changed with the area of the system. For large areas, convective µ re f tended towards 30 dBZ while the stratiform µ re f towards 25 dBZ. Additionally, for areas around 20 km 2 , the convective µ re f and σ re f reached their maximum values of 48 dBZ and 50 dBZ, respectively. Moreover, the stratiform µ re f maximized for areas around 50 km 2 . In both cases, the mean reflectivity decreased for systems with areas below or above the ones mentioned. The described result may be related to the size of the watersheds where flash floods usually occur (areas below 100 km 2 ). According to the convective µ re f vs. area scatter in Figure 5, extensive convective system usually had a lower µ re f . By comparing the area and µ re f distributions, we observed many relatively small convective systems with reduced reflectivity. The described pattern is also similar in the case of the stratiform systems. Both cases are likely attributed to misclassification caused by noise in the radar images. Therefore, we improved our classification by dividing the convective systems into two categories.
Using the methodology described in sub-Section 3.1.2, we classified 80% of the systems as unenveloped and 20% as enveloped. Figure 6 compares the area and µ re f of the enveloped (triangles) and unenveloped (circles) systems. Compared with Figure 5, the enveloped and unenveloped systems' area and reflectivity are more similar. The mean reflectivity distributions of the enveloped and unenveloped categories were 28 and 32 dBZ, and the areas had mean values of 15 and 30 km 2 , respectively. Moreover, the unenveloped convective system's σ re f was higher and peaked along with µ re f for systems with areas between 20 and 40 km 2 . Furthermore, the expected σ re f value of the unenveloped was considerably higher than the enveloped system (see Figure 6). enveloped (triangles) and unenveloped (circles) systems. Compared with Figure-5, the enveloped and unenveloped systems' area and reflectivity are more similar. The mean reflectivity distributions of the enveloped and unenveloped categories were 28 and 32 dBZ, and the areas had mean values of 15 and 30 km , respectively. Moreover, the unenveloped convective system's was higher and peaked along with for systems with areas between 20 and 40 km . Furthermore, the expected value of the unenveloped was considerably higher than the enveloped system (see Figure-6).

Figure-6.
Reflectivity and area comparison for enveloped (triangles) and unenveloped (circles) connective systems. Colors represent the standard deviation. Vertical and horizontal histograms present the PDFs for the reflectivity and areas, respectively. Table 1 summarizes the statistics of the stratiform, convective, enveloped, and unenveloped systems. According to the results, stratiform systems have an area with greater mean and deviation values. However, they have a lower mean reflectivity. On the other hand, convective systems have an area that oscillates around 24 km . Enveloped systems are smaller and have a reflectivity lower than unenveloped systems. The described results mark the differences between both types of convective systems, highlighting unenveloped ones as good descriptors of deep convective systems. Moreover, our results also match the given description of convective clusters enveloped in stratiform formations [52].   Table 1 summarizes the statistics of the stratiform, convective, enveloped, and unenveloped systems. According to the results, stratiform systems have an area with greater mean and deviation values. However, they have a lower mean reflectivity. On the other hand, convective systems have an area that oscillates around 24 km 2 . Enveloped systems are smaller and have a reflectivity lower than unenveloped systems. The described results mark the differences between both types of convective systems, highlighting unenveloped ones as good descriptors of deep convective systems. Moreover, our results also match the given description of convective clusters enveloped in stratiform formations [52]. The defined convective categories help us explore the region's storm systems. Moreover, the studied features of both classes have relevant differences. Considering the described differences between the enveloped and unenveloped systems, we also explored their spatial and temporal differences.

Spatiotemporal Behavior
We first analyzed the convective systems' diurnal and spatial distributions. We counted the hour of the day and the spatial localization of the convective, enveloped, and unenveloped systems. Then, we divided each count by the total of observed systems obtaining their probability of occurrence. Figure 7 presents the daily distribution observed in the region, starting at midnight and ending at 23:00. According to the figure, convective and undeveloped systems develop around noon, reaching the peak probability between 15:00 and midnight. On the other hand, enveloped systems are more likely to occur late at night and early morning. We attributed the observed temporal oscillations to surface heating and the eventual deep convection. Moreover, the increase in enveloped systems during the morning might be explained by the late stage of the deep convective systems.
The defined convective categories help us explore the region's storm systems. Moreover, the studied features of both classes have relevant differences. Considering the described differences between the enveloped and unenveloped systems, we also explored their spatial and temporal differences.

Spatiotemporal Behavior
We first analyzed the convective systems' diurnal and spatial distributions. We counted the hour of the day and the spatial localization of the convective, enveloped, and unenveloped systems. Then, we divided each count by the total of observed systems obtaining their probability of occurrence. Figure-7 presents the daily distribution observed in the region, starting at midnight and ending at 23:00. According to the figure, convective and undeveloped systems develop around noon, reaching the peak probability between 15:00 and midnight. On the other hand, enveloped systems are more likely to occur late at night and early morning. We attributed the observed temporal oscillations to surface heating and the eventual deep convection. Moreover, the increase in enveloped systems during the morning might be explained by the late stage of the deep convective systems. In Figure-7, we closely observe the described temporal evolution of the convective system. In it, we colored the spatial domain using the total count of systems falling in pixels of 1 km . Considering the significant count differences, we used different ranges to color each column of the figure. Moreover, we divided the results using a time step of six hours starting at 1:00 A.M. Since the total count of cases for the columns was significantly different, we kept independent color bars for each.
Column a in Figure-7 presents the spatiotemporal results of all the convective systems and expands the description presented in Figure-6. According to Figure-7, the morning hours have some convective activity (1st row). This activity reaches its minimum values between 7 A.M. and noon (2nd row). Then, during the afternoon, convective systems occur over the West and center regions of the domain (3rd row). Finally, during the night, the storms occur over the East (4th row).
Columns b and c in Figure describe the results for the unenveloped and enveloped system cases. Unenveloped systems (column b) exhibit a predominant formation over the eastern region between the afternoon and midnight (third and fourth rows). Moreover, they decrease during the morning (first row) and mostly disappear during the day (second row). Enveloped systems dominate the East, with some occurrences over the West. Moreover, the lag described in Figure-6 between the enveloped and unenveloped systems is also present. During the afternoon, unenveloped systems intensify while some envelopes occur (third row). During the night (fourth row), both categories increase. Then, In Figure 7, we closely observe the described temporal evolution of the convective system. In it, we colored the spatial domain using the total count of systems falling in pixels of 1 km 2 . Considering the significant count differences, we used different ranges to color each column of the figure. Moreover, we divided the results using a time step of six hours starting at 1:00 A.M. Since the total count of cases for the columns was significantly different, we kept independent color bars for each.
Column a in Figure 7 presents the spatiotemporal results of all the convective systems and expands the description presented in Figure 6. According to Figure 7, the morning hours have some convective activity (1st row). This activity reaches its minimum values between 7 A.M. and noon (2nd row). Then, during the afternoon, convective systems occur over the West and center regions of the domain (3rd row). Finally, during the night, the storms occur over the East (4th row).
Columns b and c in Figure describe the results for the unenveloped and enveloped system cases. Unenveloped systems (column b) exhibit a predominant formation over the eastern region between the afternoon and midnight (third and fourth rows). Moreover, they decrease during the morning (first row) and mostly disappear during the day (second row). Enveloped systems dominate the East, with some occurrences over the West. Moreover, the lag described in Figure 6 between the enveloped and unenveloped systems is also present. During the afternoon, unenveloped systems intensify while some envelopes occur (third row). During the night (fourth row), both categories increase. Then, in the morning, enveloped systems become dominant. According to our results, enveloped and unenveloped systems categories' temporal distributions coincide with the described evolution of the convective system [40]. Moreover, their occurrence coincides with the region's descriptions of orographic rainfall formation [53,54].
According to Figure 7, the convective systems have well-defined spatiotemporal distributions. Most convective systems occurred in the East between noon and midnight. Their occurrence coincided with reported flash floods and shallow landslide events [13,14].
They also coincided with descriptions of the Colombian Andean climatology [54,55]. The results presented here are likely intertwined with several climatological variables. However, exploring that connection is out of our scope. Future work may explore that link and how to use the results presented in the vulnerability analysis and risk assessment. Figure 8 summarizes the spatial distribution regardless of the daytime. According to Figure 8a, most convective systems occur in the East, specifically in Rio Playas, Rio Calderas, and Rio Verde (see Figure 1). There are also minor convective accumulations in the West near Rio Penderisco and Rio San Juan watersheds, and in the north near the outlet of Rio Grande. Unenveloped systems follow a similar localization (Figure 8b), with a significant occurrence increase near the boundaries of the watersheds. On the other hand, enveloped systems are more sparse over the region, being dominant in the West and in the East in the watersheds of Rio Verde and Samana.
Hydrology 2022, 9, x FOR PEER REVIEW 11 of 18 in the morning, enveloped systems become dominant. According to our results, enveloped and unenveloped systems categories' temporal distributions coincide with the described evolution of the convective system [40]. Moreover, their occurrence coincides with the region's descriptions of orographic rainfall formation [53,54]. According to Figure-7, the convective systems have well-defined spatiotemporal distributions. Most convective systems occurred in the East between noon and midnight. Their occurrence coincided with reported flash floods and shallow landslide events [13,14]. They also coincided with descriptions of the Colombian Andean climatology [54,55]. The results presented here are likely intertwined with several climatological variables. However, exploring that connection is out of our scope. Future work may explore that link and how to use the results presented in the vulnerability analysis and risk assessment. Figure-8 summarizes the spatial distribution regardless of the daytime. According to The localization of the convective systems coincides with the region's topography. Unenveloped systems mainly occur over the watersheds' boundary lines, usually the steepest areas. Moreover, they occur more often in watersheds with a significant elevation difference, in this case, the Rio Caldera, Rio Verde, and Samana watersheds with elevation gradients of around 2000 m. On the other hand, enveloped systems behave like a remanent of unenveloped ones being distributed downstream of the described regions.
The given description is of significant value, and the localization of the unenveloped systems can help determine vulnerable regions.

Connections with the Topography
According to our results, several convective systems tend to occur near the divisor lines of watersheds with high elevation gradients (see Figure 9a,b). However, some watersheds had a high elevation gradient, but the count of convective systems was low. In this section, we explored further the connection between the convective systems' occurrence and the characteristics of the watersheds. Here, we compared the localization of the convective systems with the area, aspect, and elevation gradient of the region's watersheds.  The localization of the convective systems coincides with the region's topography. Unenveloped systems mainly occur over the watersheds' boundary lines, usually the steepest areas. Moreover, they occur more often in watersheds with a significant elevation difference, in this case, the Rio Caldera, Rio Verde, and Samana watersheds with elevation gradients of around 2000 m. On the other hand, enveloped systems behave like a remanent of unenveloped ones being distributed downstream of the described regions.
The given description is of significant value, and the localization of the unenveloped systems can help determine vulnerable regions.

Connections with the Topography
According to our results, several convective systems tend to occur near the divisor lines of watersheds with high elevation gradients (see Figure-9 a,b). However, some watersheds had a high elevation gradient, but the count of convective systems was low. In this section, we explored further the connection between the convective systems' occurrence and the characteristics of the watersheds. Here, we compared the localization of the convective systems with the area, aspect, and elevation gradient of the region's watersheds. Figure-10 presents the 18 watersheds, their aspects, the topography, and the prevailing direction of the wind at 750 hPa (around 2500 m above sea level). Moreover, in the figure, we colored the aspects of the watersheds with the overlapping index, . The index measures the area shared between a convective system and a watershed. In our case, oscillated between 0.12 and 0.22.
According to Figure-10, the Samana, Rio Verde, Guadalupe (East), and Rio San Juan (West) watersheds have the highest values. In the four cases, the aspect is opposite to the wind, and the elevation difference is above 2000 m, except for Guadalupe where it is 1843 m. Nevertheless, this analysis has been conducted using the preferential wind direction, and, in many cases, the results presented here (3) may not be fulfilled. Moreover, it seems that additional features are involved, since some watersheds exhibit similar topographical features, but lower values.  According to Figure 10, the Samana, Rio Verde, Guadalupe (East), and Rio San Juan (West) watersheds have the highest O i values. In the four cases, the aspect is opposite to the wind, and the elevation difference is above 2000 m, except for Guadalupe where it is 1843 m. Nevertheless, this analysis has been conducted using the preferential wind direction, and, in many cases, the results presented here (3) may not be fulfilled. Moreover, it seems that additional features are involved, since some watersheds exhibit similar topographical features, but lower O i values.
Additionally, we compared O i with the upstream area, aspect, and elevation gradient of the watersheds (Figure 11). In the figure, the position of the arrow coincides with the area and the overlap index, O i , computed for each watershed. The direction represents the aspect, and the color the elevation gradient. Moreover, the circles represent the index for the enveloped (green) and unenveloped (purple) systems.
According to Figure 11, the unenveloped and enveloped O i values change among watersheds. In thirteen out of the eighteen watersheds, unenveloped O i is dominant. Moreover, enveloped O i is foremost in watersheds with upstream areas of around 1000 km 2 or less. The difference between the enveloped and unenveloped O i decreases towards 0.16 for watersheds with areas around 500 km 2 . On the other hand, with some exceptions, watersheds with high elevation differences tend to exhibit high O i values, a condition also influenced by their aspect. Watersheds, such as Samana, Rio Verde, and Rio San Juan, have a high O i , areas greater than 500 km 2 , and elevation differences above 2,000. The given description presents more information regarding the required local conditions for the occurrence of convective systems. However, the analyzed watersheds are large compared to the mean area of the convective systems (24 km 2 , see Table 1). Therefore, we expanded our analysis using the boundary sub-watersheds of orders 3, 4, and 5. Additionally, we compared with the upstream area, aspect, and elevation gradient of the watersheds (Figure-11). In the figure, the position of the arrow coincides with the area and the overlap index, , computed for each watershed. The direction represents the aspect, and the color the elevation gradient. Moreover, the circles represent the index for the enveloped (green) and unenveloped (purple) systems.
According to Figure-11, the unenveloped and enveloped values change among watersheds. In thirteen out of the eighteen watersheds, unenveloped is dominant. Moreover, enveloped is foremost in watersheds with upstream areas of around 1000 km or less. The difference between the enveloped and unenveloped decreases towards 0.16 for watersheds with areas around 500 km . On the other hand, with some exceptions, watersheds with high elevation differences tend to exhibit high values, a condition also influenced by their aspect. Watersheds, such as Samana, Rio Verde, and Rio San Juan, have a high , areas greater than 500 km , and elevation differences above 2,000. The given description presents more information regarding the required local conditions for the occurrence of convective systems. However, the analyzed watersheds are large compared to the mean area of the convective systems (24 km , see Table 1). Therefore, we expanded our analysis using the boundary sub-watersheds of orders 3, 4, and 5. The boundary sub-watersheds had no upstream tributaries and, in most cases, shared the divisor line with its containing watershed. We used boundary sub-watersheds of orders 3, 4, and 5. In the case of order 5, we obtained between two and three sub-watersheds per watershed (blue borders in Figure-12a). Moreover, each sub-watershed had its preferential aspect. According to Figure-12a, watersheds with an aspect opposite to the Figure 11. Overlap index in function of the area, aspect, and elevation gradient of the watersheds. The arrows correspond to the analyzed watershed denoting its main aspect. The arrows represent the watershed aspect, and the color corresponds to the elevation gradient (H max -H min ). The purple and green circles represent the overlap index, I oc , for the enveloped and unenveloped systems.
The boundary sub-watersheds had no upstream tributaries and, in most cases, shared the divisor line with its containing watershed. We used boundary sub-watersheds of orders 3, 4, and 5. In the case of order 5, we obtained between two and three sub-watersheds per watershed (blue borders in Figure 12a). Moreover, each sub-watershed had its preferential aspect. According to Figure 12a, watersheds with an aspect opposite to the wind direction tended to have a larger O i . In addition, the relationship between the upstream area, elevation difference, and O i became more evident (Figure 12b). Larger areas usually had increased elevation gradients and O i values. Figure 11. Overlap index in function of the area, aspect, and elevation gradient of the watershe The arrows correspond to the analyzed watershed denoting its main aspect. The arrows repres the watershed aspect, and the color corresponds to the elevation gradient ( -). The pur and green circles represent the overlap index, , for the enveloped and unenveloped systems.
The boundary sub-watersheds had no upstream tributaries and, in most cas shared the divisor line with its containing watershed. We used boundary sub-watershe of orders 3, 4, and 5. In the case of order 5, we obtained between two and three sub-w tersheds per watershed (blue borders in Figure-12a). Moreover, each sub-watershed h its preferential aspect. According to Figure-12a, watersheds with an aspect opposite to wind direction tended to have a larger . In addition, the relationship between the u stream area, elevation difference, and became more evident (Figure-12b). Larger ar usually had increased elevation gradients and values. Boundary sub-watersheds of orders 3 and 4 ( Figure-13a,b) had a pattern where overlap index increased with the area and the elevation difference. However, this tre had noise. In order 3 sub-watersheds (Figure-13a), areas between 10 and 40 km 2 had si ilar maximum values. Nevertheless, most of the cases with a high index were su watersheds facing East with elevation differences above 1500 m. In the case of orde sub-watersheds (Figure-13b), there was an increase in the values and the trend w Boundary sub-watersheds of orders 3 and 4 (Figure 13a,b) had a pattern where the overlap index increased with the area and the elevation difference. However, this trend had noise. In order 3 sub-watersheds (Figure 13a), areas between 10 and 40 km 2 had similar maximum O i values. Nevertheless, most of the cases with a high index were subwatersheds facing East with elevation differences above 1500 m. In the case of order 4 sub-watersheds (Figure 13b), there was an increase in the O i values and the trend with the area was evident. Moreover, the aspect and the elevation difference keep playing a significant role.  The occurrence of convective systems was linked to the elevation difference and aspect of the watersheds at several scales. We analyzed this relationship by defining an overlap index between watersheds and the identified convective systems. According to our results, watersheds with a high elevation gradient and an aspect opposite to the mean wind direction are more likely to develop deep convective systems. Nevertheless, further work is required to establish a link with the topography.

Conclusions
This work assessed the convective systems observed in the Colombian Andean region using local meteorological radar data between 2014 and 2017. Furthermore, we expanded our identification by defining two categories: enveloped and unenveloped con- The occurrence of convective systems was linked to the elevation difference and aspect of the watersheds at several scales. We analyzed this relationship by defining an overlap index between watersheds and the identified convective systems. According to our results, watersheds with a high elevation gradient and an aspect opposite to the mean wind direction are more likely to develop deep convective systems. Nevertheless, further work is required to establish a link with the topography.

Conclusions
This work assessed the convective systems observed in the Colombian Andean region using local meteorological radar data between 2014 and 2017. Furthermore, we expanded our identification by defining two categories: enveloped and unenveloped convective systems. In addition, we compared the area, reflectivity, and localization of the systems. Finally, we proposed a novel method to explore possible interactions between the region's topography and the occurrence of convective systems. In the topography interaction analysis, we used 18 watersheds and their boundary sub-watersheds of orders 3, 4, and 5. We identified some features of the convective storms of the region along with the areas and times where they usually occurred.
The characteristics of the deep convective systems depended on the stage of their evolution and the mechanisms behind them. A convective storm usually starts as an individual or as a collection of small systems that form a deep system and finally evolves into a stratiform formation with small convective systems [56]. The proper identification of the described evolution requires a tracking algorithm, such as TITAN [57]. Moreover, additional identification algorithms may present more insights into their properties [58]. However, implementing such algorithms in the region is beyond the work presented here. Instead, we introduced a classification of enveloped and unenveloped convective systems.
We described the differences between the region's deep convective (unenveloped) and clustered (enveloped) systems. Unenveloped systems exhibited higher reflectivity, larger areas, and a higher connection with the topography. On the other hand, enveloped systems behave similar to a late stage of deep convection. This conclusion is also supported by the occurrence time of both types of systems. While unenveloped ones developed between the afternoon and midnight, enveloped systems occurred during the night and morning. The described oscillations may be linked to the energy and humidity availability during the day and the eventual cooling during the night. Nevertheless, more work is required in this direction to develop a more robust conclusion. Furthermore, our approach is a step forward in understanding the mechanisms behind the occurrence of intense storms in the region.
Our analysis also found that deep convective storms occur more often over topographies with specific characteristics. The elevation gradient of the terrain and its aspect relative to the local wind direction were two critical features. We explored the relationship with the topography by analyzing the overlap between the convective systems, the ERA-5 wind at 750 hPa, and hundreds of watersheds. According to our results, convective systems occur more often in areas with high topographical gradients, where the aspect is opposite to the wind direction. This result coincides with other studies performed in the region around rainfall and topography [55,59]. Moreover, similar results linking topography and convective rainfall have been reported in other regions [60][61][62][63][64]. However, our analysis does not fully describe the convective storms' spatial distributions. Future work may include additional meteorological variables and use different ways to assess the connection with topography.
Our results are significant for the hydrologic community and the region's stakeholders. The characterization presented here identified the areas prone to deep convective storms. Furthermore, joint with landscape information, it could help identify areas in which flash floods and landslides may occur. Moreover, the analysis presented here could be easily replicated and improved, helping to identify vulnerable areas in other regions.