1. Introduction
Weather monitoring and forecasting are as old as the beginning of civilization. Although the water circle was recognized in ancient times (Plato, Aristotle), the cause of natural disasters, and specifically extreme rainfall events associated with lightning, was often associated with the presence and act of the most powerful of Gods, e.g., Zeus in ancient Greece, Set in ancient Egypt, Illapa in Inca mythology (third in power), Thor in Norse mythology, while, in monotheistic religions, the act of inducing storms and lightning was attributed to God’s wrath, judgment, and punishment towards humanity. Even in recent times, extreme storm weather and flooding are the predominant natural hazards affecting humans, causing widespread devastation that can displace people, damage property and critical public infrastructure, and ultimately result in loss of life. This fact is not likely to change since the effects of climate change are projected to increase the intensity and frequency of such events [
1,
2,
3,
4]. Notably, the Mediterranean region has witnessed substantial economic losses and loss of life from recent flash floods [
5,
6,
7,
8,
9,
10].
To that end, precise weather monitoring and forecasting have become critical for prevention systems such as flood and storm early warning [
11]. Although many meteorological parameters affect weather, the presence and effect of rainfall are the most important, followed by intense wind. Extreme rainfall, in conjunction with human activity, causes land cover change and flooding. Flash floods, in particular, are presently among the most lethal weather-related hazards on a global scale. Their origin diverges from conventional river-driven floods, primarily from abrupt and intense precipitation over a brief duration. Usually confined to local settings, they are closely associated with convective storms, limiting their spatial impact [
12].
The instruments used to measure rainfall can be categorized into two groups: ones that detect and quantify rainfall at the surface, such as rain gauges and disdrometers, and remote-sensed instruments that measure rainfall well above the ground [
13]. Of the above, those offering the better and more advanced methods of storm tracking are the remote-sensed ones, especially weather radars, since they offer high spatial and temporal resolution, which is crucial for weather forecasting [
14,
15,
16]. For these reasons, more efficient ways involving remote-sensed precipitation measurements, i.e., weather radars and satellites, have increased attention [
17]. Storm tracking through weather radar monitoring dates back to the 1950s in the USA. Weather radars measure reflectivity, which can be transformed into rainfall intensity using a Z–R relationship [
15,
18,
19]. However, transformation into rainfall intensity is not necessary for storm tracking since reflectivity itself, after being filtered, provides all the information to determine a storm’s location and strength.
There are numerous storm-tracking algorithms in use, with the most frequently cited being the TITAN project [
20] and the SCIT system [
21]. The TITAN project brought to the forefront the use of reflectivity thresholds for identifying cells and a method to perform real-time nowcasting on the basis of evaluating a series of properties. Since then, numerous improvements or new algorithms have been discussed, mainly compared to the above [
22,
23,
24,
25,
26]. Two main categories provide cell tracking: centroid-based and cross-correlation algorithms [
26,
27]. The first is better at identifying and tracking individual convective cells, while the latter can produce more accurately the speed and direction of large moving storms [
27]. Another way of obtaining tracks is through sheer image analysis, specifically optical flow [
28,
29], which is usually used as a complete nowcasting tool [
30,
31]. Moreover, fuzzy logic and neural networks can be implemented in either the cell identification or the cell tracking process [
32,
33], showing promise in increasing the accuracy of generated results, although a large training dataset is required in such cases [
32].
In most of the above-cited cases, C-Band weather radar datasets are used, which compared to X-Band weather radars, have smaller spatial and temporal resolutions. Moreover, these algorithms are hard to find, as in an available computer code, and complex to apply when used with other weather radar datasets due to inconsistencies in data formats. Moreover, whereas a storm-tracking algorithm’s primary purpose is to track storm cells in real time, they can also be applied to past events to provide insights about rainfall pattern behavior over the observed area. This kind of analysis is significant, as identifying areas that are frequently affected by high rainfall intensity leads to enhanced Flood Early Warning Systems (FEWS) [
34], which include the installation and maintenance of rainfall monitoring equipment designed for flood protection [
35,
36], and thus further improve the planning of structural and non-structural flood mitigation measures. Although the use of intensity duration frequency curves assists in such cases, the high spatial coverage of weather radars, and especially the X-Band systems, which are inexpensive, feature better spatial resolution and are more versatile since they can also be mounted on a vehicle [
37], should be able to provide with added information in low-monitored areas found in rural and semi-urban regions.
In this research work, the main focus is on analyzing rainfall patterns in Athens, Greece. The NTUA X-Band weather radar, the “Rainscanner” system, has been used to analyze the storm patterns over Athens in recent years. This analysis provides two benefits, one being the development of an easy-to-use storm-tracking algorithm, able to analyze high spatial and temporal resolution datasets, and second, providing insights into the storm behavior in Athens, useful for identifying possible flood-prone areas and understanding the rainfall patterns over Athens for use in possible FEWS implementations.
3. Results
3.1. Cell Identification Analysis
The first section focuses on the performance of the storm-tracking algorithm. In
Figure 5, an analysis is performed to examine the thresholds used in the cell identification process. The images showcase a single timeframe where different thresholds and shapes are used to detect the cells. In each row, a different reflectivity threshold is used (25, 30, and 35 dBZ), while in each row, the difference between selecting the exact polygon to the fitted ellipse is shown.
First, we examine the used thresholds. It is evident that the higher the threshold used, the smaller the storm cells. For instance, when a 25 dBZ threshold is used, 11 unique storm cells are detected, whereas eight and only 4 for the 30 and 35 dBZ thresholds, respectively. By sheer observation, the lower threshold seems to over-perform, while the highest threshold seems to under-perform to the number of storm cells detected by the human eye. Since most storm events in Athens featured a mean 25–35 dBZ reflectivity, the optimal reflectivity threshold is the 30 dBZ level. Furthermore, when comparing the polygon-ellipse shapes used, it is evident that in the lower thresholds, the fitting ellipse tends to provide less quality results when trying to encapsulate a large cell, as seen in the number 4 cell, whereas cells with a smaller size are ideally identified.
This problem is eradicated when a higher threshold is used. In this case, the results show a good fit, which also tends to better represent the essence of a storm cell than the previously used polygon. For instance, in the high 35 dBZ threshold, a fitting ellipse is favored over the uneven shape of the small polygons, which feature sharp edges. A fitting ellipse, being a mathematical shape, is more suitable for correlation analysis as its eccentricity and rotation can be utilized in the cell matchmaking algorithm; therefore, it is generally the preferred shape.
Assessing all the above findings, the strategy used in the identification process is as follows. First, the 30 dBZ threshold is used to identify the storm cells that need to be tracked. A fitting ellipse is then formed, and the centroid, eccentricity, and the minor and major axes vectors are stored for each cell. The 25 dBZ threshold is then applied to figure the cell boundaries. The cell polygon boundaries that include the above 30 dBZ cells are stored in the database of each of the 30 dBZ cells. Finally, the 35 dBZ (or 40 dBZ) fitting ellipse is drawn and, along with its centroid, stored for each of the 30 dBZ cells. This is performed to identify areas with high reflectivity representing areas with high rainfall. All these datasets are used later in the tracking algorithm.
3.2. Cell Tracking Results
After the identification, the tracking process follows. The main goal is to apply the matchmaking algorithm and assess its result under different temporal scales.
Figure 6 displays six consecutive time frames at a 2-min interval, equal to the rainscanner’s temporal resolution. In the images shown, the 30 dBZ threshold and a fitted ellipse represent the storm cells, shown in red for the current time frame and with dashed black lines for the previous time frame. The same color scheme is applied to the centroids of each cell. In the first four frames of
Figure 6, the matchmaking process is deemed highly accurate; however, this is because the cells do not undergo significant changes within a two-minute period, making cell association straightforward. However, the last two frames show slight discrepancies in the matchmaking process. In the 15:48 and 15:50 time frames, two cells merge to form a larger one. While the merge is acceptable at 15:48, the resulting ellipse at 15:50 is less accurate, as the cell takes on an inverted “V” shape, and the algorithm produces a single storm cell with a wrongly positioned centroid.
To avoid such matchmaking issues and enhance the identification and tracking process, combining multiple frames to create a single image is recommended, which can be achieved by merging past 2-min frames and calculating the average reflectivity values. For example, if two frames with timestamps 15:42 and 15:44 are merged, a 4-min reflectivity field is created with the timestamp of 15:44, illustrating the average reflectivity of the two frames. This process produces more uniform and consistent images, making it easier to identify larger storm cells rather than splitting them into smaller cells. Using a coarser resolution also improves the accuracy of nowcasting algorithms since it aligns better with other data types, such as rain gauges, which operate on similar, coarser timescales. However, it is essential to note that when merging frames, the reflectivity values are normalized, reducing both minimum and maximum values; therefore, the reflectivity threshold might need to be adjusted to reflect this. Nevertheless, in this application, as shown in
Figure 7, and after several tests, the 30 dBZ threshold that was applied yielded satisfactory results. In
Figure 7, five timeframes are merged to create 10-min reflectivity fields, with the tracking being performed on the original, 2-min resolution, similar to the tracking performed in
Figure 6.
Comparing the same timestamp frames between
Figure 6 and
Figure 7, it is visible that the storm cells in
Figure 7 are more blurred, with the ellipse being of a similar but smaller and different shape, as a result of averaging five reflectivity fields. As mentioned before, the timeframe 15:44 in
Figure 6 is the 2-min reflectivity the rainscanner measures. In contrast, the timeframe 15:44 in
Figure 8 is the 10-min reflectivity resulting from averaging timeframes 15:36, 15:38, 15:40, 15:42, and 15:44. The comparison shows that although the first tracking steps feature similar results, the difference is highlighted in the last two frames, the 15:48 and 15:50, where using the 10-min reflectivity a merge is formed from two nearby storm cells. In contrast, in the 2-min fields, this is not exhibited. Furthermore, in the 15:50 frame, fewer, larger storm cells are identified in the 10-min resolution, whereas in the 2-min resolution, they are shown as different storm cells. In the time frame 15:50, the cells are formed more consistently, making a better cell identification. Specifically, compared with the 2-min resolution, the transition between each timeframe is more natural, yet not less realistic. This comparison shows that using more data is preferable to achieving better results, even in such small scales as 2 min.
Therefore, the next step in algorithm testing is to increase the temporal resolution of the consecutive images. Naturally, images observed with a two-minute delay should not exhibit significant changes. Therefore, coarser resolutions are tested to estimate better the algorithm’s ability to track storm cells in consecutive images correctly. This application is usually used for nowcasting and forecasting since the more extensive the temporal gap, the better the lead time provided. Typical resolutions are in either 5-, 10-, 15-, and 30-min resolutions, while more extended periods, such as 1-, 3-, and 6-h, typically require more data, such as advanced cross-correlation algorithms, dual-frequency Doppler radars, wind velocity, and direction data, and even NWP products. However, the algorithm is tested in this test to assess its tracking ability on observed images rather than figuring out a possible future position.
Finally,
Figure 8 shows the 10-min interval using merged 10-min resolution frames, similar to the previous tracking images. As was the previous results, the 10-min periods also showed good consistency. Specifically, the storm cells were found to be consistent with the used-to-be place and current place. For instance, by comparing the timeframe 15:50 shown in
Figure 7 and
Figure 8, it is shown that although in the past, cells were calculated using different data—specifically, the 10-min reflectivity at 15:48 in the former and the 10-min reflectivity at 15:40 in the latter—the consistency between the periods remains acceptable. This demonstrates that the algorithm can still be reliable for future predictions, even with larger time intervals.
3.3. Rainfall Patterns Analysis
After evaluating the ability of the storm-tracking algorithm to identify and track storm cells across multiple rainscanner images, it is used to construct the track of an entire event. In Athens, typical events have short durations, lasting 6- to 12-h, while summer convective events can have smaller durations of up to 4- to 9-h.
Table 1 features a sublist of the events used in the analysis, with the date and duration of the events illustrated in the following figures.
Although the 30 dBZ limit is used as the primary threshold to identify storm cells, in order to produce more readable images and focus on the areas where high rainfall intensity was observed, we plot the storm cells’ boundaries and centroids that featured a reflectivity over 35 dBZ and 40 dBZ. These thresholds are used to emphasize regions that were hit by intense rainfall. These maps assist in assessing areas affected by the highest rainfall intensities. While the storm cells might have a bigger footprint, the 35 dBZ can be considered a relatively high threshold. When transforming this threshold to rainfall intensity, through the use of a Z–R relationship, such as the typical Marshal and Palmer [
18], Z = 200 R
1.6, 35 dBZ translates into 5.62 mm/h, while when using the Z–R relationship that was derived by the authors [
19], who used the same radar datasets and correlation analysis with rain gauges, the Z = 321 R
1.25 relationship, the 35 dBZ translates into 4.46 mm/h. Similarly, the 40 dBZ threshold translates into 11.53 mm/h and 9.47 mm/h rainfall intensity for the above Z–R relationships.
Figure 9 shows the regions affected mainly by two convective events, 31 and 50, using the 35 dBZ and 40 dBZ thresholds. The difference between utilizing a 35 and 40 dBZ threshold is shown in each column. In event 31, the main trajectory of the storm is more visible when observing the 40 dBZ threshold map. The storm’s movement had a northwest-to-east direction, first observed on the top of Mount Pateras, and followed a direction towards Aigaleo Mountain to the northern suburbs of Athens when comparing the maps to
Figure 1. The triangle showcases the location of the rainscanner, which, as mentioned, is only 5 km from the city of Athens. Finally, the tracking is also visible in the 40 dBZ map for the E50 event with dashed black lines. Similar maps are made for the dataset, consisting of 50 storm events from 2018 to 2023.
By analyzing the similar maps of the entire event database, it is evident that most storm cells are located in the southeast region, over the islands of Aegina and Salamina. This result is attributed to the seawater feeding the storm cell more moisture. Considering the mainland areas, the areas most affected depend on the storm’s trajectory, with the coastal areas in the southwest, where Piraeus port is located, being the majority, while also the area west of the city of Athens, where the Aigaleo mountain is found (see
Figure 1), also features a high number of impacts. The center of Athens, near the triangle that illustrates the rainscanner location, does not have that many impacts. Generally, rainfall events tend to have a western direction, towards the city center, then following a northern direction, due to the presence of Hymettus Mountain (found east of the rainscanner location).
Since these maps might be overwhelming when trying to understand the trajectory of the primary storm cell, in
Figure 10, only the centroids of the 35 dBZ cells are illustrated with red dots. Moreover, four more events are added to the specific figure: events 6, 21, 61, and 62.
On each frame, of
Figure 10, an entire event is illustrated, and the main trajectory of the storm cells is plotted. This trajectory is extracted by evaluating all the results generated by the storm-tracking algorithm, focusing on the larger storm cells. Generally, the trajectory of the storm cell matches the location of the storm cell centroids. Event 21 shows a change of trajectory, where, first, the main trajectory had a south-to-north direction, only to change to an eastern direction.
The featured events are a subsection of all the events evaluated. In
Figure 11, the main directions of the storms featured in Attica after analyzing the storm direction of each event, along with the percentage of each direction, which is also illustrated with the use of increasing line weights of each direction arrow. Most storms (40%) had a southeast-to-northwest direction, passing over Athens. Following this, a west-to-east and northwest-to-east direction is also featured, 20% and 12%, respectively, while some cases of a south-to-northeast direction, 6%. Finally, north-to-south directions are also featured, approximately 16% of the entire datasets. Another critical factor to consider is the event’s storm characteristics: the season they occurred, their velocity, and convective/stratiform classification. Events that featured stratiform characteristics, as identified by their large footprints, medium reflectivity, and slower velocity, usually occurred in the winter months, and mainly had a north-to-south direction. On the other hand, most of the events studied were convective events featuring a south-western-to-east direction, and an average velocity of about 45 km/h. Finally, even though no measurements were feasible to the southeastern part of the area due to radar beam blockage, no events featured a southeast-to-west direction where data were available.
It is important to stress that during the studied period, Greece experienced reasonably hot winters with a small number of rainy days, with a negligible impact on the region of Athens due to the topography of Greece. Therefore, most events occurring, or instead significantly impacting Athens infrastructures through flooding, have convective characteristics even in the winter months. Greece’s most recent events occurred from September (the 2020 Ianos and 2023 Daniel storms) to November (the 2017 Mandra event). While the first two had significant rainfall, these events did not affect the city of Athens, while the latter had a minimal footprint, fixed over the mountain Pateras in the western area of the Attica region.
4. Conclusions
The storm-tracking algorithm developed in this research concentrated on cell identification and tracking methods. The algorithm was evaluated on the basis of its ability to track storm cells through the use of various reflectivity thresholds and shapes. It was found that employing multiple thresholds is advantageous, as it allows for better identification of both the cell’s centroid and its actual boundaries. Specifically, the 25 dBZ threshold effectively defines polygon boundaries, while the 30 dBZ threshold is better suited for identifying primary cells. Given these findings, the 30 dBZ threshold and a fitting ellipse with a minimum area of 4 km2 were utilized in the matching algorithm, as it yielded the best results for detecting primary cells. A fitting ellipse is also preferred as a mathematical representation, as it works more effectively with the matching algorithms. Following this, the matchmaking algorithm is applied and tested for cell tracking across different temporal scales. While the rainscanner system records reflectivity products with a 2-min temporal resolution, such tracking is impractical, and, therefore, reflectivity fields with a 10-min resolution were generated by merging five consecutive images. The algorithm performed better with this extended tracking period, as more data per interval were used. Further runs of the algorithm over a 10-min tracking period showed effective performance, considering the number of tracked, merged, disappeared, or newly formed cells.
The tracking algorithm was then applied to over 80 events recorded with the rainscanner system. As mentioned earlier, of the 80 events, only 54 featured high enough reflectivity records to form well-identified tracking. The results featured two main findings. First, the areas predominantly affected by high rainfall intensity were identified by analyzing the storm cell’s borderlines in conjunction with a high reflectivity threshold. These areas are firstly the northern part of the Saronic Gulf, i.e., at the sea, while on the mainland, the areas most affected are the southwest coastal front of Athens, near Piraeus port, and regions surrounding the Aegaleo mountain. These areas tend to be affected due to topography since, in the first case, the coastal front is subjectable to high rainfall due to the proximity of the sea, which feeds the storm cells with added moisture, while in the latter case, it is the increase in elevation that is caused by the Aigaleo mountain. The second finding focuses on the primary storm directions in Attica. As expected, most storms (40% of events) moved from the southwest to east, followed by northwest-to-east and south-to-north directions.
Additionally, storms moving from the north and northeast to the west were common, though these events were associated with stratiform rainfall, characterized by slower-moving storms with light-to-moderate rainfall rates. In contrast, most convective events moved from west to east and exhibited higher velocities. Considering the above, this analysis highlights the importance of proper rainfall monitoring in early warning systems. As shown, since most storm systems had a west-to-east direction, it emphasizes the need to expand the monitoring system to the west further to detect these systems earlier and add to possible lead times when producing forecasts.
The obtained results showcase the algorithm’s ability to assess the rainscanner datasets and provide valuable information concerning the rainfall patterns and intense rainfall-prone areas over Athens. The algorithm’s applicability makes it a valuable tool for any weather radar system, ensuring better weather monitoring and warning systems in the respective areas of weather radar coverage. Ongoing research focuses on adding a nowcasting algorithm to predict real-time storm movement. Such nowcasting tools are essential for real-time monitoring and forecasting in a flood early warning system frame, we suggest that further analysis on a larger scale, i.e., radar scans covering further distances or using synoptic and mesoscale datasets, should have their merit since the 50 km range of the rainscanner can be considered small to provide adequate lead times. Nonetheless, this analysis brings to fruition the necessity of adopting high-quality measurements to deal with storm issues, which are expected to increase in frequency due to the effects of climate change.