1. Introduction
Nowcasting refers to the weather forecast for the next 0–6 h (especially for 0–2 h) kilometer and minute level, which is a social service to prevent sudden catastrophic weather in the area [
1]. Its forecast objects mainly include thunderstorms, heavy precipitation, snowstorms, and other strong convective weather with rapid development, complex changes, and strong destructive power. How to accurately predict future weather changes by nowcasting is an important research topic to mitigate or even prevent strong convective disasters.
The weather forecasting methods are mainly divided into two categories: numerical weather forecasting (NWP) and extrapolation methods based on observation data from radar or satellite. NWP is a method based on solving a series of partial differential equations governing atmospheric motion [
2]. It quantitatively describes the physical state of the atmosphere. However, the mass computation makes it difficult to finish fast real-time forecasts by NWP. Additionally, it is hard to assimilate observations at the convective scale, which makes it difficult to describe sub-grid-scale physical processes. Small perturbations in the initial boundary conditions can seriously affect the final forecast accuracy [
3]. These characteristics determine that NWP is suitable for long-term large-scale forecasts but not for short-term small- and medium-scale forecasts.
Since the radar echo data has high spatio–temporal resolution and intuitional information, extrapolation based on real-time weather radar data become the mainstream method for nowcasting [
4]. Radar extrapolation technology refers to predicting radar echo images in the future based on the sequence of radar echo images in the past. High-quality radar extrapolation results are the basis for a variety of subsequent weather system identification and forecasting algorithms. The traditional radar extrapolation methods mainly include the storm cell identification and tracking algorithm (SCIT); the thunderstorm identification, tracking, analysis, and nowcasting (TITAN); tracking radar echoes with the correlation algorithm (TREC); and the optical flow method. The SCIT and TITAN algorithms focus on the identification and tracking of storm cells. They calculate the motion vectors of the cell by determining and tracking the position of the cell’s centroid [
5,
6]. However, they selectively ignore weak echo information in the forecasting process because it only focuses on the motion trajectory of a single storm cell. Tracking radar echoes with a correlation algorithm (TREC) calculates the optimal correlation between pairs of local regions. By comparing the echo images of adjacent moments, TREC obtains the motion vector of each region in the echo image. This motion vector then predicts the echo position in the future. The methodology described in [
7] is based on this approach. It will generate many disordered motion vectors when tracking echoes generate and change very fast, leading to poor performance. The optical flow method is used to obtain the optical flow field by pixel-level matching to obtain the motion vector of each pixel in the image, and then extrapolate the forecast of the echo image based on this optical flow field [
8,
9,
10]. The optical flow method is based on change rather than motion trajectory tracking based on invariant features. It can only predict the movement of storm cells by optical flow motion, not the change in storm cell size and intensity. Therefore, the performance decreases drastically with the increase in lead time [
11].
In recent years, many deep learning models have been proposed to improve forecasting performance. Neural networks can learn and model complex weather motion patterns from large historical data sets with their powerful feature extraction and nonlinear mapping capabilities. A series of neural networks for radar echo prediction have emerged [
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22]. The first end-to-end extrapolation model for nowcasting was proposed by Shi et al. [
12]. They improved the traditional long short-term memory (LSTM) unit by replacing fully connected layers with convolutional layers. This modification proposes the ConvLSTM network, in which the extraction of spatial features is implemented with convolutional layers and the LSTM unit is responsible for capturing the correlation between echoes at different moments. The extrapolation performance is significantly improved compared to the conventional optical flow method. Furthermore, Shi et al. [
13] also proposed the TrajGRU network to further capture scaling and rotating objects. Additionally, Wang et al. [
14,
15,
16] proposed a series of recurrent neural network (RNN) structures to refine the traditional ConvLSTM. In addition to RNN, Agrawal et al. [
17] treated radar echo extrapolation as an image-to-image conversion problem, using the data-driven approach without considering the actual physical mechanism, and used U-Net for direct prediction. Kim et al. [
18] used 3D convolution to extract the spatio–temporal information of the video sequence frames. Different from the end-to-end models above, in order to generate two prediction probability vectors to predict future radar echoes, Klein et al. [
19] proposed adding a dynamic convolutional layer to the traditional convolutional neural network. However, the above approach using the traditional mean square error loss leads to blurred image output from the model. Therefore, Wang et al. [
20] employed conditional generative adversarial networks (CGAN) and 3D convolutional networks to improve the extrapolation performance.
Despite their better performance, however, literature [
4] points out that most of the deep learning methods proposed at present only use historically recorded radar echo sequences as input, omitting the fact that the development of convection is highly related to the information rooting in the atmospheric environment. The dynamical structure of the storm and its potential development depend heavily on three physical elements: the thermal instability of the environment, the vertical wind shear, and the vertical distribution of water vapor [
23]. It is apparently true that radar echoes do not contain the above physical information, which can lead to different evolving processes of echoes even for similarly observed input echoes [
24,
25]. In other words, similar echo sequence inputs may have different directions under different environmental field constraints. In particular, it is difficult to predict generation, extinction, splitting, merging, and other phenomena only based on radar echoes. Some experiments have been made to integrate the radar data into the NWP [
26,
27] or use some extra data to improve the extrapolation, such as dual-polarization radar data [
28] and the high-resolution rapid refresh model (HRRR) [
29]. However, so far, no attempt has been made to incorporate environmental field information into the extrapolation process.
In order to provide intuitive and effective macroscopic physical constraints on the extrapolation process and improve the performance of the extrapolation, this paper proposes the idea of building a radar echo extrapolation model with the participation of real-time environmental field data. The environment field data can give useful information that is consistent with the evolution trend of radar echoes. The combination of radar data and environmental field data describes both the convective system intensity distribution as well as the external conditions that constrain the evolution of the convective system. Thus, it improves the model’s prediction capability of radar echo evolution comprehensively and allows for a more reasonable interpretation of the prediction results from a meteorological perspective. The design idea for the extrapolation model is to build a multi-channel 3D convolutional network consisting of three functional modules: encoding, inference, and decoding. There are two kinds of encoders: the echo encoder and the environment field encoder. The feature layers from the two types of encoders are stacked and sent to the inferer for fusion, which is then processed by the decoder to obtain the extrapolation result corrected by the environment field.
The paper is organized as follows:
Section 2 describes the proposed method and the data used in this experiment;
Section 3 and
Section 4 show the test results and the analysis of different cases.
Section 5 concludes with a discussion.
4. Case Analysis
In order to visually evaluate the improvement of the environment field on the radar echo extrapolation, we chose two cases to illustrate it.
Figure 6 shows a visual comparison of the radar echo sequence observed at Bingzhou station on 13 June 2016 at 8:06–9:00 and the results obtained from the three extrapolation models. The model inputs are all the observed sequences on 13 June 2016 at 7:00–8:00.
As can be seen from the top row of
Figure 6, the lower left region at the beginning of the observed image sequence contains a larger convective cell as well as several scattered smaller convective cells. As the lead time increases, the large convective cell tends to be stable, while numerous small convective cells converge. A new large convective cell and a strong small convective cell are formed on the left and lower left sides of the stable cell at about 54 min. In addition, the reflectivity in the upper left area is gradually increasing. The optical flow method has excellent prediction in the first few moments, but the prediction of the large convective cell shows significant deformation and scale decay. It fails to predict the new large cell’s generation by the merging of the small cells. In contrast, for the 3D-CNN model without the inclusion of the environment field, it is able to predict the motion trend of the cell and maintain its shape roughly, but the model tends to predict more and more ambiguous images. The model predicts that the scattered small cells around the large cell will gradually die out in the future, thus failing to successfully predict the generation of new large cells on the left. ConvLSTM performs well in predicting the first three images but soon tends to predict ambiguous results. It fails to predict the trend of echo enhancement in the upper left of the image, and the cells in the middle lack detail. After introducing the divergence field to the 3D model, it successfully predicts the generation of new cells on the left and lower left sides and the enhancement of reflectivity in the upper left region with the assistance of the divergence field, although it still outputs an ambiguity prediction.
The second case shows a visual comparison of the radar echo sequence observed in Yantai station on 30 July 2016 at 14:06–15:00 and the results obtained from the three extrapolation models. The model inputs are all the observed sequences on 30 July 2016 at 13:00–14:00.
The ground truth of
Figure 7 shows that the upper left region at the beginning of the observed image sequence contains multiple strong convective cells of varying sizes. As the lead time increases, the left cells gradually die out, and the right cells gradually evolve into a cluster and a band (24 min). Then, the upper side of the band echo merges with its left cell, while its lower side splits from the main body to form a separate cell (36 min). The two merged and split convective regions then enter into a weakening process. Finally, most cells are dead, leaving a few small, scattered cells. Since the optical flow method cannot predict the variation of the reflectivity, the radar reflectivity in row two remains essentially constant throughout the predicted sequence, which leads to a false prediction of the evolutionary processes of extinction, merger, and splitting of the cell. For the 3D-CNN model without environmental field and the ConvLSTM model, the radar echo starts to fade at about 36 min, but there is still a strong convective cell at the center of the image until 60 min. The information provided by the divergence field helps the model predict the weakening of the strong convective region at 24 min and correctly predict the extinction of the left cell, the merging and splitting of the band echo, and the clump echo.
In this article, a visualization method similar to
Figure 5 and
Figure 6 was used to analyze and compare all 1589 test samples. Approximately 10% of samples had generation, extinction, merging, and splitting in the 1-h echo evolution. Most of them gave similar optimistic predictions, similar to the examples in
Figure 5 and
Figure 6, fully illustrating that the inclusion of environmental fields has a positive constraint and guiding effect on the radar echo prediction.
5. Conclusions
In this paper, we propose a 3D convolutional neural network combing both radar and environmental field information to improve extrapolation performance. Our main contributions are: (1) By interpolation in the temporal dimension and down-sampling in the spatial dimension, the adaptation of the environmental field and radar echo sequence in spatial and temporal resolution is obtained. (2) The structure of the two kinds of encoders realizes the fusion of the two inputs in the channel dimension. They jointly participate in the extrapolation proceeding. (3) We explored the influence of different environmental fields and their combinations on predicting radar echoes. Finally, we conclude that the divergence field performs best in improving radar echo extrapolation quality.
Several experiments demonstrate the advantages of the proposed model in this paper: On the pixel level, the CSI of the 3D CNN model after introducing the divergence field is improved by 10% compared with the conventional optical flow method and by 2%–3% compared with the 3D CNN model without the environment field. On the convective cell level, the CSI is improved by 7.6% compared with the optical flow method and by 4.8% compared with the 3D CNN model without the environment field. Two cases show the capability of predicting the generation, extinction, merger, and splitting of convective cells. Therefore, we can conclude that using environment field information can improve extrapolation performance.
The experiments demonstrated that multi-source data provide more reliable information for radar extrapolation. Since the environment field provides information on the evolution of convective systems, we expect to further extend the effective time of extrapolation to two hours or even longer. Our future work will focus on building more complex models considering the structure of typical weather systems based on environmental information and meteorological knowledge, which fuse the environmental fields and radar data more effectively for better extrapolation performance.