Neural Network-Based Identification of Cloud Types from Ground-Based Images of Cloud Layers

Li, Zijun; Kong, Hoiio; Wong, Chan-Seng

doi:10.3390/app13074470

Open AccessArticle

Neural Network-Based Identification of Cloud Types from Ground-Based Images of Cloud Layers

by

Zijun Li

¹,

Hoiio Kong

^1,*

and

Chan-Seng Wong

²

¹

Faculty of Data Science, City University of Macau, Macau 999078, China

²

Macao Meteorological Society, Macau 999078, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(7), 4470; https://doi.org/10.3390/app13074470

Submission received: 13 March 2023 / Revised: 26 March 2023 / Accepted: 30 March 2023 / Published: 31 March 2023

(This article belongs to the Special Issue AI-Based Image Processing)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Clouds are a significant factor in regional climates and play a crucial role in regulating the Earth’s water cycle through the interaction of sunlight and wind. Meteorological agencies around the world must regularly observe and record cloud data. Unfortunately, the current methods for collecting cloud data mainly rely on manual observation. This paper presents a novel approach to identifying ground-based cloud images to aid in the collection of cloud data. However, there is currently no publicly available dataset that is suitable for this research. To solve this, we built a dataset of surface-shot images of clouds called the SSC, which was overseen by the Macao Meteorological Society. Compared to previous datasets, the SSC dataset offers a more balanced distribution of data samples across various cloud genera and provides a more precise classification of cloud genera. This paper presents a method for identifying cloud genera based on cloud texture, using convolutional neural networks. To extract cloud texture effectively, we apply Gamma Correction to the images. The experiments were conducted on the SSC dataset. The results show that the proposed model performs well in identifying 10 cloud genera, achieving an accuracy rate of 80% for the top three possibilities.

Keywords:

ground-based cloud images; cloud genera; identification; convolutional neural networks; texture; Gamma Correction

1. Introduction

Clouds are a significant element that affects regional climate, with different types of clouds reflecting varying amounts of sunlight based on their composition, altitude, and other factors, which can alter the Earth’s climate [1,2]. As shown in Figure 1, under the influence of sunlight and wind, clouds play a significant role in the Earth’s water cycle [3]. Local weather is closely linked to the type of cloud coverage [4], and, to some extent, future weather forecasts can be predicted based on the type of clouds.

Cloud types and structures are vital sources of meteorological information, and it is crucial for regional meteorological bureaus to consider the types and structures of clouds present in the area when compiling meteorological data and generating regional weather maps [5]. Accurate cloud data on barometric charts can aid the weather bureau in providing more precise weather forecasts. However, unlike other meteorological data that can be automatically collected (e.g., temperature, humidity, rainfall, etc.), cloud data still rely on manual observation and collection by observers. The Met Office requires hourly observations, or more frequent observations every half-hour for facilities that are significantly affected by weather and cloud conditions (e.g., airports), which leads to a significant increase in the cost of manually collecting cloud data. If we could achieve the automatic classification of cloud genera, it would greatly save the cost of collecting meteorological data. Currently, cloud classification is primarily performed manually by experts in the field. This method has the drawback of being time-consuming, labor-intensive, and relying solely on the expertise of academics for its accuracy, resulting in uncertainty in the classification results. An automated and precise classification method would significantly enhance the consistency of cloud genera data collection for researchers. However, there is currently a lack of research on the automatic identification of cloud types from ground-based images.

The current research on cloud classification has primarily been carried out by categorizing satellite images [6], which provide a direct view of cloud cover on the Earth’s surface. However, this method is not highly effective in real-time as these satellites do not continuously monitor the same area, and the time required for data transmission is significant. In contrast, some research on cloud classification has mainly utilized cloud images captured from the ground to identify cloud genera. Researchers have proposed using the textural structure of infrared images [7] to perform cloud classification, focusing on features such as mean grey level, edge sharpness, and cloud gaps. While these methods perform well overall, they may not be accurate enough for identifying similar clouds. This is why some research considers adding the spectral information of clouds as features [8]. While fewer studies have utilized ground-based images for classification, this approach is still feasible. In fact, some studies have employed support vector machines (SVM) [9] to classify clouds based on their texture and structural features. Nevertheless, this approach demands high-quality images and may impact classifier performance if the images are captured during nighttime or under conditions of rain or snow. Along with improvements in camera technology, the use of whole-sky images [10] and infrared cloud analysis instruments [11] provide the basis for collecting high-quality ground cloud image data.

The progress of machine learning has led to the widespread use of artificial neural networks (ANN) in the field of computer vision (CV), resulting in significant achievements [12]. Numerous scholars are attempting to incorporate ANNs in meteorology, with convolutional neural networks (CNN) garnering significant attention and producing remarkable results [13,14,15]. Presently, some researchers have also started to employ CNN as a technique for cloud image identification. Traditional CNN operates as a hierarchical framework for feature extraction. In a CNN, the lower layers extract the cloud’s texture (e.g., structure, fringes, etc.), whereas the higher layers capture more complex semantic information. Despite ground-based images having limitations in representing spectral information and cloud composition, the texture features of clouds remain the primary basis for their classification. Ground-based images are better suited to depict this feature [9]. Considering the potentially subtle textural variations between cloud genera in the implementation, CNNs’ ability to efficiently extract feature information through convolutional kernels offers a means to achieve the automatic and precise classification of ground-based cloud images. The DeepCloud network was the first to attempt to use neural networks to identify ground-based cloud images [16]. The researchers classified clouds into eight categories and supplemented them with clear sky images. They then employed deep neural networks to extract features from the images and achieved an accuracy of 80%, demonstrating the efficacy of neural network classification. Following the DeepCloud study, subsequent research has also utilized neural networks to identify ground-based cloud images, known as the CloudNet [17]. In the CCSN database, clouds were classified into 11 categories, and the model was enhanced by normalizing the RGB three-channel values in the images and increasing the image contrast, ultimately resulting in a high level of accuracy. Subsequently, Huertas-Tato et al. proposed combining neural networks and random forest to further improve the classification performance of the model [18]. Shuang et al. proposed MMFN, which could learn extended cloud information by fusing heterogeneous features in a unified framework [19]. TGCN is a model that learns features in a supervised manner and incorporates graph computation into the model [20]. However, these studies have not addressed the problem of accurately classifying similar cloud formations. In summary, there are few studies in the fields of meteorology or image recognition that use neural networks to identify ground-based cloud images, and all these studies encounter the challenge of having inadequate data samples or poor data quality. In response to the above problems, we built a ground-based cloud image dataset and proposed a method based on CNN to identify ground-based cloud images. The main contributions of this research are as follows:

Constructing a ground-based cloud images dataset, known as the SSC, which is the first dataset to ensure roughly equal sample sizes for each cloud genera. The dataset comprises 1092 data samples, categorized into 10 groups.
Various processing methods are employed to adjust the contrast and brightness of the images, thereby enhancing the texture of the clouds. By this approach, the efficiency of the model in extracting cloud texture is enhanced, and the model’s performance is improved.
We employed a new evaluation method for assessing the performance of our model. The method involves categorizing clouds into three types based on their texture (cumulus, stratiformis, and undulatus). While this evaluation method is relatively uncommon in machine learning, it is crucial in the field of meteorology. In reality, these three types of clouds have varying degrees of impact on the climate, which is an important consideration when collecting cloud data [21,22].

The remainder of this paper is structured as follows: Section 2 provides a brief introduction to our dataset; Section 3 details a method for improving the recognition of ground-based cloud images by the model through image enhancement; Section 4 presents the experimental results and provides a discussion on the feasibility and effectiveness of the proposed approach; Section 5 is where a summary of the research is presented, along with a discussion on future research directions to explore the practical applications of the results.

2. Data

At present, the International Cloud Atlas (WMO) [23] classifies clouds into three families and ten genera according to their height and textures. In addition, according to the different reasons for cloud formation, they can be divided into cumulus, stratiformis, and undulatus, as shown in Table 1 and Figure 2, which enumerate each cloud sample in the datasets. Clouds generated by convective motion are cumulus, clouds formed by systematic vertical motion are stratiformis, and clouds formed by atmospheric fluctuations or atmospheric turbulence are undulatus. However, no meteorological office or research institute has a dedicated database of ground-based cloud maps yet. Therefore, this study will use CCSN [17] and the dataset from the Cloud Watching Contest [24] as a foundation to establish a better dataset and improve it under the supervision of the Macau Meteorological Society. We then removed poor-quality images from the dataset (e.g., images containing a certain proportion of buildings or trees). Specifically, we eliminated data samples with less than a certain proportion of clouds in a 480 × 480-pixel image. Both the CCSN and the Cloud Watching Contest datasets were obtained from a single institution, indicating that all images in the same dataset were captured at the same location. Additionally, cloud production is influenced by geographic location, cold, and warm currents [25]. Therefore, if data were captured from the same location, the number of the same cloud genera in the dataset would be excessively high. To avoid having one category of data samples significantly outnumber the others and causing the model to overly focus on that category during feature extraction, thereby affecting the overall performance of the model, we additionally collected and labeled cloud genera images with a smaller sample size and added them to the SSC. This ensures that the data volume of each cloud genera in the dataset is roughly the same. The types of cloud genera in our dataset and the amount of data samples are shown in Table 1.

3. Method

3.1. Image Processing

The classification of ground-based cloud images is primarily based on the textural characteristics of the clouds. However, some cloud genera have less distinctive textural characteristics than others, and the differences between them are mainly based on their height and composition. For example, Altocumulus (Ac) and Cirrocumulus (Cc) share a similar appearance of a patch, sheet, or layer of cloud. However, the distinguishing factor is that Cc has a white cloud base with a ripple-like texture, while Ac has a greyish cloud base with smaller and well-defined cloud patches. It may be challenging to accurately differentiate between cloud genera with similar textural characteristics based solely on the original images. To emphasize these textural characteristics for subsequent feature extraction, we will pre-process all data samples. There are two methods: edge detection and contrast enhancement.

Edge detection is typically carried out using the Sobel operator [26], which suppresses discrete differences by computing a grey-scale approximation of the image. This algorithm employs two 3 × 3 convolution kernels to convolve in the x and y directions, respectively, and the convolution kernels are

G_{x} = [\begin{matrix} - 1 & 0 & 1 \\ - 1 & 0 & 1 \\ - 1 & 0 & 1 \end{matrix}], G_{y} = [\begin{matrix} - 1 & - 1 & - 1 \\ 0 & 0 & 0 \\ 1 & 1 & 1 \end{matrix}]

(1)

The image is convolved and the approximate gradient value of each pixel in the image is calculated

g

:

|G| = |G_{x}| + |G_{y}|,

(2)

g = \{\begin{matrix} |G|, |G| \geq ρ \\ 0, |G| < ρ \end{matrix}

(3)

Typically, algorithms aiming for efficiency in the future will not directly use

G_{x}

and

G_{y}

, but will calculate them using the approximate values of their square roots. Then, where

ρ

is the hyperparameter, and when the grey gradient value

g

is bigger than

ρ

, it will be considered as graphical edge retention; otherwise, it will be suppressed. The Sobel operator detects edges by assigning weights to the differences based on the grey values of neighboring pixels surrounding pixel point 8. This phenomenon reaches an extreme value at the edge. The operator also has a smoothing effect on noise and provides relatively quick and accurate information on the direction of edges. However, it is not very precise in locating the edges.

Edge detection will also be performed using the Canny operator [27], which firstly smooths the image through a Gaussian filter with a Gaussian kernel

G

:

G (x, y) = \frac{1}{\sqrt{2 π} σ} e^{- \frac{x^{2} + y^{2}}{2 σ^{2}}},

(4)

where

σ

is the hyperparameter that determines the width of the Gaussian function. The magnitude and direction of the gradient are then calculated. The Canny operand uses two convolution kernels in the x and y directions to calculate the edge pixels in the

x

, and

y

directions, using the convolution kernel in the Sobel operator, as shown in Equation (1). In this case,

G_{x}

performs edge detection in the x-direction, and when the convolution kernel calculates the boundary pixels perpendicular to the x-direction, it will zoom in on the value of the pixel point on the boundary;

G_{y}

performs edge detection in the y-direction, and the operation is the same as

G_{x}

. The gradient amplitude is later calculated by the above convolution kernel:

E d g e_{G r a d i e n t (G)} = \sqrt{G_{x}^{2} + G_{y}^{2}},

(5)

and gradient direction calculation:

A n g l e (θ) = {t a n}^{- 1} (\frac{G_{y}}{G_{x}})

(6)

During the Gaussian filtering process, it is possible that the edge pixels of the image may be enlarged. Therefore, after completing the Gaussian filtering, it is necessary to filter out the non-edge pixel points using non-maximum suppression. This helps to ensure that the width of the edge is kept to a minimum of one pixel. If a pixel is an edge of the graph, it will have the maximum gradient value in the gradient direction; otherwise, the grey value is set to zero, which helps to refine the graph edge. The Canny algorithm performs non-maximum suppression in four directions:

$θ = 0^{°}$ , contrast the left and right neighboring points of the pixel.
$θ = 45^{°}$ , contrast the lower-left with the upper-right neighborhoods of the pixel.
$θ = 90^{°}$ , contrast the top and bottom neighborhoods of the pixel.
$θ = 135^{°}$ , contrast the upper-left with the lower-right neighborhoods of the pixel.

Finally, using the hysteresis thresholding method, the hysteresis threshold requires two thresholds (high and low threshold). As shown in Figure 3, an edge pixel with a gradient value higher than the high threshold is labeled as a strong edge pixel; an edge pixel with a gradient value less than the high threshold and greater than the low threshold is labeled as a weak edge pixel; and an edge pixel is suppressed if its gradient value is less than the low threshold. The Canny operator is not susceptible to noise interference and is able to detect the true weak edges. Moreover, the Canny operator excels at locating edge pixels and accurately identifying edge points on pixels that have the highest grayscale variation, but the Canny operator’s computational complexity is significantly higher, which reduces its efficiency.

Contrast enhancement uses the Clahe algorithm [28], which is further improved by the Histogram Enhancement algorithm (HE). This method enhances the contrast of an image by locally performing histogram equalization, which preserves more details of the image and facilitates the extraction of features. The Clahe algorithm first adjusts the luminance distribution of pixels by equalizing the histogram, while it avoids the issue of certain colors overpowering others by limiting the number of pixels per color. By comparing the histogram of the original image in part a of Figure 4 with the histogram processed by the Clahe algorithm in part b, it is evident that the Clahe operator alters the grey values of pixels that exceed a certain threshold to a lower value. By reducing the brightness of the bright areas and increasing the brightness of the dark areas, the Clahe algorithm enhances the contrast of the image.

The image enhancement method still utilizes Gamma Correction [29], a non-linear operation that adjusts the brightness of the image. The corresponding pixel value is determined using a transformation formula, which is represented as

s = c r^{γ}

(7)

where

r

is the input pixel grey value,

c

is a constant, usually taken as 1, and

γ

is the Gamma Correction value, which determines whether the output pixel grey value s is transformed to a larger or smaller value. As shown in Figure 5, when

γ

> 1 this transformation is called decoding gamma, and the final image will be darker overall; when

γ

< 1, this transformation is called coding gamma, and the final image will be brighter overall.

We utilize the above four methods to enhance the original data, aiming to strengthen the texture information present in the image. The results obtained after pre-processing the images are presented in Figure 6. We will utilize the four datasets obtained after processing the original dataset to train the models and compare their performance.

3.2. Model

Neural networks (NNs) are a classic field in the domain of machine learning. In NNs, problems such as classification discrimination are solved by computing non-linear functions in neurons, as illustrated in Figure 7. A classical NN network contains an input layer, one or more hidden layers, and an output layer. This structure represents the underlying multilayer perceptron (MLP) structure [30,31]. However, NNs face a challenge due to their fully connected network structure, which requires them to handle a large number of operations between their hidden layers. In the field of image recognition, NNs treat the data as a matrix vector and extract feature values from it. The feature vector is then passed to all the neurons in the next layer, which repeat the previous step until the data are passed to the output layer. This means that if we attempt to increase the number of image channels, the total number of pixels, or add new neurons to the hidden layer, the amount of computation required will increase significantly. As a result, the resulting increase in time and memory consumption is unacceptable. Compared to traditional NN, the CNN proposed by LeCun et al. addresses this problem to some extent [32,33,34]. In a CNN, not all neurons in the upper and lower layers are connected directly. Instead, they are partially connected through a medium known as ‘convolutional kernels’, which is added to the hidden layer. Simultaneously, a convolutional kernel is shared between neurons in the same layer, which retains the location information of features in the image. As the data are passed down the convolutional layer, the model extracts increasingly complex texture features. CNN is particularly effective in extracting local features from an image through convolution kernels, which can limit the number of parameters in the model. This makes CNN more suitable for the field of image recognition. In addition, CNN also includes pooling layers (also known as subsampling or downsampling), which reduce the dimensionality of the image by performing a streamlining operation (e.g., maximization, averaging, etc.) on the data within a certain region, retaining only the key information.

There are numerous classical network structures in CNN (e.g., Inception [35] proposed by Google, etc.). The CNN employed in this paper is the classical VGG16 network structure [36], which belongs to a category of models that are built on the enhancements proposed by AlexNet [37]. Compared to AlexNet, the VGG network structure contains a greater number of convolution and pooling layers, utilizing smaller convolutional kernels such as 3 × 3 or 1 × 1. This enables the model to learn subtle differences in texture features during training, making it well-suited for extracting and distinguishing features among various cloud types. As shown in Figure 8, VGG16 consists of 16 convolutional layers and 5 pooling layers. Finally, the data are fed into a Fully Connected layer. Dropout layers [38] were added between the Fully Connected layers and Softmax layers as one of the methods to prevent overfitting of the model. The output of the final softmax layer corresponds to the probability distribution of the 10 classification categories. During training, Adam optimizers [39] were used, which limit the update step size to an approximate range, automatically adjust the learning rate, and have a good interpretation of the hyperparameters. In summary, the network was fed images of size 480 × 480 as RGB trichrome channel values, and the output of the network corresponded to the probability distribution over 10 cloud genera labels.

4. Results and Discussion

To verify the validity of the proposed method in this paper, we evaluated the performance of the model on the 10 cloud genera using various methods:

Whether the first three items of the classification result include the correct labels.
Classification by cloud texture (divided into Stratiformis, Undulatus, Cumulus).

Refer to Table 1 for the specific method of classifying clouds based on their texture. In addition, we assessed the precision rate, recall rate, and F-measures for each evaluation method. Accuracy is concerned with the correctness of the objects classified in the model and is not dependent on whether the objects are fully labeled or have a high classification threshold. Recall is concerned with whether the model labels all the objects that belong to a certain class, regardless of whether other objects are misclassified. It is typically associated with a low classification threshold. The evaluation metrics are calculated based on True Positive (tp), True Negative (tn), False Positive (fp), and False Negative (fn) for the classification results generated by the model, as shown in Equations (8) and (9).

P r e c i s i o n = \frac{t p}{t p + f p}, R e c a l l = \frac{t p}{t p + f n}

(8)

\begin{array}{l} F - s c o r e (β) & = \frac{(1 + β^{2}) P r e c i s i o n * R e c a l l}{β^{2} * P r e c i s o n + R e c a l l} \\ = \frac{(1 + β^{2}) t p}{(1 + β^{2}) t p + β^{2} f p + f n} \end{array}

(9)

where

β

is the hyperparameter, usually set to 1. The result is considered optimal when the

F - s c o r e (β)

value is 1; 0 is considered the worst result.

In this paper, we used the Sobel, Canny, Clahe, and Gamma methods, respectively, to process the SSC and generate four new datasets. These datasets were applied to the VGG16 model (Figure 8), and the classification results were evaluated using the precision rate, recall rate, F-measures, and accuracy mentioned above. The results indicated that the Gamma Correction dataset performed best, as shown in Table 2. The majority of cloud genera displayed high classification recall of 80% and precision. However, the classification results of Cirrus (Cs) were relatively poor. Upon examining the model’s classification 20 times, the Cs labeled data sample model classification was shown in Figure 9, where the majority of misclassified results were identified as Altocumulus (As). It is because both cloud genera belong to Stratiformis and share high similarity in texture features, and the International Cloud Atlas distinguishes between them primarily based on the height of the cloud base and differences in composition. Based on the classification of cloud texture features, all three categories displayed over 70% correctness in both recall and precision. This indicates that the model has a strong discriminatory ability for cloud texture features.

As shown in Table 3, the Clahe dataset has the second-best performance following the Gamma dataset. While the overall performance of Clahe in identifying cloud species is slightly better than Gamma, precision analysis shows that some cloud genera had high precision while others had low precision. There was a significant difference in the performance between the two parts of cloud genera. Our belief is that the Clahe dataset does not showcase cloud textures as effectively as Gamma, which is supported by its inferior performance in texture feature classification compared to Gamma. Upon comparing the recall of both, we observed that Stratiformis and Cumulus have significantly higher recall rates in Gamma compared to Clahe. Additionally, we found that several cloud genera had a high recall but low precision in Clahe. This finding further supports the conclusion that Clahe is less effective in processing cloud texture features compared to Gamma. From Table 4 and Table 5, it can be observed that the overall performance of the Sobel dataset is similar to that of the Canny dataset, with both performing worse than Clahe. We found that after processing with Canny, the model struggled to identify Stratus (St) and was likely to incorrectly classify cloud genera as Nimbostratus (Ns). Furthermore, Canny performed worse in classifying cloud genera based on texture, with an accuracy of only 54%. It struggled to differentiate Cumulus from the other two categories.

5. Conclusions

In this paper, we propose a CNN-based method for identifying 10 cloud genera. Due to the reliance on the identification of the texture of the clouds, the images undergo pre-processing to enhance these features. The classical VGG16 architecture of the CNN model is then employed for training. We have also constructed a dataset of high-quality cloud images, known as the SSC. Moreover, the SSC is the first dataset to ensure an approximately equal number of data samples for all 10 cloud genera.

Following multiple rounds of experimentation, the results demonstrate that the model presented in this paper can accurately identify cloud genera using ground-based cloud images and performs well on our newly established database. The model has some misclassification errors, mainly focused on As and Cs. This is due to the similarity in their texture features, as well as a transformation phenomenon, making it challenging to discriminate based on image texture alone. Additionally, cloud height is currently a critical indicator for distinguishing between these two cloud genera. We have also demonstrated that the model’s performance can be optimized by adjusting the brightness or darkness of the images.

At present, our research on cloud identification solely relies on cloud texture. However, there are numerous features that can be utilized to identify clouds, such as cloud height and composition. For the future, we are considering incorporating hardware equipment (e.g., the ceilometer) to enable us to collect cloud height data while capturing cloud images. We hope to address the difficulty in distinguishing between As and Cs by adding the feature of cloud height and improving the performance of the model. In addition, multiple cloud genera can often appear in the same sky simultaneously. In the next stage, we plan to implement distinguishing between different cloud genera in the same picture. Our research results have practical applications. For example, meteorological organizations can utilize our method to collect cloud data in unpopulated areas (e.g., plateaus, oceans, etc.). On the other hand, our method is highly real-time, allowing meteorological bureaus to monitor clouds in real-time using camera equipment to forecast future local weather changes. Furthermore, our method enables the public, without prior meteorological knowledge, to identify between different cloud genera.

Author Contributions

Conceptualization, Z.L. and H.K.; methodology, Z.L.; software, Z.L. and H.K.; investigation, H.K., Z.L. and C.-S.W.; resources, C.-S.W.; writing—original draft preparation, Z.L.; writing—review and editing, H.K. and Z.L.; visualization, Z.L.; supervision, H.K. and C.-S.W.; project administration, H.K.; funding acquisition, H.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Macau Foundation under its Research Fund (Grant No. MF2102), Macau.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

We thank the Macao Meteorological Society for their support in the data checking part.

Conflicts of Interest

The authors declare no conflict of interest.

References

Duda, D.P.; Minnis, P.; Khlopenkov, K.; Chee, T.L.; Boeke, R. Estimation of 2006 Northern Hemisphere contrail coverage using MODIS data. Geophys. Res. Lett. 2013, 40, 612–661. [Google Scholar] [CrossRef]
Li, Z.; Rosenfeld, D.; Fan, J. Aerosols and their impact on radiation, clouds, precipitation, and severe weather events. In Oxford Research Encyclopedia of Environmental Science; Oxford University Press: Oxford, UK, 2017. [Google Scholar]
Stephens, G.L. Cloud Feedbacks in the Climate System: A Critical Review. J. Clim. 2005, 18, 237–273. [Google Scholar] [CrossRef] [Green Version]
Minnis, P.; Ayers, J.K.; Palikonda, R.; Phan, D. Contrails, cirrus trends, and climate. J. Clim. 2004, 17, 1671–1685. [Google Scholar] [CrossRef]
Inness, P.M.; Dorling, S. Operational Weather Forecasting; John Wiley & Sons: Hoboken, NJ, USA, 2012. [Google Scholar]
Nespoli, A.; Niccolai, A.; Ogliari, E.; Perego, G.; Collino, E.; Ronzio, D. Machine Learning techniques for solar irradiation nowcasting: Cloud type classification forecast through satellite data and imagery. Appl. Energy 2022, 305, 117834. [Google Scholar] [CrossRef]
Liu, L.; Sun, X.; Chen, F.; Zhao, S.; Gao, T. Cloud Classification Based on Structure Features of Infrared Images. J. Atmospheric Ocean. Technol. 2011, 28, 410–417. [Google Scholar] [CrossRef]
Magurno, D.; Cossich, W.; Maestri, T.; Bantges, R.; Brindley, H.; Fox, S.; Harlow, C.; Murray, J.; Pickering, J.; Warwick, L.; et al. Cirrus Cloud Identification from Airborne Far-Infrared and Mid-Infrared Spectra. Remote Sens. 2020, 12, 2097. [Google Scholar] [CrossRef]
Zhuo, W.; Cao, Z.; Xiao, Y. Cloud Classification of Ground-Based Images Using Texture–Structure Features. J. Atmospheric Ocean. Technol. 2014, 31, 79–92. [Google Scholar] [CrossRef]
Long, C.N.; DeLuisi, J.J. Development of an Automated Hemispheric Sky Imager for Cloud Fraction Retrievels. In Proceedings of the 10th Symptoms on Meteorological Observations and Instrumentation, Phoenix, AR, USA, 11–16 January 1998. [Google Scholar]
Genkova, I.; Long, C.; Besnard, T.; Gillotay, D. Assessing cloud spatial and vertical distribution with cloud infrared radiometer CIR-7. In Remote Sensing of Clouds and the Atmosphere IX; SPIE: Bellingham, WA, USA, 2004; Volume 5571, pp. 1–10. [Google Scholar]
Yu, C.; Chang, Y.; Li, Y.; Zhao, X.; Yan, L. Unsupervised image deraining: Optimization model driven deep cnn. In Proceedings of the 29th ACM International Conference on Multimedia, Virtual Event, China, 20–24 October 2021; pp. 2634–2642. [Google Scholar]
Tebaldi, C.; Knutti, R. The use of the multi-model ensemble in probabilistic climate projections. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2007, 365, 2053–2075. [Google Scholar] [CrossRef] [PubMed]
Li, Q.; Jia, H.; Qiu, Q.; Lu, Y.; Zhang, J.; Mao, J.; Fan, W.; Huang, M. Typhoon-Induced Fragility Analysis of Transmission Tower in Ningbo Area Considering the Effect of Long-Term Corrosion. Appl. Sci. 2022, 12, 4774. [Google Scholar] [CrossRef]
Li, Q.; Jia, H.; Zhang, J.; Mao, J.; Fan, W.; Huang, M.; Zheng, B. Typhoon Loss Assessment in Rural Housing in Ningbo Based on Township-Level Resolution. Appl. Sci. 2022, 12, 3463. [Google Scholar] [CrossRef]
Ye, L.; Cao, Z.; Xiao, Y. DeepCloud: Ground-Based Cloud Image Categorization Using Deep Convolutional Features. IEEE Trans. Geosci. Remote Sens. 2017, 55, 5729–5740. [Google Scholar] [CrossRef]
Zhang, J.; Liu, P.; Zhang, F.; Song, Q. CloudNet: Ground-based cloud classification with deep convolutional neural network. Geophys. Res. Lett. 2018, 45, 8665–8672. [Google Scholar] [CrossRef]
Huertas-Tato, J.; Martín, A.; Camacho, D. Cloud type identification using data fusion and ensemble learning. In Proceedings of the Intelligent Data Engineering and Automated Learning–IDEAL 2020: 21st International Conference, Guimaraes, Portugal, 4–6 November 2020; Springer International Publishing: Berlin/Heidelberg, Germany; pp. 137–147. [Google Scholar]
Liu, S.; Li, M.; Zhang, Z.; Xiao, B.; Durrani, T.S. Multi-Evidence and Multi-Modal Fusion Network for Ground-Based Cloud Recognition. Remote Sens. 2020, 12, 464. [Google Scholar] [CrossRef] [Green Version]
Liu, S.; Li, M.; Zhang, Z.; Cao, X.; Durrani, T.S. Ground-based cloud classification using task-based graph convolutional network. Geophys. Res. Lett. 2020, 47, e2020GL087338. [Google Scholar] [CrossRef]
Benner, T.C.; Curry, J.A. Characteristics of small tropical cumulus clouds and their impact on the environment. J. Geophys. Res. Atmos. 1998, 103, 28753–28767. [Google Scholar] [CrossRef]
Gray, W.M.; Jacobson Jr, R.W. Diurnal variation of deep cumulus convection. Mon. Weather. Rev. 1977, 105, 1171–1188. [Google Scholar] [CrossRef]
International Cloud Atlas Manual on the Observation of Clouds and Other Meteors (WMO-No. 407). (n, d), World Meteorological Organization. Available online: https://cloudatlas.wmo.int/en/home.html (accessed on 23 February 2023).
DataFountain Machine Image Algorithm Race Track—Cloud Identification. Available online: https://www.datafountain.cn/competitions/357/datasets (accessed on 21 September 2022).
Warren, S.G.; Hahn, C.J.; London, J. Simultaneous Occurrence of Different Cloud Types. J. Clim. Appl. Meteorol. 1985, 24, 658–667. [Google Scholar] [CrossRef]
Duda, R.O.; Hart, P.E.; Stork, D.G. Pattern Classification and Scene Analysis; Wiley: New York, NY, USA, 1973; Volume 3, pp. 731–739. [Google Scholar]
Canny, J. A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 1986, 6, 679–698. [Google Scholar] [CrossRef]
Zuiderveld, K. Contrast limited adaptive histogram equalization. Graph. Gems 1994, 474–485. [Google Scholar]
Reinhard, E.; Heidrich, W.; Debevec, P.; Pattanaik, S.; Ward, G.; Myszkowski, K. High Dynamic Range Imaging: Acquisition, Display, and Image-Based Lighting; Morgan Kaufmann: Burlington, MA, USA, 2010. [Google Scholar]
Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
Segal-Rozenhaimer, M.; Li, A.; Das, K.; Chirayath, V. Cloud detection algorithm for multi-modal satellite imagery using convolutional neural-networks (CNN). Remote Sens. Environ. 2020, 237, 111446. [Google Scholar] [CrossRef]
LeCun, Y.; Touresky, D.; Hinton, G.; Sejnowski, T. A theoretical framework for back-propagation. In Proceedings of the 1988 Connectionist Models Summer School; Morgan Kaufmann: Burlington, MA, USA, 1988; Volume 1, pp. 21–28. [Google Scholar]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Simard, P.Y.; LeCun, Y.A.; Denker, J.S.; Victorri, B. Transformation invariance in pattern recognition—Tangent distance and tangent propagation. In Neural Networks: Tricks of the Trade; Springer: Berlin/Heidelberg, Germany, 2002; pp. 239–274. [Google Scholar]
Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1–9. [Google Scholar]
Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef] [Green Version]
Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]

Figure 1. Diagram of the water cycle.

Figure 2. Display of the samples of each category in the SSC dataset. (a) Ci; (b) Cc; (c) Cs; (d) As; (e) Ac; (f) Ns; (g) St; (h) Sc; (i) Cu; (j) Cb.

Figure 3. Displaying the hysteresis threshold, when pixel values greater than maxVal are flagged as strong edges; pixel values less than maxVal and greater than minVal are flagged as weak edges; pixel values less than minVal are suppressed.

Figure 4. (a) The original image and its grayscale histogram. (b) The image is processed by Clahe and its grayscale histogram.

Figure 5. Gamma Correction for different values of

γ

.

Figure 5. Gamma Correction for different values of

γ

.

Figure 6. Original image and processed images.

Figure 7. In the traditional neural network (NN) architecture, a node represents a neuron,

θ

represents the weight parameter of the layer, and

f (θ)

represents the activation function used by the neuron. NN is fully connected, where each neuron is connected to all neurons in the next layer.

Figure 7. In the traditional neural network (NN) architecture, a node represents a neuron,

θ

represents the weight parameter of the layer, and

f (θ)

represents the activation function used by the neuron. NN is fully connected, where each neuron is connected to all neurons in the next layer.

Figure 8. VGG16 structure. The network contains 16 convolutional layers, 5 pooling layers, and 1 fully connected layer.

Figure 9. The situation of Cirrus (Cs) labeled samples under 20 times performed by the model when using Gamma dataset.

Table 1. Classification and quantity of cloud genera.

Heigh	Cloud Genera	Texture	Quantity
High-level	Cirrus (Ci)	Stratiformis	103
	Cirrocumulus (Cc)	Undulatus	103
	Cirrostratus (Cs)	Stratiformis	110
Middle-level	Altostratus (As)	Stratiformis	100
Middle-level	Altocumulus (Ac)	Undulatus	106
Low-level	Nimbostratus (Ns)	Stratiform	133
	Stratus (St)	Undulatus	102
	Stratocumulus (Sc)	Undulatus	101
	Cumulus (Cu)	Cumulus	107
	Cumulonimbus (Cb)	Cumulus	127
Total	1092

Table 2. Performance evaluation of Gamma datasets with multiple classification methods.

Categorty	Recall	Precision	F1	Accuracy
AC	0.75	0.83	0.79	0.78
AS	0.8	0.64	0.71
CB	0.85	0.81	0.83
CC	1	0.77	0.87
CI	0.6	1	0.75
CS	0.5	0.67	0.57
CU	0.9	0.67	0.77
NS	0.8	0.76	0.78
SC	0.75	0.94	0.83
ST	0.85	0.89	0.87
Stratiformis	0.675	0.74	0.71	0.71
Undulatus	0.675	0.68	0.68
Cumulus	0.85	0.71	0.77

Table 3. Performance evaluation of Clahe datasets with multiple classification methods.

Categorty	Recall	Precision	F1	Accuracy
AC	0.9	0.6	0.72	0.8
AS	0.7	0.74	0.72
CB	0.8	0.9	0.85
CC	0.95	0.9	0.93
CI	0.55	0.85	0.67
CS	0.95	1	0.97
CU	0.9	0.67	0.77
NS	0.65	0.8	0.72
SC	0.85	0.65	0.74
ST	0.95	0.70	0.83
Stratiformis	0.54	0.77	0.63	0.65
Undulatus	0.77	0.63	0.69
Cumulus	0.78	0.76	0.77

Table 4. Performance evaluation of Sobel datasets with multiple classification methods.

Categorty	Recall	Precision	F1	Accuracy
AC	0.65	1	0.79	0.72
AS	0.8	0.5	0.62
CB	0.85	0.77	0.81
CC	0.7	1	0.82
CI	0.75	0.56	0.64
CS	0.6	0.92	0.73
CU	0.8	0.1	0.86
NS	0.95	0.95	0.89
SC	0.75	1	0.88
ST	0.85	0.57	0.68
Stratiformis	0.72	0.63	0.67	0.64
Undulatus	0.54	0.61	0.57
Cumulus	0.65	0.68	0.67

Table 5. Performance evaluation of Canny datasets with multiple classification methods.

Categorty	Recall	Precision	F1	Accuracy
AC	0.9	0.9	0.9	0.71
AS	0.95	0.34	0.5
CB	0.8	1	0.89
CC	0.6	1	0.75
CI	0.85	1	0.92
CS	0.7	1	0.82
CU	0.65	0.76	0.7
NS	0.55	0.67	0.6
SC	1	0.29	0.45
ST	0	0	0
Stratiformis	0.56	0.56	0.56	0.54
Undulatus	0.66	0.54	0.59
Cumulus	0.25	0.5	0.33

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, Z.; Kong, H.; Wong, C.-S. Neural Network-Based Identification of Cloud Types from Ground-Based Images of Cloud Layers. Appl. Sci. 2023, 13, 4470. https://doi.org/10.3390/app13074470

AMA Style

Li Z, Kong H, Wong C-S. Neural Network-Based Identification of Cloud Types from Ground-Based Images of Cloud Layers. Applied Sciences. 2023; 13(7):4470. https://doi.org/10.3390/app13074470

Chicago/Turabian Style

Li, Zijun, Hoiio Kong, and Chan-Seng Wong. 2023. "Neural Network-Based Identification of Cloud Types from Ground-Based Images of Cloud Layers" Applied Sciences 13, no. 7: 4470. https://doi.org/10.3390/app13074470

APA Style

Li, Z., Kong, H., & Wong, C.-S. (2023). Neural Network-Based Identification of Cloud Types from Ground-Based Images of Cloud Layers. Applied Sciences, 13(7), 4470. https://doi.org/10.3390/app13074470

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Neural Network-Based Identification of Cloud Types from Ground-Based Images of Cloud Layers

Abstract

1. Introduction

2. Data

3. Method

3.1. Image Processing

3.2. Model

4. Results and Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI