An Enhanced IDBO-CNN-BiLSTM Model for Sentiment Analysis of Natural Disaster Tweets

Mu, Guangyu; Li, Jiaxue; Li, Xiurong; Chen, Chuanzhi; Ju, Xiaoqing; Dai, Jiaxiu

doi:10.3390/biomimetics9090533

Open AccessArticle

An Enhanced IDBO-CNN-BiLSTM Model for Sentiment Analysis of Natural Disaster Tweets

by

Guangyu Mu

^1,2,†

,

Jiaxue Li

^1,†

,

Xiurong Li

^3,*

,

Chuanzhi Chen

¹,

Xiaoqing Ju

¹ and

Jiaxiu Dai

¹

School of Management Science and Information Engineering, Jilin University of Finance and Economics, Changchun 130117, China

²

Key Laboratory of Financial Technology of Jilin Province, Changchun 130117, China

³

Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Biomimetics 2024, 9(9), 533; https://doi.org/10.3390/biomimetics9090533

Submission received: 30 June 2024 / Revised: 26 August 2024 / Accepted: 2 September 2024 / Published: 4 September 2024

(This article belongs to the Special Issue Nature-Inspired Metaheuristic Optimization Algorithms 2024)

Download

Browse Figures

Versions Notes

Abstract

:

The Internet’s development has prompted social media to become an essential channel for disseminating disaster-related information. Increasing the accuracy of emotional polarity recognition in tweets is conducive to the government or rescue organizations understanding the public’s demands and responding appropriately. Existing sentiment analysis models have some limitations of applicability. Therefore, this research proposes an IDBO-CNN-BiLSTM model combining the swarm intelligence optimization algorithm and deep learning methods. First, the Dung Beetle Optimization (DBO) algorithm is improved by adopting the Latin hypercube sampling, integrating the Osprey Optimization Algorithm (OOA), and introducing an adaptive Gaussian–Cauchy mixture mutation disturbance. The improved DBO (IDBO) algorithm is then utilized to optimize the Convolutional Neural Network—Bidirectional Long Short-Term Memory (CNN-BiLSTM) model’s hyperparameters. Finally, the IDBO-CNN-BiLSTM model is constructed to classify the emotional tendencies of tweets associated with the Hurricane Harvey event. The empirical analysis indicates that the proposed model achieves an accuracy of 0.8033, outperforming other single and hybrid models. In contrast with the GWO, WOA, and DBO algorithms, the accuracy is enhanced by 2.89%, 2.82%, and 2.72%, respectively. This study proves that the IDBO-CNN-BiLSTM model can be applied to assist emergency decision-making in natural disasters.

Keywords:

DBO algorithm; deep learning; social media; sentiment analysis; natural disaster tweets; emergency management

1. Introduction

Natural disasters are phenomena triggered by the forces of nature, such as sandstorms, hurricanes, or forest fires [1,2]. The constant change of climate makes calamities more frequent [3,4]. Because of the unpredictability, suddenness, and destructiveness, natural disasters have caused significant damage to infrastructure, economy, and society [5,6]. During calamities, Twitter, with its powerful real-time interactivity, makes it convenient for people in affected areas to communicate with the outside world and seek assistance [7,8]. Nevertheless, the public simultaneously has fears, worries, and even resistance, leading to many negative online public opinions [9]. Social stability may be at risk if the government fails to steer and manage these viewpoints effectively [10]. Sentiment analysis of tweets helps decision-makers and researchers infer the possible polarity changes to some extent [11]. Then, some targeted disaster-related information and the progress of emergency management can be released in time [12,13]. This is conducive to guiding public opinions in a positive and benign direction. The disaster prevention and mitigation work will also proceed smoothly. Therefore, this research has crucial practical significance.

Sentiment analysis of tweets identifies whether the polarities are positive or negative [14,15], viewed as a binary classification issue [16]. Relevant research methods are categorized into three types: sentiment dictionary, machine learning, and deep learning. The lexicon-based approach utilizes words annotated with emotional scores to match the content to be analyzed [17]. The final polarity is obtained by accumulating the scores for each word [18]. Positive and negative numbers usually represent positive and negative sentiments, respectively. Researchers commonly use the NRC and VADER lexicons. The NRC lexicon lists the associations between several English words with eight basic emotions and two polarities [19,20]. VADER is another rule-based dictionary [21,22,23]. The sentiment lexicon-based approach is simple to understand and accurately reflects the textual structural features [24]. Nevertheless, identical sentiment words may express different meanings in diverse contexts or domains. Some web neologisms and special terms must also be continuously supplemented to meet the demands [25]. Consequently, this lexicon-based method still has issues with accuracy and applicability owing to limitations in size and coverage [26]. Supervised learning-based approaches [27,28] are more prevalent in sentiment analysis research, including machine and deep learning techniques.

The machine learning-based method trains models to learn features from extensive textual data with sentiment labels [29,30]. The trained models are then used to classify and predict polarity for new test text. Naive Bayes and Support Vector Machine (SVM) are representative machine learning approaches. Naive Bayes is based on probabilistic statistics, which assumes that each feature is independent [31]. Predictions are obtained by learning the conditional probability relationship between textual features and sentiment polarities [32,33]. The advantage of Naive Bayes is that it is computationally simple and performs well on small-scale data. However, the results are not satisfactory when the feature attributes are correlated with each other. SVM is a classification technique that operates on the principle of minimizing the structural risk. It separates diverse categories of textual data by finding a maximally spaced hyperplane [34,35]. SVM is beneficial for handling high-dimensional feature spaces and nonlinear issues [36,37], but choosing the kernel function and regularization parameters is crucial. Despite many advantages, the machine learning-based method usually requires manual feature selection and does not fully utilize the semantic information of the context [38,39].

The deep learning-based method has become mainstream due to complicated textual features’ automatic learning capability [40]. Relevant models go through a process from single to hybrid. Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) are the most essential approaches. CNN ignores sequential information while focusing on textual local features [41]. RNN can model sequential problems but suffers from the limitation of long-term dependency [42]. As a variation, the Long Short-Term Memory (LSTM) solves the gradient vanishing and explosion issues [43]. The Bidirectional Long Short-Term Memory (BiLSTM) comprises forward and backward LSTM that can recognize preceding or following words to access more contextual information [44]. Considering the advantages of combining various approaches, some hybrid models are proposed, such as CNN-LSTM [45] or CNN-BiLSTM [46]. Experiments demonstrate that hybrid models outperform single models on sentiment polarity classification [47]. Nevertheless, the performance of deep learning models rests on the hyperparameter settings, including the learning rate or the number of neurons [48]. There is no uniform standard. Manual optimization is time-consuming and requires professional knowledge [49].

Swarm intelligence optimization algorithms solve complicated optimization issues by simulating biological behavior in nature [50,51]. To further improve the accuracy of textual sentiment polarity classification, researchers have applied the Grey Wolf Optimization (GWO) algorithm [52,53,54] and Whale Optimization Algorithm (WOA) [55,56,57] to optimize the hyperparameters of deep learning models. The “No Free Lunch” theorem posits that no single algorithm can universally excel at solving every optimization issue [58,59]. That is to say, the algorithm performs well in the current task, but other situations may differ. The Dung Beetle Optimizer (DBO) is a novel algorithm proposed in 2022 [60]. It is characterized by high accuracy and fast convergence [61,62]. Compared with previous methods, the DBO algorithm produces superior results [63,64]. However, obtaining the ideal optimal solution is still challenging. Specifically, the global prospecting and local searching capabilities are imbalanced. There are problems with weak global exploration ability and falling into local optimal solutions easily [65,66]. Therefore, it is necessary to seek improvement strategies for the DBO algorithm.

Here, we can present two motivations for this study:

(1): Hybrid deep learning models are more suitable for tweets’ sentiment polarity classification than single models.
(2): Improved swarm intelligence algorithms can optimize the hybrid deep learning models’ hyperparameters to increase classification accuracy further.

This research’s principal contributions are as below:

We utilize three strategies to improve the DBO algorithm. First, we adopt the Latin hypercube sampling to update the population initialization process. Second, we integrate the OOA’s global prospecting strategy in the ball-rolling dung beetles’ position update equation. Third, we introduce an adaptive Gaussian–Cauchy mixture mutation disturbance for optimal individuals.
We construct a CNN-BiLSTM model based on local feature extraction and contextual information understanding abilities. We then use the improved DBO algorithm to obtain the CNN-BiLSTM model’s optimal hyperparameters. These hyperparameters include the 1D convolutional layer’s filter number, the convolutional kernel sizes, and the unit number in BiLSTM’s each LSTM layer.
We conduct extensive comparative experiments with other single and hybrid deep learning models on natural disaster tweets. The empirical analysis proves the IDBO-CNN-BiLSTM model’s superiority in sentiment polarity classification of natural disaster tweets.

2. Literature Review

2.1. Natural Disasters

Natural disasters are usually divided into three stages: precursor, occurrence, and recovery. According to the characteristics, the focus of scholars is diverse, corresponding to monitoring and early warning, emergency response, and recovery and reconstruction.

Early monitoring and warning aims at adopting technological means to predict impending natural disasters and notify potential affected areas [67]. Early warning systems are vital in mitigating risks by guiding people to respond appropriately and timely [68]. The rapid development of technology has facilitated the emergence of various early warning systems, such as Earthquake Early Warning (EEW) [69], Drought Early Warning (DEW) [70], Landslide Early Warning (LEW) [71,72], and Flood Early Warning (FEW) [73]. Nevertheless, these systems are only sometimes completely applicable due to the disasters’ complexity. Researchers consider integrating monitoring data and social media to improve understanding of calamity events [74]. Some advanced models have also been constructed to forecast disasters accurately [75].

As a pivotal indicator of emergency response [76], resource allocation is a continuous multi-cycle process that minimizes damage and saves lives by transporting essentials such as food, water, medicine, and tents to disaster areas [77]. From various practices worldwide, many casualties result from material delays and shortages [78]. When natural disasters occur, it is challenging to meet the enormous demands in the affected areas, relying only on the unilateral reserve of the government or rescue organizations [79,80]. By building a game model, each participant will seek a balanced condition of maximizing interests to fully invoke the relief supplies [81,82]. Furthermore, some information technology or neural network algorithms have been utilized in the supply chain to establish a complete system and improve efficiency [83,84].

Post-disaster recovery and reconstruction are complex, dynamic, and multifaceted [85]. The effectiveness and speed depend on the socio-economic characteristics, adaptive capacity, and the response of policymakers [86,87]. During this time, the government needs reliable information to understand the damage extent and formulate recovery strategies [88]. As a powerful tool, remote sensing observation can be combined with machine learning methods to save time and labor [89]. However, the vast impact caused by natural disasters manifests not only in terms of economy and infrastructure but also on the public’s mental health issues [90]. Along with the psychological trauma of experiencing a disaster, people in the affected areas will lose their sense of identity, belonging, and happiness to a certain extent. Even maintaining the basic aspects of daily life is challenging [91]. The support provided by society and the community can help disaster-affected people alleviate mental stress, reduce anxiety, enhance their sense of security, and eventually adapt to the changes in post-disaster life [92,93].

Overall, the work on natural disasters emphasizes the focus of emergency management at diverse stages. These studies prompt us to explore how we can propose specific measures for the government and related departments based on calamities’ evolution, considering the public’s changing sentiments and demands.

2.2. Social Media Analysis of Natural Disasters

Compared to expensive and time-consuming traditional survey methods, social media provides convenient ways to obtain public opinions and instrumental disaster information [94]. Social media analysis of natural disasters can be broadly categorized into the following three areas: rumor detection, topic modeling, and sentiment analysis.

Social media has indicated significant advantages in disseminating urgent information [95]. Because of the difficulty of supervision, content on platforms is ambiguous and complex to recognize [96]. In natural disaster events, people are more likely to believe and share unconfirmed information due to intense anxiety and emotional vulnerability [97]. In addition, misinformation spreads faster than actual news, resulting in a specific time lag in dispelling rumors [98]. The prevalence of rumors becomes a critical limiting factor for managers in decision-making. Social stability will be affected, potentially causing delays in implementing disaster response measures. There are currently two main methods for detecting misinformation on social media platforms. One relies on expert manual fact-checking, which is an effective means to combat rumors. Nevertheless, the time cost and human resources required must be considered [99]. The other uses advanced techniques to extract user or textual features [100,101,102], enabling real-time automatic rumor detection.

Investigating the topics of interest on social media can assist managers in understanding the public’s demands and deploying response resources [103]. It is a challenge to mine substantial data for valuable information. Topic modeling helps researchers identify significant themes from textual data [104]. One of the most frequently utilized means is Latent Dirichlet Allocation (LDA), proposed in 2003. LDA is an unsupervised machine learning method for text mining that gives the topic of each document as a probability distribution [105,106]. The performance of LDA in topic modeling of natural disaster data has been proven [107,108]. However, conventional models are limited to short texts, and some words are frequently shared between topics. Accordingly, improved approaches based on domain knowledge have been proposed in recent studies to enhance the accuracy of topic recognition further [109,110]. Moreover, the novel Biterm Topic Model (BTM) is also becoming popular because of its excellent applicability [111].

Sentiment in natural disaster data is equally valuable. Temporal and spatial sentiment analyses help deepen the exhaustive understanding of social responses and provide some essential information for emergency management. By calculating sentiment scores for tweets, quantitative data can reflect emotional evolutionary trends and distribution [112]. From a temporal perspective, public sentiment is constantly in flux during diverse phases of a natural disaster [113]. As the catastrophe subsides and relief efforts are in full swing, sentiment will gradually change from negative to positive [114]. From a spatial perspective, some tweets have geolocation attributes that indicate the user’s location [115,116]. Based on the latitude and longitude, counting the average sentiment value in each region can be studied for visualization and correlation analysis. Especially in some specific domains, social media is more suitable for obtaining sentiment analysis data than Geographic Information System (GIS) techniques [117].

Analyzing social media data during natural disasters is of great research value and practical significance. Nevertheless, few scholars have focused on the sentiment polarity categorization of disaster tweets, and adopting advanced models is even less common. We effectively enhance the performance of hybrid deep learning models by utilizing the improved swarm intelligence algorithm to optimize hyperparameters.

3. Method

This study constructs an enhanced IDBO-CNN-BiLSTM model for recognizing sentiment polarity in natural disaster tweets. First, the DBO algorithm is improved by adopting three strategies. Then, the IDBO algorithm is utilized to optimize the hybrid CNN-BiLSTM model’s hyperparameters. Finally, the IDBO-CNN-BiLSTM model classifies the sentiment polarity as positive or negative. The proposed model’s architecture is displayed in Figure 1.

3.1. The DBO Algorithm

The DBO algorithm simulates the ball-rolling, dancing, breeding, foraging, and stealing behaviors of dung beetles. The algorithm’s population is divided into four parts: ball-rolling dung beetles, brood balls, small dung beetles, and stealing dung beetles. The detailed description is as below.

3.1.1. The Ball-Rolling Dung Beetles

Without obstacles, dung beetles utilize the sun to locate and keep the dung ball rolling in a straight line. The algorithm assumes that the light strength affects the dung beetles’ route. At this point, the location update of the ball-rolling dung beetles is expressed as Equation (1). The parameters are described as shown in Table 1.

x_{i} (t + 1) = x_{i} (t) + α \times k \times x_{i} (t - 1) + b \times ∆ x

(1)

∆ x = |x_{i} (t) - X^{W}|

(2)

α

is determined through a probabilistic approach to emulate the intricate conditions in the natural environment. A greater value of

∆ x

signifies a less intense light source.

When dung beetles encounter obstacles preventing them from rolling forward, they must dance to reposition. A tangent function simulates this behavior. The location update at this time is calculated by Equation (3).

x_{i} (t + 1) = x_{i} (t) + \tan (θ) |x_{i} (t) - x_{i} (t - 1)|

(3)

θ

denotes the deflection angle belonging to [0, π]. The location will change if

θ

equals 0, π/2, or π.

3.1.2. The Brood Balls

Choosing spawning sites is crucial. Dung balls are concealed after being rolled to a safe place. A boundary selection strategy is adopted to model the spawning area of female dung beetles. This region is restricted by Equations (4) and (5).

L b^{*} = m a x (X^{*} \times (1 - R), L b)

(4)

U b^{*} = m i n (X^{*} \times (1 + R), U b)

(5)

R = 1 - t / T_{m a x}

(6)

After defining the zone, the female dung beetles select the brood balls to spawn. The DBO algorithm assumes that each female dung beetle only reproduces once in each iteration. In addition, the boundary range is dynamically changing, primarily dictated by the

R

-value. Therefore, the brood balls’ locations are also changeable during iteration. The position update can be calculated by Equation (7). The parameters are described as shown in Table 2.

B_{i} (t + 1) = X^{*} + b_{1} \times (B_{i} (t) - L b^{*}) + b_{2} \times (B_{i} (t) - U b^{*})

(7)

3.1.3. The Small Dung Beetles

Small dung beetles will drill out of the ground to search for food. The DBO algorithm establishes an optimal foraging area. The region boundaries are restricted by Equations (8) and (9). The small dung beetles’ location update is indicated by Equation (10). The parameters are described as shown in Table 3.

L b^{b} = m a x (X^{b} \times (1 - R), L b)

(8)

U b^{b} = m i n (X^{b} \times (1 + R), U b)

(9)

x_{i} (t + 1) = x_{i} (t) + C_{1} \times (x_{i} (t) - L b^{b}) + C_{2} \times (x_{i} (t) - U b^{b})

(10)

3.1.4. The Stealing Dung Beetles

Swiping dung balls from other dung beetles is called stealing behavior. The DBO algorithm assumes that the vicinity of

X^{b}

is the optimal location to scramble for food. The stealing dung beetles’ position update can be calculated by Equation (11). The parameters are described as shown in Table 4.

x_{i} (t + 1) = X^{b} + S \times g \times (|x_{i} (t) - X^{*}| + |x_{i} (t) - X^{b}|)

(11)

3.2. The Proposed IDBO Algorithm

The DBO algorithm’s advantages are rapid convergence and excellent optimization accuracy. Nevertheless, the global prospecting and local searching capabilities are imbalanced. That is to say, the DBO algorithm suffers from weak global exploration ability and easily falls into local optimization. Consequently, we adopt three improvement strategies to solve the above issues.

3.2.1. Utilize the Latin Hypercube Sampling for Population Initialization

Swarm intelligence optimization algorithms’ convergence speed and accuracy are usually closely related to the initial population’s quality and structure [118]. The random initialization in the traditional DBO algorithm leads to an uneven sample distribution. If the initial population’s quality and diversity cannot be ensured, the algorithm’s searching effectiveness will be significantly affected. Latin Hypercube Sampling (LHS) [119] realizes non-overlapping sampling based on the principle of stratified sampling, which can make the samples evenly distributed in the search space. The updated steps for initializing the population are below:

(1): Determine the number of hyperparameters $D$ representing the optimization problem’s dimension.
(2): Set the range $[L b, U b]$ for each hyperparameter, where $L b$ is the lower boundary, and $U b$ is the upper boundary.
(3): The range $[L b, U b]$ of each hyperparameter is divided into $N$ equal subintervals. $N$ is the population size of the DBO algorithm.
(4): Create a matrix of size $N \times D$ . Each column randomly orders the numbers $1, 2, \dots, N$ . Then, a sample is randomly generated in the corresponding subinterval based on the rows’ number. The final resultant forms the initial population.

In LHS, the sample values are usually in the range of

[0, 1]

. However, they must be converted to the range set by the corresponding hyperparameters in the optimization problem. The

i

th sample value of the

j

th hyperparameter is denoted as:

X_{i j} = L b_{j} + {L H S}_{i j} \times (U b_{j} - L b_{j})

(12)

Assuming a sample size of 30 and a search range of

[0, 1]

, the sample distribution in two dimensions is shown in Figure 2. The abscissa and ordinate represent the search scope. The LHS samples are more uniform and have a more extensive coverage than random initialization. Thus, it is proved that using LHS to initialize the population can improve the DBO algorithm’s performance.

3.2.2. Integrate the OOA’s Global Prospecting Strategy

In the traditional DBO algorithm, the ball-rolling dung beetles’ location update strategy depends on the global worst position and has many parameters. Inspired by the OOA [120] proposed in 2023, Equation (1) is improved. The first stage of the OOA is global exploration. Ospreys can detect fish with their powerful vision. After determining the position, the ospreys dive underwater to attack and feed on the fish. The position update in this phase can be expressed by Equation (13). The parameters are described as shown in Table 5.

x_{i j}^{P 1} = x_{i j} + r_{i j} \times ({S F}_{i j} - I_{i j} \times x_{i j})

(13)

During the OOA’s fishing process, the ospreys’ location in the search space changes prominently. If this position update strategy is incorporated into the DBO algorithm, identifying the global optimal region and escaping from the local optimum can be significantly enhanced. Specifically, a more optimal dung ball is randomly selected for rolling during the ball-rolling dung beetles’ position update. The aim is to improve the randomness of the route selection. Equation (14) can calculate the updated location. The parameters are described as shown in Table 6.

X_{i} (t + 1) = X_{i} (t) + r a n d (X^{'} - F X_{i} (t))

(14)

3.2.3. Introduce an Adaptive Gaussian–Cauchy Mixture Mutation Disturbance

In the traditional DBO algorithm’s later iterations, the dung beetle population will gather and search near the current best location. The algorithm will fail to discover the actual optimal solution if this position is not the global optimum. Performing a mutation perturbation increases the population’s diversity and enlarges the search scope by disturbing the algorithm’s individuals, thus escaping from the local optimum [121]. In other words, the algorithm can enter the solution space’s other regions and continue to explore until it eventually finds the global optimum. Gaussian and Cauchy mutations are two effective disturbance methods. Gaussian mutation is usually based on a normal distribution and explores the solution space by adding small random perturbations in the current solution’s neighborhood [122]. These mutations are symmetrically distributed and form peaks around the mean. The Cauchy variation is based on the Cauchy distribution. This distribution has a sharp peak and a long tail, which can generate more significant perturbations far from the current solution [123]. To combine the advantages, an adaptive Gaussian–Cauchy mixture mutation disturbance is introduced.

The result of the mutation disturbance is randomized. The algorithm’s complexity will increase if all dung beetle individuals are perturbed. Therefore, only the optimal individuals are selected in this study. By comparing the positions before and after the mutation, the better location is chosen for the next iteration. The position after Gaussian–Cauchy mixture mutation disturbance can be expressed by Equation (15). The parameters are described as shown in Table 7.

H_{b} (t) = X^{b} (t) * (1 + μ_{1} * G a u s s (σ) + μ_{2} * C a u c h y (σ))

(15)

μ_{1} = t / T_{m a x}

(16)

μ_{2} = 1 - t / T_{m a x}

(17)

Adjusting the weights of Gaussian and Cauchy mutation operators adaptively according to the iterations makes the mixture disturbance more flexible at the algorithm’s diverse stages. Due to the relatively decentralized population distribution, the individuals are mainly perturbed with a more considerable variance by the Cauchy distribution function at the iterations’ beginning. The resulting individuals fully utilize the current location information and increase the random disturbance. As the iteration continues, most individual positions will not change much. At this time, more minor perturbations are applied to the individuals through the Gaussian distribution function. In conclusion, the adaptive Gaussian–Cauchy mixture mutation disturbance can enhance the DBO algorithm’s convergence velocity and even up the local exploitation and global exploration ability.

3.2.4. The IDBO Algorithm’s Time Complexity

Time complexity is an essential metric to measure the algorithm’s efficiency. It describes the performance when the input data may result in the longest running time. A commonly used calculation method is the Big O notation [124]. Define the maximum iterations

T

, the population size

N

, and the issue dimension

D

. The traditional DBO algorithm’s complexity can be expressed as

O (N \times D \times T)

. The IDBO algorithm is optimized and extended within the original framework and does not change the basic execution order or introduce new loops. Accordingly, the time complexity of the IDBO algorithm remains

O (N \times D \times T)

. Although the operating efficiency may be affected, the growth rate of the algorithm’s execution time will not vary with an increase in input size.

3.2.5. The Steps of the IDBO Algorithm

The IDBO algorithm’s steps are as below:

Step 1: Define the objective function and set the IDBO algorithm’s hyperparameters.

Step 2: Initialize the population according to the Latin hypercube sampling. Calculate the fitness values of individuals.

Step 3: Set a random number

δ = r a n d (1)

if the current individual is a ball-rolling dung beetle. When

δ < 0.9

, Equation (14) is used to update the position, incorporating the Osprey Optimization Algorithm; otherwise Equation (3) is utilized. If the current individual is a brood ball, a small dung beetle, or a stealing dung beetle, the location is renewed by Equations (7), (10) and (11), respectively. Boundary detection is performed after each position update.

Step 4: Update the current optimal solution and fitness value.

Step 5: The current optimum is perturbed by adopting an adaptive Gaussian–Cauchy mixture mutation disturbance to produce a novel optimal solution.

Step 6: Repeat Steps 3 to 5. After reaching the maximum iterations, the global optimal solution and fitness value are output.

3.3. The CNN-BiLSTM Model

In sentiment analysis, CNN effectively extracts textual local features. BiLSTM is adept at capturing long-distance dependencies and understanding contextual information. The hybrid model can fully utilize the advantages of the two network structures to improve accuracy and efficiency. The CNN-BiLSTM model consists of an embedding layer, a 1D convolutional layer, a 1D max pooling layer, a BiLSTM layer, and a Dense layer. These structures play different roles. The upper layer’s output is the following layer’s input. The 1D convolutional and max pooling layers are used because they apply to sequential data. The details are described below.

3.3.1. Embedding Layer

Before the CNN-BiLSTM model is trained, a vocabulary is usually constructed using the Tokenizer. Each word is mapped to a unique index. Input data is a sequence of word indexes. As the first layer in which the model receives text, the embedding layer’s pivotal role is converting the indexes into a continuous representation of word vectors. These vectors capture and express the semantic information. As a result, the CNN-BiLSTM model can utilize continuous numerical features instead of original textual data for more efficient information extraction and analysis.

3.3.2. 1D Convolutional Layer

The CNN-BiLSTM model’s core is a convolutional layer consisting of multiple convolutional kernels. Each convolutional kernel corresponds to a feature mapping. Specifically, the convolution kernels slid over the input text. The feature mapping is generated by calculating the dot product between the convolution kernel and the textual local region. This process can be expressed by Equation (18). The parameters are described as shown in Table 8.

C = f (X * K + b)

(18)

3.3.3. 1D Max Pooling Layer

The pooling layer’s primary function is to decrease the feature mapping’s spatial dimensionality. Then, the number of parameters and computations is also reduced. The commonly used operations are max and average pooling operations. The former chooses the maximum as the output. The latter computes all values’ averages and aims to smooth the feature mapping. This study adopts max pooling to retain the most salient features in the textual data and ignore trivial information. The output feature mapping

P

can be described as follows:

P = m a x (C)

(19)

3.3.4. BiLSTM Layer

The BiLSTM comprises two LSTM layers, one for forward processing and the other for reverse processing. The parameters are updated independently in both directions. The network structure is displayed in Figure 3. When handling the input sequence, the BiLSTM layer combines the extracted local features with contextual information. This particular structure simultaneously considers the words before and after each word in the text, leading to a more comprehensive understanding of the textual meaning. An input gate, a forgetting gate, and an output gate control the info flow of each LSTM unit. The three gates work together in the memory unit. Significant information is learned and memorized, while trivial information is ignored or forgotten. Accordingly, the BiLSTM layer can extract critical features for determining sentiment polarity, such as word order, syntactic structure, and semantic information. The implicit state of the output is expressed as below:

h_{t} = \vec{h_{t}} + \overset{\leftarrow}{h_{t}}

(20)

3.3.5. Dense Layer

The dense layer is located after the CNN-BiLSTM model’s sequence processing section, which integrates the extracted features. The dense layer consists of several neurons. The received input values are multiplied by the corresponding weights. Then, bias is added to obtain a linear combination. An activation function, such as the softmax function, usually follows the dense layer.

4. Empirical Analysis

4.1. Data Collection and Preprocessing

Twitter contains loads of active users and disaster information, so it can be a data source for sentiment analysis. Meanwhile, Twitter has provided an Application Programming Interface (API) for researchers to access tweets. Hurricane Harvey landed on 25 August 2017, along the southern coast of Texas, USA. This catastrophic event brought extreme rainfall and flooding, causing significant damage and loss of life [125]. Internet users expressed more distinct sentiments than regular events. Therefore, this study selects Hurricane Harvey as the research object. The details of data acquisition are shown below.

First, this paper utilizes TwitterScraper in Python to obtain data with Hurricane Harvey as the keyword. Second, for a more comprehensive analysis, the data range is extended by one week based on the disaster’s duration. This is because the government often issues disaster warnings in advance, and citizens’ information awareness usually lags. Third, this study collects only English tweets, considering English is a global language. Tweets expressing people’s attitudes towards the relief organizations’ response or their demands are further selected. The aim is to demonstrate that analyzing social media tweets is helpful for more effective disaster management. In the end, a total of 5000 pieces of data are retained. Tweets are manually annotated as positive or negative sentiments. The proportions of the two labels are shown in Figure 4, which are 2262 and 2738, respectively. In this study, positive sentiment is denoted by 0 and negative sentiment by 1. Table 9 cites an instance of the correspondence between tweets and sentiment labels.

Raw data must be converted into a suitable format for analysis or modeling before being fed into the model. The purpose is to eliminate some invalid noise information. The tweet data preprocessing in this study consists of the following tasks:

(1): Remove Twitter handles (@user).
(2): Remove special characters, numbers, and punctuation.
(3): Remove short words with lengths of less than three.
(4): Utilize Tokenizer to segment the text and convert it into a sequence.
(5): Fill the sequence to the same length.

4.2. Experimental Details

This experiment is performed on a computer with Python 3.8 and RTX 4090. The training and testing sets are stochastically chosen at a ratio of 7 to 3. L2 regularization is added to the BiLSTM layer to control the model complexity and reduce overfitting. In the IDBO algorithm, the population proportions of ball-rolling dung beetles, brood balls, small dung beetles, and stealing dung beetles are set to 0.2, 0.4, 0.2, and 0.2, respectively. The other hyperparameter settings are shown in Table 10.

4.3. Evaluation Metrics

This study adopts four evaluation metrics to compare several models comprehensively, including accuracy, precision, recall, and F1. Higher values represent better classification results. The calculations are expressed by Equations (21)–(24). The standard binary confusion matrix is shown in Table 11.

A c c u r a c y = \frac{T P + T N}{T P + F P + F N + T N}

(21)

P r e c i s i o n = \frac{T P}{T P + F P}

(22)

R e c a l l = \frac{T P}{T P + F N}

(23)

F 1 = \frac{2 * R e c a l l * P r e c i s i o n}{P r e c i s i o n + R e c a l l}

(24)

4.4. Experimental Results

4.4.1. The Contrast of Evaluation Metrics

Several single and hybrid models are compared to prove the proposed IDBO-CNN-BiLSTM model’s superiority in the sentiment polarity classification. All the experiments are conducted under a consistent operating environment and parameter settings to ensure the results’ reliability. The single models include CNN, RNN, GRU, LSTM, and BiLSTM. The hybrid models are CNN-BiLSTM, GWO-CNN-BiLSTM, WOA-CNN-BiLSTM, and DBO-CNN-BiLSTM. The contrast of evaluation metrics is indicated in Table 12. The comparison of accuracy is displayed in Figure 5.

The results of the evaluation metrics reveal the following findings:

Among the selected single models, CNN is the only one that can extract textual local features. It achieves an accuracy of 0.7247. The other models are suitable for processing sequential information. Nevertheless, RNN is susceptible to gradient vanishing and explosion. As two variants of RNN, LSTM performs better in capturing long-term dependencies than GRU due to its complex gating mechanism. BiLSTM has a bidirectional LSTM layer that simultaneously considers words before and after each word in the text. The accuracy of BiLSTM reaches 0.7667. Compared to RNN, GRU, and LSTM, BiLSTM improves the accuracy by 12.87%, 1.51%, and 0.45%, respectively.
The CNN-BiLSTM model, which combines the local feature extraction capability of CNN with the contextual understanding ability of BiLSTM, outperforms both individual methods. The hybrid model achieves an accuracy of 0.7700, increasing by 6.25% and 0.43%, respectively.
After optimizing the 1D convolutional layer’s filter number, the convolutional kernel sizes, and the unit number in BiLSTM’s each LSTM layer, the performance is better than that of the basic CNN-BiLSTM model. The IDBO algorithm shows the most significant enhancement. The accuracy is 0.8033, improved by 2.89%, 2.82%, and 2.72% compared to GWO, WOA, and DBO algorithms.

4.4.2. The Comparison of Confusion Matrices

The confusion matrix provides an intuitive perspective for comparing the classification of positive (labeled as 0) or negative sentiments (labeled as 1) by diverse models. The contrast results on the test set are shown in Figure 6 and Figure 7. Raw counts indicate the match between the predicted and actual labels. The normalized probabilities are obtained by dividing each cell’s raw counts by the sum of that row or column. Normalized probability makes comparisons between categories fairer because it eliminates the effect of sample size. A darker color means a higher probability.

The IDBO-CNN-BiLSTM model’s accuracy for categorizing positive and negative sentiments is 78% and 82%, respectively. More negative than positive sentiments are expressed in obtained natural disaster tweets. The IDBO-CNN-BiLSTM model, compared with other models, not only maintains a stable accuracy for negative sentiment classification but effectively enhances the categorization performance for positive sentiment. The experimental results demonstrate the proposed model’s superiority in analyzing natural disaster tweets.

4.4.3. The Performance Comparison of Four Optimization Algorithms

The accuracy is noteworthy when utilizing swarm intelligence algorithms to solve optimization problems. Furthermore, time costs also need to be considered. Table 13 shows the optimal hyperparameters and runtime of four models. These algorithms are set with a consistent population and maximum iterations to ensure the experimental results’ comparability.

Table 13 reveals that the IDBO-CNN-BiLSTM model acquires minimal optimal hyperparameter values. As a result, the model will be highly efficient in handling the sentiment classification task. Compared to the WOA and DBO algorithms, the IDBO algorithm takes slightly more time to acquire the optimal hyperparameters. Nevertheless, it is acceptable considering the increase in accuracy.

5. Conclusions and Prospect

5.1. Conclusions

This study proposes an enhanced IDBO-CNN-BiLSTM model for classifying the sentiment polarity of natural disaster tweets. The hybrid model fully considers the advantages of swarm intelligence optimization algorithms and deep learning methods.

In single models, CNN can extract textual local features. BiLSTM has the most robust ability to process sequence information. This research combines these two network structures to construct a CNN-BiLSTM model. The deep learning models’ performance mainly depends on the hyperparameters. Manual settings increase the difficulty and randomness. Swarm intelligence algorithms are effective in solving complicated optimization issues. The DBO algorithm’s advantages are rapid convergence and excellent optimization accuracy. Accordingly, the DBO algorithm is selected to obtain the CNN-BiLSTM model’s optimal hyperparameters. These hyperparameters include the 1D convolutional layer’s filter number, the convolutional kernel sizes, and the unit number in BiLSTM’s each LSTM layer. Nevertheless, the DBO algorithm suffers from weak global exploration and falls into local optimum easily. There is still room for performance improvement.

Three improvement strategies have been proposed to address the DBO algorithm’s shortcomings. First, Latin hypercube sampling population initialization is employed to avoid samples’ uneven distribution in the search space. Second, the OOA’s global prospecting strategy is fused into the ball-rolling dung beetles’ position update equation to solve the problem of more parameters. Third, an adaptive Gaussian–Cauchy mixture mutation disturbance is introduced to enhance the algorithm’s performance by disturbing the optimal individuals.

Experimental results of sentiment classification on natural disaster tweets show that the accuracy of BiLSTM is improved by 12.87%, 1.51%, and 0.45% compared to RNN, GRU, and LSTM, respectively. The CNN-BiLSTM model outperforms the separate models, with an accuracy enhancement of 6.25% and 0.43%, respectively. The IDBO algorithm has the most remarkable optimization effect among several swarm intelligence algorithms. In contrast with the GWO, WOA, and DBO algorithms, the accuracy is increased by 2.89%, 2.82%, and 2.72%, respectively. Furthermore, the proposed model’s optimal hyperparameters are minimal. Consequently, the IDBO-CNN-BiLSTM model will save more computing resources in sentiment analysis.

In general, this study has momentous practical implications. The proportion of sentiment polarity in actual natural disaster tweets is usually unbalanced. The IDBO-CNN-BiLSTM model’s classification performance is more stable than other algorithms. Comparative experiments prove the proposed model’s superiority in coping with natural disaster tweets.

5.2. Suggestion

In the natural disaster tweets obtained, the public’s needs are primarily in the following areas: food, water, housing, transportation, and medical care. If these demands are responded to and met promptly, more positive sentiment will be collected on social media platforms. But overall, the proportion of negative emotion exceeds positive sentiment, indicating that governments or relief organizations still need to enhance their emergency management capabilities. According to the evolution of calamities and changes in people’s requirements, diverse stages should have corresponding focuses. Internet users and social media platforms should also work closely together to minimize the damage caused by natural disasters.

Several potential crisis factors emerge when natural disasters are in the precursor phase. It is a critical period for prevention and preparation. Netizens need to raise their self-protection awareness and prepare emergency supplies immediately after receiving official notifications. Social media platforms should carry out educational activities and push disaster information to users rapidly and accurately. The government and related agencies must establish monitoring systems and formulate detailed contingency plans, including evacuation routes, stockpiling and distribution of relief materials, and training and drills for rescue teams.

Natural disasters in the occurrence phase cause direct damage to human society and the environment. This period is characterized by colossal destructiveness and wide-ranging impact. Internet users need to remain calm and follow official instructions for evacuation or sheltering. They should also avoid spreading unconfirmed information to reduce panic and confusion. Social media platforms can utilize technical means to monitor and manage inaccurate news’ spread, such as keyword filtering or user reporting systems. The government and humanitarian organizations must immediately activate the emergency response plans and concentrate rescue forces to assist the affected areas.

When natural disasters are in recovery process, emergency management efforts include disaster assessment, infrastructure reconstruction, and psychological rehabilitation. Netizens can participate in community work or organize online fundraising and material donation activities. Social media platforms should provide data support for assessing the persistent impact of calamities. The government and relevant departments must prioritize restoring basic facilities and public services. Psychological counseling for the affected population is also necessary. Post-disaster recovery may last a long time and require collaborative efforts.

5.3. Limitation and Future Prospect

This paper still has some limitations. The current contents of social media platforms are no longer restricted to plain text. Internet users tend to utilize images to express their viewpoints. Meanwhile, there are some complicated implicit emotions, such as sarcasm. The text or images are opposite to the actual emotional tendencies. To a certain extent, it affects the recognition results. Future research can consider proposing more advanced classification models or performing multimodal sentiment analysis.

Author Contributions

Conceptualization, G.M. and J.L.; methodology, J.L.; software, J.L.; validation, G.M. and J.L.; formal analysis, J.L.; investigation, C.C., X.J. and J.D.; resources, X.L.; data curation, J.L.; writing—original draft preparation, J.L.; writing—review and editing, G.M. and J.L.; visualization, J.L.; supervision, G.M.; project administration, G.M.; funding acquisition, G.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Social Science Fund of China under Grant No. 19BJY246, the Natural Science Fund Project of the Science and Technology Department of Jilin Province under Grant No. 20240101361JC, and the Think Tank Fund Project of the Jilin Science and Technology Association.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

The authors are grateful for the financial support from the National Social Science Fund of China under Grant No. 19BJY246, the Natural Science Fund Project of the Science and Technology Department of Jilin Province under Grant No. 20240101361JC, and the Think Tank Fund Project of the Jilin Science and Technology Association.

Conflicts of Interest

The authors declare no conflicts of interest.

Correction Statement

This article has been republished with a minor correction to the image quality of figure (Figures 1, 2, 6 and 7). This change does not affect the scientific content of the article.

References

Speck, O.; Speck, T. Is a Forest Fire a Natural Disaster? Investigating the Fire Tolerance of Various Tree Species—An Educational Module. Biomimetics 2024, 9, 114. [Google Scholar] [CrossRef]
Zhou, B.; Zou, L.; Mostafavi, A.; Lin, B.; Yang, M.; Gharaibeh, N.; Cai, H.; Abedin, J.; Mandal, D. VictimFinder: Harvesting Rescue Requests in Disaster Response from Social Media with BERT. Comput. Environ. Urban Syst. 2022, 95, 101824. [Google Scholar] [CrossRef]
Zander, K.K.; Garnett, S.T.; Ogie, R.; Alazab, M.; Nguyen, D. Trends in Bushfire Related Tweets during the Australian ‘Black Summer’ of 2019/20. For. Ecol. Manag. 2023, 545, 121274. [Google Scholar] [CrossRef]
Olynk Widmar, N.; Rash, K.; Bir, C.; Bir, B.; Jung, J. The Anatomy of Natural Disasters on Online Media: Hurricanes and Wildfires. Nat. Hazards 2021, 110, 961–998. [Google Scholar] [CrossRef] [PubMed]
Sufi, F.K.; Khalil, I. Automated Disaster Monitoring From Social Media Posts Using AI-Based Location Intelligence and Sentiment Analysis. IEEE Trans. Comput. Soc. Syst. 2024, 11, 4614–4624. [Google Scholar] [CrossRef]
Platania, F.; Hernandez, C.T.; Arreola, F. Social Media Communication during Natural Disasters and the Impact on the Agricultural Market. Technol. Forecast. Soc. Chang. 2022, 179, 121594. [Google Scholar] [CrossRef]
Karimiziarani, M.; Jafarzadegan, K.; Abbaszadeh, P.; Shao, W.; Moradkhani, H. Hazard Risk Awareness and Disaster Management: Extracting the Information Content of Twitter Data. Sust. Cities Soc. 2022, 77, 103577. [Google Scholar] [CrossRef]
Lam, N.S.N.; Meyer, M.; Reams, M.; Yang, S.; Lee, K.; Zou, L.; Mihunov, V.; Wang, K.; Kirby, R.; Cai, H. Improving Social Media Use for Disaster Resilience: Challenges and Strategies. Int. J. Digit. Earth 2023, 16, 3023–3044. [Google Scholar] [CrossRef]
Yuan, Q.; Wang, S.; Li, N. Research on Emotional Tendency of Earthquake Disaster Based on E-Trans Model: Take the Topic of “Sichuan Earthquake” on Microblog as an Example. Nat. Hazards 2024, 120, 5057–5074. [Google Scholar] [CrossRef]
Wan, B.; Wu, P.; Yeo, C.K.; Li, G. Emotion-Cognitive Reasoning Integrated BERT for Sentiment Analysis of Online Public Opinions on Emergencies. Inf. Process. Manag. 2024, 61, 103609. [Google Scholar] [CrossRef]
Win Myint, P.Y.; Lo, S.L.; Zhang, Y. Unveiling the Dynamics of Crisis Events: Sentiment and Emotion Analysis via Multi-Task Learning with Attention Mechanism and Subject-Based Intent Prediction. Inf. Process. Manag. 2024, 61, 103695. [Google Scholar] [CrossRef]
Tounsi, A.; Temimi, M. A Systematic Review of Natural Language Processing Applications for Hydrometeorological Hazards Assessment. Nat. Hazards 2023, 116, 2819–2870. [Google Scholar] [CrossRef] [PubMed]
Fauzi, M.A. Social Media in Disaster Management: Review of the Literature and Future Trends through Bibliometric Analysis. Nat. Hazards 2023, 118, 953–975. [Google Scholar] [CrossRef]
Taborda, B.; Maria de Almeida, A.; Carlos Dias, J.; Batista, F.; Ribeiro, R. SA-MAIS: Hybrid Automatic Sentiment Analyser for Stock Market. J. Inf. Sci. 2023, 016555152311713. [Google Scholar] [CrossRef]
Senbeto, D.L.; Mamo, Y.; Seyfi, S. Light in the Middle of the Tunnel? A Sentimental Analysis of Tourist Responses to Ongoing Crisis. Curr. Issues Tour. 2023, 27, 838–846. [Google Scholar] [CrossRef]
Bigne, E.; Ruiz, C.; Perez-Cabañero, C.; Cuenca, A. Are Customer Star Ratings and Sentiments Aligned? A Deep Learning Study of the Customer Service Experience in Tourism Destinations. Serv. Bus. 2023, 17, 281–314. [Google Scholar] [CrossRef]
Xavier, T.; Lambert, J. Sentiment and Emotion Trends in Nurses’ Tweets about the COVID-19 Pandemic. J. Nurs. Scholarsh. 2022, 54, 613–622. [Google Scholar] [CrossRef]
Tamer, M.; Khamis, M.A.; Yahia, A.; Khaled, S.; Ashraf, A.; Gomaa, W. Arab Reactions towards Russo-Ukrainian War. EPJ Data Sci. 2023, 12, 36. [Google Scholar] [CrossRef]
Sarsam, S.M.; Al-Samarraie, H.; Alzahrani, A.I.; Mon, C.S.; Shibghatullah, A.S. Characterizing Suicide Ideation by Using Mental Disorder Features on Microblogs: A Machine Learning Perspective. Int. J. Ment. Health Addict. 2022, 1–14. [Google Scholar] [CrossRef]
Turón, A.; Altuzarra, A.; Moreno-Jiménez, J.M.; Navarro, J. Evolution of Social Mood in Spain throughout the COVID-19 Vaccination Process: A Machine Learning Approach to Tweets Analysis. Public Health 2023, 215, 83–90. [Google Scholar] [CrossRef]
Hussain, Z.; Sheikh, Z.; Tahir, A.; Dashtipour, K.; Gogate, M.; Sheikh, A.; Hussain, A. Artificial Intelligence–Enabled Social Media Analysis for Pharmacovigilance of COVID-19 Vaccinations in the United Kingdom: Observational Study. JMIR Public Health Surveill. 2022, 8, e32543. [Google Scholar] [CrossRef] [PubMed]
Weerasinghe, S.; Oyebode, O.; Orji, R.; Matwin, S. Dynamics of Emotion Trends in Canadian Twitter Users during COVID-19 Confinement in Relation to Caseloads: Artificial Intelligence-Based Emotion Detection Approach. Digit. Health 2023, 9, 205520762311714. [Google Scholar] [CrossRef] [PubMed]
Duan, H.K.; Vasarhelyi, M.A.; Codesso, M.; Alzamil, Z. Enhancing the Government Accounting Information Systems Using Social Media Information: An Application of Text Mining and Machine Learning. Int. J. Account. Inf. Syst. 2023, 48, 100600. [Google Scholar] [CrossRef]
Li, T.; Chen, H.; Liu, W.; Yu, G.; Yu, Y. Understanding the Role of Social Media Sentiment in Identifying Irrational Herding Behavior in the Stock Market. Int. Rev. Econ. Financ. 2023, 87, 163–179. [Google Scholar] [CrossRef]
Polignano, M.; Basile, V.; Basile, P.; Gabrieli, G.; Vassallo, M.; Bosco, C. A Hybrid Lexicon-Based and Neural Approach for Explainable Polarity Detection. Inf. Process. Manag. 2022, 59, 103058. [Google Scholar] [CrossRef]
Karami, B.; Bakouie, F.; Gharibzadeh, S. A Transformer-Based Deep Learning Model for Persian Moral Sentiment Analysis. J. Inf. Sci. 2023, 01655515231188344. [Google Scholar] [CrossRef]
Mohd, M.; Javeed, S.; Nowsheena; Wani, M.A.; Khanday, H.A. Sentiment Analysis Using Lexico-Semantic Features. J. Inf. Sci. 2022, 016555152211240. [Google Scholar] [CrossRef]
Qin, J.; Zeng, M.; Wei, X.; Pedrycz, W. Ranking Products through Online Reviews: A Novel Data-Driven Method Based on Interval Type-2 Fuzzy Sets and Sentiment Analysis. J. Oper. Res. Soc. 2023, 75, 860–873. [Google Scholar] [CrossRef]
Fonseca, M.; Delbianco, F.; Maguitman, A.; Soto, A.J. Assessing Causality among Topics and Sentiments: The Case of the G20 Discussion on Twitter. J. Inf. Sci. 2023, 016555152311600. [Google Scholar] [CrossRef]
Hartmann, J.; Heitmann, M.; Siebert, C.; Schamp, C. More than a Feeling: Accuracy and Application of Sentiment Analysis. Int. J. Res. Mark. 2023, 40, 75–87. [Google Scholar] [CrossRef]
Laifa, M.; Mohdeb, D. Sentiment Analysis of the Algerian Social Movement Inception. Data Technol. Appl. 2023, 57, 734–755. [Google Scholar] [CrossRef]
Rizk, R.; Rizk, D.; Rizk, F.; Hsu, S. 280 Characters to the White House: Predicting 2020 U.S. Presidential Elections from Twitter Data. Comput. Math. Organ. Theory 2023, 29, 542–569. [Google Scholar] [CrossRef] [PubMed]
Zahoor, K.; Bawany, N.Z. Explainable Artificial Intelligence Approach towards Classifying Educational Android App Reviews Using Deep Learning. Interact. Learn. Environ. 2023, 1–26. [Google Scholar] [CrossRef]
Nguyen, T.T.; Merchant, J.S.; Yue, X.; Mane, H.; Wei, H.; Huang, D.; Gowda, K.N.; Makres, K.; Najib, C.; Nghiem, H.T.; et al. A Decade of Tweets: Visualizing Racial Sentiments Towards Minoritized Groups in the United States Between 2011 and 2021. Epidemiology 2023, 35, 51–59. [Google Scholar] [CrossRef] [PubMed]
Ramzy, M.; Ibrahim, B. User Satisfaction with Arabic COVID-19 Apps: Sentiment Analysis of Users’ Reviews Using Machine Learning Techniques. Inf. Process. Manag. 2024, 61, 103644. [Google Scholar] [CrossRef]
Overbeck, M.; Baden, C.; Aharoni, T.; Amit-Danhi, E.; Tenenboim-Weinblatt, K. Beyond Sentiment: An Algorithmic Strategy for Identifying Evaluations within Large Text Corpora. Commun. Methods Meas. 2023, 1–22. [Google Scholar] [CrossRef]
Senadhira, K.I.; Rupasingha, R.A.H.M.; Kumara, B.T.G.S. A Deep Learning Based Approach for Classifying Tweets Related to Online Learning during the Covid-19 Pandemic. Educ. Inf. Technol. 2023, 29, 7707–7736. [Google Scholar] [CrossRef]
Liu, J.; Hu, S.; Mehraliyev, F.; Zhou, H.; Yu, Y.; Yang, L. Recognizing Emotions in Restaurant Online Reviews: A Hybrid Model Integrating Deep Learning and a Sentiment Lexicon. Int. J. Contemp. Hosp. Manag. 2023, 36, 2955–2976. [Google Scholar] [CrossRef]
Bochkay, K.; Brown, S.V.; Leone, A.J.; Tucker, J.W. Textual Analysis in Accounting: What’s Next? Contemp. Account. Res. 2022, 40, 765–805. [Google Scholar] [CrossRef]
Alslaity, A.; Orji, R. Machine Learning Techniques for Emotion Detection and Sentiment Analysis: Current State, Challenges, and Future Directions. Behav. Inf. Technol. 2022, 43, 139–164. [Google Scholar] [CrossRef]
Khan, J.; Ahmad, N.; Khalid, S.; Ali, F.; Lee, Y. Sentiment and Context-Aware Hybrid DNN With Attention for Text Sentiment Classification. IEEE Access 2023, 11, 28162–28179. [Google Scholar] [CrossRef]
Jain, S.; Roy, P.K. E-Commerce Review Sentiment Score Prediction Considering Misspelled Words: A Deep Learning Approach. Electron. Commer. Res. 2022, 1–25. [Google Scholar] [CrossRef]
Wei, Z.; Zhang, M.; Ming, Y. Understanding the Effect of Tourists’ Attribute-Level Experiences on Satisfaction—A Cross-Cultural Study Leveraging Deep Learning. Curr. Issues Tour. 2022, 26, 105–121. [Google Scholar] [CrossRef]
Olusegun, R.; Oladunni, T.; Audu, H.; Houkpati, Y.; Bengesi, S. Text Mining and Emotion Classification on Monkeypox Twitter Dataset: A Deep Learning-Natural Language Processing (NLP) Approach. IEEE Access 2023, 11, 49882–49894. [Google Scholar] [CrossRef]
Mohbey, K.K.; Meena, G.; Kumar, S.; Lokesh, K. A CNN-LSTM-Based Hybrid Deep Learning Approach for Sentiment Analysis on Monkeypox Tweets. New Gener. Comput. 2023, 42, 89–107. [Google Scholar] [CrossRef]
Kodati, D.; Dasari, C.M. Negative Emotion Detection on Social Media during the Peak Time of COVID-19 through Deep Learning with an Auto-Regressive Transformer. Eng. Appl. Artif. Intell. 2024, 127, 107361. [Google Scholar] [CrossRef]
Philip Thekkekara, J.; Yongchareon, S.; Liesaputra, V. An Attention-Based CNN-BiLSTM Model for Depression Detection on Social Media Text. Expert Syst. Appl. 2024, 249, 123834. [Google Scholar] [CrossRef]
Mu, G.; Liao, Z.; Li, J.; Qin, N.; Yang, Z. IPSO-LSTM Hybrid Model for Predicting Online Public Opinion Trends in Emergencies. PLoS ONE 2023, 18, e0292677. [Google Scholar] [CrossRef]
Mu, G.; Li, J.; Liao, Z.; Yang, Z. An Enhanced IHHO-LSTM Model for Predicting Online Public Opinion Trends in Public Health Emergencies. SAGE Open 2024, 14, 21582440241257681. [Google Scholar] [CrossRef]
Hosseinalipour, A.; Ghanbarzadeh, R. A Novel Metaheuristic Optimisation Approach for Text Sentiment Analysis. Int. J. Mach. Learn. Cybern. 2022, 14, 889–909. [Google Scholar] [CrossRef]
Suddle, M.K.; Bashir, M. Metaheuristics Based Long Short Term Memory Optimization for Sentiment Analysis. Appl. Soft. Comput. 2022, 131, 109794. [Google Scholar] [CrossRef]
Yildirim, G. A Novel Grid-Based Many-Objective Swarm Intelligence Approach for Sentiment Analysis in Social Media. Neurocomputing 2022, 503, 173–188. [Google Scholar] [CrossRef]
Mardjo, A.; Choksuchat, C. HyVADRF: Hybrid VADER–Random Forest and GWO for Bitcoin Tweet Sentiment Analysis. IEEE Access 2022, 10, 101889–101897. [Google Scholar] [CrossRef]
Rasappan, P.; Premkumar, M.; Sinha, G.; Chandrasekaran, K. Transforming Sentiment Analysis for E-Commerce Product Reviews: Hybrid Deep Learning Model with an Innovative Term Weighting and Feature Selection. Inf. Process. Manag. 2024, 61, 103654. [Google Scholar] [CrossRef]
Mehbodniya, A.; Rao, M.V.; David, L.G.; Joe Nigel, K.G.; Vennam, P. Online Product Sentiment Analysis Using Random Evolutionary Whale Optimization Algorithm and Deep Belief Network. Pattern Recognit. Lett. 2022, 159, 1–8. [Google Scholar] [CrossRef]
Krosuri, L.R.; Aravapalli, R.S. Feature Level Fine Grained Sentiment Analysis Using Boosted Long Short-Term Memory with Improvised Local Search Whale Optimization. PeerJ Comput. Sci. 2023, 9, e1336. [Google Scholar] [CrossRef]
Seilsepour, A.; Ravanmehr, R.; Nassiri, R. Topic Sentiment Analysis Based on Deep Neural Network Using Document Embedding Technique. J. Supercomput. 2023, 79, 19809–19847. [Google Scholar] [CrossRef]
Rushing, B. No Free Theory Choice from Machine Learning. Synthese 2022, 200, 414. [Google Scholar] [CrossRef]
Wolpert, D.H.; Macready, W.G. No Free Lunch Theorems for Optimization. IEEE Trans. Evol. Comput. 1997, 1, 67–82. [Google Scholar] [CrossRef]
Xue, J.; Shen, B. Dung Beetle Optimizer: A New Meta-Heuristic Algorithm for Global Optimization. J. Supercomput. 2022, 79, 7305–7336. [Google Scholar] [CrossRef]
Bai, X.; Ma, Z.; Chen, W.; Wang, S.; Fu, Y. Fault Diagnosis Research of Laser Gyroscope Based on Optimized-Kernel Extreme Learning Machine. Comput. Electr. Eng. 2023, 111, 108956. [Google Scholar] [CrossRef]
Kong, Q. NLOS Identification for UWB Positioning Based on IDBO and Convolutional Neural Networks. IEEE Access 2023, 11, 144705–144721. [Google Scholar] [CrossRef]
Wang, Z.; Huang, L.; Yang, S.; Li, D.; He, D.; Chan, S. A Quasi-Oppositional Learning of Updating Quantum State and Q-Learning Based on the Dung Beetle Algorithm for Global Optimization. Alex. Eng. J. 2023, 81, 469–488. [Google Scholar] [CrossRef]
Cheng, Y.; Qiao, K.; Jin, S.; Zhou, S.; Xue, J. Research on Electric Spindle Thermal Error Prediction Model Based on DBO-SVM. Int. J. Adv. Manuf. Technol. 2024, 132, 3333–3347. [Google Scholar] [CrossRef]
Sun, H.; He, D.; Ma, H.; Wen, Z.; Deng, J. The Parameter Identification of Metro Rail Corrugation Based on Effective Signal Extraction and Inertial Reference Method. Eng. Fail. Anal. 2024, 158, 108043. [Google Scholar] [CrossRef]
Jiachen, H.; Li-hui, F. Robot Path Planning Based on Improved Dung Beetle Optimizer Algorithm. J. Braz. Soc. Mech. Sci. Eng. 2024, 46, 235. [Google Scholar] [CrossRef]
Kuller, M.; Schoenholzer, K.; Lienert, J. Creating Effective Flood Warnings: A Framework from a Critical Review. J. Hydrol. 2021, 602, 126708. [Google Scholar] [CrossRef]
Hermans, T.D.G.; Šakić Trogrlić, R.; van den Homberg, M.J.C.; Bailon, H.; Sarku, R.; Mosurska, A. Exploring the Integration of Local and Scientific Knowledge in Early Warning Systems for Disaster Risk Reduction: A Review. Nat. Hazards 2022, 114, 1125–1152. [Google Scholar] [CrossRef]
Kumar, R.; Mittal, H.; Sandeep; Sharma, B. Earthquake Genesis and Earthquake Early Warning Systems: Challenges and a Way Forward. Surv. Geophys. 2022, 43, 1143–1168. [Google Scholar] [CrossRef]
Sharafi, L.; Zarafshani, K.; Keshavarz, M.; Azadi, H.; Van Passel, S. Farmers’ Decision to Use Drought Early Warning System in Developing Countries. Sci. Total Environ. 2021, 758, 142761. [Google Scholar] [CrossRef]
Hong, B.; Shao, B.; Wang, B.; Zhao, J.; Qian, J.; Guo, J.; Xu, Y.; Li, C.; Zhu, B. Using the Meteorological Early Warning Model to Improve the Prediction Accuracy of Water Damage Geological Disasters around Pipelines in Mountainous Areas. Sci. Total Environ. 2023, 889, 164334. [Google Scholar] [CrossRef]
Sharma, A.; Mohana, R.; Kukkar, A.; Chodha, V.; Bansal, P. An Ensemble Learning–Based Experimental Framework for Smart Landslide Detection, Monitoring, Prediction, and Warning in IoT-Cloud Environment. Environ. Sci. Pollut. Res. 2023, 30, 122677–122699. [Google Scholar] [CrossRef]
Hussain, F.; Wu, R.-S.; Wang, J.-X. Comparative Study of Very Short-Term Flood Forecasting Using Physics-Based Numerical Model and Data-Driven Prediction Model. Nat. Hazards 2021, 107, 249–284. [Google Scholar] [CrossRef]
Shoyama, K.; Cui, Q.; Hanashima, M.; Sano, H.; Usuda, Y. Emergency Flood Detection Using Multiple Information Sources: Integrated Analysis of Natural Hazard Monitoring and Social Media Data. Sci. Total Environ. 2021, 767, 144371. [Google Scholar] [CrossRef] [PubMed]
Moishin, M.; Deo, R.C.; Prasad, R.; Raj, N.; Abdulla, S. Designing Deep-Based Learning Flood Forecast Model With ConvLSTM Hybrid Algorithm. IEEE Access 2021, 9, 50982–50993. [Google Scholar] [CrossRef]
Zhang, T.; Shen, S.; Cheng, C.; Su, K.; Zhang, X. A Topic Model Based Framework for Identifying the Distribution of Demand for Relief Supplies Using Social Media Data. Int. J. Geogr. Inf. Sci. 2021, 35, 2216–2237. [Google Scholar] [CrossRef]
Wang, S.L.; Sun, B.Q. Model of Multi-Period Emergency Material Allocation for Large-Scale Sudden Natural Disasters in Humanitarian Logistics: Efficiency, Effectiveness and Equity. Int. J. Disaster Risk Reduct. 2023, 85, 103530. [Google Scholar] [CrossRef]
Fei, L.; Wang, Y. Demand Prediction of Emergency Materials Using Case-Based Reasoning Extended by the Dempster-Shafer Theory. Socio-Econ. Plan. Sci. 2022, 84, 101386. [Google Scholar] [CrossRef]
Liu, Y.; Tian, J.; Feng, G. Pre-Positioning Strategies for Relief Supplies in a Relief Supply Chain. J. Oper. Res. Soc. 2021, 73, 1457–1473. [Google Scholar] [CrossRef]
Toland, J.C.; Wein, A.M.; Wu, A.-M.; Spearing, L.A. A Conceptual Framework for Estimation of Initial Emergency Food and Water Resource Requirements in Disasters. Int. J. Disaster Risk Reduct. 2023, 90, 103661. [Google Scholar] [CrossRef]
Zhang, M.; Kong, Z. A Tripartite Evolutionary Game Model of Emergency Supplies Joint Reserve among the Government, Enterprise and Society. Comput. Ind. Eng. 2022, 169, 108132. [Google Scholar] [CrossRef]
Yang, X.; Yao, Y.; Tian, K.; Jiang, W.; Xing, Q.; Yang, J.; Liu, C. Disaster Response Strategies of Governments and Social Organizations: From the Perspective of Infrastructure Damage and Asymmetric Resource Dependence. Heliyon 2023, 9, e20432. [Google Scholar] [CrossRef]
Sentia, P.D.; Abdul Shukor, S.; Wahab, A.N.A.; Mukhtar, M. Logistic Distribution in Humanitarian Supply Chain Management: A Thematic Literature Review and Future Research. Ann. Oper. Res. 2023, 323, 175–201. [Google Scholar] [CrossRef]
Chen, M. Optimal Path Planning and Data Simulation of Emergency Material Distribution Based on Improved Neural Network Algorithm. Soft Comput. 2023, 27, 5995–6005. [Google Scholar] [CrossRef]
Akter, S. Australia’s Black Summer Wildfires Recovery: A Difference-in-Differences Analysis Using Nightlights. Glob. Environ. Change Hum. Policy Dimens. 2023, 83, 102743. [Google Scholar] [CrossRef]
Akbulut-Yuksel, M.; Rahman, M.H.; Ulubaşoğlu, M.A. Silver Lining of the Water: The Role of Government Relief Assistance in Disaster Recovery. Eur. J. Polit. Econ. 2023, 79, 102436. [Google Scholar] [CrossRef]
Lu, Y.; Li, R.; Mao, X.; Wang, S. Towards Comprehensive Regional Resilience Evaluation, Resistance, Recovery, and Creativity: From the Perspective of the 2008 Wenchuan Earthquake. Int. J. Disaster Risk Reduct. 2022, 82, 103313. [Google Scholar] [CrossRef]
Marlier, M.E.; Resetar, S.A.; Lachman, B.E.; Anania, K.; Adams, K. Remote Sensing for Natural Disaster Recovery: Lessons Learned from Hurricanes Irma and Maria in Puerto Rico. Environ. Sci. Policy 2022, 132, 153–159. [Google Scholar] [CrossRef]
Bahmani, H.; Zhang, W. A Conceptual Framework for Integrated Management of Disasters Recovery Projects. Nat. Hazards 2022, 113, 859–885. [Google Scholar] [CrossRef]
Newman, G.; Li, D.; Park, Y. The Relationships between Neighbourhood Vacancy, Probable PTSD, and Health-Related Quality of Life in Flood-Disaster-Impacted Communities. Urban Stud. 2022, 59, 3077–3097. [Google Scholar] [CrossRef]
Witt, A.; Sachser, C.; Fegert, J.M. Scoping Review on Trauma and Recovery in Youth after Natural Disasters: What Europe Can Learn from Natural Disasters around the World. Eur. Child Adolesc. Psych. 2022, 33, 651–665. [Google Scholar] [CrossRef] [PubMed]
Pham, N.K.; Do, M.; Diep, J. Social Support and Community Embeddedness Protect against Post-Disaster Depression among Immigrants: A Vietnamese American Case Study. Front. Psychiatry 2023, 14, 1075678. [Google Scholar] [CrossRef]
Wang, D.; Liu, J. Resource Allocation, Individual Social Network, Community Trust and Recovery from Depression among Rural Survivors in the Wenchuan Earthquake. Curr. Psychol. 2023, 43, 328–339. [Google Scholar] [CrossRef] [PubMed]
Li, L.; Ma, Z.; Cao, T. Data-Driven Investigations of Using Social Media to Aid Evacuations amid Western United States Wildfire Season. Fire Saf. J. 2021, 126, 103480. [Google Scholar] [CrossRef]
Hunt, K.; Wang, B.; Zhuang, J. Misinformation Debunking and Cross-Platform Information Sharing through Twitter during Hurricanes Harvey and Irma: A Case Study on Shelters and ID Checks. Nat. Hazards 2020, 103, 861–883. [Google Scholar] [CrossRef]
Chen, X. Monitoring of Public Opinion on Typhoon Disaster Using Improved Clustering Model Based on Single-Pass Approach. SAGE Open 2023, 13, 21582440231200098. [Google Scholar] [CrossRef]
Lian, Y.; Liu, Y.; Dong, X. Strategies for Controlling False Online Information during Natural Disasters: The Case of Typhoon Mangkhut in China. Technol. Soc. 2020, 62, 101265. [Google Scholar] [CrossRef]
Cheong, S.-M.; Babcock, M. Attention to Misleading and Contentious Tweets in the Case of Hurricane Harvey. Nat. Hazards 2020, 105, 2883–2906. [Google Scholar] [CrossRef]
Hunt, K.; Agarwal, P.; Zhuang, J. Monitoring Misinformation on Twitter During Crisis Events: A Machine Learning Approach. Risk Anal. 2020, 42, 1728–1748. [Google Scholar] [CrossRef]
Vicari, R.; Komendatova, N. Systematic Meta-Analysis of Research on AI Tools to Deal with Misinformation on Social Media during Natural and Anthropogenic Hazards and Disasters. Hum. Soc. Sci. Commun. 2023, 10, 332. [Google Scholar] [CrossRef]
Byrd, K.; John, R.S. Lies, Damned Lies, and Social Media Following Extreme Events. Risk Anal. 2021, 42, 1704–1727. [Google Scholar] [CrossRef] [PubMed]
Li, S.; Wang, Y.; Huang, H.; Zhou, Y. Study on the Rumor Detection of Social Media in Disaster Based on Multi-Feature Fusion Method. Nat. Hazards 2023, 120, 4011–4030. [Google Scholar] [CrossRef]
Yuan, F.; Li, M.; Liu, R.; Zhai, W.; Qi, B. Social Media for Enhanced Understanding of Disaster Resilience during Hurricane Florence. Int. J. Inf. Manag. 2021, 57, 102289. [Google Scholar] [CrossRef]
Jin, X.; Spence, P.R. Understanding Crisis Communication on Social Media with CERC: Topic Model Analysis of Tweets about Hurricane Maria. J. Risk Res. 2020, 24, 1266–1287. [Google Scholar] [CrossRef]
Boon-Itt, S.; Skunkan, Y. Public Perception of the COVID-19 Pandemic on Twitter: Sentiment Analysis and Topic Modeling Study. JMIR Public Health Surveill. 2020, 6, e21978. [Google Scholar] [CrossRef] [PubMed]
Yuan, F.; Li, M.; Liu, R. Understanding the Evolutions of Public Responses Using Social Media: Hurricane Matthew Case Study. Int. J. Disaster Risk Reduct. 2020, 51, 101798. [Google Scholar] [CrossRef]
Guo, D.; Zhao, Q.; Chen, Q.; Wu, J.; Li, L.; Gao, H. Comparison between Sentiments of People from Affected and Non-Affected Regions after the Flood. Geomat. Nat. Hazards Risk 2021, 12, 3346–3357. [Google Scholar] [CrossRef]
Mendon, S.; Dutta, P.; Behl, A.; Lessmann, S. A Hybrid Approach of Machine Learning and Lexicons to Sentiment Analysis: Enhanced Insights from Twitter Data of Natural Disasters. Inf. Syst. Front. 2021, 23, 1145–1168. [Google Scholar] [CrossRef]
Zhou, S.; Kan, P.; Huang, Q.; Silbernagel, J. A Guided Latent Dirichlet Allocation Approach to Investigate Real-Time Latent Topics of Twitter Data during Hurricane Laura. J. Inf. Sci. 2021, 49, 465–479. [Google Scholar] [CrossRef]
Chen, Y.; Ji, W. Enhancing Situational Assessment of Critical Infrastructure Following Disasters Using Social Media. J. Manag. Eng. 2021, 37, 04021058. [Google Scholar] [CrossRef]
Sugino, H.; Sekiguchi, T.; Terada, Y.; Hayashi, N. “Future Compass”, a Tool That Allows Us to See the Right Horizon—Integration of Topic Modeling and Multiple-Factor Analysis. Sustainability 2023, 15, 10175. [Google Scholar] [CrossRef]
Zhang, T.; Cheng, C. Temporal and Spatial Evolution and Influencing Factors of Public Sentiment in Natural Disasters—A Case Study of Typhoon Haiyan. ISPRS Int. J. Geo Inf. 2021, 10, 299. [Google Scholar] [CrossRef]
Xu, Z. How Emergency Managers Engage Twitter Users during Disasters. Online Inf. Rev. 2020, 44, 933–950. [Google Scholar] [CrossRef]
Karimiziarani, M.; Moradkhani, H. Social Response and Disaster Management: Insights from Twitter Data Assimilation on Hurricane Ian. Int. J. Disaster Risk Reduct. 2023, 95, 103865. [Google Scholar] [CrossRef]
Karimiziarani, M.; Shao, W.; Mirzaei, M.; Moradkhani, H. Toward Reduction of Detrimental Effects of Hurricanes Using a Social Media Data Analytic Approach: How Climate Change Is Perceived? Clim. Risk manag. 2023, 39, 100480. [Google Scholar] [CrossRef]
Kumar, V.V.; Sahoo, A.; Balasubramanian, S.K.; Gholston, S. Mitigating Healthcare Supply Chain Challenges under Disaster Conditions: A Holistic AI-Based Analysis of Social Media Data. Int. J. Prod. Res. 2024, 1–19. [Google Scholar] [CrossRef]
Ma, M.; Gao, Q.; Xiao, Z.; Hou, X.; Hu, B.; Jia, L.; Song, W. Analysis of Public Emotion on Flood Disasters in Southern China in 2020 Based on Social Media Data. Nat. Hazards 2023, 118, 1013–1033. [Google Scholar] [CrossRef]
Wang, H.; Mo, Y. Adaptive Hybrid Optimization Algorithm for Numerical Computing in Engineering Applications. Eng. Optimiz. 2024, 1–39. [Google Scholar] [CrossRef]
Mckay, M.D.; Beckman, R.J.; Conover, W.J. A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output From a Computer Code. Technometrics 2000, 42, 55–61. [Google Scholar] [CrossRef]
Dehghani, M.; Trojovský, P. Osprey Optimization Algorithm: A New Bio-Inspired Metaheuristic Algorithm for Solving Engineering Optimization Problems. Front. Mech. Eng. 2023, 8, 1126450. [Google Scholar] [CrossRef]
Chen, Y.; Dong, W.; Hu, X. IMATSA—An Improved and Adaptive Intelligent Optimization Algorithm Based on Tunicate Swarm Algorithm. AI Commun. 2024, 37, 1–22. [Google Scholar] [CrossRef]
Wu, A.; Gong, R.; Mao, J.; Yu, X.; He, J.; Li, E. Voltage Feed-Forward Control of Photovoltaic- Battery DC Microgrid Based on Improved Seeker Optimization Algorithm. IEEE Access 2024, 12, 46067–46080. [Google Scholar] [CrossRef]
Fan, F.; Cheng, X.; Yan, X.; Wu, Y.; Luo, Z. Multi-objective Firefly Algorithm Combining Logistic Mapping and Cauchy Mutation. Concurr. Comput. Pract. Exp. 2023, 36, e7974. [Google Scholar] [CrossRef]
Bansal, S. Performance Comparison of Five Metaheuristic Nature-Inspired Algorithms to Find near-OGRs for WDM Systems. Artif. Intell. Rev. 2020, 53, 5589–5635. [Google Scholar] [CrossRef]
King, K.K.; Wang, B. Diffusion of Real versus Misinformation during a Crisis Event: A Big Data-Driven Approach. Int. J. Inf. Manag. 2023, 71, 102390. [Google Scholar] [CrossRef]

Figure 1. The architecture of the IDBO-CNN-BiLSTM model.

Figure 2. Comparison of two initialization methods.

Figure 3. The structure of the BiLSTM network.

Figure 4. The number of two sentiment labels.

Figure 5. The comparison of accuracy.

Figure 6. The comparison of confusion matrices for single models: (a) CNN; (b) RNN; (c) GRU; (d) LSTM; (e) BiLSTM.

Figure 7. The comparison of confusion matrices for hybrid models: (a) CNN-BiLSTM; (b) GWO-CNN-BiLSTM; (c) WOA-CNN-BiLSTM; (d) DBO-CNN-BiLSTM; (e) IDBO-CNN-BiLSTM.

Table 1. The description of the parameters in Equations (1) and (2).

Parameters	Description
$t$	The current iteration number
$x_{i} (t)$	The $i$ th dung beetle’s position information at the $t$ th iteration
$α$	A natural coefficient assigned as −1 (deviation) or 1 (no deviation)
$k$	A constant value representing the deflection coefficient in the interval (0, 0.2]
$b$	A constant value belonging to (0, 1)
$X^{W}$	The worst global position
$∆ x$	The simulation of light intensity change

Table 2. The description of the parameters in Equations (4)–(7).

Parameters	Description
$X^{*}$	The current local optimal position
$L b^{*}$	The spawning zone’s lower boundary
$U b^{*}$	The spawning zone’s upper boundary
$T_{m a x}$	The maximum iterations
$L b$	The optimization issue’s lower boundary
$U b$	The optimization issue’s upper boundary
$B_{i} (t)$	The $i$ th brood ball’s location information at the $t$ th iteration
$b_{1}$ , $b_{2}$	The stochastic vectors by size 1 × $D$
$D$	The optimization problem’s dimension

Table 3. The description of the parameters in Equations (8)–(10).

Parameters	Description
$X^{b}$	The global optimal position
$L b^{b}$	The lower boundary of the optimal foraging zone
$U b^{b}$	The upper boundary of the optimal foraging zone
$x_{i} (t)$	The $i$ th small dung beetle’s position information at the $t$ th iteration
$C_{1}$	A stochastic value following the normal distribution
$C_{2}$	A stochastic vector belonging to (0, 1)

Table 4. The description of the parameters in Equation (11).

Parameters	Description
$x_{i} (t)$	The $i$ th thief’s position information at the $t$ th iteration
$g$	A stochastic vector following the normal distribution by size 1 × $D$
$S$	A constant value

Table 5. The description of the parameters in Equation (13).

Parameters	Description
$x_{i j}$	The $i$ th osprey’s position information at the $j$ th dimension
$r_{i j}$	A stochastic value within the scope [0, 1]
${S F}_{i j}$	The location information of the fish chosen by the $i$ th osprey at the $j$ th dimension
$I_{i j}$	A stochastic value from {1, 2}

Table 6. The description of the parameters in Equation (14).

Parameters	Description
$X_{i} (t)$	The $i$ th dung beetle’s location information at the $t$ th iteration
$r a n d$	A stochastic value in the interval [0, 1]
$X^{'}$	The selected better position of the dung ball
$F$	A stochastic value from {1, 2}

Table 7. The description of the parameters in Equation (15).

Parameters	Description
$X^{b} (t)$	The individual’s optimal position at the $t$ th iteration
$μ_{1}$ , $μ_{2}$	The weight coefficient of the mutation operator
$G a u s s (σ)$	The Gaussian mutation operator
$C a u c h y (σ)$	The Cauchy mutation operator

Table 8. The description of the parameters in Equation (18).

Parameters	Description
$f$	The ReLU activation function
$X$	The input word embedding matrix
$K$	The convolutional kernel matrix
$b$	The bias term

Table 9. The examples of tweets with sentiment labels.

Tweets	Sentiment Labels
Thank you to the many volunteers & farmers from North Dakota who harvested sweet corn & delivered it to the Food Bank for hurricane victims!	0
I need food and water. This freaking hurricane ruins everything!	1

Table 10. The hyperparameter settings of this experiment.

Hyperparameters	Value
Optimizer	Adam
Learning rate	0.0001
L2	0.01
Epochs	20
$S$	0.5
$N$	10
$D$	4
Maximum iteration	10
$L b$	[3, 32, 64]
$U b$	[8, 128, 256]

Table 11. The standard binary confusion matrix.

	Predicted Positive Instance	Predicted Negative Instance
Actual Positive Instance	True Positive (TP)	False Negative (FN)
Actual Negative Instance	False Positive (FP)	True Negative (TN)

Table 12. The contrast of evaluation metrics.

Types	Models	Sentiment Labels	Precision	Recall	F1
Single models	CNN	0	0.6829	0.7145	0.6983
	CNN	1	0.7612	0.7329	0.7468
	RNN	0	0.6794	0.5321	0.5968
	RNN	1	0.6793	0.7978	0.7338
	GRU	0	0.7961	0.6069	0.6887
	GRU	1	0.7343	0.8748	0.7985
	LSTM	0	0.7343	0.7354	0.7349
	LSTM	1	0.7867	0.7858	0.7863
	BiLSTM	0	0.7085	0.8102	0.7559
	BiLSTM	1	0.8272	0.7316	0.7765
Hybrid models	CNN-BiLSTM	0	0.7231	0.7848	0.7527
	CNN-BiLSTM	1	0.8140	0.7581	0.7850
	GWO-CNN-BiLSTM	0	0.7185	0.8356	0.7726
	GWO-CNN-BiLSTM	1	0.8476	0.7365	0.7882
	WOA-CNN-BiLSTM	0	0.7518	0.7608	0.7563
	WOA-CNN-BiLSTM	1	0.8056	0.7978	0.8017
	DBO-CNN-BiLSTM	0	0.7436	0.7803	0.7615
	DBO-CNN-BiLSTM	1	0.8158	0.7834	0.7993
Proposed model	IDBO-CNN-BiLSTM	0	0.7783	0.7818	0.7800
Proposed model	IDBO-CNN-BiLSTM	1	0.8237	0.8207	0.8222

Table 13. The optimal hyperparameters and runtime of four models.

Models	Convolutional Filters	Convolutional Kernel Sizes	LSTM Units 1	LSTM Units 2	Runtime (Seconds)
GWO-CNN-BiLSTM	76	4	197	120	3432.0266
WOA-CNN-BiLSTM	128	4	203	131	1711.1468
DBO-CNN-BiLSTM	82	6	177	128	1778.9641
IDBO-CNN-BiLSTM	32	3	64	87	1936.3141

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mu, G.; Li, J.; Li, X.; Chen, C.; Ju, X.; Dai, J. An Enhanced IDBO-CNN-BiLSTM Model for Sentiment Analysis of Natural Disaster Tweets. Biomimetics 2024, 9, 533. https://doi.org/10.3390/biomimetics9090533

AMA Style

Mu G, Li J, Li X, Chen C, Ju X, Dai J. An Enhanced IDBO-CNN-BiLSTM Model for Sentiment Analysis of Natural Disaster Tweets. Biomimetics. 2024; 9(9):533. https://doi.org/10.3390/biomimetics9090533

Chicago/Turabian Style

Mu, Guangyu, Jiaxue Li, Xiurong Li, Chuanzhi Chen, Xiaoqing Ju, and Jiaxiu Dai. 2024. "An Enhanced IDBO-CNN-BiLSTM Model for Sentiment Analysis of Natural Disaster Tweets" Biomimetics 9, no. 9: 533. https://doi.org/10.3390/biomimetics9090533

APA Style

Mu, G., Li, J., Li, X., Chen, C., Ju, X., & Dai, J. (2024). An Enhanced IDBO-CNN-BiLSTM Model for Sentiment Analysis of Natural Disaster Tweets. Biomimetics, 9(9), 533. https://doi.org/10.3390/biomimetics9090533

Article Menu

An Enhanced IDBO-CNN-BiLSTM Model for Sentiment Analysis of Natural Disaster Tweets

Abstract

1. Introduction

2. Literature Review

2.1. Natural Disasters

2.2. Social Media Analysis of Natural Disasters

3. Method

3.1. The DBO Algorithm

3.1.1. The Ball-Rolling Dung Beetles

3.1.2. The Brood Balls

3.1.3. The Small Dung Beetles

3.1.4. The Stealing Dung Beetles

3.2. The Proposed IDBO Algorithm

3.2.1. Utilize the Latin Hypercube Sampling for Population Initialization

3.2.2. Integrate the OOA’s Global Prospecting Strategy

3.2.3. Introduce an Adaptive Gaussian–Cauchy Mixture Mutation Disturbance

3.2.4. The IDBO Algorithm’s Time Complexity

3.2.5. The Steps of the IDBO Algorithm

3.3. The CNN-BiLSTM Model

3.3.1. Embedding Layer

3.3.2. 1D Convolutional Layer

3.3.3. 1D Max Pooling Layer

3.3.4. BiLSTM Layer

3.3.5. Dense Layer

4. Empirical Analysis

4.1. Data Collection and Preprocessing

4.2. Experimental Details

4.3. Evaluation Metrics

4.4. Experimental Results

4.4.1. The Contrast of Evaluation Metrics

4.4.2. The Comparison of Confusion Matrices

4.4.3. The Performance Comparison of Four Optimization Algorithms

5. Conclusions and Prospect

5.1. Conclusions

5.2. Suggestion

5.3. Limitation and Future Prospect

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Correction Statement

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI