Triggers and Tweets: Implicit Aspect-Based Sentiment and Emotion Analysis of Community Chatter Relevant to Education Post-COVID-19

: This research proposes a well-being analytical framework using social media chatter data. The proposed framework infers analytics and provides insights into the public’s well-being relevant to education throughout and post the COVID-19 pandemic through a comprehensive Emotion and Aspect-based Sentiment Analysis (ABSA). Moreover, this research aims to examine the variability in emotions of students, parents, and faculty toward the e-learning process over time and across different locations. The proposed framework curates Twitter chatter data relevant to the education sector, identiﬁes tweets with the sentiment, and then identiﬁes the exact emotion and emotional triggers associated with those feelings through implicit ABSA. The produced analytics are then factored by location and time to provide more comprehensive insights that aim to assist the decision-makers and personnel in the educational sector enhance and adapt the educational process during and following the pandemic and looking toward the future. The experimental results for emotion classiﬁcation show that the Linear Support Vector Classiﬁer (SVC) outperformed other classiﬁers in terms of overall accuracy, precision, recall, and F-measure of 91%. Moreover, the Logistic Regression classiﬁer outperformed all other classiﬁers in terms of overall accuracy, recall, an F-measure of 81%, and precision of 83% for aspect classiﬁcation. In online experiments using UAE COVID-19 education-related data, the analytics show high relevance with the public concerns around the education process that were reported during the experiment’s timeframe.


Introduction
The COVID-19 outbreak started in November 2019 and has been declared a global pandemic by WHO since March 2020 [1]. Several precautions have been recommended to fight its spread. As a result, all modes of public life shifted to online platforms to the full extent possible. Education around the world experienced a major interruption as educational institutions were forced to shift to distance learning. Some institutes were ready for the transition; however, the overwhelming majority were not [2]. Subsequently, many challenges have been reported concerning the new pedagogy, the online examination environment, and the lack of interaction among students as well as with their teachers [3,4]. In connection to these concerns, increased levels of anxiety, stress, and fear have been reported [5].
Several survey-based studies have attempted to investigate the impact of these changes on students [6] and teachers [7], as well as the efficacy of the educational process in its current state. These surveys were limited by time, sample size, and target group, making them less informative [8,9]. A few surveys addressed mental health-related factors affecting students and teachers post-lockdown, yet those surveys suffer from the same limitations [10][11][12]. Moreover, in addition to being limited by time, sample size, location, and focus, existing questionnaires

Literature Review
In this section, we review the literature related to the impact of the COVID-19 outbreak on mental health in the education sector. In addition, we survey some research works addressing ABSA in the field of education.

COVID-19 Outbreak Impact on the Education Sector Using Surveying Techniques
Several studies have investigated the impact of the COVID-19 outbreak on the educational sector through surveys and questionnaires. These research works relied on online surveys, interviews, or questionnaires with a variety of sampling and analysis techniques either through specific platforms, social media, journals, and literature, or a combination of these sources. Based on review and comparison, it has been observed that the surveying techniques are limited in terms of the study focus, targeted population and sample size, location, and time interval.
The reviewed literature was mainly focused on the effect of the pandemic on the education sector [25], including learning efficacy [26][27][28] and evaluating the efficiency of distance learning as an alternative solution during pandemics [29], as well as the impact on students. Different groups of students were targeted in different studies; for example, the majority of the studies primarily targeted higher education students, with some focused on specific colleges [30] or limited-income students [31]. Such studies targeting narrow populations may provide inaccurate insights on a greater scale. Therefore, in an attempt to Big Data Cogn. Comput. 2022, 6, 99 3 of 23 overcome this limitation and provide better insight, some studies were focused on assessing the impact on students and their parents to provide a more wholesome analysis [32,33].
Several studies were focused on assessing the mental health of students, parents, and educational personnel post-outbreak. Even though some studies targeted relatively large geographical areas, surveying responders from four countries [25], most of the studies were limited to specific locations. For instance, the studies focused on the mental health assessment of college students were limited to specific countries such as the United States [34,35], Saudi Arabia [36], and Bangladesh [37]. Similarly, the studies focused on assessing the impact on parents' mental health were limited in geographical coverage, targeting parents only in the United States [38] and China [39]. Moreover, some studies were focused on the evaluation of mental health services during the pandemic [40]; however, these were more limited in geographical coverage as they only evaluated mental health services in New York City.
A couple of studies targeted a wider range of respondents in the educational sector, including staff, educators, and students [25,41]. However, such studies were conducted in a limited time interval, for example, during the early phase of the pandemic [33]. This disregards the importance of the preceding intervals on the comprehension of the studies. However, Alqahtani et al. [27] addressed this limitation by allowing the user to adjust the timeframe as appropriate.
While most of these studies revealed negative impacts, some have revealed some positive impacts, reporting agility, innovation, and increased technological skills in some cases. While these results are not general, they are valuable. The overall conclusion inferred from these studies is that distance learning can be a temporary alternative but not a complete substitute as it still lacks efficiency. Table 1 provides a comparison of these studies. All the studies were compared across the following criteria: surveying method used, respondents' sample, the focus of the study, as well as country and language of the survey.

Aspect-Based Sentiment Analysis in the Field of Education
Aspect-based Sentiment Analysis (ABSA) has been deployed for education evaluation purposes since before the emergence of COVID-19. Multiple researchers have investigated the practicality of using the Natural Language Processing (NLP) technique to evaluate reviews and opinions on different topics and aspects. For instance, in 2019, Nikola et al. [21] proposed an ABSA system for the automated mining of free text reviews. This proposal aimed to monitor Serbian university students' satisfaction through two sets of data sources: official student opinion surveys and review websites such as "Rate my professors" [42]. While this research proved successful in sentiment polarity detection for both data sets, it concluded that the source of the reviews highly affects the quality of the ABSA. Separately, Kastrati et al. [23] proposed a weakly supervised ABSA framework to provide educators and course designers with insight into the main factors affecting the educational process through students' opinions. This framework was tested on two real-world datasets collected from both online and traditional classroom setups. The results of this research provide insight into the effectiveness of online courses in general when applied to students' feedback. In another work, Sindhu et al. [24] proposed the use of a two-layered Long Short-term Memory (LSTM) model for aspect extraction and sentiment polarity detection on two manually tagged datasets. The presence of multiple aspects within a sentence without connectives affected the accuracy of the system, introducing a drawback to this implementation.
Other ABSA systems have used social media networks as the data source. Balachandran et al. [22] proposed an online review system dedicated to higher education institutions. This proposed system aims to assist students in the institution selection process through the feedback retrieved from social network sites such as Facebook and Twitter. Data collection was through the Application Programming Interface (API) of Facebook and Twitter, which provide large quantities of relatively clean data. The end product is a system that categorizes reviews based on sentiment polarity, generates statistical summaries, and provides an aspect-based evaluation of the institutions and recommendations. However, this implementation is still limited to certain data sources. In addition, Alassaf et al. [43] implemented ABSA using a hybrid selection method on Arabic tweets. This implementation extracts the aspects from education-related categories, including quality of teaching, electronic services, activities, etc. However, this was only limited to Arabic tweets related to Qassim University.
To the best of the authors' knowledge, none of the extant research that was aimed at evaluating the educational process during the COVID-19 pandemic proposed the use of ABSA. However, the closest proposal implemented the Ensemble Learning-based Sentiment Analysis (ELSA) algorithm to assess the adoption rate of e-learning during COVID-19 in the educational sector [44]. Table 2 provides a comparison of the different literature reviews that were summarized above. The comparison is across the method used, the focus of the study, the data source, and the language used for the natural language input.

Contribution
After the critical analysis of the reviewed literature, several limitations were identified. In addition to the limitations of traditional surveying methods, these studies were limited in terms of the targeted population, source and method of data collection, as well as geographical coverage and time interval; hence, the inferred insights. These limitations accentuated a more adaptive approach to infer more representative insights relevant to variable locations, timeframes, and events. In contrast to surveying techniques, social media platforms offer real-time access to public opinions and emotions across the world, relevant to several life events [22,43,44].
To address the limitations inherent to traditional surveys and other techniques presented in the previous research studies, this research proposes a real-time automatic analytics framework to provide representative and timely insights into the varying impacts of several life events, such as COVID-19, on the public relevant to the educational sector. This proposed framework considers feedback from all groups of society, especially the main stakeholders in the educational sector, such as students, parents, and educational personnel using social media chatter data. We focus specifically on Twitter, yet the proposed methods can be applied to other social media platforms. The proposed framework is not limited to a specific time or location; rather, it monitors public emotion continuously and allows several temporal and spatial filters for more fine-grained analysis. Furthermore, this study is the first to propose this mode of analysis for the educational sector. Previous studies proposing aspects-based sentiment analysis in the field of education addressed different types of concerns and targeted different populations. However, this study is more holistic in its analysis, target group, and coverage.

Overall System Description
The proposed framework is composed of three main modules. The first module is responsible for social media data curation. The second module is focused on data preprocessing and analysis. It generates several analytics using machine learning models. The last module is the web interface providing access to different stakeholders to realtime analytics based on several filtering parameters such as location, time, and aspects. The proposed framework is illustrated in Figure 1. In subsequent sections, we further describe the detailed components of the second module, which is responsible for producing well-being analytics.

Analytics Module
The second module, depicted in Figure 2, is responsible for generating several insights into the public's emotions relevant to the education sector. It performs three main tasks. These are: (i) Preprocessing, (ii) Sentiment analysis, and (iii) Emotion and Aspectbased classification. In subsequent sections, we provide a detailed explanation of each of these tasks.

Data Preprocessing
For the purpose of experiments, we use several Twitter Chatter training datasets and produce a UAE-specific COVID-19 dataset to test the proposed analytics framework on UAE-specific data. The training dataset for emotion classification and ABSA are explained in the respective sections: Sections 3.2.3 and 3.2.4.
The experimental UAE COVID-19 Twitter Chatter Dataset [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: For the purpose of experiments, we use several Twitter Chatter training datasets and produce a UAE-specific COVID-19 dataset to test the proposed analytics framework on UAE-specific data. The training dataset for emotion classification and ABSA are explained in the respective sections: Section 3.2.3 and Section 3.2.4.
The experimental UAE COVID-19 Twitter Chatter Dataset [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value. For instance, the following cleaning tasks were conducted: For the purpose of experiments, we use several Twitter Chatter training datasets and produce a UAE-specific COVID-19 dataset to test the proposed analytics framework on UAE-specific data. The training dataset for emotion classification and ABSA are explained in the respective sections: Section 3.2.3 and Section 3.2.4.
The experimental UAE COVID-19 Twitter Chatter Dataset [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value.  [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were  [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets. The experimental UAE COVID-19 Twitter Chatter Dataset [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this The experimental UAE COVID-19 Twitter Chatter Dataset [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data Retweets: number of times the Tweet was retweeted;

Data Preprocessing
For the purpose of experiments, we use several Twitter Chatter training datasets and produce a UAE-specific COVID-19 dataset to test the proposed analytics framework on UAE-specific data. The training dataset for emotion classification and ABSA are explained in the respective sections: Section 3.2.3 and Section 3.2.4.
The experimental UAE COVID-19 Twitter Chatter Dataset [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into Place: includes the full geotag provided by Twitter in JSON format.
Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data Big Data Cogn. Comput. 2022, 6, 99 8 of 23 used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value. For instance, the following cleaning tasks were conducted:

Data Preprocessing
For the purpose of experiments, we use several Twitter Chatter training datasets and produce a UAE-specific COVID-19 dataset to test the proposed analytics framework on UAE-specific data. The training dataset for emotion classification and ABSA are explained in the respective sections: Section 3.2.3 and Section 3.2.4.
The experimental UAE COVID-19 Twitter Chatter Dataset [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value. For instance, the following cleaning tasks were conducted:


Stopwords and common word removal [47]: commonly repeated words that do not bear relevant emotional or sentimental orientation were extracted, such as "covid", "covid19", and "corona", "a", "the", "in", etc.  Single and double character, and punctuation and special character removal  Conversion to lowercase so as to eliminate redundancy caused by letters' capitalization. For instance: "DANGEROUS," "Dangerous," and "dangerous" can be considered two different features if not converted to lowercase.  Stemming and lemmatization reduce feature space dimensionality and reduce redundancy [48].  Filter the dataset to create an education-related chatter dataset by extracting the tweets that contain any morphological derivation of the stem of a set of keywords related to the educational context such as education, school, university, teacher, professor, exam, learning, etc.
Some examples of education-related tweets extracted from the UAE Twitter Chatter Dataset are illustrated in Table 3. Table 3. Sample education-related UAE Twitter Chatter Data.

Education-Related Tweets
"Great meeting with UAE Min. Educ HE Hussain Al Hammadi-focused on e-learning especially under COVID, training, curriculum review to match 21st century learning expectations, etc. These will be fleshed out in MOU to be signed between UAE and Sierra Leone." "From vital COVID-19 Management in a particular challenging time for schools to emergency First Aid management, Health checks, Health education and general day to day support whilst running busy health care centers in our schools, we are grateful for our Healthcare teams." "Teachers have been on the frontline every day during Covid. So why scapegoat them What an eye opener." Stopwords and common word removal [47]: commonly repeated words that do not bear relevant emotional or sentimental orientation were extracted, such as "covid", "covid19", and "corona", "a", "the", "in", etc. we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value.
For instance, the following cleaning tasks were conducted:  Stopwords and common word removal [47]: commonly repeated words that do not bear relevant emotional or sentimental orientation were extracted, such as "covid", "covid19", and "corona", "a", "the", "in", etc.  Single and double character, and punctuation and special character removal  Conversion to lowercase so as to eliminate redundancy caused by letters' capitalization. For instance: "DANGEROUS," "Dangerous," and "dangerous" can be considered two different features if not converted to lowercase.  Stemming and lemmatization reduce feature space dimensionality and reduce redundancy [48].  Filter the dataset to create an education-related chatter dataset by extracting the tweets that contain any morphological derivation of the stem of a set of keywords related to the educational context such as education, school, university, teacher, professor, exam, learning, etc.

Some examples of education-related tweets extracted from the UAE Twitter Chatter
Dataset are illustrated in Table 3. Table 3. Sample education-related UAE Twitter Chatter Data.

Education-Related Tweets
"Great meeting with UAE Min. Educ HE Hussain Al Hammadi-focused on e-learning especially under COVID, training, curriculum review to match 21st century learning expectations, etc. These will be fleshed out in MOU to be signed between UAE and Sierra Leone." "From vital COVID-19 Management in a particular challenging time for schools to emergency First Aid management, Health checks, Health education and general day to day support whilst running busy health care centers in our schools, we are grateful for our Healthcare teams." "Teachers have been on the frontline every day during Covid. So why scapegoat them What an eye opener." Single and double character, and punctuation and special character removal we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value.
For instance, the following cleaning tasks were conducted:  Stopwords and common word removal [47]: commonly repeated words that do not bear relevant emotional or sentimental orientation were extracted, such as "covid", "covid19", and "corona", "a", "the", "in", etc.  Single and double character, and punctuation and special character removal  Conversion to lowercase so as to eliminate redundancy caused by letters' capitalization. For instance: "DANGEROUS," "Dangerous," and "dangerous" can be considered two different features if not converted to lowercase.  Stemming and lemmatization reduce feature space dimensionality and reduce redundancy [48].  Table 3. Table 3. Sample education-related UAE Twitter Chatter Data.

Education-Related Tweets
"Great meeting with UAE Min. Educ HE Hussain Al Hammadi-focused on e-learning especially under COVID, training, curriculum review to match 21st century learning expectations, etc. These will be fleshed out in MOU to be signed between UAE and Sierra Leone." "From vital COVID-19 Management in a particular challenging time for schools to emergency First Aid management, Health checks, Health education and general day to day support whilst running busy health care centers in our schools, we are grateful for our Healthcare teams." "Teachers have been on the frontline every day during Covid. So why scapegoat them What an eye opener." Conversion to lowercase so as to eliminate redundancy caused by letters' capitalization. For instance: "DANGEROUS," "Dangerous," and "dangerous" can be considered two different features if not converted to lowercase. we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value.
For instance, the following cleaning tasks were conducted:  Stopwords and common word removal [47]: commonly repeated words that do not bear relevant emotional or sentimental orientation were extracted, such as "covid", "covid19", and "corona", "a", "the", "in", etc.  Single and double character, and punctuation and special character removal  Conversion to lowercase so as to eliminate redundancy caused by letters' capitalization. For instance: "DANGEROUS," "Dangerous," and "dangerous" can be considered two different features if not converted to lowercase.  Stemming and lemmatization reduce feature space dimensionality and reduce redundancy [48].  Table 3. Table 3. Sample education-related UAE Twitter Chatter Data. Stemming and lemmatization reduce feature space dimensionality and reduce redundancy [48]. we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.

Education-Related Tweets
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value.
For instance, the following cleaning tasks were conducted:  Stopwords and common word removal [47]: commonly repeated words that do not bear relevant emotional or sentimental orientation were extracted, such as "covid", "covid19", and "corona", "a", "the", "in", etc.  Single and double character, and punctuation and special character removal  Conversion to lowercase so as to eliminate redundancy caused by letters' capitalization. For instance: "DANGEROUS," "Dangerous," and "dangerous" can be considered two different features if not converted to lowercase.  Stemming and lemmatization reduce feature space dimensionality and reduce redundancy [48].  Table 3. Table 3. Sample education-related UAE Twitter Chatter Data.

Education-Related Tweets
"Great meeting with UAE Min. Educ HE Hussain Al Hammadi-focused on e-learning especially under COVID, training, curriculum review to match 21st century learning expectations, etc. These will be fleshed out in MOU to be signed between UAE and Sierra Leone." "From vital COVID-19 Management in a particular challenging time for schools to emergency First Aid management, Health checks, Health education and general day to day support whilst running busy health care centers in our schools, we are grateful for our Healthcare teams." "Teachers have been on the frontline every day during Covid. So why scapegoat them What an eye opener." Filter the dataset to create an education-related chatter dataset by extracting the tweets that contain any morphological derivation of the stem of a set of keywords related to the educational context such as education, school, university, teacher, professor, exam, learning, etc.
Some examples of education-related tweets extracted from the UAE Twitter Chatter Dataset are illustrated in Table 3. Table 3. Sample education-related UAE Twitter Chatter Data.

Education-Related Tweets
"Great meeting with UAE Min. Educ HE Hussain Al Hammadi-focused on e-learning especially under COVID, training, curriculum review to match 21st century learning expectations, etc. These will be fleshed out in MOU to be signed between UAE and Sierra Leone." "From vital COVID-19 Management in a particular challenging time for schools to emergency First Aid management, Health checks, Health education and general day to day support whilst running busy health care centers in our schools, we are grateful for our Healthcare teams." "Teachers have been on the frontline every day during Covid. So why scapegoat them What an eye opener." "Discussing the procedures and challenges of accepting students for the fall semester of the academic year 2020/2021 during the COVID-19 pandemic. The Admissions Department at the University of Sharjah held the meeting with the Dean of Academic Support Services." "What topics are trending in the workplace with the shift to remote work amid the coronavirus pandemic, online learning related to mindfulness, cybersecurity, and hybrid tech capabilities surged."

Sentiment Analysis
At this stage, the education-related tweets were analyzed to identify the overall sentiment score of the tweet. This task was carried out to identify tweets that have emotional orientation compared to neutral tweets that only contain information, instructions, or announcements that do not reflect the special emotional state of the public.
For the sentiment analysis task, a lexicon-based approach was adopted. Using a lexicon-based sentiment annotation tool [49], a lexicon containing a set of sentiment words, each with an associated sentiment score, was used to assign a polarity score for each word in the tweet. After weighing the individual words in the tweet, an overall normalized sentiment score of the tweet was calculated in the range [−1.0, 1.0]. The score '0 was considered "neutral," a positive score (i.e., greater than zero) was considered "positive," and a negative score (i.e., less than zero) was considered "negative." These generated labels were validated manually by two human experts, and disagreements were resolved. Some examples of labeled tweets are provided in Table 4. From this module, only tweets with explicit polarity and sentiment are fed to the emotion analysis module. Neutral tweets that do not have significant emotional orientation are dropped.

Emotion Analysis
To perform emotion analysis, several machine learning classifiers were trained on an annotated training dataset available in [50], labeled with the six most common human emotions, i.e., Happiness, Sadness, Surprise, Fear, Love, and Anger. The dataset has 21,586 tweets and was collected through the Twitter API using hashtags associated with the six explicit emotions. For example, 'Peace' is associated with 'Happiness'. The statistics of the emotion training dataset are illustrated in Table 5. Sample tweets, along with the associated keywords and the associated emotion labels, are illustrated in Table 6. The emotional analysis was conducted to identify the public's specific emotional state relevant to particular aspects (i.e., concerns) in the field of education. For instance, parents might feel "Sad", "Fearful", or "Angry" about their kid's "Safety" in the case of safety breaches and disease outbreaks in a particular school. On the other hand, parents might be very "Happy" about the school and the government taking all necessary precautions to protect their kids' "Safety" while attending the school. Therefore, to have a comprehensive understanding of the emotional triggers in society relevant to the education sector and the education process, it is important to not only identify emotional triggers but also understand what type of emotions are triggered by these triggers.

Tweets Hashtags Emotion Class
"I feel peaceful and unafraid certain that my god has my best interests at heart" Peace Happy "I watched his face contort in sadness I began to feel regretful of my actions" Regret Sad "I feel very stunned that people got it in a big way" Stun Surprise "I started to feel uncomfortable buzzy short of breath and very mildly panicky" Panic Fear "I was still looking out for good causes that I feel passionate about to volunteer and again last year when a friend introduced me to an organization that packs food rations for needy families" Passion Love "I feel irritated to have missed out direct instruction from master lee is never to be passed up casually I have to admit my body just feels like it needs the rest" Irritate Angry

Aspect-Based Sentiment Analysis
To detect the societal emotional triggers associated with different emotions relevant to the educational process, an education-related Twitter chatter training dataset [51] was used to manually label the tweets with the most representative emotional triggers (i.e., aspects). Three human annotators conducted the annotation and resolved the disagreement. As a result, three distinct aspects were defined to represent the most significant emotional triggers impacting societal emotions and opinions relevant to the educational process. These aspects are "Education Quality and Educational Rights", "Financial Security", and "Safety". The total number of samples in the training dataset is 1179. The statistics of the emotional trigger dataset are summarized in Table 7. These three aspects were identified as the most prevalent emotional triggers around the educational process resulting from the transformation that took place during and post lockdown. Students, parents, educators, and officials shared many concerns about the quality of the education process after the shift to online education. Mixed feelings ranging from an appreciation for the quick resolutions and fast transformation that prevented education's interruption, and fear and worry associated with the efficacy of online education were common on social media chatter. This variability in emotions and the emotional trigger was not only present on social media but was also reported in the news.
According to the United Nations COVID-19 Socio-Economic analysis in the UAE, a decline of 13% in the employment rate is expected in the Arabian Gulf region [52]. The unemployment rate in the UAE has increased to 5% in 2020 alone [53], raising concerns about job losses due to the pandemic among a large segment of the public. The increase in job insecurity has led to financial security concerns [54]. Job and financial insecurities have influenced different segments of society, including teachers, educational staff, and parents. Therefore, they were considered among the key aspects. Moreover, in Australia, for example, studies report that 40,000 tertiary education staff have lost their jobs, 60% of which were held by women [55]. Working parents were among the most impacted segments; for instance, 51.7 million parents have lost their jobs due to COVID-19 in the US alone [56], leaving them unable to pay for their children's tuition. Furthermore, approximately 56% of undergraduate students reported being unable to afford college tuition due to COVID-19 and its consequent job insecurities [57]. Hence, "Education Quality and Educational Rights" were among the key concerns that had to be represented in the aspects. Finally, over 397,000 cases and at least 90 deaths were reported due to COVID-19 in the US higher educational institutions alone [58]. Consequently, more educational staff, parents, and students were concerned about their safety and expressed fear of death; hence, these two factors were considered key aspects. Table 8 illustrates a sample of the tweets with associated aspects.

Tweets Aspect
"Due to the current situation and health issues college students are given chance to complete final exams online. I doubt the efficacy of these exams!!" Education Quality and Educational Rights "After one year I've learned from the pandemic that the world is fragile. It breaks easily and disease is affecting economy. We need cure developed faster to retain jobs and stop attending classes at home." Financial Security "Let's teach our kids preventive measures to reduce the risk and getting stop spread virus." Safety

Experimental Work and Discussion of Results
The offline experiments were conducted to evaluate and identify the most accurate classification model for detecting societal emotions and emotional triggers. Further, the selected prediction models were deployed in a responsive web application to facilitate inference-making regarding public sentiment and the associated emotions and emotional triggers using real-time data. Online experiments on real-time data are demonstrated subsequent section to show the efficacy of the proposed framework in producing relevant analytics using several test scenarios.

Offline Experiments
A comparative evaluation of the four predictive models trained on the normalized frequency bag of words (TF-IDF-BoW) feature vectors is summarized in Tables 7-10. Classification models that reported successful classification results in the literature are used [59][60][61]. These are Logistic Regression, Linear SVC (Support Vector Classifier), Multinomial Naïve Bayes, and Random Forests. For the logistic regression model, the random state was set to 42, and the inverse regularization strength was set to one to control the penalty strength, which can also be effective in multiple classes. Furthermore, the weights of the classes were set to balanced mode, which automatically modifies the weights such that they are inversely proportional to the frequencies of the classes in the input data. For MultinomialNB, the prior class is assigned to none, so priors are adjusted according to the data. The random state was set to zero for the Random state to control the unpredictability of the sample's bootstrapping, and a value of 100 trees was selected. Further, a value of 100 was set for iterations to run, and the penalization's norm for LinearSVC was set to l2. Equations (1)-(4) show the mathematical formulations of the four classification models.
Logistics regression is based on using a logit function (e.g., Sigmoid) to estimate the probability, p, that a binary event, y, will occur. It determines each predictor, X i , independent contribution to the variance in the dependent variable, y, as shown in Equation (1) (1) y = β + β 1 X 1 + β 2 X 2 + . . . + β n X n , where X 1 , X 2 . . . and X n are explanatory variables and y is the dependent variable Linear SVC (Support Vector Classifier) defines a hyperplane that optimizes the separation of the data points to their prospective classes in an n-dimensional space. These data points are thus near the border. It was calculated based on a mathematical model presented in Equation (2) to enable linear domain division [62]. y = WX + γ , where W is a weighted vector , X is input vector , and γ is bias (2) Multinomial Naïve Bayes is based on calculating the conditional probability of each aspect, k, given a predictor, p, as shown in Equation (3): , for class k and predictor p Random Forests classification result is the average of the feature importance over all the trees as shown in Equation (4) [63]: where norm f i ij is the normalized feature importance for i in tree j, which is expressed as : where f i is the feature importance expressed as : node j splits on f eature i ni j ∑ k ∈ all nodes ni k The evaluation was carried out using a hold-out approach where 85% of the dataset was used for training the predictive models, and 15% of the dataset was used to test the models. Accuracy, Precision, Recall, and F-Measure are the performance evaluation metrics used throughout the evaluation of the experimental results. For each of the emotions or emotional trigger classes, such as 'happiness', the classification results could be any of the following:  [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value. For instance, the following cleaning tasks were conducted:


Stopwords and common word removal [47]: commonly repeated words that do not bear relevant emotional or sentimental orientation were extracted, such as "covid", "covid19", and "corona", "a", "the", "in", etc.  Single and double character, and punctuation and special character removal  Conversion to lowercase so as to eliminate redundancy caused by letters' capitalization. For instance: "DANGEROUS," "Dangerous," and "dangerous" can be considered two different features if not converted to lowercase.  Stemming and lemmatization reduce feature space dimensionality and reduce redundancy [48].  Filter the dataset to create an education-related chatter dataset by extracting the tweets that contain any morphological derivation of the stem of a set of keywords related to the educational context such as education, school, university, teacher, professor, exam, learning, etc.
Some examples of education-related tweets extracted from the UAE Twitter Chatter Dataset are illustrated in Table 3. Table 3. Sample education-related UAE Twitter Chatter Data.

Education-Related Tweets
"Great meeting with UAE Min. Educ HE Hussain Al Hammadi-focused on e-learning especially under COVID, training, curriculum review to match 21st century learning expectations, etc. These will be fleshed out in MOU to be signed between UAE and Sierra Leone." "From vital COVID-19 Management in a particular challenging time for schools to emergency First Aid management, Health checks, Health education and general day to day support whilst running busy health care centers in our schools, we are grateful for our Healthcare teams." "Teachers have been on the frontline every day during Covid. So why scapegoat them What an eye opener." True Positive (TP): Predicted emotion is happiness, and the ground truth is happiness; Big Data Cogn. Comput. 2022, 6, x FOR PEER REVIEW 8 of 23

Data Preprocessing
For the purpose of experiments, we use several Twitter Chatter training datasets and produce a UAE-specific COVID-19 dataset to test the proposed analytics framework on UAE-specific data. The training dataset for emotion classification and ABSA are explained in the respective sections: Section 3.2.3 and Section 3.2.4.
The experimental UAE COVID-19 Twitter Chatter Dataset [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value. For instance, the following cleaning tasks were conducted:


Stopwords and common word removal [47]: commonly repeated words that do not bear relevant emotional or sentimental orientation were extracted, such as "covid", "covid19", and "corona", "a", "the", "in", etc.  Single and double character, and punctuation and special character removal  Conversion to lowercase so as to eliminate redundancy caused by letters' capitalization. For instance: "DANGEROUS," "Dangerous," and "dangerous" can be considered two different features if not converted to lowercase.  Stemming and lemmatization reduce feature space dimensionality and reduce redundancy [48].  Filter the dataset to create an education-related chatter dataset by extracting the tweets that contain any morphological derivation of the stem of a set of keywords related to the educational context such as education, school, university, teacher, professor, exam, learning, etc.
Some examples of education-related tweets extracted from the UAE Twitter Chatter Dataset are illustrated in Table 3. Table 3. Sample education-related UAE Twitter Chatter Data.

Education-Related Tweets
"Great meeting with UAE Min. Educ HE Hussain Al Hammadi-focused on e-learning especially under COVID, training, curriculum review to match 21st century learning expectations, etc. These will be fleshed out in MOU to be signed between UAE and Sierra Leone." "From vital COVID-19 Management in a particular challenging time for schools to emergency First Aid management, Health checks, Health education and general day to day support whilst running busy health care centers in our schools, we are grateful for our Healthcare teams." "Teachers have been on the frontline every day during Covid. So why scapegoat them What an eye opener." True Negative (TN): Predicted emotion is any non-happiness emotion, and the ground truth is any non-happiness emotion; Big Data Cogn. Comput. 2022, 6, x FOR PEER REVIEW 8 of 23

Data Preprocessing
For the purpose of experiments, we use several Twitter Chatter training datasets and produce a UAE-specific COVID-19 dataset to test the proposed analytics framework on UAE-specific data. The training dataset for emotion classification and ABSA are explained in the respective sections: Section 3.2.3 and Section 3.2.4.
The experimental UAE COVID-19 Twitter Chatter Dataset [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value. For instance, the following cleaning tasks were conducted:


Stopwords and common word removal [47]: commonly repeated words that do not bear relevant emotional or sentimental orientation were extracted, such as "covid", "covid19", and "corona", "a", "the", "in", etc.  Single and double character, and punctuation and special character removal  Conversion to lowercase so as to eliminate redundancy caused by letters' capitalization. For instance: "DANGEROUS," "Dangerous," and "dangerous" can be considered two different features if not converted to lowercase.  Stemming and lemmatization reduce feature space dimensionality and reduce redundancy [48].  Filter the dataset to create an education-related chatter dataset by extracting the tweets that contain any morphological derivation of the stem of a set of keywords related to the educational context such as education, school, university, teacher, professor, exam, learning, etc.
Some examples of education-related tweets extracted from the UAE Twitter Chatter Dataset are illustrated in Table 3. Table 3. Sample education-related UAE Twitter Chatter Data.

Education-Related Tweets
"Great meeting with UAE Min. Educ HE Hussain Al Hammadi-focused on e-learning especially under COVID, training, curriculum review to match 21st century learning expectations, etc. These will be fleshed out in MOU to be signed between UAE and Sierra Leone." "From vital COVID-19 Management in a particular challenging time for schools to emergency First Aid management, Health checks, Health education and general day to day support whilst running busy health care centers in our schools, we are grateful for our Healthcare teams." "Teachers have been on the frontline every day during Covid. So why scapegoat them What an eye opener." False Positive (FP): Predicted emotion is happiness emotion, while the ground truth is any non-happiness emotion;

Data Preprocessing
For the purpose of experiments, we use several Twitter Chatter training datasets and produce a UAE-specific COVID-19 dataset to test the proposed analytics framework on UAE-specific data. The training dataset for emotion classification and ABSA are explained in the respective sections: Section 3.2.3 and Section 3.2.4.
The experimental UAE COVID-19 Twitter Chatter Dataset [45] is a bilingual UAE GeoTagged dataset available in both English and Arabic languages. In our experiments, we use the English dataset. The raw dataset was collected through Twitter API using filtering keywords and hashtags relevant to COVID-19, such as: "corona", "coronavirus", "COVID", "Covid", and "Corona". The filtering keywords and hashtags were specified in the parsing function of the Twitter API so as to collect UAE, COVID-19-specific chatter data. The structure of the raw dataset is as follows: Since the dataset is collected on hourly bases and chatter data for each hour is stored in a separate CSV file, the first processing task was to integrate all of the distinct files into one main dataset and remove redundancy and repetitions caused by retweets. The data used for these experiments are collected from 11 October 2020 and up to the date of this study, with a total number of 171,873 tweets.
Next, several text cleaning and natural language processing tasks [46] were conducted to clean the chatter data from peculiarities that do not bear any semantic value. For instance, the following cleaning tasks were conducted:


Stopwords and common word removal [47]: commonly repeated words that do not bear relevant emotional or sentimental orientation were extracted, such as "covid", "covid19", and "corona", "a", "the", "in", etc.  Single and double character, and punctuation and special character removal  Conversion to lowercase so as to eliminate redundancy caused by letters' capitalization. For instance: "DANGEROUS," "Dangerous," and "dangerous" can be considered two different features if not converted to lowercase.  Stemming and lemmatization reduce feature space dimensionality and reduce redundancy [48].  Filter the dataset to create an education-related chatter dataset by extracting the tweets that contain any morphological derivation of the stem of a set of keywords related to the educational context such as education, school, university, teacher, professor, exam, learning, etc.
Some examples of education-related tweets extracted from the UAE Twitter Chatter Dataset are illustrated in Table 3. Table 3. Sample education-related UAE Twitter Chatter Data.

Education-Related Tweets
"Great meeting with UAE Min. Educ HE Hussain Al Hammadi-focused on e-learning especially under COVID, training, curriculum review to match 21st century learning expectations, etc. These will be fleshed out in MOU to be signed between UAE and Sierra Leone." "From vital COVID-19 Management in a particular challenging time for schools to emergency First Aid management, Health checks, Health education and general day to day support whilst running busy health care centers in our schools, we are grateful for our Healthcare teams." "Teachers have been on the frontline every day during Covid. So why scapegoat them What an eye opener." False Negative (FN): Predicted emotion is any non-happiness emotion, while the ground truth is happiness.
Accuracy is the percentage of correctly labeled tweets and can be calculated using Equation (5).
Precision is the percentage of correctly predicted positive tweets relative to all positive predictions of each class. Precision represents the sensitivity of the prediction model and can be calculated using Equation (6).
Recall is the percentage of correctly predicted positive tweets relative to all positive tweets in the dataset. Recall represents the completeness of the prediction and can be calculated using Equation (7).
Finally, the F-Measure is the harmonic mean of the recall and precision and is the most representative measure as it is not affected by the class imbalance, it can be calculated using Equation (8). Table 9 illustrates the model-level average Accuracy, Precision, Recall, and F-Measure of Logistic Regression, Linear SVC (Support Vector Classifier), Multinomial Naïve Bayes, and Random Forests for emotion classification. The experimental results show that the Linear SVC classifier outperformed other classifiers in terms of overall accuracy, precision, recall, and had an F-measure of 91%. Hence, Linear SVC was selected to build the prediction model for emotions in the educational tweets in the real-time system. Detailed analysis of prediction quality per class, i.e., per emotion, is presented in Table 10. By analyzing class-wise prediction quality and focusing on the SVC, it is clear that the predictive model is reliable in detecting the correct target emotion with high accuracy and precision, as well as retrieving all related samples to a particular emotion with high recall. The values of precision for the majority class emotions: "Happiness", "Sadness", "Fear, and "Anger" range between 90% and 93%, which indicate that the selected predictive model is sensitive and very precise in detecting the exact emotion in the retrieved chatter data. Moreover, by considering the recall value for the same model, SVC, and for the same majority class emotions, we can see that the recall value ranges between 87% and 95%, which further indicates the completeness of the selected predictive model. It is able to retrieve most of the samples related to a particular emotion. In contrast, "Surprise" and "Love" achieved slightly lower accuracy compared to "Happiness", "Anger", "Fear", and "Sadness". This is due to the fact that the percentage of samples representing "Happiness", "Fear", "Sadness", and "Anger" surpass the percentage of samples belonging to "Surprise" and "Love" in the experimental chatter data by 88%. Furthermore, Figures 3-6 represent the Receiver Operating Characteristic (ROC) curves for the four classifiers, which show excellent performance, with the area under the curve ranging between 80 and 99 for the three most prominent emotions, i.e., Anger, Fear, and Happiness.      Table 11 illustrates the model-based average Accuracy, Precision, Recall, and F-measure of the Logistic Regression, Linear SVC, Multinomial Naïve Bayes, and Random Forests predictions models for emotional trigger (i.e., aspect) classification. The Logistic Regression classifier outperformed all other classifiers in terms of overall accuracy, recall, and F-measure of 81%, and 83% precision. Table 12 demonstrates the class-wise experimental prediction results, i.e., prediction accuracy per emotional trigger. Due to the data imbalance, the class-wise comparison shows significantly improved performance of the four classification models toward the aspect of "Safety", which was most represented in the chatter data constituting 54% of the samples. This is expected as "Safety" would be the most prominent concern of students, parents, and educators. However, despite the data imbalance toward the class "Safety", by considering the performance quality of the Logistic Regression prediction model that produced the best model-based prediction results, we can see that the prediction accuracy for the minority classes (i.e., "Educational Quality and Educational Rights" and "Financial Security"), are within an acceptable range for this type of data. Given the mixed nature of human feelings and concerns, some tweets may be related to multiple aspects, especially when it comes to "Educational Rights" and "Financial Security" this explains the slightly reduced prediction accuracy of these two aspects. Furthermore, Figures 7-10 represent the ROC curves for the four classifiers, which show very good performance with an area under the curve ranging between 88 and 93 for the three aspects using Logistic Regression.

Online Experiments
In this section, we demonstrate several real-time test scenarios conducted through a responsive website to validate the accuracy of the proposed framework in inferring insights about societal emotions and emotional triggers relevant to the educational sector post-COVID-19. These scenarios are based on various parameters to validate whether the proposed analytics framework is capable of inferring accurate emotional insight. Table 13 summarizes real-time test scenarios. In this testing scenario, we demonstrate the analytics of the Twitter chatter data across the UAE from 03 October 2020 to 26 March 2021. Figure 11 shows the resulting sentiment and emotion analytics. The word cloud and the most frequent words bar charts show that "school" and "students" were the most frequent words in the retrieved chatter data confirming the relevance of the retrieved chatter data to the education sector. In addition, the emotion pie chart shows that the emotions of "sadness" and "happiness" dominated the majority of the UAE chatter in the specified timeframe with percentages of 43.3% and 38%, respectively.

Testing Scenario 2
To obtain more fine-grained analytics, in the second testing scenario, we demonstrate the analytics for a specific location (i.e., Abu Dhabi), a smaller timeframe (18 October 2020 to 01 March 2021), and we focus the analysis on a specific emotional trigger/aspect (Safety). The results are shown in Figure 12. The emotional analytics show that the "Happy" emotion is the most prominent societal emotion relevant to "Safety" in the emirate of Abu Dhabi, surpassing all other emotions detected in the public chatter related to education. By looking at the word cloud and word frequency charts, we can see "school", "Learn", "student", and "teacher" among the most frequent words indicating the relevance of the chatter data to the educational domain. We can infer that the public felt happy about the Safety measure taken locally to protect the school students. Prior to the selected timeframe, the Department of Education and Knowledge in Abu Dhabi (ADEK) has released the reopening policies, guidelines, and protocols with a detailed framework, including the preventive measures to be followed and space management, the timeline for resumption of operation, entry requirements for both staff and students, as well as the criteria for reopening; covering all expected and thought-through situations [64]. Therefore, it is evident that the public had a positive attitude and opinions toward the safety measures followed at Abu Dhabi schools.

Testing Scenario 2
To obtain more fine-grained analytics, in the second testing scenario, we demonstrate the analytics for a specific location (i.e., Abu Dhabi), a smaller timeframe (18 October 2020 to 1 March 2021), and we focus the analysis on a specific emotional trigger/aspect (Safety). The results are shown in Figure 12. The emotional analytics show that the "Happy" emotion is the most prominent societal emotion relevant to "Safety" in the emirate of Abu Dhabi, surpassing all other emotions detected in the public chatter related to education. By looking at the word cloud and word frequency charts, we can see "school", "Learn", "student", and "teacher" among the most frequent words indicating the relevance of the chatter data to the educational domain. We can infer that the public felt happy about the Safety measure taken locally to protect the school students. Prior to the selected timeframe, the Department of Education and Knowledge in Abu Dhabi (ADEK) has released the reopening policies, guidelines, and protocols with a detailed framework, including the preventive measures to be followed and space management, the timeline for resumption of operation, entry requirements for both staff and students, as well as the criteria for reopening; covering all expected and thought-through situations [64]. Therefore, it is evident that the public had a positive attitude and opinions toward the safety measures followed at Abu Dhabi schools.
the selected timeframe, the Department of Education and Knowledge in Abu Dhabi (ADEK) has released the reopening policies, guidelines, and protocols with a detailed framework, including the preventive measures to be followed and space management, the timeline for resumption of operation, entry requirements for both staff and students, as well as the criteria for reopening; covering all expected and thought-through situations [64]. Therefore, it is evident that the public had a positive attitude and opinions toward the safety measures followed at Abu Dhabi schools.

Testing Scenario 3
The third testing scenario focuses on the same timeframe used in scenario#1 but targets a different emirate (Dubai) and focuses on a specific emotional trigger (Financial Security). The results are illustrated in Figure 13. Interestingly, despite the fact that the analytics for the first test scenario over the same timeframe revealed mixed societal emotions across all emirates, this test scenario shows that happiness was prominent in Dubai relevant to "Financial Security". We can infer that the public of the Emirate of Dubai felt positive and happy about financial security and were not worried about losing their jobs nor worried about school tuition and fees. This can be further supported by the overall increase in the employment rate of the non-oil private sector in Dubai during the specified timeframe [65].

Testing Scenario 4
To construct a comparison with test scenario#3, scenario#4 focused on the same aspect and timeframe (Financial Security, 03 October 2020 to 26 March 2021) yet targeted a different emirate (Abu Dhabi). Figure 14 shows the emotion analytics results. By looking at the word cloud, we can see "develop" and "Help" are the most frequent in the selected timeframe and in the emirate of Abu Dhabi. In addition, the analytics show that "Anger" was the most prominent emotion in Abu Dhabi relevant to the selected aspect "Financial Security". These analytics are quite representative of the educational sector situation in the selected timeframe when it comes to financial security, as many teachers lost their jobs after the COVID-19 outbreak. Many private educational institutes released teaching and academic staff, which resulted in a great state of anger among the public. As a result, the residents of the emirate of Abu Dhabi felt negative about job security and were concerned about their jobs.

Conclusions
The COVID-19 pandemic has affected every sector worldwide since 2020, and education is no exception. COVID-19 has impacted the mode of delivery as educational institutions around the globe were forced to switch to online learning. This sudden change had an evident influence on the mental health of the public, including educational staff, students, and parents. Consequently, the public expressed their concerns on different social media platforms, mainly Twitter. Although multiple research works have explored the impact of distance learning on the public's well-being through social media, the literature was still limited in the data collection method, study focus, target audience, geographical coverage, and timeframe. To address these limitations in existing research works, his paper proposed an implicit ABSA framework that identifies the emotions and classifies the associated aspects of emotional triggers from education-related Twitter chatter data in the UAE. The proposed framework aims to provide in-depth insights into the sources of the public's concerns relevant to the educational sector through representative spatiotemporal analytics. These comprehensive insights support decision-makers in the educational sector. The context of COVID-19 was a showcase of an application where this framework's results and contribution are most evident; however, the framework is applicable to various contexts and sectors. To the best of the authors' knowledge, this is the first study implementing ABSA for education-related data collected from Twitter during the COVID-19 pandemic and focusing on well-being factors. The experimental results for emotion classification show that the Linear SVC classifier outperformed other classifiers in terms of overall accuracy, precision, recall, and had an F-measure of 91%. Moreover, the Logistic Regression classifier outperformed all other classifiers with overall accuracy, recall, and had an Fmeasure of 81%, and a precision of 83% for aspect classification. In online experiments, using UAE COVID-19 education-related data, the analytics show high relevance with the public concerns around the education process that were reported during the timeframe of the experiment. These results confirm that the proposed analytical framework can produce reliable insights into public emotions and emotional triggers to aid decision-makers in making informed decisions in different real-life events.
This study was focused on English tweets, whereas more representative insights can be inferred by analyzing Arabic, Hindi, and other languages that are spoken in multilingual countries. In the future, this framework will be extended to cover multilingual analytics and offer continuous insights on several societal events relevant to other sectors, not only the educational sector.
Author Contributions: H.I. conceived the idea, proposed the analytics framework, designed the system, and contributed to writing, results analysis, and supervision. N.H. contributed to writing, results analysis, and system design and modeling. A.K. contributed to writing and system design. R.E. contributed to implementation, data curation, and results analysis. All authors have read and agreed to the published version of the manuscript.
Funding: This research project was funded by the Office of Research & Sponsored Programs, Abu Dhabi University.
Institutional Review Board Statement: The paper does not use human and animal subjects as part of the conducted experiments. Experiments were conducted on Twitter chatter data, and the dataset is anonymized in compliance with the Twitter privacy policy.

Informed Consent Statement:
The experiments do not involve any individual details. Experiments were conducted to research Twitter chatter data. All the used datasets are anonymized in compliance with the Twitter privacy policy.
Data Availability Statement: Emotional Triggers annotated data can be requested from the corresponding author: heba.ismail@adu.ac.ae.