Sentimental Knowledge Graph Analysis of the COVID-19 Pandemic Based on the Official Account of Chinese Universities

Li, Xiaolin; Li, Zhiyi; Tian, Yahe

doi:10.3390/electronics10232921

Open AccessArticle

Sentimental Knowledge Graph Analysis of the COVID-19 Pandemic Based on the Official Account of Chinese Universities

by

Xiaolin Li

¹

,

Zhiyi Li

^1,2,*

and

Yahe Tian

³

¹

School of Economics and Management, South China Normal University, Guangzhou 510006, China

²

Faculty of Information Technology, Macau University of Science and Technology, Macau 999078, China

³

School of Cyber Science and Engineering, Southeast University, Nanjing 210096, China

^*

Author to whom correspondence should be addressed.

Electronics 2021, 10(23), 2921; https://doi.org/10.3390/electronics10232921

Submission received: 26 September 2021 / Revised: 16 November 2021 / Accepted: 18 November 2021 / Published: 25 November 2021

(This article belongs to the Section Computer Science & Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

With the advent of the new media mobile Internet era, the network public opinion in colleges and universities, as an extension of social network public opinion, is also facing a crisis in the prevention, control, and governance system. In this paper, the Fiddler was used to collect the comments and other relevant data of the COVID-19 topic articles on the WeChat Official Accounts of China’s top ten universities in 2020. The BILSTM_LSTM sentiment analysis model was used to analyze the sentiment tendency of the comments, and the LDA topic model was used to mine the topics of the comments with different emotional attributes at different stages of COVID-19. Based on sentiment analysis and text mining, entities and relationships in the theme graph of public opinion events in colleges and universities were identified, and the Neo4j graph database was established to construct the sentimental knowledge graph of the pandemic theme of university public accounts. People’s attitudes in university public opinion are easily influenced by a variety of factors, and the degree of emotional disposition changes over time, with the stage the pandemic is in, and with different commentators; official account opinion topics change with the development of the time stage of the pandemic, and students’ positive and negative comment topics show a diverse trend. By incorporating topic mining into the sentimental knowledge graph, the graph can realize functions such as the emotion retrieval of comments on university public numbers, a source search of security threats in university social networks, and monitoring of comments on public opinion under the theme of the pandemic, which provides new ideas for further exploring the research and governance system of university network public opinion and is conducive to preventing and resolving campus public opinion crises.

Keywords:

knowledge graph; sentimental analysis; public opinion analysis; university official account; COVID-19; opinion mining

1. Introduction

WeChat Official Accounts [1] is a social media for WeChat to provide users with information and services. Since its official launch on 23 August 2013, it has quickly become the most popular Internet speech platform, with its advantages of high communication efficiency and strong interactivity. Additionally, as an important position in campus new media, the influence of university official accounts cannot be underestimated. In recent years, the number of unexpected events of university network public opinion has increased, and the occurrence and spread of some events are closely related to the new media public opinion represented by WeChat public numbers, which brings serious challenges to the construction of university public opinion [2,3]. In the face of the sudden global pandemic of COVID-19, social media paid great attention to it, while the student community was greatly affected during this period, which, to a certain extent, affected students’ lives and physical and mental health [4].

The content of comments on official university accounts articles during the pandemic can reflect the attitudes, tendencies, and micro-behaviors of university students during the pandemic, providing a highly present and time-sensitive text corpus for campus opinion research [5]. On the other hand, as the COVID-19 continues to spread, it makes it an important research topic for data analysts and scholars, in general, to use sentiment analysis algorithms [6] and knowledge graph techniques [7] to accurately identify hot comments and pandemic themes, analyze the emotional dynamics of the masses, uncover topics of public concern, and show the spatio-temporal evolution of network public opinion during this pandemic event.

Scholars have conducted extensive research on university student populations in the context of COVID-19, focusing on “education” and “psychology”, such as studying student satisfaction with the quality of videoconferencing during the pandemic [8] and exploring the impact of the pandemic on college students’ academic performance and social mood [9]. Fewer studies have been conducted from the perspective of public opinion analysis. Secondly, in China, scholars have mostly conducted qualitative analysis on public opinion in universities during the pandemic [10], and there are few quantitative research results. Again, analysis of university public opinion through sentimental knowledge graphs, which is an innovative study in the current period of the pandemic explosion, is studied even less by scholars.

In this paper, starting from the time stage of pandemic events, a topic–sentiment knowledge graph model of university public opinion events is constructed, and the topic text information with time attribute and massive public opinion data are analyzed. The main contributions of the university pandemic sentimental knowledge graph proposed in this paper are as follows.

(1): In this paper, the sentiment analysis model BiLSTM_LSTM (bi-directional LSTM (Long Short-Term Memory)) is constructed to improve the correct rate of sentiment tendency recognition, accurately identify the positive and negative sentiment of public comments in universities, and quantitatively determine the degree of sentiment tendency presented by students at a specific time period.
(2): In this paper, the LDA (Latent Dirichlet Allocation) topic mining experimental corpus is segmented according to positive and negative sentiment as well as the development stage of the pandemic and achieves a fine-grained division of the pandemic topic comments on university official accounts to identify the hidden topic information in the positive and negative comments made by students during the pandemic and to achieve the comment topic posture measurement.
(3): In this paper, we use Sankey diagrams, which can show the flow of data, to visualize the themes of the pandemic generated by the LDA topic model, to show the evolution of the themes of comment texts at different stages of COVID-19 in a time-series image, and to observe the overall dynamics of university public opinion.
(4): In this paper, the flexible and efficient Neo4j graph database to achieve the knowledge storage of the official university account COVID-19 sentimental knowledge graph, organizing knowledge in the form of a graph structure.
(5): The Neo4j knowledge graph built in this article uses the Cypher query language, which allows the graph databases to be efficiently queried.

The remainder of this paper is as follows: “Related Works” briefly introduces the research work related to the sentimental knowledge graph of COVID-19 in official accounts of colleges and universities; “Models and Methods” constructs a theoretical model of the sentimental knowledge graph of COVID-19 in official accounts of colleges and universities, introduces the research framework of this paper and conducts experiments and analysis on the techniques and methods involved in a sentimental knowledge graph; “Graphs and Discussion” builds a Neo4j-based knowledge graph of university public opinion; and “Conclusions” gives a summary and the prospects of this paper.

2. Related Works

2.1. Sentimental Knowledge Graph

As smart technology and mobile Internet continue to develop, making the interconnection of everything possible, the data generated by the interconnection is exploding. Finding a way to make use of the data and the connections between it has become an urgent need for all industries. In the past, intelligent analysis tended to focus on each individual, but in the mobile Internet era, in addition to the individual, the relationship between individuals is also an important part of the analysis. Knowledge graphs are dedicated to dealing with relational analysis. A knowledge graph is a multi-relational graph consisting of entities and relations. Each edge is represented as a triple form (head entity, relation, tail entity), also called a fact, indicating that two entities are connected by a specific relation [11]. As a fusion of a knowledge graph and sentiment analysis, a sentimental knowledge graph not only visualizes the research content but also portrays the subjective sentiment of the research subject. Emotion-based knowledge graphs identify user intent using retrieval, matching, or inference within the constructed structured graphs and enable efficient query and retrieval with the information stored in the contextual information. In terms of research on lexicon-based sentimental knowledge graphs, You et al. [12] establish a fine-grained sentimental classification lexicon and semantic role analysis model based on frame semantic theory, construct a sentimental knowledge graph through the semantic annotation of emotions in online reviews, and apply ontology query tools to realize personalized and intelligent queries of emotional knowledge. Xu et al. [13] used movie reviews and emotion words to build a knowledge extraction lexicon using the Stanford Natural Language Processing (NLP) toolkit to annotate the movie review data, clustered the text to form structured knowledge, and finally fused the movie ontology knowledge to generate a knowledge graph of movie reviews containing user sentiment factors. Using a machine learning model, Li [14] used the text of user comments on the topic of “anti-academic misconduct” on Weibo as data and used the Naive Bayesian Model to classify the data to obtain the users’ sentiment tendency, and combined this with the influence of user nodes to build a sentiment map of university opinion on social media. Again, using Weibo as the research platform but using a deep learning model, Xie et al. [15] used a Skip-gram model to obtain a word embedding matrix, a BiLSTM model for sentiment classification, and Neo4j to construct a knowledge graph of user comments by identifying the relationships between different users.

A sentimental knowledge graph presents complex knowledge domains through data mining, information processing, knowledge measurement, sentiment value calculation, and graphical drawing to reveal the dynamic development of emotion in a specific knowledge domain. Theoretical research and practical applications are expanding and achieving good results, and scholars are constantly optimizing the results of sentiment analysis to improve the accuracy of the sentimental knowledge graph. However, there are still few studies focusing on the ontology of the sentimental knowledge graph. Most scholars take sentiment analysis as an experimental method and only apply the knowledge graph to the visualization of experimental results or apply part of the technology extraction in the knowledge graph to the sentimental analysis model [16,17], without innovatively integrating the two.

2.2. COVID-19 Analysis in Colleges and Universities

With COVID-19 spreading fiercely around the world in the first half of 2020, universities faced unprecedented challenges as public places where people gather. Scholars have also paid attention to the impact of the pandemic on colleges and universities and published relevant research papers. In general, the research on the topic of the pandemic in colleges and universities mainly focuses on the psychological and mental health of students, online education, and public opinion in colleges and universities. Tasso et al. [9] noted a positive correlation between students’ academic frustration and mental health symptoms as a result of the significant educational and psychological impact received by university students in the pandemic. Han et al. [18] collected questionnaires from 17,876 university students and used chi-square tests and multivariate logistic tests to determine the association between pandemic experience and anxiety detection, showing a detection rate of 18.2% for university students with anxiety disorders. On the other hand, students’ irrational cognition and legal awareness are weak, while the network society is full of false rumors and the lack of an all-round monitoring mechanism for network public opinion makes it easy for students to immerse themselves in the network world and be influenced by various negative comments and ideas [19]. Online education is particularly important as a link between schools and students, and Mukhtar et al. [20] conducted a study with a sample of staff and students in the form of group interviews to explore the advantages and disadvantages of online learning for both teachers and students. Sorour et al. [21] investigated the effectiveness of a hybrid modelling approach using cloud computing services (CCS) and virtual reality (VR) in a virtual cloud learning environment (VCLE) system in a closed educational context to try to help and address the key challenges and factors affecting e-learning systems for computer maintenance courses during the pandemic. In addition, the development and response to public opinion in universities during the pandemic has also become a focus of attention [22]. Yao et al. [23] conducted social network and semantic network analysis and sentiment analysis for online teaching topics by selecting educational online opinion data from social media sites such as WeChat and Weibo.

In summary, scholars have conducted many studies and achieved many results in the fields of the sentimental knowledge graph and topic analysis of university pandemics, but few scholars have applied the above two analysis methods to the study of university pandemic public opinion. In addition, the analysis of public opinion in universities lacks a comprehensive consideration of the characteristics of various aspects of university public opinion groups. In this paper, we use the sentiment analysis model BiLSTM_LSTM and the LDA topic model to experiment with public comment data from representative, official university WeChat accounts and import the experimental results into the Neo4j graph database to build a topic–sentiment knowledge graph. The integration application of university public opinion and knowledge graph is realized, making the sentimental knowledge graph an effective means of data mining and knowledge discovery in university knowledge management. It provides certain guidelines for the public opinion supervisory department to understand the control points, evolutionary trend analysis, emotion guidance, and monitoring tools of public opinion in colleges and universities events; it has certain theoretical and practical guidance significance to promote the construction of a university network public opinion monitoring platform.

3. Models and Methods

Through the theoretical elaboration above, this chapter first constructs a theoretical model of a topic–sentiment knowledge graph of public opinion events in colleges and universities, and then elaborates on the data collection, sentiment analysis, and topic mining methods and techniques in the model; through the methods and technical experiments, it provides data sources for the construction of sentimental knowledge graph in the next chapter and also provides multiple perspectives for thinking about university public opinion analysis.

3.1. Theoretical Model Construction

To analyze the overall tendency of university students and the topic and sentiment posture of public opinion in colleges and universities during COVID-19, a public opinion in colleges and universities topic–sentiment knowledge graph model was built. The structure of the model is shown in Figure 1, which includes the following four parts: data collection and pre-processing, BiLSTM sentiment analysis, LDA topic text mining, and Neo4j knowledge graph construction.

The steps are as follows: (1) Obtain the article information and students’ comment data through the packet-crawling tool Fiddler and clean the duplicate and wrong data. (2) Use the “Chinese-word-vectors” [24] pre-trained by researchers from the Institute of Chinese Information Processing of Beijing Normal University and the DBIIR (Database and Intelligence Information Retrieval Lab) Laboratory of Renmin University of China, and the GitHub Chinese sentiment analysis corpus “online_shopping_10_cats” [25]. The BILSTM_LSTM deep learning model was selected to train the sentiment analysis model based on the sentiment corpus, and the sentiment value of the comment content was calculated by using the model. (3) Use the word list extended by the deactivated thesaurus of the Machine Intelligence Laboratory of Sichuan University and the thesaurus of Baidu [26], use the jab word splitting tool to split all comments, remove deactivated words, build a topic matrix by LDA model, determine the number of topics according to the confusion curve to extract topic content, and calculate the topic relevance, extract topic labels according to the topic word weights and pleasing visualization results. (4) Identify entities based on the article and student comment data, further identify inter-entity relationships based on entities, import entity nodes and relationship edges to the Neo4j graphical database, store and map the knowledge graph of public opinion in colleges and universities, and query the experimental results through the cypher language.

To achieve the refined event evolution analysis described in this paper, the study phase was divided into four key periods based on Fighting covid-19 China in action [27] (T1: “Initial containment period of the pandemic” from 21 January to 20 February 2020; T2: “Important effectiveness period for pandemic prevention and control” from 21 February to 17 March; T3: “Consolidate the effective period of the pandemic” from 18 March to 28 April; T4: “Normalization period for pandemic prevention and control” from 29 April to 31 December), as shown in Figure 2.

3.2. Data Collection and Pre-Processing Based on Fiddler

Fiddler [28] is an HTTP protocol debugging proxy tool that inspects HyperText Transfer Protocol over SecureSocket Layer (HTTP/HTTPS) traffic to and from browsers and desktop applications. As a commonly used man-in-the-middle packet capture tool, it can access everything needed to create a new request, communicating with the web server precisely so that only the required resources are invoked. Its working schematic is shown in Figure 3.

In this paper, according to the watermelon data “2020 first half-year Chinese universities public number influence ranking”, the Chinese universities ranked in the top 10 were selected as the subject of the study, the search was conducted in the public number history message with “pandemic” as the keyword, the articles with the time interval from 1 January 2020 to 31 December 2020 were selected, and the list of pandemic-themed articles was obtained, and the data were collected by using Fiddler to grab the HTML pages. First, the attributes of the original HTML data set, such as title, publication time, and comment content, were collected; then, we screened and eliminated user blank comment data; finally, through browsing, screening and repeated checking, 1163 articles were published on the official account of 10 universities with the pandemic as the theme and 22,134 comments were obtained. The format of the experimental data is shown in Appendix A: Table A1.

3.3. Sentiment Analysis of Official Account Commentary in Time Context Based on BiLSTM_LSTM

3.3.1. BiLSTM_LSTM

RNN has achieved remarkable results in processing continuous data for sentiment analysis, but its gradient disappearance problem makes it extremely problematic in several aspects similar to efficient learning and tuning hyperparameters. The LSTM, on the other hand, is a special recurrent neural network designed to address the long-term dependence of RNNs by adding memory units to each unit of the hidden layer making the memory information in the time series controllable and better able to predict sentiment through the neural network [29]. On the other hand, BiLSTM is useful in scenarios where input context is required, especially for tasks such as sentiment classification. In BiLSTM, information flows not only from back to front, but also from front to back using two hidden states. This allows the learning capability of the LSTM to be further improved to better understand contextual information [30,31]. Considering that the advantage of LSTM is that long-term memory functions can be achieved by connecting previous and present contexts, the advantage of BiLSTM is that past and future contexts can be extracted by considering both directions of the information flow [32]. Based on this, this paper proposes to add a bidirectional LSTM network to the LSTM network and use the BiLSTM_LSTM model for training and the prediction of sentiment analysis. The parameter information of the model is shown in Table 1 and the architecture is shown in Figure 4: It consists of a word embedding layer, a BiLSTM layer, an LSTM layer, a fully connected layer, and an output layer. The following is described in detail.

Word embedding layer. The embedding layer converts each unique index into n-dimensional real-valued vectors, which are stacked together to form the embedding matrix, namely the word vector matrix [33]. The word vector matrix representation of input comment information is as follows:

R = [\begin{matrix} \begin{matrix} r_{11} & r_{12} & \dots \\ r_{21} & r_{22} & \dots \\ ⋮ & ⋮ & ⋮ \end{matrix} & \begin{matrix} r_{1 m} \\ r_{2 m} \\ ⋮ \end{matrix} \\ \begin{matrix} r_{n 1} & r_{n 2} & \dots \end{matrix} & r_{n m} \end{matrix}]

(1)

BiLSTM layer. The BiLSTM model consists of two LSTM networks capable of reading input comments from both forward and backward directions [34]. The forward LSTM processes information from left to right, while the backward LSTM processes information from right to left. The model takes into account both previous and future information, thus processing contextual information efficiently.
LSTM layer. Unlike the BiLSTM layer, the LSTM layer does not return sequences, only the final result.
Fully connected layer and output layer. The feature information extracted by the above process is used in the fully connected layer to generate feature values for sentiment classification. The fully connected layer acts as a classifier in the overall neural network model. The results are output in the output layer using a sigmoid activation function, which translates the output of the fully connected layer into approximate probability values for each sentiment polarity category.

3.3.2. Analysis of Students’ Emotional Evolution in Time Context

Through repeated experiments and tests, the BiLSTM_LSTM model constructed in this paper uses the “Chinese-word-vectors” Baidu Encyclopedia Word and Ngram pre-trained word vector [24], and the GitHub sentiment corpus collection “online_shopping_10_cats” with 62,774 entries to train the sentiment model [25]. Some of the core code for sentence feature extraction and model construction is given as Algorithms 1:

Algorithms 1. Sentence feature extraction and model construction.

num_tokens = [len(tokens) for tokens in train_tokens]
num_tokens = np.array(num_tokens)
max_tokens = np.mean(num_tokens) + 2 * np.std(num_tokens)
max_tokens = int(max_tokens)
np.sum(num_tokens < max_tokens)/len(num_tokens)
def reverse_tokens(tokens):
text = ‘‘
for i in tokens:
if i != 0:
text = text + cn_model. index2word[i]
else:
text = text + ‘ ’
return text
num_words = 50,000
embedding_matrix = np.zeros((num_words, embedding_dim))
for i in range(num_words):
embedding_matrix[i,:] = cn_model[cn_model.index2word[i]]
embedding_matrix = embedding_matrix.astype(‘float32′)
np.sum(cn_model[cn_model. index2word [333] == embedding_matrix [333])
embedding_matrix.shape
train_pad = pad_sequences(train_tokens, maxlen = max_tokens, padding = ‘pre’, truncating = ‘pre’)
train_pad[train_pad >= num_words] = 0
train_target = np.array(train_target)
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(train_pad, train_target, test_size = 0.1, random_state = 12)
model = Sequential ()
model.add(Embedding(num_words, embedding_dim, weights = [embedding_matrix], input_length = max_tokens, trainable = False))
model.add(Bidirectional(LSTM(units = 64, return_sequences = True)))
model.add(LSTM(units = 16, return_sequences = False))

To ensure the accuracy of sentiment recognition, the model divided the data set into 90% training set and 10% test set and then extracted 10% of the data from the 90% training set as cross-validation samples. After training the model for 10 iterations, the final training set accuracy value was 98.3% and the val_accuracy value was 90.8%; the test set accuracy was 89.6%. Therefore, the sentiment classification model constructed using the BiLSTM network in this paper works well. Some of the results are shown in Table 2:

The distribution of sentiment tendency based on sentiment values is shown in Figure 5. The vertical coordinate is the sentiment tendency of each comment, with larger values closer to 1 indicating positive sentiment and closer to positive sentiment, and smaller values closer to 0 indicating negative sentiment, where a sentiment tendency of <0.5 is judged as negative by the model. The horizontal coordinate indicates the month when the comment was published. The density of data points indicates the number of comments.

According to Figure 5, the degree of emotional tendency changes with time, with the stage of the pandemic and with different commentators, indicating that people’s attitudes in university public opinion are easily influenced by various factors, and their emotional tendency is variable. It is also evident that during the early months of the pandemic, January–March, there was a major outbreak of commentary, a sharp divergence of emotional tendencies as people were restricted from travelling and isolated at home during the Chinese New Year period, there was a high level of interest in news about the pandemic-related reports, and students made their comments widely available under public university articles. The end of February/beginning of March is the start of the academic year for students, and due to the pandemic, the opening of many universities had been postponed. In March–April, a period after the Chinese New Year but before the start of the academic year, when universities and colleges were unable to start their regular classes in the form of online classes, the attention of public articles on the topic of the pandemic remained high. This was followed by the start of the academic year in late April and early May, and with the guidance of social media, the level of concern and worry about the outbreak among students was less than before. However, during the July college graduation season, graduates were worried about employment, with extremely negative sentiment outweighing extreme positive sentiment. Additionally, then under the influence of relevant national policies and social regulation, as well as after the new school season in September, graduates had mostly embarked on their normal work or study paths, while new students were more concerned about school news articles, and overall, the attention to pandemic-themed articles had declined, with a balance of positive and negative comments. However, after November, the number of confirmed indigenous cases was on the rise again with the re-emergence of indigenous outbreaks in Tianjin, Shanghai, and Anhui Province in China; on the other hand, the global outbreak had seen wave after wave of climaxes, with the total number of cases reaching record highs. On 18 December, the world saw the biggest spike in confirmed cases thus far in this year, with 731,264 new cases in a single day. This also heightened concern among university students about the outbreak, but the overall sentiment was steady, with slightly more negative than positive sentiment among students during the global outbreak in December.

3.4. Analysis of Topic Content Evolution in Time Context Based on the LDA Topic Model

3.4.1. LDA Topic Model

The LDA topic model is a representative model in text mining, which can identify topics in documents and mine the hidden information in the corpus to extract potential topics and has been widely used in text corpus topic discovery [35,36]. Applying the topic model method to the topic mining of the university network opinion text corpus can help to understand the main ideas among the huge number of opinions at the text level according to the idea of topics [37,38]. Based on this, this paper uses the LDA topic model for topic mining of university public comment data.

The LDA model’s topic generation process: selecting a topic with a certain probability and selecting a word from a topic with a certain probability:

p (w o r d | d o c) = \sum_{t o p i c} p (t o p i c | d o c) * p (w o r d | t o p i c)

(2)

The structure of the LDA model is shown in Figure 6.

In Figure 6,

M

represents the number of documents,

N

represents the number of

N_{M}

words per document, and

k

represents the number of potential topics. There are

V

non-repeatable terms in the vocabulary; there are

M

documents

d_{1}

,

d_{2}

, …,

d_{M}

in the corpus, which is composed of

N_{i}

repeatable words; there are

k

topics

T_{1}

,

T_{2}

,…,

T_{k}

in the corpus.

α

and

η

are the parameters of the prior distribution;

θ

is the topic distribution for each document; for the

i

th document

d_{i}

, under the topic distribution

θ_{i}

, the topic

z_{i j} = k

can be determined;

β

is the word distribution for each topic; the selection of

β_{z i j}

from

z_{i j}

indicates that the term is determined from the word distribution

β_{z i j}

, resulting in the observation

w_{i j}

.

3.4.2. Analysis of the Evolution of Topic Content in Time Context

In general, the smaller the study granularity setting is, the more accurate the research results are. Firstly, all the review data were divided into positive comments and negative comments according to the results of sentiment analysis; secondly, the paper further divided the positive and negative comment data into four periods, respectively. Eight datasets divided by pandemic stage and positive and negative sentiment were imported, and a thesaurus and a deactivated thesaurus [26] were introduced for Chinese word separation using the jieba word separation tool. As the merit of the topic representation result is affected by the number of topics, to determine the optimal number of topics, this paper introduced a confusion index. The lower the degree of confusion is, the better the clustering results are. The perplexity is calculated as follows:

p e r p l e x i t y = e^{\frac{- \sum \log (p (w))}{N}}

(3)

where

p (w)

is the frequency of each word that appears in the text and

N

refers to all words that appear in the text.

The LDA topic model perplexity was calculated via python and a perplexity trend graph was generated as shown in Figure 7.

After determining the optimal number of topics through the perplexity indicator, the weights of the topic terms and subject terms were output and the results of some of the runs are shown in Table 3.

To more accurately and objectively determine the names of topic categories, topics were visualized using the LDA topic model visual pyLDAvis library, as shown in Figure 8. The combination of topic word weights and the visualization of interactive topics was tested on eight datasets to determine the final topic labels for positive and negative comments at different stages of the pandemic.

After acquiring the topics discussed at different stages of COVID-19 with positive and negative tendencies, further clarifying the information flow of public opinion topics at different time stages helped university staff to grasp the dynamic situation of public opinion development. Sankey diagrams can visualize the flow of information between subjects, and the amount of information flow in the diagram is positively related to the frequency of co-occurrence and similarity of different topics. A Sankey diagram showing the evolution of topics is drawn using PowerBI, where the width of the line represents the similarity value and the size of the word block represents the closeness of association with other topics, as shown in Figure 9 and Figure 10.

Figure 9 shows the topic evolution diagram of positive comments on pandemic topic articles on public accounts of universities. During the T1 period, students paid attention to the following topics: first, cheer for Wuhan, Hubei, and Cheer for China; second, paying high tribute to the brave medical staff on the front line of COVID-19 and hoping for their safe and healthy return; third, COVID-19 is coming, and the people are united. During the T2 period, students were proud of their Alma mater for volunteering to be on the front line of the pandemic. For example, the medical team of Peking University rushed to Wuhan to help, and the research institute of Tsinghua University helped to defeat the pandemic with science and technology. Students taking online classes at home also received warm comfort from the school and tips on pandemic prevention and control. During the cherry blossom season, students looked forward to the revival of Wuhan and the blooming cherry blossoms at Wuhan University. During the T3 period, China’s remarkable achievements in fighting the pandemic made the people proud. The medical workers who helped Hubei returned home one after another, and the students were very excited about the hero’s safe return. Some colleges and universities announced the opening of the semester in April, and students expressed their anticipation. Those who were still taking online classes also enjoyed the fun of online classes. The graduation season had also become a hot issue for students amid the pandemic. As the T4 period was normalized for a long time, many topics were discussed. Thanks and respect were extended to the heroes who fought against the pandemic. Wishes were also expressed on the warm Mother’s Day and for the graduation season to send their own blessings. The new term also saw a keen interest in their own studies, the dedication of their teachers, and the development of the school. At the end of the year, they expressed a rosy vision of leaving the past behind and embracing a brighter future.

The evolution of topics for negative comments on pandemic-themed articles on university official accounts is shown in Figure 10. The T1 period was frightened by online rumors, and students were both concerned but clear about their attitude of “do not spread rumors or believe them”. In the early stages of the COVID-19, there were reports of frontline workers infected with the virus and students prayed and hoped for the triumphant return of the health care workers fighting the pandemic. Many students also volunteered for work related to the pandemic, showing their responsibility as college students. On the other hand, college students come from all over the world, and the opening of the semester inevitably affected the development of the pandemic; therefore, the overall planning of the school was particularly important. During the T2 period, some students contributed to the pandemic on the front line, while others actively responded to the country’s call to stay at home and not attend parties. The words about cheering for teachers and the school were mostly neutral comments, expressing their close concern for the Alma mater. On the other hand, China made progress in fighting the pandemic, but students were informed to postpone the start of school. Students facing graduation also worried about employment and further studies. During the T3 period, there was an active response to the arrangements of schools, and active cooperation with the pandemic prevention work of schools. Despite the anxiety of opening and graduation, students firmly said their Alma mater stood with Wuhan, Hubei. At the same time, students expressed their condolences for those who lost their lives in the fight against the pandemic and hoped that the country would be well. During the T4 period, students expressed their strong concern for their classmates and school and maintained a neutral attitude toward returning to school and graduation. They had worries about the future but also dared to meet the challenges in the future.

4. Graphs and Discussion

4.1. Neo4j Graph Database Design

Neo4j [39] is a local graphical database that stores structured data on the web rather than in tables, with a flexible structure for storing data. The data model of the Neo4j graph database consists of the following four main components: nodes, attributes, labels, and relationships. Relationships are used to describe node-to-node relationships and are what sets graph databases apart from traditional databases, allowing them to describe net-like relationships that are difficult to represent in a normal row database. Neo4j provides Cypher, a graph database query language, which is a powerful, graph-optimized query language that understands and makes use of the data stored in Neo4j. This also allows graph database nodes to be added, deleted, and changed at any time with great flexibility and scalability, and is several orders of magnitude faster at performing complex join queries than other traditional databases.

Neo4j graph databases represent the connections between entities as they would in the real world, without the logical transformations that relational databases require. How the database structure is built depends on business requirements and performance testing, and there is no standard schema; therefore, its data modelling approach is very flexible. The graph database structure diagram used in this paper is shown in Figure 11. The graph database design consists of five entities and seven types of relationships between the entities. A table of specific node and relationship definitions can be found in Appendix A: Table A2 and Table A3.

4.2. The Construction of Neo4j Knowledge Graph of Public Opinion in Colleges and Universities

This article imports nodes and relationships via Cypher LOAD CSV. After importing the data into Neo4j, we can obtain the knowledge graph of university public opinion as shown in Figure 12 (Neo4j memory limitation, only some nodes, and relationships are shown). The green circles represent universities, the blue circles represent public articles published by universities, and the yellow circles represent comment content.

4.3. Application of neo4j Topic-Affective Knowledge Graph

4.3.1. Sentiment Retrieval of University Official Account Comments

By analyzing students’ attitudes toward a particular article, that is, by obtaining the distribution of comments on the article, it is possible to understand the thoughts and attitudes of students in higher education and to analyze the reasons for the positive or negative comments, which will help universities to implement timely strategies to maintain a stable online environment.

For example, to query the opening notices issued by universities to check the status of student comments, the query code is as follows:

MATCH (a: article{title:“今天, 开学了! 北航第一批返校的是他们……”})-[]-(b: comment) RETURN a, b

The results of the query are shown in Figure 13, where the blue circles are article titles, the yellow circles are article comments, the red arrows indicate positive comments, and the blue arrows indicate negative comments. It can be seen that students had more positive than negative attitudes toward the start of back-to-school. Through emotion retrieval of the knowledge graph, students’ tendency toward articles on a specific topic can be retrieved to understand students’ attitude towards school decision making, social public opinion, and national policy, which plays an important role in campus supervision and the grasp of public opinion in colleges and universities.

4.3.2. The Source Search of Security Threats in University Social Network

Social networks have a complicated and messy structure and a huge group of users, and the massive information circulating in the network at high speed is difficult to trace and track. In the supervision of public opinion on social networks in colleges and universities, the scope of public opinion management can be greatly reduced by reasoning with the information collected from a large number of user comments, clarifying the information dissemination process of comments–articles–colleges and universities, and finally implementing the source to colleges and universities. This can play a significant role in the regulation of network public opinion.

For example, to find out which universities have the word “dead” in the negative comments of their articles, the search code is as follows:

MATCH (a:school)-[:Publish]-(b:article)-[:Negative_expression]-(c:comment) WHERE c. content =~’.*死.*’ RETURN a, b, c

The results of the query are shown in Figure 14. In Figure 14a, the green nodes are colleges, blue nodes are article titles, and yellow nodes are comments. There were nine university public numbers that included the word “dead” in the negative comments on the article, but the number of comments was small. Further, to analyze a particular comment node, such as the following comment published by the students from Dalian University of Technology: “courage to heal the wounded and rescue the dying, heavier than mount Tai, white angels” the sentiment analysis model can be divided into negative comments, but these can be partially neutral, as the word “death” is not a dangerous word, it is, in fact, in patients with medical staff rescue behavior, praise. Figure 14b expands the purple “comment topic” node and the red “outbreak stage” node by deepening the query on this comment node. By expanding the nodes, the information behind the comments can be better understood and the source of the security threat can be identified.

4.3.3. Public Opinion Monitoring of University Reviews in the COVID-19 Stage

The COVID-19 pandemic broke out at the beginning of 2020, and relevant departments were very alert to the resulting information pandemic in cyberspace, that is, the spread of false, panic, and unstable hazardous information in response to the pandemic. Even though China has entered the post-pandemic period, it still needs to pay close attention to the development of the international pandemic and the potential public opinion risks and do a good job in response and governance. The network public opinion in colleges and universities has the characteristics of fast dissemination and randomness of information dissemination, and the value of college students as the main force of new media is not yet mature. College students are often easily influenced by wrong opinions, which manifests in a large number of negative emotional comments. Therefore, it is particularly necessary to control and monitor public opinion in colleges and universities. Starting from the different stages of the pandemic, this paper explores the attitude of students to the current massive information during the pandemic from the perspective of time, which is conducive to public opinion decision-making in colleges and universities.

For example, to query the commentary on understanding the propensity of students during the T4 period, the query code is as follows:

MATCH p=(:stage{time:’Normalization period for epidemic prevention and control’})-[]-(:comment) RETURN p LIMIT 15000

The results of the query are shown in Figure 15, with the blue areas being negative sentiment comments and the red areas being positive sentiment comments. It can be clearly seen that students’ positive attitudes toward information about the pandemic outweighed the negative attitudes during this period, and the information pandemic in cyberspace was in good condition.

5. Conclusions

In this paper, data were collected through Fiddler on comments and other relevant information on pandemic-themed articles from ten universities’ official accounts. We calculated sentiment values and mapped sentiment tendency distribution for article comments using the built BiLSTM_LSTM sentiment analysis model. According to the interval of emotion value, the positivity or negativity of the comments was determined. The binary comment data set was subdivided according to the four stages of the pandemic, and the eight subdivided data sets were imported into the LDA topic model for topic clustering. According to the confusion index, the best number of topic categories was obtained, and then the topic name was determined by combining the subject word weight and pyldavis visualization results. The Sankey diagrams of positive and negative comment topics were drawn, respectively. Finally, the sentiment analysis results and the results of LDA text mining were integrated, and the nodes and relationships of the Neo4j graph database were determined by identifying entities and relationships, and the CSV data table was imported into Neo4j for data storage and visual presentation, which completed the construction of a public opinion topic–sentiment knowledge graph in colleges and universities. The Cypher statement was used to query the graph database and summarize the application scenarios of the knowledge graph.

Theoretical contribution

Firstly, this study helps to study the influence of new social media on public opinion in universities from the perspective of university public numbers. Using university WeChat public number platforms as the experimental platform and pandemic-themed articles as the experimental data source, it provides a new model for the construction method of topic mapping of public opinion events in universities by introducing the topic–sentiment knowledge graph into the research field of university public opinion analysis. Traditionally, most of the data for research on public opinion in universities come from microblogs, but with the development of social network media and the arrival of the 5G era, students are using WeChat more frequently. At the same time, WeChat Official Accounts in colleges and universities are growing rapidly, and the campus ecology in the “Internet+” era has led to the rapid development of public numbers in colleges and universities, which have also become one of the main platforms for students to express their opinions and are an important platform for researching public opinion in colleges and universities. The data obtained in this paper based on this platform are more representative.

Secondly, this study attempts to adapt and improve on the traditional LSTM network, and the obtained BiLSTM_LSTM model for sentiment analysis is good in terms of accuracy, which also lays the foundation for the subsequent construction of a sentimental knowledge graph.

Once again, this study’s dataset, which combines pandemic stages and sentiment tendencies for segmentation, also plays a significant role in the LDA mining of this paper. The LDA topic model finally yields a trend evolution diagram of pandemic article comment topics, which deeply explores the intrinsic relationship between comment contents and also provides material for the prediction and research of university public opinion in the new media era.

Finally, this study uses an efficient and convenient graph database to build a knowledge graph, circumventing the usual problem of using large amounts of structured data to generate a static knowledge graph for opinion analysis, which makes subsequent changes costly. By querying the edges (relationships) of an entity to quickly obtain another entity linked to it, the complex association operations of various tables are eliminated, and the relationship query is more convenient and significantly more efficient. It can also be used in subsequent research to build opinion monitoring systems.

Regulatory and policy implications

For campus administrators, this study can timely track and analyze the development of this pandemic topic in colleges and universities, which is conducive to improving the knowledge management ability of colleges and universities regarding network public opinion in the new media era, fully exploring the explicit and implicit values of students’ comments, and providing college staff with methods to better organize, manage, and understand students’ opinions, which, in turn, is conducive to promoting sound relevant campus network public opinion guidance mechanisms. Campus policymakers can make use of the Neo4j topic–sentiment knowledge graph to promote the formation of an informative and professional management team, rationalize the planning and construction of campus network platforms, correctly and effectively lead students to use the Internet, and cultivate their media literacy.

Limitations and future prospects

At the same time, this paper still has certain limitations in the research process: first, the use of open-source annotated sentiment quotes in the training of the sentiment analysis model has biases in matching with university students’ comments, and the goodness of the annotation affects the final classification results; second, the optimization of the Neo4j graph database is not sufficient, which could easily result in memory overflow in subsequent query operations; third, the expression of the evolutionary dynamics of the topic of the Sankey diagram visualization is not displayed in the dynamic sentimental knowledge graph.

Based on the topic–sentiment knowledge graph of university public opinion events constructed in this paper, the relationship between the degree of emotional tendency and universities, regions, and the number of likes can be explored through the method of correlation analysis; it can be combined with other machine learning techniques to make a portrait of the users’ public opinion on unexpected events based on university students’ information and to provide theoretical support for accurate public opinion guidance; the knowledge map function can also be improved to realize an early warning system for public opinion Q&A.

In this paper, the focus of research is limited to university public opinion, but without limits, the interesting directions for future research may be (1) the use of topic–sentiment knowledge graphs by FMCG companies to analyze consumers’ hidden needs and develop personalized push services; (2) the realization of accurate health information demand for users of the “Internet + Health” platform; (3) the exploration of potential characteristics of public service personnel and the promotion of matching job requirements with personnel characteristics; and (4) profiling the behavior of users in multiple social environments. To do this, we need a multi-disciplinary and diverse dataset to test our approach and explore empirical insights from multiple disciplines and research areas.

Author Contributions

Methodology, visualization, and writing—original draft preparation, X.L.; writing—review and editing and supervision, Z.L.; data curation, Y.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Data from the “pandemic” themed WeChat official accounts.

University Name	Article Title	Article Publication Time	Username	Comment Post Time	Comments	Likes
Peking University	General secretary Xi Jinping sent five letters to Peking University.	18 March 2020	The suit.	18 March 2020 12:18	Go China! Go Peking University! [Fist] [Fist]	24
Peking University	General secretary Xi Jinping sent five letters to Peking University.	18 March 2020	Big North Shirakawa	18 March 2020 12:26	Come on [Love] You can definitely [love] The power of youth can hold up China	34
Peking University	Urgent help from the Middle East! Peking University and Qatar University shared their experience in combating the pandemic through a video link.	21 March 2020	Chen	21 March 2020 8:12	“Snow melts on the Liaohe River, flowers bloom on the Rich Mountain; With the same breath and branches, looking forward to the coming of spring.” We hope to work together to stop the pandemic Looking forward to the end of the pandemic	8
Peking University	Urgent help from the Middle East! Peking University and Qatar University shared their experience in combating the pandemic through a video link	21 March 2020	EASYMONEY	21 March 2020 9:34	Hope everyone is safe	11
Peking University	Peking University academicians, professors, outstanding students on TV together, come to watch this immortal class meeting!	24 March 2020	Wang xi	24 March 2020 20:41	Peking University is really full of talents. I hope I can meet with Peking University in 2022	20
Peking University	Peking University academicians, professors, outstanding students on TV together, come to watch this immortal class meeting!	24 March 2020	ASe thé vert	24 March 2020 20:45	Every flower of Peking University blooms and dedicates in its proper place	14

Table A2. Knowledge graph entity types.

Entity Type	Meaning	Number of Entities	Example
school	University name	10	Tsinghua University
article	Article title	1163	The Peking University research team has for the first time confirmed that human respiration is an important emission transmission mode of novel coronavirus
comment	Article comments	22,134	We 60 million people in Hubei sincerely thank you!
topic	Comment topic	40	Hero triumph
stage	Pandemic stage	4	Initial containment period of the pandemic

Table A3. Knowledge graph relationship types.

Entity Relationship Type	Meaning	Number of Relationships	Example
Publish	Universities publish articles	2326	Wuhan University\|Publish\|Official announcement! Announcement of the campus policy during the opening of Wuhan University
Belong_to	The publishing time of the article belongs to the stage	2326	Zhejiang University held the 36th Teacher’s Day and Advanced Commendation Meeting\|Belong_to\|Normalization period for pandemic prevention and control
article_P_comment	Positive comments in the article	22,828	Peking University, on the front line!\|Positive_expression\|Come on, Wuhan, Come on, Peking University
article_N_comment	Negative comments in the article	21,414	A message from Peking University Senior three students: Adolescence has a period; we will not see each other!\|Negative_expression\|Guard against arrogance and rashness, do not forget your original intention, only 123 days left, coming to you!
stage_P_comment	Comments on positive guidance during the pandemic stage	22,828	Initial containment period of the pandemic\|Positive_guidance\|I hope that the pandemic will end soon and every Chinese will be safe and happy
stage_N_comment	Comments on the negative impact of the pandemic stage	21,440	Initial containment period of the pandemic\|Negative_impact\|A tribute to the retrograde heroes of Peking University, the responsibility of the country and the nation.
Comment_topic	Topics mined from comments	225,868	Resolutely implement!\|Mining\|Schools obey uniform arrangements

References

WeChat Official Accounts. Available online: https://mp.weixin.qq.com (accessed on 20 November 2021).
Chen, X.; He, Y. Research on negative network public opinion communication in universities based on the motivation theory under the new media environment. J. High. Educ. Manag. 2020, 14, 105–116. [Google Scholar] [CrossRef]
Huang, S.; Si, W.; Mu, T. Innovation and Research of Online Public Opinion Control and Guidance Mechanism in Universities in the Self-media Era. Inf. Sci. 2021, 39, 62–67+91. [Google Scholar]
Kuru Alici, N.; Ozturk Copur, E. Anxiety and fear of COVID-19 among nursing students during the COVID-19 pandemic: A descriptive correlation study. Perspect. Psychiatr. Care 2021. Advance online publication. [Google Scholar] [CrossRef]
Zhang, C.; Ma, X.; Zhou, Y.; Guo, R. Analysis of Public Opinion Evolution in COVID-19 Pandemic from a Perspective of Sentiment Variation. J. Geo-Inf. Sci. 2021, 23, 341–350. [Google Scholar]
Imamah; Rachman, F. Twitter Sentiment Analysis of COVID-19 Using Term Weighting TF-IDF And Logistic Regresion. In Proceedings of the 2020 6th Information Technology International Seminar (ITIS), Surabaya, Indonesia, 14–16 October 2020; pp. 238–242. [Google Scholar]
Wu, J. Construct a Knowledge Graph for China Coronavirus (COVID-19) Patient Information Tracking. Risk Manag. Healthc. Policy 2021, 14, 4321–4337. [Google Scholar] [CrossRef] [PubMed]
Fatani, T.H. Student satisfaction with videoconferencing teaching quality during the COVID-19 pandemic. BMC Med. Educ. 2020, 20, 396. [Google Scholar] [CrossRef]
Tasso, A.F.; Sahin, N.H.; San Roman, G.J. COVID-19 Disruption on College Students: Academic and Socioemotional Implications. Psychol. Trauma-Theory Res. Pract. Policy 2021, 13, 9–15. [Google Scholar] [CrossRef]
Ma, T. The Influence of Emergency Network Public Opinion on Ideological and Political Education in Colleges and Universities and Its Countermeasures—Taking the“COVID-19”Network Public Opinion as an Example. Educ. Explor. 2020, 53–57. [Google Scholar]
Wang, Q.; Mao, Z.; Wang, B.; Guo, L. Knowledge Graph Embedding: A Survey of Approaches and Applications. IEEE Trans. Knowl. Data Eng. 2017, 29, 2724–2743. [Google Scholar] [CrossRef]
You, L.; Lang, Y. Construction and Query Application of Emotion Knowledge Graph Based on Semantic Analysis of Product Reviews. Inf. Stud. Theory Appl. 2018, 41, 132–136. [Google Scholar] [CrossRef]
Xu, Z.; Yu, Z.; Dong, Y.; Yan, W. Research on Constructing the Knowledge Graph Based on Emotional Analysis of Film Review. Comput. Simul. 2020, 37, 424–430. [Google Scholar]
Li, J. The Study of Emotional Map of University Public Opinion Users in social media: A Case of “Anti-academic Misconduct” in Sina Weibo. Inf. Sci. 2020, 38, 100–104. [Google Scholar] [CrossRef]
Xie, T.; Yang, Y.; Li, Q.; Liu, X.; Wang, H. Knowledge Graph Construction for Intelligent Analysis of Social Networking User Opinion. In Proceedings of the International Conference on e-Business Engineering, Cham, Germany, 19–21 October 2020; pp. 236–247. [Google Scholar]
Kumar, A.; Kawahara, D.; Kurohashi, S. Knowledge-enriched Two-layered Attention Network for Sentiment Analysis. arXiv 2018, arXiv:1805.07819v4. [Google Scholar]
Lin, S.; Lin, S.; Wei, J.; Wu, Y.; Liao, X. Emotional classification of combining knowledge graph. J. Fuzhou Univ. (Nat. Sci. Ed.) 2020, 48, 269–275. [Google Scholar]
Han, W.; Xu, L.; Niu, A.; Jing, Y.; Qin, W.; Zhang, J.; Jing, X.; Wang, Y. Online-Based Survey on College Students’ Anxiety During COVID-19 Outbreak. Psychol. Res. Behav. Manag. 2021, 14, 385–392. [Google Scholar] [CrossRef] [PubMed]
Liu, Q. Challenges and Responses of Online Public Opinion Guidance in Universities during Epidemic Prevention and Control. Cuanbo Yu Banq. 2020, 185–188. [Google Scholar] [CrossRef]
Mukhtar, K.; Javed, K.; Arooj, M.; Sethi, A. Advantages, Limitations and Recommendations for online learning during COVID-19 pandemic era. Pak. J. Med. Sci. 2020, 36, S27–S31. [Google Scholar] [CrossRef]
Sorour, S.E.; Kamel, T.M.; Abdelkader, H.E. A Hybrid Virtual Cloud Learning Model during the COVID-19 Pandemic. CMC-Comput. Mater. Contin. 2021, 66, 2671–2689. [Google Scholar] [CrossRef]
Lu, K.; Zhao, L. Analysis and Enlightenment of Public Opinion on Education in Public Emergencies —Taking the Public Opinion on Education in Beijing During the COVID-19 Epidemic for Example. Educ. Meas. Eval. 2020, 15–22. [Google Scholar] [CrossRef]
Yao, L.; Yang, Q.; Miao, L. Public Opinion Events of University Online Teaching: An Analysis with Text Mining from New Media. J. Hulunbeier Univ. 2020, 28, 114–119. [Google Scholar]
Li, S.; Zhao, Z.; Hu, R.; Li, W.; Liu, T.; Du, X. Analogical Reasoning on Chinese Morphological and Semantic Relations. In Proceedings of the Meeting of the Association for Computational Linguistics, Melbourne, Australia, 15–20 July 2018; pp. 138–143. [Google Scholar]
ChineseNlpCorpus. Available online: https://github.com/SophonPlus/ChineseNlpCorpus/ (accessed on 20 November 2021).
Chinese Data Preprocessing Materials. Available online: https://github.com/procectcollection/Chinese (accessed on 20 November 2021).
The State Council Information Office of the People’s Republic of China. Fighting COVID-19 China in Action; Foreign Languages Press: Beijing, China, 2020. [Google Scholar]
Fiddler. Available online: https://www.telerik.com/fiddler (accessed on 20 November 2021).
Sharfuddin, A.A.; Tihami, M.N.; Islam, M.S. A Deep Recurrent Neural Network with BiLSTM model for Sentiment Classification. In Proceedings of the 2018 International Conference on Bangla Speech and Language Processing (ICBSLP), Sylhet, Bangladesh, 21–22 September 2018; pp. 1–4. [Google Scholar]
Xu, G.; Meng, Y.; Qiu, X.; Yu, Z.; Wu, X. Sentiment Analysis of Comment Texts Based on BiLSTM. IEEE Access 2019, 7, 51522–51532. [Google Scholar] [CrossRef]
Kumar, V.; Reforgiato Recupero, D.; Riboni, D.; Helaoui, R. Ensembling Classical Machine Learning and Deep Learning Approaches for Morbidity Identification From Clinical Notes. IEEE Access 2020, 9, 7107–7126. [Google Scholar] [CrossRef]
Elfaik, H.; Nfaoui, E.H. Combining Context-Aware Embeddings and an Attentional Deep Learning Model for Arabic Affect Analysis on Twitter. IEEE Access 2021, 9, 111214–111230. [Google Scholar] [CrossRef]
Hameed, Z.; Zapirain, B.G. Sentiment classification using a single-layered BiLSTM model. IEEE Access 2020, 8, 73992–74001. [Google Scholar] [CrossRef]
Hameed, Z.; Garcia-Zapirain, B.; Ruiz, I.O. A computationally efficient BiLSTM based approach for the binary sentiment classification. In Proceedings of the 2019 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), Ajman, United Arab Emirates, 10–12 December 2019. [Google Scholar]
Du, H.S.; Zhan, B.; Xu, J.; Yang, X. The influencing mechanism of multi-factors on green investments: A hybrid analysis. J. Clean. Prod. 2019, 239, 117977. [Google Scholar] [CrossRef]
Fei, X. An LDA based model for semantic annotation of Web English educational resources. J. Intell. Fuzzy Syst. 2021, 40, 3445–3454. [Google Scholar] [CrossRef]
Song, C.; Guo, C.; Hunt, K.; Zhuang, J. An Analysis of Public Opinions Regarding Take-Away Food Safety: A 2015–2018 Case Study on Sina Weibo. Foods 2020, 9, 511. [Google Scholar] [CrossRef] [Green Version]
Tan, X.; Zhuang, M.; Lu, X.; Mao, T. An Analysis of the Emotional Evolution of Large-Scale Internet Public Opinion Events Based on the BERT-LDA Hybrid Model. IEEE Access 2021, 9, 15860–15871. [Google Scholar] [CrossRef]
Neo4j. Available online: https://neo4j.com/ (accessed on 20 November 2021).

Figure 1. Topic-–sentiment knowledge graph model of public opinion in colleges and universities.

Figure 2. The main line of event development followed in the study of this paper.

Figure 3. Fiddler working schematic.

Figure 4. Block diagram of the proposed model.

Figure 5. Emotional Tendency Distribution Map.

Figure 6. LDA topic model structure diagram.

Figure 7. LDA topic confusion trend chart of the positive sentiment data set during the T4 period.

Figure 8. Visualization results of pyldavis Library.

Figure 9. Topic content evolution map of positive comment.

Figure 10. Topic content evolution map of negative comment.

Figure 11. Neo4j database structure diagram.

Figure 12. A Neo4j-based Emotional Knowledge Graph of the Pandemic Theme of University Official Accounts.

Figure 13. Query the results of the comments on the article “Today, school is starting! The first batch of Beihang back to school are they…”.

Figure 14. (a) Search results for negative comments on university official accounts articles containing “dead”. (b) Deeper search results based on figure (a).

Figure 15. Comment on emotional tendencies during the period of normalization of pandemic prevention and control.

Table 1. Model architecture parameters.

Layer (Type)	Output Shape	Param #
embedding (Embedding)	(None, 113, 300)	15,000,000
bidirectional (Bidirectional)	(None, 113, 128)	186,880
lstm_1 (LSTM)	(None, 16)	9280
dense (Dense)	(None, 1)	17
Total params: 15,196,177
Trainable params: 196,177
Non-trainable params: 15,000,000

Table 2. BiLSTM_LSTM model prediction results.

Comments	Sentiment Value	Sentiment Classification
Tsinghua University! How many people’s faith!	0.020796925	0
The little brothers and sisters who have studied well are all so good-looking [eat melon]	0.9867792	1
Different departure, the same belief: unremitting self-improvement	0.001008	0
When I was an intern in ICU, I had a close relationship with Brother Ding. I am a gentle and considerate doctor. I pay tribute to my seniors and learn from them.	0.704895	1
Return is still a boy [strong] [strong] [strong] [strong] [strong]	0.860122	1
I envy	0.000978112	0

Table 3. Partial running results of LDA topic model.

TOPIC1		TOPIC2		TOPIC3
Give a like	127.3196	South China Normal University	66.1488	Tsinghua	74.43001
Beihang University	94.66321	Pandemic	64.99213	See	36.84994
Love	85.02743	Student	47.41485	Miss	36.34968
Teacher	85.02378	Take on	45.90197	Salute	35.44682
Wuhan University	83.02523	Together	30.76216	Safety	34.50235
Zhejiang University	70.51457	Country	27.04597	Hope	30.35487
Great	56.31093	Good job	22.38388	School opens	28.34165
School	55.97728	Awesome	21.37601	Learning	28.14177
Alma mater	49.29518	Society	20.35642	Return	26.83747
Alumni	36.29545	Big love	19.51961	Graduation	25.70739

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, X.; Li, Z.; Tian, Y. Sentimental Knowledge Graph Analysis of the COVID-19 Pandemic Based on the Official Account of Chinese Universities. Electronics 2021, 10, 2921. https://doi.org/10.3390/electronics10232921

AMA Style

Li X, Li Z, Tian Y. Sentimental Knowledge Graph Analysis of the COVID-19 Pandemic Based on the Official Account of Chinese Universities. Electronics. 2021; 10(23):2921. https://doi.org/10.3390/electronics10232921

Chicago/Turabian Style

Li, Xiaolin, Zhiyi Li, and Yahe Tian. 2021. "Sentimental Knowledge Graph Analysis of the COVID-19 Pandemic Based on the Official Account of Chinese Universities" Electronics 10, no. 23: 2921. https://doi.org/10.3390/electronics10232921

APA Style

Li, X., Li, Z., & Tian, Y. (2021). Sentimental Knowledge Graph Analysis of the COVID-19 Pandemic Based on the Official Account of Chinese Universities. Electronics, 10(23), 2921. https://doi.org/10.3390/electronics10232921

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Sentimental Knowledge Graph Analysis of the COVID-19 Pandemic Based on the Official Account of Chinese Universities

Abstract

1. Introduction

2. Related Works

2.1. Sentimental Knowledge Graph

2.2. COVID-19 Analysis in Colleges and Universities

3. Models and Methods

3.1. Theoretical Model Construction

3.2. Data Collection and Pre-Processing Based on Fiddler

3.3. Sentiment Analysis of Official Account Commentary in Time Context Based on BiLSTM_LSTM

3.3.1. BiLSTM_LSTM

3.3.2. Analysis of Students’ Emotional Evolution in Time Context

3.4. Analysis of Topic Content Evolution in Time Context Based on the LDA Topic Model

3.4.1. LDA Topic Model

3.4.2. Analysis of the Evolution of Topic Content in Time Context

4. Graphs and Discussion

4.1. Neo4j Graph Database Design

4.2. The Construction of Neo4j Knowledge Graph of Public Opinion in Colleges and Universities

4.3. Application of neo4j Topic-Affective Knowledge Graph

4.3.1. Sentiment Retrieval of University Official Account Comments

4.3.2. The Source Search of Security Threats in University Social Network

4.3.3. Public Opinion Monitoring of University Reviews in the COVID-19 Stage

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI