Next Article in Journal
Favourable Short- to Mid-Term Outcome after PDA-Stenting in Duct-Dependent Pulmonary Circulation
Previous Article in Journal
The Role of Social Capital in Predicting Tourists’ Waste Sorting Intentions in Rural Destinations: Extending the Theory of Planned Behavior
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Consumers’ Opinions towards Public Health Effects of Online Games: An Empirical Study Based on Social Media Comments in China

1
School of Computing and Artificial Intelligence, Southwestern University of Finance and Economics, Chengdu 611130, China
2
School of Management Science and Engineering, Southwestern University of Finance and Economics, Chengdu 611130, China
3
Management College, Ocean University of China, Qingdao 266100, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2022, 19(19), 12793; https://doi.org/10.3390/ijerph191912793
Submission received: 11 September 2022 / Revised: 30 September 2022 / Accepted: 2 October 2022 / Published: 6 October 2022

Abstract

:
Online game products have fueled the boom in China’s digital economy. Meanwhile, its public health concerns have sparked discussion among consumers on social media. However, past research has seldom studied the public health topics caused by online games from the perspective of consumer opinions. This paper attempts to identify consumers’ opinions on the health impact of online game products through non-structured text and large-size social media comments. Thus, we designed a natural language processing (NLP) framework based on machine learning, which consists of topic mining, multi-label classification, and sentimental analysis. The hierarchical clustering method-based topic mining procedure determines the compatibility of this study and previous research. Every three topics are identified in “Personal Health Effects” and “Social Health Effects”, respectively. Then, the multi-label classification model’s results show that 61.62% of 327,505 comments have opinions about the health effects of online games. Topics “Adolescent Education” and “Commercial Morality” occupy the top two places of consumer attention. More than 31% of comments support two or more topics, and the “Adolescent Education” and “Commercial Morality” combination also have the highest co-occurrence. Finally, consumers expressed different emotional preferences for different topics, with an average of 63% of comments expressing negative emotions related to the health attributes of online games. In general, Chinese consumers are most concerned with adolescent education issues and hold the strongest negative emotion towards the commercial morality problems of enterprises. The significance of research results is that it reminds online game-related enterprises to pay attention to the potential harm to public health while bringing about additional profits through online game products. Furthermore, negative consumer emotions may cause damage to brand image, business reputation, and the sustainable development of the enterprises themselves. It also provides the government supervision departments with an advanced analysis method reference for more effective administration to protect public health and promote the development of the digital economy.

1. Introduction

With the development of the Internet and computer technology, online games have gradually penetrated public life, rapidly occupying people’s fragmented time and becoming a form of leisure and entertainment. According to the 49th Statistical Report on China’s Internet Development, the number of online game users in China reached 554 million as of December 2021, 35.61 million more than that in December 2020, accounting for 53.6% of the total Internet users. Further, the total number of applications (APP) monitored in the Chinese market was 2.52 million, while the number of game APPs continued to lead with 709,000, accounting for 28.2% of all APPs [1]. According to the 2021 China game industry report, the sales revenue of China’s online game industry reached CNY 290.3 billion [2]. Under the combined action of technology and market requirements, the online game industry has developed rapidly and promoted the prosperity of China’s digital economy. It can be seen that online games have become the economic pillar industry in today’s Internet era. However, online games have changed the social economy and people’s lives and have some potential public health effects [3,4,5]. The definition of public health refers to the population’s health, with more emphasis on the health problems of the public in various societies [6]. Different social public organizations form the online game consumption market [7,8]. For instance, consumer groups play game products, game enterprises provide goods, and government departments regulate the game industry [9,10,11,12]. Consequently, the online game products set off human health and socially affect a wide range of populations, which conforms to the connotation of public health: all the problems related to human health can be understood as public health problems [13].
In 1989 the World Health Organization (WHO) redefined health as a state of complete physical, mental, social, and ethical well-being and not merely the absence of disease or infirmity. Therefore, a large number of papers investigating the impact of online games have focused on people’s health by studying the potential harmful effects, involving mental disorders [14,15,16,17] and addiction [3,18,19,20,21], with physical injuries such as obesity, back and neck pain, orthopedic/joint muscle disorders, vision problems, hearing problems, and physical inactivity [22,23]. In addition, regarding social adaptability and ethical health aspects, some research shows that chronic engagement in online gaming is associated with poor social skills, learning difficulties, aggression, ignored family responsibilities, job dereliction, and gambling [5,12,24,25,26,27,28]. Based on the existing state of research, these viewpoints highlight the possible public health problems that can result from online games. A recommendation will be made as to how preventive aspects can be adequately considered in government regulatory policies [11,29,30] and business marketing strategies [31,32,33], which aim to create a healthy and orderly online game industry market environment in China [8,34].
Notwithstanding the endeavors described, little research focuses on Chinese consumers’ opinions on public health effects data of online games products, which can provide a basis and reference for government policymakers and marketing strategy makers [35]. For instance, Cavero Esponera et al. conducted telephonic interviews to gather public opinion on Spain’s food policies to combat obesity [36]. The study found that different groups showed different levels of support for different food policies. Austhof et al. used a global warming audience segmentation tool, a secondary analysis of the consumer panel’s opinion about a public health online questionnaire survey in the online crowdsourcing market [37]. Devlin et al. studied parents’ opinions towards the water-only drink policy at junior triathlon events [38]. Research has found that parents view policy positively, which warrants further research and policy development to facilitate behavioral change. Xu et al., using an online opt-in panel in China, collected survey responses from a total of 1089 respondents to study the influential factors posited by the spiral of silence theory in shaping people’s perceptions of the overall public opinion towards food safety issues and their willingness to speak out [39].
Nevertheless, with the popularization and development of information technology, social media comments are becoming increasingly important ways for consumers to express their opinions [40,41]. Different from the traditional questionnaires or interview methods to collect opinions, data retrieved from social media can be used to obtain or directly analyze opinions [42] and break the limitations of traditional methods such as quantity, timeliness, period, space span, richness, and objective [43]. Thus, using social media comments to track, analyze, and study available opinions related to public health has become a trend in recent years. Yang et al. [44] used social media comments to explore the causal relationship between public environmental opinions and adaptation strategies. Some research based on social media comments is being applied to analyze public opinion on our responses to the COVID-19 pandemic [42,45,46,47]. The study, data collected from a social media Weibo, examined opinions towards child abuse and made recommendations for the government to improve child protection policies [48]. From the perspective of public opinion about emotion and mental health towards the built environment, we explore geolocated Weibo comments to assess the built environment at different scales and to analyze the correlations between built environment elements and sentiment intensity at different scales [49]. Simultaneously, the digitalization form of comment text on social media relies on computer-assisted analytical techniques. Methods for natural language processing (NLP) from the text aim at automatically analyzing and comprehending human language [50], allowing scholars to easily extract beneficial insights contained in textual datasets while avoiding burdensome computational work [51,52]. Based on social media comments, the diversity of NLP methods is revealed. All six primary NLP methods—text preprocessing, text representation, classification, topic modeling, emotion analysis, and deep learning—are applied in public health studies [44,45,46,47,48,49]. The above studies show that social media comments and NLP methods have given researchers unprecedented opportunities to harvest a wealth of textual data materials and advanced technology for investigating contemporary public health phenomena. However, current research is not yet from a new perspective: adopted social media data to analyze consumers’ opinions towards public health effects of online games studies.
Moreover, some studies further advocate for the effective use of consumers’ opinions on social media comments. It supports government in formulating policies and prompts businesses with adjustment strategies. Accurately reacting to people’s genuine opinions can improve government trust and increase enterprises’ loyalty [53,54]. Given this, our research focuses on consumers’ opinions about online game products bringing about public health effects, using practical and diversified NLP methods, and identifying the semantics of comments on different social media platforms. The study allows us to expand our understanding of online game product-related opinions and points to ways that NLP methodologies can be utilized to improve semantics recognition of social media comments, especially regarding the public health effects of online games. Then, understanding public health topics and emotional tendencies in consumers’ opinions towards online games may have important implications for our social well-being and prevent or mitigate continuing public health effects due to online game production. The findings from this study may help relevant role players in the online game consumer market to target government policies and corporate strategies that can be designed to respond accurately to public opinion. It can promote the online game industry’s sustainable development and virtuous cycle and ensure a win-win situation for public health and economic growth.
The structure of this article is as follows:
  • Introduce research materials, data sources and preprocessing, and three NLP tasks that form the implementation steps and models of this study.
  • Introduce experimental model setting parameters and evaluation indicators.
  • Conduct empirical results and discussions to prove the method’s feasibility and obtain advanced suggestions and means for government management and enterprise operation.

2. Materials and Methods

2.1. Data Sources and Preprocessing from Social Media Platforms

This study uses Python programming to capture 327,505 public social media comments about online games from 1 January 2020 to 15 March 2022. Data required for research are derived from 6 social media platforms, as shown in Table 1. Among them, Weibo is a social platform that emerged in the early days of the Internet (2009), where users can share brief real-time information. It has more than 340 million active users. Zhihu is the first high-quality Q&A community in China’s Internet. Since 2011, there have been more than 2.5 million active monthly paying users, more than 3 million Q&A contents, and more than 3 billion annual visits. Toutiao is the number one recommendation engine based on data mining, with 260 million monthly active users. Tiktok and Kuaishou are the top two in China’s short video publishing platforms, with 620 million and 380 million daily active users, respectively.
Data preprocessing is the key to obtaining an effective experimental database. In this study, the data preprocessing process is divided into six steps. The first step is to unify the fields. Due to the different data formats of each social platform, we unified the field names of valid content, such as comment content, topic, and comment generation time, in the process of collection. The second step is to clean up irrelevant information. The original comment data contain content irrelevant to this study, which will affect the accuracy of subsequent experiments, such as spam, useless links, punctuation, etc. We use web cleanup technology to filter out invalid comments such as ads and spam links. The third step is the correction of typos. Comments on social media are informal language environments, which are prone to typos and need to be corrected. The fourth step is the substitution of synonyms. Chinese has a lot of synonyms. For the convenience of subsequent experiments, synonyms are replaced according to the original comment data to improve the efficiency of the text mining model. The fifth step is word segmentation. We did this by using Jieba, a Chinese phrase widget in the Python computer programming framework that provides a dictionary of Chinese character prefixes. In addition, users can also customize the dictionary to include words not found in the Jieba dictionary. For example, in this study, words related to online games can be added. The new words added according to the study can ensure higher accuracy in subsequent experiments. The sixth step is stopping words processing. This paper uses the stop word list to prevent unnecessary words in the comment sentences. Stop words refer to some words that are automatically filtered out before or after processing natural language text data in order to save storage space and improve the search efficiency of information retrieval, such as “this” and “that”. The raw data are preprocessed through the above steps to form the online game public opinion corpus required by this study, which contains 327,505 comments as shown in Table 1.

2.2. Methods

Natural language processing (NLP) is a field of machine learning to identify important semantic information from large irregular or unstructured databases and extract meaningful knowledge information in the shortest period [55]. Currently, the use of NLP methods to semantically comprehend social media comments has become a popular method in management research [56]. To identify and explore consumers’ opinions about online games causing public health effects from social media comments, this study combined three tasks based on NLP methods of topic modeling, multi-label classification, and emotion analysis. The first task is to mine topics from social comments using the Latent Dirichlet Allocation (LDA) model. The second task is to combine topic mining results with a bidirectional long/short-term memory (Bi-LSTM) model for semantic identification. The third task is emotion analysis based on classified data via the SnowNLP model. According to the results of the three tasks, we can analyze public health-related topics about the online game in social media comments, and the emotional intensity of different topics. The specific methods process of this study is shown in Figure 1.

2.2.1. Topic Modeling Based on LDA Model

Topic modeling is a machine learning technique for learning, identifying, and extracting semantic topics from unstructured text corpora, which divides collections of documents into natural groups to understand them separately [57]. LDA is the most widespread and popular method for fitting topic modeling [58,59]. Some studies have used LDA to explore topics in public health related fields. For example, Paul et al. proposed a disease-related topic model that analyzed over 1.5 million public health tweets [60]. The results suggest that more detailed and comprehensive public health information can be obtained from social media. Botsis et al. developed an LDA model to extract adverse event reports from clinical features of vaccines [61], which facilitates automatic review of adverse events. Hao et al. conducted topic modeling and mining for online comments of doctors [62]. The study found that doctors’ experience, skills, and attitudes to patients were the most important factors affecting patients. Han et al. used topic modeling to mine and analyze public opinion related to COVID-19 in China [42]. As these papers prove, the LDA model has been applied to the research of public health management, and it is also suitable for this study to explore the related topics of public health issues caused by online games.
LDA is a three-level Bayesian probability model, proposed by Blei et al. in 2003 [63]. The relationship between topics and words is estimated by hypothesis in the LDA model. A topic consists of multiple words, while a document consists of multiple topics. Figure 2 presents the model structure of LDA, where nodes represent random variables and directed edges represent conditional probability dependence between random variables. In addition, LDA clearly defines the topic distribution θ and the topic word distribution as a polynomial distribution sampled from the Dirichlet priori with α and β parameters, which makes its generating process more complete and enhances the interpretability and rationality of the modeling process. Assume that corpus D = { d 1 , d 2 , , d N } , N is the contained documents, document d n composed and is expressed as M word d n = w n , 1 , w n , 2 , , w n , M , where each word w n , m ( m { 1 , 2 , , M } )   for the words in the word list, with size V. As a generative model for mining K implicit topics from corpus, LDA defines the process of generating each word w n , m in document d n as the following two steps: first, sampling an implicit topic z n , m according to the topic distribution θ n corresponding to document d n ; then sample the corresponding word w n , m based on the word distribution z n , m for the implicit topic z n , m . Specifically, for the whole corpus D, the LDA model has the following generation process:
  • For each topic K [ 1 , 2 , , K ] :
    i. Sample a word distribution z k ~ D i r ( β )
  • For each document d n , n [ 1 , N ] :
    i. Sample a topic distribution θ n ~ D i r ( α )
    ii. For each word in the Nth document w n , m , m [ 1 , M ]
      a. Sample a topic z n , m ~ Mult ( θ n )
      b. Sample a word w n , m ~ Mult ( z n , m )

2.2.2. Semantic Identification Based on Bi-LSTM Model

Social media comment data identification is a multi-label classification task. Tsoumakas et al. developed a multi-label classification method to automatically identify text [64], as a key technology to process massive text data. The multi-label classification task of text refers to text that is related to multiple labels [65,66]. For example, the comment text “Some online games easily addiction leading to delaying students’ studies, calling for national supervision” contains multiple categories of labels, namely, “online games”, “addiction”, “delay study”, and “supervision”. Moreover, a few multi-label classification models based on deep learning have been proved to improve text identification efficiency, especially in Chinese text [67,68,69,70], among them, including the long short-term memory (LSTM) model [71]. Further, its improved model, such as Bi-LSTM, came to the rescue which effectively leveraged both past and future context information. Through extensive experimentation, researchers have proved that Bi-LSTM is capable of learning way more semantic information than trivial LSTM [72,73]. Therefore, the second task, the multi-label classification Bi-LSTM model, was chosen in this article to identify sematic multiple aspects of public health towards online games in social media comments. In particular, it should be pointed out that the source of classified labels is consumers’ opinion topics that have been obtained, which is the result of the first task.
Bi-LSTM, a two-way short and long duration memory network, uses sequential forward and sequential backward outputs to consider both historical and future information at a given moment. The structure diagram of the Bi-LSTM model is shown in Figure 3. As shown in the figure, the input of each time node is passed to the forward LSTM and the reverse LSTM, respectively. Both of them can generate corresponding outputs according to their own state, and these two outputs will be connected together to the output node of Bi-LSTM to form the final output of the model. Moreover, since the timing sequence processing in different directions eventually forms the output together, its training parameters will be optimized to the appropriate value according to the contribution and loss of node output according to the gradient. In fact, the calculation of Bi-LSTM at every moment is very similar to that of ordinary LSTM, assuming that the input sequence of a comment is w 1 , w 2 , w 3 , w n w n is the n vectorized word. They are in t 1 , t 2 , t 3 , t n , as the input moment of the LSTM model. The calculation process in each LSTM unit can be expressed as:
h n = t a n h ( U w n + V h n 1 )
where, h n 1 represents the hidden layer state of the model at the previous moment of t n . U and V represent the weight parameter matrices of the model input layer and hidden layer, respectively. In the final output, the Softmax function will be used to obtain the probability distribution of classification labels, and its calculation formula is as follows:
y n = s o f t m a x ( h n ) = exp ( h n ( i ) ) j = 1 T exp ( h n ( j ) )
Softmax (.) functions as the activation function. T is the number of category labels. h n ( i ) represents the nth vector component of vector h n , whose total vector length is equal to the number of labels T.

2.2.3. Emotion Analysis Based on SnowNLP Model

Emotion analysis, the important work of opinion mining, aims to analyze people’s emotional situations towards entities such as topics, services, products, organizations, and their attributes [74,75]. Associated with social media data, automatic emotion analysis of the data is helpful to grasp public opinions in order to make corresponding decisions [76,77].
Bayesian classifiers SnowNLP (Available online: https://github.com/isnowfy/snownlp, accessed on 20 August 2022), which is the Scikit-learn Python machine learning library, have been extensively used to analyze the emotions of Chinese texts in research on public health. Zhang et al. used the SnowNLP model to analyze the sentiment data of restaurant reviews, and studied the differences in consumer preferences and emotional changes before and after COVID-19 [78]. Moreover, based on Weibo comment data, the SnowNLP model was used to analyze the emotional evolution of epidemy-related topics and visually display them in time and space [79]. Jia et al. collected Weibo public comments on returning to college after the epidemic and analyzed sentiments based on the SnowNLP method [80]. The results showed that students had positive emotions about going back to college. Zhang et al. used a Python script to collect 65,277 valid Weibo comments to assess the environmental risk perception of urban residents [81]. SnowNLP sentiment analysis was used to measure environmental risk perception in 366 cities in China, and the structural proportion characteristics and temporal and spatial differences of environmental risk perception were analyzed.
Based on the application experience of the above research, this paper also uses the SnowNLP model to analyze the consumers’ emotions towards different public health effects of online games and complete the last NLP task. For government and enterprises to grasp the opinion information related to online games’ domains, the emotion analysis task is specially designed, which is based on the results of multi-label classification, that is, to analyze the emotions from different consumer’s opinion regarding recognition results of online games.

3. Experimental Settings

3.1. Experimental Process Setting

  • The experimental process of the first task combines quantitative and qualitative analysis. First, quantitative analysis aims to retrieve topics about online games in the self-built corpus of this study; we use the LDA model to mine feature words and topics from the entire corpus and to generate a topic model. Subsequently, the important reasoning task of the researcher is to analyze the feature words mined by the LDA model to determine the topics [45,82,83,84], thus the qualitative analysis is divided into four steps. In the first step, we invited three experts to participate in the determination of the topics, one doctoral student majoring in business administration and two doctoral students majoring in public health. In the second step, each expert separately analyzed the feature words corresponding to each topic and labeled the topic according to prior knowledge. In the third step, the three experts discussed the topics marked separately and decided on the number and name of the topics by voting. In the last step, we invited two professors of related disciplines to review the topics and finally determine the results. Moreover, in order to eliminate the correlations and retain the independence of determinate topics, hierarchical cluster analysis was adopted to build a tree diagram for topic grouping. Hierarchical cluster analysis was used to convert a set of possibly correlated observations into a set of linearly uncorrelated components [44]. Based on the results of clustering, the topics were classified into different domains in consumers’ opinions on the effects of online games.
  • The experimental process of the second task is divided into three steps. In the first step, we manually annotate 10% of the corpus data by referring to the topics and feature words determined in the first task to form a data set with marked topics. In the second step, labeled data sets will be randomly divided into training, validation, and test data subsets in a ratio of 8:1:1 for the training and testing of multi-label classification Bi-LSTM model. In the third step, we set the topic mining results of the first task as classification labels and input them into the classification model in order to classify comment data into one or more topics. The classified data set obtained is one of the important results of this study and also the basis of subsequent experiments.
  • In the experimental process of the third task, the classified data set obtained in the second task was used as the data base for emotion analysis. Then, the classified data sets were input into the SnowNLP model, and emotion analysis was performed for each topic separately.

3.2. Model Parameters Setup

After optimization processes in the experiments, the parameters of the three models applied in this study are finally set as follows:
  • The LDA model experimental setup: On the basis of data preprocessing, this article uses a Python third-party module, Scikit-learn, commonly used in machine learning to train LDA subject models, and sets the super parameter alpha value to 0.5 and beta value to 0.1.
  • The Bi-LSTM model experimental setup: In this model, the word embedding vector dimension is 256, label vector dimension is 128, hidden layer dimension is 200, optimizer is Adam, batch size is 32, and dropout is 0.4.
  • The SnowNLP model experimental setup: On the basis of data preprocessing, this article uses a Python third-party module, Scikit-learn, commonly used in machine learning to train the SnowNLP model, and sets the super parameter alpha value to 0.5 and beta value to 0.1.

4. Results and Analysis

4.1. The Result and Analysis of First Task

According to the experiment steps of the first task above, we completed the feature word mining and topic determination related to the public health effects of online games in the self-built corpus. The task results are shown in Table 2. The feature word mining results of the LDA model are shown in the fourth column, in which the ten most important feature words for each topic correspond to words of the highest probability in this topic while having a lower probability in other topics. The third column indicates six topics, including “Physical Health” (T1), “Mental Health” (T2), “Individual Virtue” (T3), “Adolescent Education” (T4), “Commercial Morality” (T5), and “Governmental Regulation” (T6), which are determined by the researchers regarding the output of the LDA model. Then, the first column represents the consumer’s opinion domain clustering, including “Personal Health Effects” (D1) and “Social Health Effects” (D2), which come from hierarchical cluster analysis. Furthermore, the advantage of the topic mining task in expanding new topics, as there are two topics reflected in the last column, namely, “Individual Virtue” (T3) and “Commercial Morality” (T5).
According to results above, it is evident that the effects of online game products are related to physical health, mental health, education for young people, and the demand for government supervision. These topics are also common in existing studies. However, further research found that a considerable part of the corpus was distinctly clustered under different topics. It means that some topics paid no attention to, or were less studied before, but occurred in large numbers in the corpus, indicating that consumers have such concerns; therefore, researchers should bring it into the scope of research. First, the undesirable personal behaviors caused by online games also encourage the discussion of consumers, supported by the features words “Unfilial, Lack Ambition, Violence, Gambling, Bullying ...”, etc. It strongly implies a topic related to personal norms of conduct and morality in consumer reviews, named the “Individual Virtue” (T3) topic in this paper. At the same time, consumers also paid additional attention to game enterprises’ behaviors, which is the topic of “Commercial Morality” (T5). The word list strongly supports this topic, “Company Interests, Lure, Devaluation, Game Recharge, Token, Reward, Game Props, Lottery ...”, etc. It indicates that the current mainstream game manufacturers intend to lower the entry threshold and do not seriously verify the user identity to expand the participation population.
In summary, task 1 mined and found a series of public health topics caused by online games in consumer opinions. The total of six topics with the support word list are shown in Table 2. However, the degree of consumer concern for each topic is different, and the correlation between topics is not the same, which will be revealed in the next task.

4.2. The Result and Analysis of Second Task

We obtained semantic identification results by implementing multi-label classification in the second task, as shown in Table 3. According to the sorting of classification results, this study selected the top 30 for discussion. It can be seen from the table that about 38.38% of comments do not involve any topics. It shows that some consumers may only express their emotions about the public health effects of online games through text comments on social media, such as “I have an aversion to online games”. In other words, in social media comments, about 61.62% concern at least one topic, containing 201,821 comments as valid data for our study. In this paper, we defined each concern about the target topics as a consumer opinion. All content in one comment to support any one topic can only be counted once. However, one comment can contain multiple opinions if its content supports different topics. Thus, in the corpus, the total count of opinions from all valid comments is 388,802 divided by valid data number, 201,821, resulting in an average of about 1.93 opinions per comment. It strongly supports the fact that many Chinese consumers not only express their opinion on public health topics regarding online games but also their understanding of the relationship between different topics through social media comments.
Then, according to the results shown in Table 3, “Commercial Morality” (T5) accounts for 11.89%, ranking first among the labels with a sole topic, representing the hottest and most sensitive topic in consumer opinion. Subsequently, among the paired topics, “Adolescent Education” and “Commercial Morality” (T4 + T5) ranked first, accounting for 3.31%. It illustrates consumers’ discussions about the public health effects of online games, which often links the adolescent education effects to the behavior of online game enterprises. Furthermore, the combinatorial semantic expression of more than two topics, such as “Physical Health”, “Adolescent Education”, and “Commercial Morality” (T1 + T4 + T5, 1.15%) or “Physical Health”, “Mental Health”, “Adolescent Education”, and “Governmental Regulation” (T1 + T2 + T4 + T6, 0.69), occupy a significant proportion. It indicates that in China’s online game consumer market, many consumers have profound and comprehensive cognition towards the public health effects of online game products.
Furthermore, according to the statistics of the occurrence of permutation and combination topics, it is found that “Adolescent Education” (T4) co-occurs with other topics most frequently, with more than 10.16% related to it. Furthermore, the combinatorial semantic expression of more than two topics involving topic 4 amounts to 9.61%. It shows that adolescent education is the key factor that is closely related to other topics. Due to this, it can be revealed that many consumers are most concerned that online game products result in adolescent education issues.
Further analysis, in multi-label classification experiments, all labels and their combinations are mapped into the 2N space of the number of topics N, thus the results in Table 3 can only represent the number of comments classified under a corresponding individual label. In other words, if a comment involves multiple topics, it will be counted repeatedly. Thus, the total number of comments in the statistical result is greater than the corpus. Ultimately, the consumer concern degree of each topic situation is quantified, as visualized in Figure 4.
Figure 4 shows that the two different colors represent different public health effects caused by online games, among which the consumers’ opinions on the “Personal Health Effects” (D1) domain with 133,347 opinions, and 254,855 opinions are related to the “Social Health Effects” (D2) domain. It reflects the strength of consumers’ concerns towards the public health effects of online games in different domains. Obviously, consumers are more concerned about the social health effects caused by online games, including “Adolescent Education” (T4, 99,050), “Commercial Morality” (T5, 95,301), and “Governmental Regulation” (T6, 60,504); especially topic 4, which is the key factor among all topics, co-occurring most frequently with other topics. At the same time, by referring to the feature words of each topic in the experiment results of task 1, it can be interpreted that consumers’ opinions on online game products are mainly reflected in the fact that they hinder the realization of the social goals of adolescent education. Furthermore, it is rooted in the enterprises’ lack of self-discipline and commercial morality. Thus, consumers have no choice but to appeal for governmental regulation.
Conversely, although online games’ adverse effects on personal health are frequently mentioned in the comments, the level of concern is obviously not as high as the social health effects. It is due to the People’s Republic of China’s online game censorship regime, which lets consumers think that the existing online games have less impact on “Individual Virtue” (T3), with minor comments of 33,699. It shows that under the strict censorship of the Chinese government, game design companies and operating platforms are trying to meet regulatory policy requirements for health content. However, the consumer still perceives the physical and mental effects of online game participants. In particular, the “Mental Health” (T2, 52,330) effect is significantly more significant than “Physical Health” (T1, 47,318).
Finally, in task 2, the completed semantic recognition of consumer comments through the multi-label classification model revealed the honest opinions of consumers that online games affect many aspects of public health, further deconstructed the relationship between various health topics, and quantified the degree of consumer concern for each topic. However, consumer opinions include not only the topic expression of the effects of online games on public health but also the perception and expression of emotions. Therefore, in the next task, it is necessary to analyze each topic’s corresponding emotional intensity further and conduct a multidimensional and fine-granular analysis of consumers’ opinions towards online games’ impacts on public health.

4.3. The Result and Analysis of Third Task

The result of the third task, each topic emotion classification experiment, identifies consumers’ emotions towards online games causing public health effects. The results are visualized as shown in Figure 5. It can be seen from the figure that the emotion classification pie graph of consumers’ opinions on online games is in the middle. The classification pie graphs of the other six topics surround it, with arrows pointing to it, respectively. It intuitively shows that its classification results are calculated from the results of the six topics. The gray part represents neutral emotions in the seven pie graphs, the blue part represents positive emotions, and the orange represents negative emotions.
As shown in Figure 5, the pie graph of the general emotion of consumers’ opinions on online games shows that negative emotions account for 63%, positive emotions account for 20%, and neutral emotions account for 17%. Thus, the proportion of negative emotions is far more significant than the neutral and positive emotions, indicating that most consumers expressed negative emotions towards related opinions on the public health effects of online games through comments on social media.
Furthermore, in Figure 5, in the domain of personal health effects, the proportion of negative emotions in “Physical Health” (T1), “Mental Health” (T2), and “Individual Virtue” (T3) are 63%, 66%, and 55%, respectively, indicating that consumers have strong negative emotion to the personal health effects caused by online games. Similarly, in the domain of social health effects, the proportion of negative emotions in “Commercial Morality” (T5) accounts for 72%, which is the strongest of the six topics. Conversely, the weakest intensity of negative emotion is “Governmental Regulation” (T6), accounting for 38%. Thus, it can be explained that consumers have strong negative emotions about the public health effects, such as physical health, mental health, adolescent education, and especially commercial morality problems, caused by online games. Yet, from the aspect of government supervision, consumers’ emotional expressions are more positive, holding a supportive trend.
Combined with the results of the first and second tasks in our study, “Commercial Morality” (T5), less studied in the past, not only involves many comments but is also the most potent negative emotion coming from consumers. It indicates that the consumers’ opinions of online games-related enterprises lack self-discipline, commercial morality, and rising popular discontent. Meanwhile, although “Individual Virtue” (T3) is least expressed by consumers in the comments compared to other topics, consumers’ strong negative feelings towards topic 3 indicate that consumers also resent the negative effects of online games on their personal virtues.
Finally, it should be noted that the proportion of negative emotional comments towards “Adolescent Education” (T4) reaches 69%. According to most comments identified by the multi-label classification semantic, “Adolescent Education” (T4) is the most concern of all topics and the most co-occurred topic compared to other topics. It indicates that consumers are worried about the negative impact of online game products on education, especially among adolescents.
The first two tasks mined, identified, and quantified consumers’ opinions on the public health effects caused by online games in social media comments, according to topics. We also have an insight into the relationship between the topics. However, there are differences in consumers’ emotions towards different topics. For example, in “Governmental Regulation” (T6), consumers’ positive emotions outweigh their negative emotions. Thus, the purpose of further emotion analysis for each topic is regarding attitude in this task, whether consumers support or oppose it. The results of this paper may represent the majority of consumers’ cognition towards the health effects of online games.

5. Conclusions

5.1. Contributions

Based on social media comments, this study uses NLP technology to mine, identify, and reveal consumers’ opinions on the public health effects of online game products. The research results show that six topics related to personal and social health effects domain, respectively, in which the “Individual Virtue” (T3) and “Commercial Morality” (T5) topics are less researched before but objectively exist in consumer social media comments. Additionally, the multi-label classification model quantifies the degree of consumers’ concerns for each topic. It reveals that “Adolescent Education” (T4) is the key factor, which has a large-size number of comments and co-occurs most frequently with other topics. Finally, the emotional intensity analysis was conducted for each topic to obtain different consumers’ attitudes to each topic. It was found that consumers, in addition to “Governmental Regulation” (T6), had negative emotions towards other effects, which for “Commercial Morality” (T5) was the strongest.
This paper summarizes some theoretical contributions based on the above research results.
First, from the new perspective of China’s online game consumption market, this paper innovatively studies the content of consumer opinions on public health effects caused by online game products. It can provide a reference for the government and enterprises, promoting human health, social well-being, and economic development.
Second, a new comments database based on different social media is established in this study to obtain consumers’ opinions about online games accurately, timely, and objectively. Thus, it provides a new research basis for future researchers to study the public health effects of online games.
Third, driven by big data, a natural language recognition process for consumers’ opinions on online games was established by using scientific methods. The hierarchical clustering model in this study can practically discover new topics from unstructured text data. It has the ability for precise demonstration and discovery of research topics. The multi-label classification model quantifies the consumer’s degree of concern for each topic and reveals the relationship between topics. Nevertheless, the sentiment analysis task gives emotional intensity to each topic to further deconstruct consumer opinions on online games. The results of these experiments form a complete interpretation of the different dimensions and granularities of the research task for the public.

5.2. Practical Implications

Research in this paper has considerable practical significance. Business patterns of most of the current online game companies have the characteristics of low entry barriers and gradually increasing temptation, and have made significant commercial profits. Thus, it is unfeasible to expect the online game industry to reduce the negative impact of this business model on personal and social health through self-discipline. When the harm accumulates to a certain extent, it will inevitably lead to a public backlash. There are currently a lot of highly negative comments by Chinese consumers about the lack of business ethics in the online game industry through social media. Thus, it will bring about more serious social problems and inevitably lead to the intervention of government departments. Suppose the profit of the enterprise is mainly based on this pattern with the intention of attracting consumption and lottery, in that case, the enterprise’s profitability will inevitably decline due to the limitation of government intervention.
It is well known that China is changing from simply pursuing the goal of rapid economic growth to a country that pays more attention to social equality, the health and well-being of the population, especially the youth, and sustainable development [85]. From the perspective of government public management, understanding the relationship between industrial development and public health is conducive to achieving the government’s goals. A series of laws introduced and implemented shows that companies and industries that focus only on short-term benefits and ignore externalities and social responsibilities will be subject to increasingly strict regulations in the foreseeable future [86]. It is actually happening currently, as the Chinese government suddenly tightened restrictions on teenagers’ participation in online games in mid-2021. In 2021, the actual sales revenue of China’s online game market reached RMB 225.538 billion, an increase of RMB 15.862 billion over 2020, with a year-on-year growth of 7.57%. The stock price of Tencent, China’s largest game operator, on NASDAQ fell from about USD 75 in mid-June 2021 to less than USD 40 in mid-September 2022 [87]. While China’s consumer market for online games remains huge and is growing, its growth rate has fallen to its lowest in the past decade, far below the 32.61% growth in 2020 [2].
Moreover, this decline may be fatal to start-ups and small and medium-sized online game companies. Furthermore, since mid-2021, more than 3500 online game enterprises have already been canceled. It can be predicted that most of them are small and medium-sized online game enterprises [88]. Thus, from the perspective of the online game industry-related enterprises, in the face of consumers’ general negative emotions, reminding them of their need to pay attention to the potential harm to consumers while bringing about excess profits through online game products. Otherwise, consumers’ cognition that a large number of various online games lead to public health issues and high negative emotions may damage the overall image of the online game industry, business reputation, and the sustainable development of the enterprise itself. Thus, enterprises should seek their own development strategies, improve their sense of social responsibility, take the initiative to adapt to the new market environment, start from the content design, control the addiction mechanism, explore excellent themes, and seek development opportunities in the new online game field conducive to consumer health and youth education.
In summary, the research results in this paper can assist the needs of the above two aspects.

6. Limitations

Although the current research results have made a contribution, there are still shortcomings. That is, the timeline has not been introduced into the research to obtain the changing trend of consumer opinions. In future research, we will obtain social media comments in different periods to test whether the strategic adjustment of the enterprise is appropriate or whether the government regulatory measures are effective.
Furthermore, although the research object of this paper is online games, its research framework is also expected to be suitable for other industries. China has set new goals for its national development, which is one of the necessary conditions for enterprises to meet the requirements of social health and public well-being requirements. Therefore, limiting this article to the impact of online games on public health is far from adequate, and it needs to be expanded to other industries.

Author Contributions

Conceptualization, T.S. and J.Z.; methodology, Z.W. and T.P.; writing—original draft preparation, T.S.; writing—review and editing, W.Z. and J.Z.; supervision, Z.W. and H.J.; project administration, T.S. and J.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. The 49th Statistical Report on China’s Internet Development. Available online: www.cnnic.net.cn/hlwfzyj/hlwxzbg/hlwtjbg/202202/P020220311493378715650.pdf (accessed on 20 August 2022).
  2. 2021 China Gaming Industry Report. Available online: https://www.superpixel.com/article/18537/2021-china-gaming-industry-report-out-highlighting-worries-that-growth-cannot-continue-it-has (accessed on 20 August 2022).
  3. Wang, X.; Abdelhamid, M.; Sanders, G.L. Exploring the effects of psychological ownership, gaming motivations, and primary/secondary control on online game addiction. Decis. Support Syst. 2021, 144, 113512. [Google Scholar] [CrossRef]
  4. Pontes, H.M.; Griffiths, M.D. A new era for gaming disorder research: Time to shift from consensus to consistency. Addict. Behav. 2019, 103, 106059. [Google Scholar] [CrossRef] [Green Version]
  5. Liang, Q.; Yu, C.; Chen, Q.; Xie, X.; Wu, H.; Xing, J.; Dou, K. Exposure to community violence, affiliations with risk-taking peer groups, and internet gaming disorder among Chinese adolescents: The moderating role of parental monitoring. Front. Psychol. 2019, 10, 2074. [Google Scholar] [CrossRef] [PubMed]
  6. Rothstein, M.A. Rethinking the Meaning of Public Health. J. Law Med. Ethics 2002, 30, 144–149. [Google Scholar] [CrossRef] [PubMed]
  7. Osathanunkul, C. A Classification of Business Models in Video Game Industry. Int. J. Manag. Cases 2015, 17, 35–44. Available online: https://www.circleinternational.co.uk/wp-content/uploads/2021/01/17.1.pdf#page=35 (accessed on 20 August 2022).
  8. Tian, L.; Liu, S. The gaming industry in the development, change, and game. Youth Journal. 2021, 9, 60–62. [Google Scholar] [CrossRef]
  9. Halbrook, Y.J.; O’Donnell, A.T.; Msetfi, R.M. When and How Video Games Can Be Good: A Review of the Positive Effects of Video Games on Well-Being. Perspect. Psychol. Sci. 2019, 14, 1096–1104. [Google Scholar] [CrossRef]
  10. Syahrivar, J.; Chairy, C.; Juwono, I.D.; Gyulavári, T. Pay to Play in Freemium Mobile Games: A Compensatory Mechanism. Int. J. Retail. Distrib. Manag. 2022, 50, 117–134. [Google Scholar] [CrossRef]
  11. Han, Y.; Xu, X. Evolution and prospect of online game supervision policy from the perspective of Minor protection——Based on the analysis of policy text from 2000 to 2020. J. Shanghai Educ. Res. 2021, 415, 16–22. [Google Scholar] [CrossRef]
  12. Lischer, S.; Jeannot, E.; Brülisauer, L.; Weber, N.; Khazaal, Y.; Bendahan, S.; Simon, O. Response to the Regulation of Video Games under the Youth Media Protection Act: A Public Health Perspective. Int. J. Environ. Res. Public Health 2022, 19, 9320. [Google Scholar] [CrossRef] [PubMed]
  13. Xiao, W. Ethics of Public Health: Concept, Mission and Goal. J. Xiangtan Univ. (Philos. Soc. Sci.) 2006, 30, 107–112. [Google Scholar] [CrossRef]
  14. Yang, X.; Jiang, X.; Mo, P.K.; Cai, Y.; Ma, L.; Lau, J.T. Prevalence and Interpersonal Correlates of Internet Gaming Disorders among Chinese Adolescents. Int. J. Environ. Res. Public Health 2020, 17, 579. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Yuan, G.; Elhai, J.D.; Hall, B.J. The influence of depressive symptoms and fear of missing out on severity of problematic smartphone use and Internet gaming disorder among Chinese young adults: A three-wave mediation model. Addict. Behav. 2021, 112, 106648. [Google Scholar] [CrossRef] [PubMed]
  16. Yen, J.Y.; Lin, H.C.; Chou, W.P.; Liu, T.L.; Ko, C.H. Associations among resilience, stress, depression, and internet gaming disorder in young adults. Int. J. Environ. Res. Public Health 2019, 16, 3181. [Google Scholar] [CrossRef] [Green Version]
  17. Mihara, S.; Higuchi, S. Cross-sectional and longitudinal epidemiological studies of internet gaming disorder: A systematic review of the literature. Psychiatry Clin. Neurosci. 2017, 71, 425–444. [Google Scholar] [CrossRef] [Green Version]
  18. Kuss, D.J.; Louws, J.; Wiers, R.W. Online gaming addiction? Motives predict addictive play behavior in massively multiplayer online role-playing games. Cyberpsychol. Behav. Soc. Netw. 2012, 15, 480–485. [Google Scholar] [CrossRef] [Green Version]
  19. Tian, Y.; Yu, C.; Lin, S.; Lu, J.; Liu, Y.; Zhang, W. Sensation Seeking, Deviant Peer Affiliation, and Internet Gaming Addiction Among Chinese Adolescents: The Moderating Effect of Parental Knowledge. Front. Psychol. 2018, 9, 02727. [Google Scholar] [CrossRef] [Green Version]
  20. Lee, Z.W.Y.; Cheung, C.M.K.; Chan, T.K.H. Massively multiplayer online game addiction: Instrument development and validation. Inf. Manag. 2015, 52, 413–430. [Google Scholar] [CrossRef] [Green Version]
  21. Kuss, D.J. Internet gaming addiction: Current perspectives. Psychol. Res. Behav. Manag. 2013, 6, 125–137. [Google Scholar] [CrossRef] [Green Version]
  22. Aziz, N.; Nordin, M.J.; Abdulkadir, S.J.; Salih, M.M.M. Digital Addiction: Systematic Review of Computer Game Addiction Impact on Adolescent Physical Health. Electronics 2021, 10, 996. [Google Scholar] [CrossRef]
  23. Klasnja, A.; Milenovic, N.; Lukac, S.; Knezevic, A.; Klasnja, J.; Rakic, V.K. The Effects of Regular Physical Activity and Playing Video Games on Reaction Time in Adolescents. Int. J. Environ. Res. Public Health 2022, 19, 9278. [Google Scholar] [CrossRef] [PubMed]
  24. Yu, C.; Tang, C.; Lin, Z.; Zhang, Q. The interplay between multilevel individual and environmental factors acting on the internet gaming disorder in adolescents: Based on the latent profile analysis. Educ. Meas. Eval. 2017, 6, 33–34. [Google Scholar] [CrossRef]
  25. Turel, O.; Matt, C.; Trenz, M.; Cheung, C.M.K.; D’Arcy, J.; Qahri-Saremi, H.; Tarafdar, M. Panel report: The dark side of the digitization of the individual. Internet Res. 2019, 29, 274–288. [Google Scholar] [CrossRef]
  26. Zhai, Z.W.; Hoff, R.A.; Howell, J.C.; Wampler, J.; Krishnan-Sarin, S.; Potenza, M.N. Differences in associations between problematic video-gaming, video-gaming duration, and weapon-related and physically violent behaviors in adolescents. J. Psychiatr. Res. 2020, 121, 47–55. [Google Scholar] [CrossRef]
  27. Evren, C.; Evren, B.; Dalbudak, E.; Topcu, M.; Kutlu, N. Relationships of Internet addiction and Internet gaming disorder symptom severities with probable attention deficit/hyperactivity disorder, aggression and negative affect among university students. ADHD Atten. Deficit Hyperact. Disord. 2019, 11, 413–421. [Google Scholar] [CrossRef]
  28. Ferguson, C.J.; Jeong, E.J.; Wang, J.C.K. Pathological gaming: A longitudinal study from the perspectives of mental health problems and social stress model. J. Gen. Psychol. 2022. [Google Scholar] [CrossRef]
  29. Chen, D. On development of regulation policies on the content of online game in China. J. Lingnan Norm. Univ. 2016, 37, 38–45. [Google Scholar]
  30. Liu, S.; Zhang, X. Analyze on the routes of government administration in the online game industry. Editor. Friend 2015, 5, 24–27. [Google Scholar] [CrossRef]
  31. Wang, R.; Liu, Y. Research on social responsibility of online game provider. J. North China Univ. Technol. 2011, 23, 24–29. [Google Scholar]
  32. Liu, S. Analysis on the cultural characteristics and the promotion route of network games. J. Commun. Univ. China 2014, 36, 125–128. [Google Scholar]
  33. Gao, Y.; Wang, J.; Liu, C. Research on the path of network games social regulation. J. Northeast Norm. Univ. (Philos. Soc. Sci.) 2017, 5, 69–74. [Google Scholar] [CrossRef]
  34. Su, J.; Sun, H. The impact of online game on teenagers’ educational expectations. J. Tsinghua Univ. (Philos. Soc. Sci.) 2022, 37, 185–200+217. [Google Scholar] [CrossRef]
  35. Burstein, P. The Impact of Public Opinion on Public Policy: A Review and an Agenda. Political Res. Q. 2003, 56, 29–40. [Google Scholar] [CrossRef]
  36. Esponera, C.C.; Sánchez-Escalonilla, S.F.; Royo-Bordonada, M.Á. Public Opinion on Food Policies to Combat Obesity in Spain. Int. J. Environ. Res. Public Health 2022, 19, 8561. [Google Scholar] [CrossRef] [PubMed]
  37. Austhof, E.; Brown, H.E. Global Warming’s Six MTurks: A Secondary Analysis of a US-Based Online Crowdsourcing Market. Int. J. Environ. Res. Public Health 2022, 19, 8320. [Google Scholar] [CrossRef]
  38. Devlin, B.L.; Staley, K.J.; Trakman, G.L.; Forsyth, A.K.; Nicholson, M.G.; Cosgriff, G.; Chisholm, M.; Belski, R. Attitudes and Opinions of Parents towards Water-Only Drink Policy at Junior Triathlon Events. Int. J. Environ. Res. Public Health 2022, 19, 8529. [Google Scholar] [CrossRef]
  39. Xu, L.; Liu, J.; Kim, J.; Chon, M.-G. Are Chinese Netizens Willing to Speak Out? The Spiral of Silence in Public Reactions to Controversial Food Safety Issues on Social Media. Int. J. Environ. Res. Public Health 2021, 18, 13114. [Google Scholar] [CrossRef]
  40. Wang, J.; Shu, T.; Zhao, W.; Zhou, J. Research on Chinese Consumers’ Attitudes Analysis of Big-Data Driven Price Discrimination Based on Machine Learning. Front. Psychol. 2021, 12, 803212. [Google Scholar] [CrossRef]
  41. Shu, T.; Wang, Z.; Lin, L.; Jia, H.; Zhou, J. Customer Perceived Risk Measurement with NLP Method in Electric Vehicles Consumption Market: Empirical Study from China. Energies 2022, 15, 1637. [Google Scholar] [CrossRef]
  42. Han, X.; Wang, J.; Zhang, M.; Wang, X. Using social media to mine and analyze public opinion related to COVID-19 in China. Int. J. Environ. Res. Public Health 2020, 17, 2788. [Google Scholar] [CrossRef] [Green Version]
  43. LaValle, S.; Lesser, E.; Shockley, R.; Hopkins, M.S.; Kruschwitz, N. Big Data, Analytics and the Path from Insights to Value. MIT Sloan Manag. Rev. 2011, 52, 21–32. [Google Scholar]
  44. Yang, C.L.; Huang, C.Y.; Hsiao, Y.H. Using Social Media Mining and PLS-SEM to Examine the Causal Relationship between Public Environmental Concerns and Adaptation Strategies. Int. J. Environ. Res. Public Health 2021, 18, 5270. [Google Scholar] [CrossRef]
  45. Hou, K.; Hou, T.; Cai, L. Public attention about COVID-19 on social media: An investigation based on data mining and text analysis. Personal. Individ. Differ. 2021, 175, 110701. [Google Scholar] [CrossRef] [PubMed]
  46. Shim, J.G.; Ryu, K.H.; Lee, S.H.; Cho, E.A.; Lee, Y.J.; Ahn, J.H. Text Mining Approaches to Analyze Public Sentiment Changes Regarding COVID-19 Vaccines on Social Media in Korea. Int. J. Environ. Res. Public Health 2021, 18, 6549. [Google Scholar] [CrossRef]
  47. Zhang, Y.; Chen, N.; Du, W.; Yao, S.; Zheng, X. A New Geo-Propagation Model of Event Evolution Chain Based on Public Opinion and Epidemic Coupling. Int. J. Environ. Res. Public Health 2020, 17, 9235. [Google Scholar] [CrossRef]
  48. Lyu, Y.W.; Chow, J.C.C.; Hwang, J.J. Exploring public attitudes of child abuse in mainland China: A sentiment analysis of China’s social media Weibo. Child. Youth Serv. Rev. 2020, 116, 105250. [Google Scholar] [CrossRef]
  49. Duan, S.; Shen, Z.; Luo, X. Exploring the Relationship between Urban Youth Sentiment and the Built Environment Using Machine Learning and Weibo Comments. Int. J. Environ. Res. Public Health 2022, 19, 4794. [Google Scholar] [CrossRef] [PubMed]
  50. Manning, C.D.; Schütze, H. Foundations of Statistical Natural Language Processing; MIT Press: Cambridge, MA, USA, 1999. [Google Scholar]
  51. Collobert, R.; Weston, J.; Bottou, L.; Karlen, M.; Kavukcuoglu, K.; Kuksa, P. Natural language processing (almost) from scratch. J. Mach. Learn. Res. 2011, 12, 2493–2537. [Google Scholar] [CrossRef]
  52. Green, G.M. Pragmatics and Natural Language Understanding, 2nd ed.; Routledge: New York, NY, USA, 1996. [Google Scholar] [CrossRef]
  53. Herhausen, D.; Ludwig, S.; Grewal, D.; Wulf, J.; Schoegel, M. Detecting, preventing, and mitigating online firestorms in brand communities. J. Mark. 2019, 83, 1–21. [Google Scholar] [CrossRef]
  54. Bansal, N.; Sharma, A.; Singh, R.K. Fuzzy AHP approach for legal judgement summarization. J. Manag. Anal. 2019, 6, 323–340. [Google Scholar] [CrossRef]
  55. McDonald, D.D. Natural language generation. In Handbook of Natural Language Processing, 2nd ed.; Routledge: New York, NY, USA, 2010; pp. 121–144. [Google Scholar]
  56. Kang, Y.; Cai, Z.; Tan, C.W.; Huang, Q.; Liu, H. Natural language processing (NLP) in management research: A literature review. J. Manag. Anal. 2020, 7, 139–172. [Google Scholar] [CrossRef]
  57. Reisenbichler, M.; Reutterer, T. Topic modeling in marketing: Recent advances and research opportunities. J. Bus. Econ. 2019, 89, 327–356. [Google Scholar] [CrossRef] [Green Version]
  58. Tong, Z.; Zhang, H. A Text Mining Research Based on LDA Topic Modelling. In Proceedings of the International Conference on Computer Science, Engineering and Information Technology, Vienna, Austria, 21–22 May 2016; pp. 201–210. [Google Scholar] [CrossRef]
  59. Jelodar, H.; Wang, Y.; Yuan, C.; Feng, X.; Jiang, X.; Li, Y.; Zhao, L. Latent Dirichlet allocation (LDA) and topic modeling: Models, applications, a survey. Multimed. Tools Appl. 2019, 78, 15169–15211. [Google Scholar] [CrossRef]
  60. Paul, M.J.; Dredze, M. You Are What You Tweet: Analyzing Twitter for Public Health. In Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, Barcelona, Spain, 17–21 July 2011; pp. 265–272. [Google Scholar]
  61. Botsis, T.; Buttolph, T.; Nguyen, M.D.; Winiecki, S.; Woo, E.J.; Ball, R. Vaccine Adverse Event Text Mining System for Extracting Features from Vaccine Safety Reports. J. Am. Med. Inform. Assoc. 2012, 19, 1011–1018. [Google Scholar] [CrossRef] [Green Version]
  62. Hao, H.; Zhang, K. The Voice of Chinese Health Consumers: A Text Mining Approach to Web-Based Physician Reviews. J. Med. Internet Res. 2016, 18, e108. [Google Scholar] [CrossRef]
  63. Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent Dirichlet Allocation. J. Mach. Learn. Res. 2003, 3, 993–1022. [Google Scholar]
  64. Tsoumakas, G.; Katakis, I. Multi-label classification: An overview. Int. J. Data Warehous. Min. (IJDWM) 2007, 3, 1–13. [Google Scholar] [CrossRef] [Green Version]
  65. Sorower, M.S. A Literature Survey on Algorithms for Multi-Label Learning; Oregon State University: Corvallis, OR, USA, 2010; pp. 1–25. Available online: https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.5612&rep=rep1&type=pdf (accessed on 20 August 2022).
  66. Zheng, X.; Li, P.; Chu, Z.; Hu, X. A survey on multi-label data stream classification. IEEE Access 2019, 8, 1249–1275. [Google Scholar] [CrossRef]
  67. Zhao, P. Research on Short Text. Feature Expansion and Classification Based on Topic Model and Deep Learning; Tianjin Polytechnic University: Tianjin, China, 2018. [Google Scholar]
  68. Jin, W.; Zhu, H.; Yang, G. An efficient character-level and word-level feature fusion method for Chinese text classification. J. Phys. Conf. Ser. 2019, 1229, 12057. [Google Scholar] [CrossRef]
  69. Wang, L. Research on Chinese Short Text. Classification Based on Hybrid. Neural Network; Zhejiang University of Science and Technology: Hangzhou, China, 2019. [Google Scholar]
  70. Li, F.; Yu, H. ICD coding from clinical text using multi-filter residual convolutional neural network. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; pp. 8180–8187. [Google Scholar] [CrossRef]
  71. Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
  72. Wan, S.; Lan, Y.; Guo, J.; Xu, J.; Pang, L.; Cheng, X. A deep architecture for semantic matching with multiple positional sentence representations. In Proceedings of the AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA, 12–17 February 2016. [Google Scholar] [CrossRef]
  73. Ibrahim, M.A.; Khan, M.U.G.; Mehmood, F.; Asim, M.N.; Mahmood, W. GHS-NET a generic hybridized shallow neural network for multi-label biomedical text classification. J. Biomed. Inform. 2021, 116. [Google Scholar] [CrossRef] [PubMed]
  74. Liu, B. Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 2012, 5, 1–167. [Google Scholar] [CrossRef]
  75. Wang, T.; Yang, W. Review of Text Sentiment Analysis Methods. Comput. Eng. Appl. 2021, 57, 11–24. [Google Scholar] [CrossRef]
  76. Zhong, J.; Liu, W.; Wang, S.; Yang, H. Review of Methods and Applications of Text Sentiment Analysis. Data Anal. Knowl. Discov. 2021, 54, 1–13. [Google Scholar] [CrossRef]
  77. Wan, Q.; Xu, X.; Zhuang, J.; Pan, B. A sentiment analysis-based expert weight determination method for large-scale group decision-making driven by social media data. Expert Syst. Appl. 2021, 185, 115629. [Google Scholar] [CrossRef]
  78. Zhang, C.; Jiang, J.; Jin, H.; Chen, T. The Impact of COVID-19 on Consumers’ Psychological Behavior Based on Data Mining for Online User Comments in the Catering Industry in China. Int. J. Environ. Res. Public Health 2021, 18, 4178. [Google Scholar] [CrossRef]
  79. Wan, J.; Liu, X.; Zuo, Y.; Luo, J. Analysis on Public Opinion Sentiment Evolution of COVID-19 Based on Weibo Data. In Proceedings of the WHICEB 2021, Wuhan, China, 28–30 May 2021; Available online: https://aisel.aisnet.org/whiceb2021/67 (accessed on 20 August 2022).
  80. Jia, L.; Li, Z. Emotional Analysis on the Public Sentiment of Students Returning to University under COVID-19. In Proceedings of the 16th International Conference on Computational Intelligence and Security (CIS), Nanning, China, 27–30 November 2020; pp. 238–242. [Google Scholar] [CrossRef]
  81. Zhang, J.; Zheng, Z.; Zhang, L.; Qin, Y.; Duan, J.; Zhang, A. Influencing Factors of Environmental Risk Perception during the COVID-19 Epidemic in China. Int. J. Environ. Res. Public Health 2021, 18, 9375. [Google Scholar] [CrossRef]
  82. Zhai, Y.; Song, X.; Chen, Y.; Lu, W. A Study of Mobile Medical App User Satisfaction Incorporating Theme Analysis and Review Sentiment Tendencies. Int. J. Environ. Res. Public Health 2022, 19, 7466. [Google Scholar] [CrossRef]
  83. Sutherland, I.; Sim, Y.; Lee, S.K.; Byun, J.; Kiatkawsin, K. Topic Modeling of Online Accommodation Reviews via Latent Dirichlet Allocation. Sustainability 2020, 12, 1821. [Google Scholar] [CrossRef] [Green Version]
  84. He, L.; Han, D.; Zhou, X.; Qu, Z. The Voice of Drug Consumers: Online Textual Review Analysis Using Structural Topic Model. Int. J. Environ. Res. Public Health 2020, 17, 3648. [Google Scholar] [CrossRef]
  85. The Outline of the 14th Five-Year Plan for Economic and Social Development and Long-Range Objectives through the Year 2035 of the People’s Republic of China. Available online: https://en.ndrc.gov.cn/policies/202203/P020220315511326748336.pdf (accessed on 28 September 2022).
  86. Law of the People’s Republic of China on Protection of Minors. Available online: http://www.china.org.cn/english/government/207410.htm (accessed on 28 September 2022).
  87. Tencent Holdings Ltd. ADR. Available online: https://www.nasdaq.com/market-activity/stocks/tcehy (accessed on 28 September 2022).
  88. 3500 Game Enterprises Were Cancelled in a Year, Small and Medium-Sized Game Companies Entered the Cold Winter Again? Available online: https://view.inews.qq.com/a/20220615A04ZYX00 (accessed on 28 September 2022).
Figure 1. The semantics identification process of consumers’ opinions towards online games.
Figure 1. The semantics identification process of consumers’ opinions towards online games.
Ijerph 19 12793 g001
Figure 2. The LDA topic model.
Figure 2. The LDA topic model.
Ijerph 19 12793 g002
Figure 3. The Bi-LSTM multi-label classification model.
Figure 3. The Bi-LSTM multi-label classification model.
Ijerph 19 12793 g003
Figure 4. Consumer concern degree of each topic.
Figure 4. Consumer concern degree of each topic.
Ijerph 19 12793 g004
Figure 5. Emotion pie graph of consumers’ opinions on online games.
Figure 5. Emotion pie graph of consumers’ opinions on online games.
Ijerph 19 12793 g005
Table 1. Data sources and corpus formation.
Table 1. Data sources and corpus formation.
Social Media PlatformComments on Online Games
Weibo34,303
Zhihu21,789
Toutiao92,017
Xigua Video36,547
Tiktok121,198
Kuaishou21,651
Total327,505
Table 2. Determined topics and domain clustering.
Table 2. Determined topics and domain clustering.
Opinion Domain ClusteringTopic CodeTopic NameTop 10 Feature Words Belonging to Each TopicAppeared in Literate
Personal Health Effects (D1)T1Physical
Health
Poor Vision, Myopia, Headache, Physical Strength, Nausea, Obesity, Insufficient Sleep, Back Pain, Exercise Less, Physical WeaknessYes
T2Mental
Health
Depression, Emotional Stress, Negative Emotions, Nervous Breakdown, Identity Crisis, Avoid Reality, Obsessed, Solitary, Loneliness, Mental DisordersYes
T3Individual VirtueUnfilial, Lack Ambition, Violence, Gambling, Porn, Nationalist Sentiment, Patriotic Feelings, Bullying, Family StabilityNo
Social Health Effects (D2)T4Adolescent EducationAcademic Record, Teenagers, Students, Cut Class, Stop Schooling, Ignore Study, Disciplining Child, Rebellious, Lack of Learning Interest, Bad Classroom DisciplineYes
T5Commercial MoralityCompany Interests, Lure, Devaluation, Game Recharge, Company Name A *, Token, Reward, Game Props, Lottery, Company Name B **No
T6Governmental RegulationIndustry Governance, Real Name Authentication, Hierarchical Management, Access Restriction, Off Shelf, Shield, Censor Content, Administrative Control, Heavy Fines and Taxes, Green Health IndustryYes
* The largest online game company. ** Another online game company.
Table 3. Semantic identified results of multi-label classification model.
Table 3. Semantic identified results of multi-label classification model.
Proportion (%)Description
138.38%Only emotion expression, not involving topics.
211.89%T5
36.43%T4
45.35%T6
53.13%T4 + T5
62.69%T2
71.94%T1
81.92%T4 + T6
91.89%T2 + T4
101.83%T1 + T4
111.47%T1 + T5
121.42%T5 + T6
131.33%T3 + T4
141.17%T2 + T5
151.15%T1 + T4 + T5
161.04%T3
170.96%T4 + T5 + T6
180.83%T2 + T4 + T5
190.80%T1 + T4 + T6
200.78%T1 + T2 + T4
210.74%T2 + T3 + T4
220.71%T2 + T4 + T6
230.69%T3 + T4 + T5
240.69%T1 + T4 + T5 + T6
250.64%T3 + T4 + T6
260.62%T3 + T5
270.58%T1 + T2 + T4 + T6
280.53%T1 + T2 + T3 + T4 + T5 + T6
290.52%T2 + T6
300.51%T1 + T2 + T4 + T5
--Those accounting for <0.5% are omitted.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Shu, T.; Wang, Z.; Jia, H.; Zhao, W.; Zhou, J.; Peng, T. Consumers’ Opinions towards Public Health Effects of Online Games: An Empirical Study Based on Social Media Comments in China. Int. J. Environ. Res. Public Health 2022, 19, 12793. https://doi.org/10.3390/ijerph191912793

AMA Style

Shu T, Wang Z, Jia H, Zhao W, Zhou J, Peng T. Consumers’ Opinions towards Public Health Effects of Online Games: An Empirical Study Based on Social Media Comments in China. International Journal of Environmental Research and Public Health. 2022; 19(19):12793. https://doi.org/10.3390/ijerph191912793

Chicago/Turabian Style

Shu, Tao, Zhiyi Wang, Huading Jia, Wenjin Zhao, Jixian Zhou, and Tao Peng. 2022. "Consumers’ Opinions towards Public Health Effects of Online Games: An Empirical Study Based on Social Media Comments in China" International Journal of Environmental Research and Public Health 19, no. 19: 12793. https://doi.org/10.3390/ijerph191912793

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop