Next Article in Journal
Psychological Capital and Self-Acceptance Modified the Association of Depressive Tendency with Self-Rated Health of College Students in China during the COVID-19 Pandemic
Previous Article in Journal
The Use of Voice Assistant for Psychological Assessment Elicits Empathy and Engagement While Maintaining Good Psychometric Properties
Previous Article in Special Issue
The Role of Mass Media in Promoting the Dual Career of the Performance Athlete
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Analysis of COVID-19-Related User Content on the Baseball Bulletin Board in 2020 through Text Mining

1
Institute of Physical Education, Health and Leisure Studies, National Cheng Kung University, No. 1, Daxue Road, East District, Tainan 701401, Taiwan
2
Department of Recreational Sport and Health Promotion, National Pingtung University of Science and Technology, No. 1, Shuefu Road, Neipu, Pingtung 912301, Taiwan
3
Faculty of Sport Sciences, Waseda University, 3-4-1 Higashifushimi STEP22 Nishitokyo, Tokyo 202-0021, Japan
*
Authors to whom correspondence should be addressed.
Behav. Sci. 2023, 13(7), 551; https://doi.org/10.3390/bs13070551
Submission received: 23 May 2023 / Revised: 29 June 2023 / Accepted: 29 June 2023 / Published: 2 July 2023

Abstract

:
The world engaged in online sport watching during COVID-19. Fortunately, in Taiwan, the pandemic was stably controlled in 2020, allowing for the continuation of the Chinese Professional Baseball League (CPBL); this attracted international attention and encouraged relevant discussions on social media in Taiwan. In the present study, through text mining, we analyzed user content (e.g., the concepts of sports service quality and social identity) on the Professional Technology Temple (PTT) baseball board—the largest online bulletin board system in Taiwan. A predictive model was constructed to assess PTT users’ COVID-19-related comments in 2020. A total of 422 articles and 21,167 comments were retrieved. PTT users interacted more frequently during the closed-door period, particularly during the beginning of the CPBL in April. Effective pandemic prevention, which garnered global attention to the league, generated a sense of national identity among the users, which was strengthened with the development of peripheral products, such as English broadcasting and live broadcasting on Twitch. We used machine learning to develop a chatbot for predicting the attributes of users’ comments; this chatbot may improve CPBL teams’ understanding of public opinion trends. Our findings may help stakeholders develop tailored programs for online spectators of sports during pandemic situations.

1. Introduction

During COVID-19, Taiwan garnered international attention because of its effective pandemic preventive strategies. This increases the possibility of Taiwan receiving the right to host international sports events in the future. We noted that the fans focused on creating content on social media to promote interaction and increase engagement during the regular season closed-door period. Online engagement and display of sports-related passion may improve spectator attendance and the social identity of fans [1]. Social identity refers to individuals’ self-identification and perceived sense of belonging to a certain group as well as active engagement in group activities [2]. A sense of social identity affects individuals’ cocreation behaviors [3]. Earlier studies have focused primarily on real-world interactions and have rarely explored the concept of social identity theory in the virtual world [4]. The analysis of specific trends on social media, such as the Professional Technology Temple (PTT: the largest online bulletin board in Taiwan) may improve our understanding of relevant opinions of professional and experienced users, which may offer important insights into different topics [5]. Thus, social media appears to be a suitable setting for exploring the social identities of users in the context of specific topics. Sport service quality is closely associated with the success of sports leagues [6]. The assessment of the two key categories (core and peripheral products) of sport service quality may help us identify the association between the categories and relevant social media content. Social media provides an alternative avenue for accessing online spectators’ positive and negative feedback on sport service quality. The online activities of social media users may be assessed in terms of their contributions (number of comments) and creations (number of posts).
In the present study, the social identity theory was divided into three levels: team identity, national identity, and player identity. The consumers online brand-related activities (COBRA) framework was used to explore users’ engagement behaviors on the PTT baseball board. After referring to the studies conducted by Buzeta et al. [7], Vale and Fernandes [8], Saridakis et al. [9], and Piehler et al. [10], we modified and deleted the consumption dimension of the COBRA because our aim was to evaluate the effects of users’ active behaviors on their motivation to use social media. This was an unobtrusive study; the PTT does not reveal the number of profiles or post visitors. Figure 1 depicts the conceptual framework used in our study.
Through text mining, we analyzed users’ COVID-19-related content posted on the PTT baseball board in 2020, which included the concepts of sports service quality and social identity. In addition, a predictive model was constructed to assess comment attributes on the basis of their COVID-19-related comments on PTT. Our study objectives were as follows: to investigate the influence of users’ social identity on their social media engagement, to identify the association between different categories of sport service quality and users’ social media engagement, and to use machine learning for constructing a model and a chatbot for the teams participating in the Chinese Professional Baseball League (CPBL).

2. Materials and Methods

2.1. Data Collection

Data were collected from the PTT baseball bulletin board by using PHP (hypertext preprocessor) curl and were managed using Excel. Although the baseball board covers various professional baseball leagues, such as Major League Baseball, Nippon Professional Baseball, and Korea Baseball Organization League, the number of posts on the CPBL exceeds those on others because it is the most popular league in Taiwan. In the present study, we focused particularly on CPBL-related articles to explore the cultural meaning of Taiwanese baseball. Data were extracted from relevant COVID-19-related posts made in 2020 (keywords: epidemic, masks, new coronavirus, closed doors, and pneumonia) on the PTT baseball board. The data set included various data, such as post times, classification, post topics, post texts, and comments (three attributes [reactions]: boo, like, and neutral).

2.2. Multivariate Regression Analysis

SPSS (version 25.0) was used to analyze the number of post days and the classification of posts using descriptive statistics. Data regarding the uses and gratifications (U&G) classification, model concepts, comment attributes, and post period were used in a multivariate regression analysis.

2.2.1. Classification

Original posters classify their content on the basis of topics before posting on the PTT. We classified the articles into the information, interaction, and entertainment categories of the U&G classification according to comment attributes.

2.2.2. Concepts

The articles were classified according to the keywords used in relevant studies. Sport service quality was divided into two categories: core and peripheral products. Furthermore, social identity was divided into three levels: team identity, player identity, and national identity.

2.2.3. Comment Attributes

On the PTT, comments are evaluated using three reactions: boo, like, and neutral. These reactions correspond to those of dislike, like, and no opinion, respectively, on other social media (e.g., Facebook).

2.2.4. Post Period

The CPBL in 2020 was divided into four periods: preseason (before the league), regular season closed door (closed-door games, audience not allowed), regular season open door (audience allowed), and nonseason (after the 2020 season).

2.3. Modeling and Evaluation

We used PHP for word segmentation and Word2vec for constructing a machine learning model. For different comment attributes, word segmentation was performed to convert sentences into words. Then, the words were processed using Word2vec to identify word vectors. Next, the predictive model was constructed, which could predict comment attributes. Finally, a chatbot was developed using Telegram Webhooks (Figure 2).

3. Results

3.1. Descriptive Statistics

We retrieved a total of 422 articles and 21,167 comments from the PTT baseball board. Table 1 summarizes the descriptive statistics of the contribution (number of comments) and creation (the number of posts) of PTT users in the contexts of social identity, sport service quality, and post period. Among the three attributes, like had the highest percentage in the contexts of social identity, sport service quality, and post period. Regarding social identity, the highest proportion of the articles was related to national identity (87%) and comments (76%). Regarding sport service quality, the highest proportion of articles was related to peripheral sport service quality (58%) and comments (55%). Regarding post period, the highest proportion of articles was related to regular season closed door (42%) and comments (54%).
We determined the monthly numbers of articles stratified by concept and collected data regarding the number of confirmed cases of COVID-19 in Taiwan in 2020 (Table 2) [11]. The top three months with the highest numbers of confirmed cases were March, November, and December. Regarding user content, discussions were focused primarily on peripheral sport service quality and national identity. The number of posts was the highest in April.

3.2. Results of Multivariate Regression Analysis

Post classification, social identity, sport service quality, and post period were regarded as independent variables in this study, whereas the three comment attributes were regarded as dependent variables. Chatting, team identity, peripheral sport service quality, and regular season closed door served as reference variables for post classification, social identity, sport service quality, and post period, respectively. The value of collinearity tolerance was >0.2 and the variance inflation factor was <5, indicating no collinearity-related problems.
Table 3 summarizes the statistics related to post classification. Under the concept of information, we evaluated the following variables: news, intelligence, sharing, and home run. News and intelligence (information concept) exerted nonsignificant effects on the boo (news: β = −0.009 and p > 0.05; intelligence: β = −0.008 and p > 0.05), like (news: β = −0.056 and p > 0.05; intelligence: β = 0.018 and p > 0.05), and neutral (news: β = 0.013 and p > 0.05; intelligence: β = 0.041 and p > 0.05) attributes. Sharing (information concept) exerted significantly positive, significantly positive, and nonsignificant effects on the boo (β = 0.135; p < 0.01), like (β = 0.142; p < 0.01), and neutral (β = 0.077; p < 0.01) attributes, respectively. Furthermore, home run exerted nonsignificant, significantly positive, and nonsignificant effects on the boo (β = −0.012; p > 0.05), like (β = 0.29; p < 0.01), and neutral (β = 0.05; p < 0.01) attributes, respectively.
Under the concept of interaction, we evaluated the following variables: questionnaires and discussions. These variables exerted nonsignificant effects on the boo (questionnaires: β = 0.012 and p > 0.05; discussions: β = 0.063 and p > 0.05), like (questionnaires: β = −0.038 and p > 0.05; discussions: β = −0.052 and p > 0.05), and neutral (questionnaires: β = −0.048 and p > 0.05; discussions: β = 0.06 and p > 0.05) attributes.
Under the concept of entertainment, we evaluated the money variable. This variable exerted nonsignificant, significant, and significant effects on the boo (β = −0.022; p > 0.05), like (β = 0.389; p < 0.01), and neutral (β = 0.03; p > 0.05) attributes, respectively.
Table 4 presents the statistics related to social identity. National identity exerted nonsignificant, significantly positive, and nonsignificant effects on the boo (β = 0.034; p > 0.05), like (β = 0.093; p < 0.01), and neutral (β = −0.019; p > 0.05) attributes, respectively. Player identity exerted nonsignificant effects on the boo (β = −0.024; p > 0.05), like (β = −0.043; p > 0.05), and neutral (β = −0.065; p > 0.05) attributes.
Table 5 presents the statistics related to sports service quality. Core sport service quality exerted nonsignificant, significant, and nonsignificant effects on the boo (β = −0.051; p > 0.05), like (β = −0.001; p < 0.05), and neutral (β = 0.01; p > 0.05) attributes, respectively.
Table 6 presents the statistics related to post period. Regarding the boo attribute, preseason exerted significant effects on the boo (β = −0.134; p < 0.05), like (β = −0.061; p < 0.05), and neutral (β = −0.077; p < 0.05) attributes. Regular season open door exerted nonsignificant, significant, and nonsignificant effects on the boo (β = −0.073; p > 0.05), like (β = −0.063; p < 0.05), and neutral (β = −0.108; p > 0.05) attributes, respectively. Furthermore, nonseason exerted nonsignificant effects on the boo (β = −0.046; p > 0.05), like (β = −0.008; p > 0.05), and neutral (β = −0.063; p > 0.05) attributes.

3.3. Machine Learning and Model Prediction

3.3.1. Basic Training

First, the data were curled using PHP. Second, natural language processing—word segmentation and Word2vec—was performed to generate training files. Third, machine learning was used to construct a predictive model, which could predict comment attributes by finding similar texts in the training files. Finally, the chatbot was developed using Telegram Webhooks.

3.3.2. System Implementation

When users input texts or sentences into the chatbot system on Telegram, the chatbot can directly respond to the comment attributes. For example, if a user inputs “precaution” into the chatbot, the system will generate an output of “like”. Furthermore, if one puts “If you have enough inferiority complex, don’t go out anymore. You are embarrassed anyway” into the chatbot, the system will generate an output of “boo”.
If the inputted text has different attributes but the same text ranking, the system will automatically use the most frequent attribute as the main attribute. The appendix presents the text contents of different comment attributes to explain the output generation by the chatbot.

4. Discussion

4.1. Theoretical Implications

The Internet promotes social gratification [12]. In the era of consumer-led marketing, the COBRA can help official stakeholders understand consumers social media [13]. The manner in which consumers use social media is highly interactive and connected [14] and may help develop public opinions [15]. Text mining can help us understand users’ real opinions [16,17].
We used social identity and sport service quality to understand COBRA engagement; this venture is new in the domain of sports research. The chatbot developed in the present study may help official stakeholders better understand the attributes of users’ comments on the PTT baseball board during COVID-19.
We found that articles with a higher number of posts attracted more comments. Therefore, contribution may be positively correlated with creation. Consumers’ interactions on social media increased gradually, shifting from consumption to contribution. Contribution is the key to future post creation [14]. Interaction positively affects the relationship between official stakeholders and consumers, and consumer engagement affects loyalty [18].

4.1.1. Social Identity

National identity was associated with the highest levels of contribution and creation. The content of national identity involved the facts that 2020 CPBL was the first baseball league organized after the COVID-19 outbreak and was broadcast globally; these factors were ascribed to Taiwan’s effective and successful pandemic prevention measures. This invoked a sense of national pride and the like comments. The words global, world, English, rename, and leave a name attracted a high number of like comments, indicating that users wanted to advertise the CPBL to the world as an expression of their national identity. These findings are different from those of earlier studies [19,20] ascribing national identity to a satisfactory performance of the national team, the participation of Taiwanese players in professional leagues abroad, and the efficient hosting of international sporting events. In the present study, the in-depth exploration of online content revealed that sport fans’ (users’) national identity was attributed to the country’s effective pandemic prevention. Thus, the CPBL became an outlet for sports fans to demonstrate their national identity. The CPBL must be made aware of the fact that offline fans are likely to share external social identities (e.g., player and team identities) with online fans, which helps expand the fan base [21]. Thus, the CPBL must promote national identity to effectively encourage its fans to attend the games in the coming season [22].

4.1.2. Sport Service Quality

Peripheral sport service quality was associated with the highest levels of contribution and creation. This category of sport service quality more easily generated like comments than the other category, core sport service quality. Words such as baseball, English, chat room, cheerleading, world, global, fans, and epidemic prevention were highly frequent in the context of peripheral sport service quality. The word baseball was found in repost articles for politicians and fans anticipating live games. The word chatroom received no boo comments and approximately 80% of all like comments. The remaining words were mostly associated with positive comments. Therefore, in 2020, PTT users had positive attitudes toward peripheral sport service quality; this finding is consistent with that reported by Tan and Lee [23] who indicated the reputation of Asian baseball improved during COVID-19 because it was the only live sports event during the pandemic.
The results of the frequency analysis revealed that baseball, world, global, and rename had high frequency in the context of core sport service quality. The word baseball received the highest numbers of both like and boo comments. Specifically, the like comments were associated with the game schedule and players, which indicated users’ support for the schedule and player performances. The words global and world also received like comments, which indicated the users’ intention to promote the CPBL; by contrast, rename received boo comments. Rename referred to adding Taiwan to the name CPBL. Thus, discussions on core sport service quality may help promote the CPBL. Furthermore, combining discussions on core sport service quality with those on national identity may help reduce the number of boo comments. The rationale of the aforementioned strategy is similar to that mentioned by Chiang and Chen [24], who reported that integrating national identity and national morale through sports recognized by other nations may help develop positive attitudes in Taiwanese individuals. Core sport service quality is a strong driver of consumer behaviors [25]; the unpredictability of sports enhances consumer focus on the core of the competition [26]. Our findings indicate the importance of core sport service quality, which attracted the highest numbers of both like and boo comments. It is a double-edged sword. Our study may offer important insights to the CPBL.
Overall, focusing on both core and peripheral sport service qualities online is essential because this represents a promising avenue for obtaining international consumers [27]. The influence of online fans is extensive and global; hence, professional sport leagues must understand online fans’ positive and negative feedback and respond immediately. The leagues’ brand names can be promoted through core and peripheral products to improve the fans’ consumption behaviors [25].

4.1.3. Post Classification

The articles related to home run, sharing, and money were more likely to attract like comments than those related to chat. This indicates that the affinity of users toward articles on players’ home run records, information shared from other platforms, and original posters encouraging users is more important than merely expressing personal opinions. These findings are similar to those reported by Muntinga et al. [13] who found that social media users are driven by entertainment, which is positively correlated with user satisfaction.
Articles related to sharing were more likely to receive boo comments than those related to chat. As information articles describe events from an objective perspective, both positive and negative public opinions are expected. Sharing is essential for establishing a relationship between official stakeholders and fans [28]. However, analysis of the content of articles related to sharing that received boo comments revealed that these articles were shared by politicians and were related to baseball-focused pandemic prevention policies. Politics-related keywords that received boo comments included cockroaches, politicians, government, and politics. The negative comments indicated users’ dissatisfaction with irrelevant information on the baseball board. Sharing helps promote users’ social experience and encourage them to browse and comment on posts [29]. Team managers should avoid or restrict irrelevant posts on their social media to reduce the number of negative comments. The topics of articles related to chat that received boo comments included pandemic prevention policies for the regular season open-door period. Although users expressed their opinions through chats, they were actively and seriously involved in relevant discussions and thus received fewer negative comments. Interaction is regarded the core of audience activities, which can improve our understanding of the requirements of social media and their users [30]. The marketing managers of sports teams must create interaction boards on team social media to explore the requirements of their fans, as encouraging discussions on crucial issues may promote the active engagement of users and reduce the number of negative comments.

4.1.4. Chatbot’s Significance

The chatbot developed in our study predicts comment attributes. Earlier studies on the use of big data in sports management focused on word frequency [31,32,33,34] and topic identification [5,28,35]. By contrast, we developed the predictive model using machine learning; this innovative model may help advance sport research. Using this chatbot, professional baseball teams can effectively manage their online fanbase by better understanding user attitudes and reacting immediately and appropriately.

4.2. Practical Implications

Using both qualitative and quantitative methods in the present study, we assessed online COVID-19-related sports content through text mining. Through the thematic analysis of words with high frequency, key points can easily be identified from an unstructured content. We found that the highest number of posts and comments were made in the regular season closed-door period. This implies that most sports fans were eager for elevated levels of social interactions through virtual platforms such as social media (e.g., PTT). Therefore, sports fans may not abandon their favorite teams even if they are temporarily not allowed to enter the stadium to watch live matches. This is a key implication for the marketing managers of sports teams. They should invest in intensive marketing and building or managing a relationship with their fans through social media [36].
Core products are a predominant and indispensable part of competitions. However, during COVID-19, a relatively high proportion of posts were related to peripheral products (i.e., English broadcasts and Twitch). The availability of English broadcasts and various platforms for the CPBL may help promote users’ national identity. In the present study, the national identity of PTT users was identified to be a strong driver of like comments, which, in turn, promotes sports leagues through word of mouth. The chatbot developed in our study can help understand the posts and comments during the pandemic period. It can also help the CPBL teams to predict public opinions and act accordingly.

4.3. Limitations and Future Directions

Our study has some limitations. First, we focused only on COVID-19-related content posted in 2020. Thus, this study does not cover widespread issues that may attract readers’ attention to professional baseball and COVID-19. For a comprehensive analysis of user content on the PTT baseball board, future studies should collect data chronologically to constitute both training and validation data sets. Thus, the model fit and accuracy of our model may be improved in future studies.
Second, the collection of data focusing on incidents [37] and new theories (i.e., cocreation) may help us better understand users’ interactions [38] and emotions to improve the representativeness and creativity of our model. This requires further studies.
Third, the PTT is a platform purely for text communication. Researchers are encouraged to collect data from social media, such as Facebook and Instagram, which contain pictures and videos for constructing predictive models that may enhance interaction and entertainment [39].
Fourth, understanding emotions underlying Chinese texts through text mining is challenging. The systematic calculation of emotion scores and the improvement of word segmentation accuracy may enhance the substantiveness of future studies [40].
Finally, people can freely create content on the Internet; thus, fake articles and comments cannot be entirely excluded [41]. Texts with extremely high frequency may be deleted to enhance the credibility of future studies.

4.4. Conclusions

With time and technological advances, the development of Web2.0 has ushered us into a whole new world—the virtual world, a platform for networking. Due to the interactivity and immediacy offered by such platforms, users become socially connected and creative consumers. Virtual networks enable social media users to contribute to the development of online public opinions through social interactions; thus, such networks may facilitate targeted problem solving and benefit organizations. The PTT has a strong influence in Taiwan. The original posters of the PTT are mostly professional and experienced fans; this factor strengthens our findings. The concept of PTT comment attributes is the same as that of upvote and downvote on Reddit; hence, the comment attributes indicate users’ attitudes toward articles posted on PTT.
Text mining may help reflect users’ real opinions because it can accurately grasp information by organizing a substantial amount of data. The three comment attributes and concepts (i.e., social identity and sport service quality) were assessed using a word cloud; this would improve professional sport leagues’ understanding of online user content and help them effectively manage their online fanbase. Furthermore, predictive models constructed using big data may help predict user opinions, which may facilitate the management of the relationship between sports teams and fans. Team managers can strengthen their fan base by posting articles to attract positive comments and rapidly resolving and clarifying negative comments.

Author Contributions

Conceptualization, S.-C.M. and C.-Y.S.; methodology, S.-C.M., C.-Y.S. and S.-F.C.; formal analysis, S.-C.M., C.-Y.S. and S.-F.C.; investigation, S.-C.M. and C.-Y.S.; resources, S.-C.M. and S.-M.M.; writing—original draft preparation, S.-C.M., C.-Y.S. and S.-F.C.; writing—review and editing, S.S., S.-F.C. and S.-M.M.; project administration, S.-C.M., C.-Y.S. and S.-F.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on reasonable request from the corresponding author.

Conflicts of Interest

The authors declare no potential conflict of interest with respect to the research, authorship, and/or publication of this article.

References

  1. Smith, L.R.; Smith, K.D. Identity in Twitter’s hashtag culture: A sport-media-consumption case study. Int. J. Sport Commun. 2012, 5, 539–557. [Google Scholar] [CrossRef]
  2. Watkins, B.A. Social identification and social media in sports. In Routledge Handbook of Sport and New Media; Billing, A.C., Hardin, M., Eds.; Routledge: New York, NY, USA, 2014. [Google Scholar]
  3. Qian, T.Y. Watching sports on Twitch? A study of factors influencing continuance intentions to watch Thursday Night Football co-streaming. Sport Manag. Rev. 2021, 23, 1–22. [Google Scholar] [CrossRef]
  4. Fan, M.; Billings, A.; Zhu, X.; Yu, P. Twitter-based BIRGing: Big data analysis of English national team fans during the 2018 FIFA World Cup. Commun. Sport 2020, 8, 317–345. [Google Scholar] [CrossRef]
  5. Chen, L.C.; Lee, C.M.; Chen, M.Y. Exploration of social media for sentiment analysis using deep learning. Soft Comput. 2019, 24, 8187–8197. [Google Scholar] [CrossRef]
  6. Tsuji, Y.; Bennett, G.; Zhang, J. Consumer satisfaction with an action sports event. Sport Market. Qtly. 2007, 16, 199–208. Available online: https://www.proquest.com/scholarly-journals/consumer-satisfaction-with-action-sports-event/docview/228090708/se-2 (accessed on 25 March 2023).
  7. Buzeta, C.; De Pelsmacker, P.; Dens, N. Motivations to use different social media. Types and their impact on Consumers’ Online Brand-Related Activities (COBRAs). J. Interact. Market. 2020, 52, 79–98. [Google Scholar] [CrossRef]
  8. Vale, L.; Fernandes, T. Social media and sports: Driving fan engagement with football clubs on Facebook. J. Strat. Market. 2018, 26, 37–55. [Google Scholar] [CrossRef] [Green Version]
  9. Saridakis, C.; Baltas, G.; Oghazi, P.; Hultman, M. Motivation recipes for brand-related social media use: A Boolean—fsQCA approach. Psychol. Market. 2016, 33, 1062–1070. [Google Scholar] [CrossRef]
  10. Piehler, R.; Schade, M.; Kleine-Kalmer, B.; Burmann, C. Consumers’ online brand-related activities (COBRAs) on SNS brand pages. Eur. J. Market. 2019, 53, 1833–1853. [Google Scholar] [CrossRef]
  11. Hsu, J.J.; Chang, C.H.; Ma, S.C.; Rotherham, I.D. Translation and cross-national validation of team identification scale. J. Taiwan Soc. Sport Manag. 2020, 20, 133–159. [Google Scholar] [CrossRef]
  12. Liu, I.L.; Cheung, C.M.; Lee, M.K. Understanding Twitter usage: What drive people continue to tweet. In Proceedings of the Pacific Asia Conference on Information Systems 2010, Taipei, Taiwan, 9–12 July 2010. [Google Scholar]
  13. Muntinga, D.G.; Moorman, M.; Smit, E.G. Introducing COBRAs: Exploring motivations for brand-related social media use. Int. J. Advert. 2011, 30, 13–46. [Google Scholar] [CrossRef]
  14. Schivinski, B.; Christodoulides, G.; Dabrowski, D. Measuring consumers’ engagement with brand-related social-media content. J. Advert. Res. 2016, 56, 64–80. [Google Scholar] [CrossRef] [Green Version]
  15. Mainieri, B.O.; Braga, P.H.C.; da Silva, L.A.; Omar, N. Text Mining of Audience Opinion in eSports Events. In Proceedings of the Paper Presentation of the SBC–Proceedings of SBGames 2017, Curitiba, PR, Brazil, 2–4 November 2017. [Google Scholar]
  16. Aarts, C.; Jiang, F.; Chen, L. A Practical Application for Sentiment Analysis on Social Media Textual Data. In Proceedings of the 24th Symposium on International Database Engineering & Applications, Seoul, Republic of Korea, 12–14 August 2020. [Google Scholar]
  17. He, W.; Zha, S.; Li, L. Social media competitive analysis and text mining: A case study in the pizza industry. Int. J. Inform. Manag. 2013, 33, 464–472. [Google Scholar] [CrossRef]
  18. Jahn, B.; Kunz, W. How to transform consumers into fans of your brand. J. Serv. Manag. 2012, 23, 344–361. [Google Scholar] [CrossRef]
  19. Penfold, T. National identity and sporting mega-events in Brazil. Sport Soc. 2019, 22, 384–398. [Google Scholar] [CrossRef]
  20. Halldorsson, V. National sport success and the emergent social atmosphere: The case of Iceland. Int. Rev. Sociol. Sport 2020, 56, 471–492. [Google Scholar] [CrossRef]
  21. Chen, K.J.; Phua, J. Self-categorization process in sport: An examination of the “Linsanity” phenomenon in Taiwan. Sport Manag. Rev. 2016, 19, 431–440. [Google Scholar] [CrossRef]
  22. Chang, T. After international tournaments: Spectator attendance at domestic professional baseball games in Taiwan. J. Glob. Sport Manag. 2019, 4, 164–184. [Google Scholar] [CrossRef]
  23. Tan, T.C.; Lee, J.W. Covid-19 and sport in the Asia Pacific region. Sport Soc. 2020, 23, 1883–1888. [Google Scholar] [CrossRef]
  24. Chiang, Y.; Chen, T.H. Adopting the diasporic son: Jeremy Lin and Taiwan sport nationalism. Int. Rev. Sociol. Sport 2015, 50, 705–721. [Google Scholar] [CrossRef]
  25. Byon, K.K.; Zhang, J.J.; Baker, T.A. Impact of core and peripheral service quality on consumption behavior of professional team sport spectators as mediated by perceived value. Eur. Sport Manag. Qtly. 2013, 13, 232–263. [Google Scholar] [CrossRef]
  26. Biscaia, R.; Yoshida, M.; Kim, Y. Service quality and its effects on consumer outcomes: A meta-analytic review in spectator sport. Eur. Sport Manag. Qtly. 2023, 23, 897–921. [Google Scholar] [CrossRef]
  27. Chiu, W.; Leng, H.K. Let’s go cycling: An analysis of tourists’ experience on online user-generated content. Int. J. Tour. Cities 2017, 3, 30–42. [Google Scholar] [CrossRef]
  28. Shreffler, M.; Hancock, M.; Schmidt, S. Organizational representation through Twitter: An examination of the WNBA. Global Sport Bus. 2015, 3, 42–52. Available online: http://www.gsbassn.com/Journal/Vol3-3/GSBJ-Vol3-Iss3-Shreffler-pp42-52.pdf (accessed on 13 April 2023).
  29. Lee, C.S.; Ma, L. News sharing in social media: The effect of gratifications and prior experience. Comp. Hum. Behav. 2012, 28, 331–339. [Google Scholar] [CrossRef]
  30. Gopinath, S.; Chintagunta, P.K.; Venkataraman, S. Blogs, advertising, and local-market movie box office performance. Manag. Sci. 2013, 59, 2635–2654. [Google Scholar] [CrossRef]
  31. Chen, Y.T.; Luan, C.J. Natural language processing and its application for word of mouth studies. Chin. J. Commun. Res. 2012, 22, 259–289. [Google Scholar] [CrossRef]
  32. Jia, S.S. Leisure motivation and satisfaction: A text mining of yoga centres, yoga consumers, and their interactions. Sustainability 2018, 10, 4458. [Google Scholar] [CrossRef] [Green Version]
  33. Chiu, W.; Bae, J.; Won, D. The experience of watching baseball games in Korea: An analysis of user-generated content on social media using Leximancer. J. Sport Tour. 2017, 21, 33–47. [Google Scholar] [CrossRef] [Green Version]
  34. Park, S.U.; Ahn, H.; Kim, D.K.; So, W.Y. Big data analysis of sports and physical activities among Korean adolescents. Int. J. Environ. Res. Public Health 2020, 17, 5577. [Google Scholar] [CrossRef]
  35. Hambrick, M.E.; Simmons, J.M.; Greenhalgh, G.P.; Greenwell, T.C. Understanding professional athletes’ use of Twitter: A content analysis of athlete tweets. Int. J. Sport Commun. 2010, 3, 454–471. [Google Scholar] [CrossRef]
  36. Mastromartino, B.; Ross, W.J.; Wear, H.; Naraine, M.L. Thinking outside the ‘box’: A discussion of sports fans, teams, and the environment in the context of COVID-19. Sport Soc. 2020, 23, 1707–1723. [Google Scholar] [CrossRef]
  37. Yazici, M.A.; Mudigonda, S.; Kamga, C. Incident detection through twitter: Organization versus personal accounts. Transp. Res. Rec. 2017, 2643, 121–128. [Google Scholar] [CrossRef]
  38. Hajli, M.; Hajli, M. Organisational development in sport: Co-creation of value through social capital. Ind. Commerc. Train. 2013, 45, 283–288. [Google Scholar] [CrossRef]
  39. Chen, C.W.; Wu, S.R.; Yu, P.H.; Yung, L. Understanding the influence of Facebook fans page to continuance intention: The use and gratifications theory and social influence perspectives. Chaoyang Bus. Manag. Rev. 2015, 14, 19–38. Available online: https://tpl.ncl.edu.tw/NclService/JournalContentDetail?SysId=A15039614 (accessed on 15 April 2023).
  40. Greco, F.; Polli, A. Emotional Text Mining: Customer profiling in brand management. Int. J. Inf. Manag. 2020, 51, 101924. [Google Scholar] [CrossRef]
  41. Ozbay, F.A.; Alatas, B. Fake news detection within online social media using supervised artificial intelligence algorithms. Phys. A Stat. Mech. Appl. 2020, 540, 123174. [Google Scholar] [CrossRef]
Figure 1. Conceptual framework.
Figure 1. Conceptual framework.
Behavsci 13 00551 g001
Figure 2. Flowcharts for data collection, text mining, and machine learning.
Figure 2. Flowcharts for data collection, text mining, and machine learning.
Behavsci 13 00551 g002
Table 1. Descriptive statistics of the COBRA model.
Table 1. Descriptive statistics of the COBRA model.
Comment EvaluationContributingCreating
BooLikeNeutralAll-
Social identityNational1165 (12%)5511 (59%)2711 (29%)9387 (76%)141 (87%)
Player88 (5%)1406 (73%)426 (22%)1920 (15%)11 (7%)
Team60 (5%)726 (66%)322 (29%)1108 (9%)10 (6%)
Total---12,415 (100%)162 (100%)
Service qualityCore922 (11%)4751 (54%)3101 (35%)8774 (45%)171 (42%)
Peripheral1417 (13%)5424 (50%)4029 (37%)10,870 (55%)239 (58%)
Total---19,644 (100%)410 (100%)
Periodpreseason581 (9%)3412 (50%)2804 (41%)6797 (32%)163 (39%)
close door1550 (14%)6225 (55%)3559 (31%)11,334 (54%)177 (42%)
Open329 (14%)1193 (55%)893 (31%)2415 (11%)66 (16%)
nonseason54 (9%)373 (60%)194 (31%)621 (3%)16 (4%)
Total---21,167 (100%)422 (100%)
Each concept in each row totals 100%.
Table 2. Descriptive statistics of the monthly numbers of articles (stratified by core concepts) and confirmed COVID-19 cases in 2020.
Table 2. Descriptive statistics of the monthly numbers of articles (stratified by core concepts) and confirmed COVID-19 cases in 2020.
ConceptSport Service QualitySocial IdentityNumber of Confirmed Cases
MonthCorePeripheralNationalPlayerTeam
Jan2 (1%)100019 (2%)
Feb5 (3%)9 (5%)00026 (3%)
Mar33 (19%)62 (25%)1 (1%)3 (11%)0330 (41%)
Apr106 (61%)89 (36%)120 (84%)22 (78%)19 (85%)61 (8%)
May6 (4%)56 (23%)14 (10%)2 (7%)09 (1%)
Jun1 (1%)4 (2%)0006 (1%)
Jul3 (2%)2 (1%)001 (5%)29 (4%)
Aug2 (1%)10 (4%)1 (1%)01 (5%)16 (2%)
Sep2 (1%)11 (1%)0025 (3%)
Oct2 (1%)6 (2%)1 (1%)1 (4%)052 (7%)
Nov6 (4%)5 (2%)3 (2%)00116 (15%)
Dec3 (2%)0001 (5%)104 (13%)
Total1712451412822793
Table 3. Regression statistics related to post classification.
Table 3. Regression statistics related to post classification.
AttributeU&G ConceptTypeβpToleranceVIF
BooInformationNews−0.0090.850.771.30
Intelligence−0.0080.820.971.03
Home run−0.0120.800.961.04
Sharing0.135 **0.010.851.17
InteractionQuestionnaires0.0120.940.921.09
Discussions0.0630.300.821.23
EntertainmentMoney−0.0220.720.991.01
LikeInformationNews−0.0560.240.771.30
Intelligence0.0180.780.971.03
Home run0.290 ***0.000.961.04
Sharing0.142 ***0.000.851.17
InteractionQuestionnaires−0.0380.220.921.09
Discussions−0.0520.160.821.23
EntertainmentMoney0.389 ***0.000.991.01
NeutralInformationNews0.0130.760.300.76
Intelligence0.0410.400.850.40
Home run0.0500.350.940.35
Sharing0.0770.121.550.12
InteractionQuestionnaires−0.0480.32−1.000.32
Discussions0.0600.291.070.29
EntertainmentMoney0.0300.560.580.56
** p < 0.01; *** p < 0.001. Reference: chatting. VIF, variance inflation factor.
Table 4. Regression statistics related to social identity.
Table 4. Regression statistics related to social identity.
AttributeSocial IdentityβpToleranceVIF
BooNational identity0.0340.08 0.99 1.01
Player identity−0.0240.78 0.99 1.01
LikeNational identity0.093 ***0.00 0.99 1.01
Player identity−0.0430.62 0.99 1.01
NeutralNational identity−0.0190.37 0.99 1.01
Player identity−0.0650.49 0.99 1.01
*** p < 0.001. Reference: team identity.
Table 5. Regression statistics related to sport service quality.
Table 5. Regression statistics related to sport service quality.
AttributeService qualityβpToleranceVIF
BooCore−0.0510.901.001.00
LikeCore−0.001 *0.021.001.00
NeutralCore0.0100.291.001.00
* p < 0.05. Reference: peripheral sport service quality.
Table 6. Regression statistics related to post period.
Table 6. Regression statistics related to post period.
AttributePeriodΒpToleranceVIF
BooPreseason−0.134 **0.01 0.85 1.18
Regular season (open)−0.0730.16 0.86 1.16
Nonseason−0.0460.26 0.95 1.05
LikePreseason−0.061 ***0.00 0.85 1.18
Regular season (open)−0.063 **0.01 0.86 1.16
Nonseason−0.0080.30 0.95 1.05
NeutralPreseason−0.077 *0.05 0.85 1.18
Regular season (open)−0.1080.19 0.86 1.16
Nonseason−0.0630.52 0.95 1.05
* p < 0.05; ** p < 0.01; *** p < 0.001. Reference: regular season closed-door period.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ma, S.-C.; Su, C.-Y.; Chen, S.-F.; Sato, S.; Ma, S.-M. Analysis of COVID-19-Related User Content on the Baseball Bulletin Board in 2020 through Text Mining. Behav. Sci. 2023, 13, 551. https://doi.org/10.3390/bs13070551

AMA Style

Ma S-C, Su C-Y, Chen S-F, Sato S, Ma S-M. Analysis of COVID-19-Related User Content on the Baseball Bulletin Board in 2020 through Text Mining. Behavioral Sciences. 2023; 13(7):551. https://doi.org/10.3390/bs13070551

Chicago/Turabian Style

Ma, Shang-Chun, Ching-Ya Su, Sheng-Fong Chen, Shintaro Sato, and Shang-Ming Ma. 2023. "Analysis of COVID-19-Related User Content on the Baseball Bulletin Board in 2020 through Text Mining" Behavioral Sciences 13, no. 7: 551. https://doi.org/10.3390/bs13070551

APA Style

Ma, S. -C., Su, C. -Y., Chen, S. -F., Sato, S., & Ma, S. -M. (2023). Analysis of COVID-19-Related User Content on the Baseball Bulletin Board in 2020 through Text Mining. Behavioral Sciences, 13(7), 551. https://doi.org/10.3390/bs13070551

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop