Next Article in Journal
Electro-Microbiology: A Green Approach for Energy and Environment Sustainability
Previous Article in Journal
White-Tailed Eagle Algorithm for Global Optimization and Low-Cost and Low-CO2 Emission Design of Retaining Structures
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The International City Image of Beijing: A Quantitative Analysis Based on Twitter Texts from 2017–2021

1
International Business School, Beijing Foreign Studies University, Beijing 100089, China
2
Institute of Art, Communication University of China, Beijing 100024, China
*
Author to whom correspondence should be addressed.
Sustainability 2022, 14(17), 10675; https://doi.org/10.3390/su141710675
Submission received: 21 July 2022 / Revised: 15 August 2022 / Accepted: 25 August 2022 / Published: 27 August 2022
(This article belongs to the Section Tourism, Culture, and Heritage)

Abstract

:
With the advent of the Internet era, users from numerous countries can express their opinions on social media platforms represented by Twitter. Unearthing people’s image perceptions of cities from tweets helps relevant organizations understand the image that cities present on mainstream social media and take targeted measures to shape a good international image, which can enhance international tourists’ willingness to travel and strengthen city’s tourism competitiveness. This paper collects nearly 130,000 tweets related to “Beijing” (“Peking”) from 2017–2021 through web-crawler technology, and uses Term Frequency-Inverse Document Frequency (TF-IDF) keywords statistics, Latent Dirichlet Allocation (LDA) topic mining, and Valence Aware Dictionary and sEntiment Reasoner (VADER) sentiment analysis to further summarize the characteristics of Beijing’s international image and propose strategies to communicate its international image. This research aims to tap into the international image of Beijing presented on Twitter, and provide data support for the relevant Chinese and Beijing authorities to develop communication strategies, as well as providing a reference for other cities aiming to manage their international image.

1. Introduction

With the increase in people’s living standards and improvements in transportation conditions, the growing tourism consumption demand have made tourism an important point of growth and a pillar industry to drive the city economy. Therefore, research on the competitiveness of city tourism has attracted extensive attention from scholars [1,2,3,4,5]. The international city image is a comprehensive reflection of its history and culture, geography and humanities, economic level and governance level. People’s perception of the city image is a key factor in the choice of tourist city. Only through scientific positioning and dynamic management of its own image can a city stand out in the increasingly fierce competition in the tourist market, and thus enhance its competitiveness. By studying the perceived international image of a city, tourism enterprises and government departments can understand people’s concerns and the positive and negative factors that affect people’s emotions. This will help in the development of targeted communication strategies and stimulate tourism’s vitality.
As the capital of China, Beijing is the political, cultural, international communication, and technology innovation center, with deep historical and cultural deposits. The success of the 2022 Beijing Winter Olympic Games made Beijing the first city to host both the Summer and Winter Olympic Games, which attracted the attention of the world and brought new opportunities to shape the Beijing’s international image. By strengthening the research and management of its own international image, Beijing can give full play to its advantages, use major platforms with worldwide influence to shape a good international image, and enhance its international tourism competitiveness.
Twitter is a social platform with wide influence, where users in many countries express their views, comment and have discussions. Unlike the traditional communication mode, the audience is no longer a passive recipient, but a participant and producer of content, and the content mostly derives from users’ views and opinions on life, which has a high persuasive power. These features make Twitter an important way to shape and spread an international city image. In addition, compared to other, more travel-oriented social media platforms, Twitter not only shows tourism images of cities, but also non-tourism images of economic trade, sports events, film and entertainment, social issues, etc. These non-tourism images also influence tourists’ choices. By mining the content of tweets, we can better understand the current international image of a city and provide a basis for government departments to make decisions and improve the international image in a targeted manner.
This paper selects the international city image of Beijing as the research object, and uses web-crawler technology to obtain tweets containing “Beijing” (“Peking”) from 2017 to 2021. The tweets are pre-processed by language recognition, language transcription, word separation, etc. The processed tweets are subjected to Term Frequency-Inverse Document Frequency (TF-IDF) keyword statistics and Latent Dirichlet Allocation (LDA) topic model construction to interpret the keywords and the themes shown in Beijing each year. This paper uses Valence Aware Dictionary and sEntiment Reasoner (VADER) sentiment analysis to further quantify the sentiment values of Beijing city at each timepoint and analyzes the trends in sentiment values from January to December 2017–2021. Finally, we obtain the international image of Beijing, and propose a communication strategy regarding Beijing’s international image in response to the analysis results.
The main contributions of this paper are as follows: (1) From the perspective of people’s perception of a city, this paper explores the ways in which cities can improve their tourism competitiveness by shaping a good international city image by mining the international city image in mainstream social media, providing a new perspective for such research. (2) As the factors affecting tourists’ choices are not only tourism images, this paper examines the international city image, rather than just the tourism image. (3) In terms of research methodology, this paper uses web-crawler technology to capture tweets and quantifies the textual information through natural language-processing techniques. (4) This paper identifies and transcribes the German text in the tweets to further expand the sample size. (5) There is little literature on the international city image of Beijing, and this paper further expands the research in this area.
This paper is structured as follows. The second section sorts the relevant literature in the field. The third section introduces the basic models and methods involved in this study. The fourth section collects nearly 130,000 tweets related to “Beijing” (“Peking ”) by web-crawlers, and uses natural language-processing techniques to quantify the collected texts. The fifth section further discusses the results of the empirical study, analyzes and summarizes the characteristics of the international city image of Beijing, and then proposes communication strategies for the international city image. The sixth section summarizes the main findings, points out the significance of the and the limitations of the current study, and then provides an outlook on future research directions.

2. Literature Review

2.1. Definition of the International City Image

The term “city image” was first proposed by American urbanist Lynch [6] in 1964, who believed that city image is an impression shared by a certain number of city residents and includes five elements: roads, boundaries, areas, nodes and landmarks. The city image proposed by Lynch was limited to the impressions that city residents had of their places. It was restricted to the perception of the physical form of the city. Later scholars placed more emphasis on the impression of non-residential conditions [7,8,9,10,11], and incorporated city culture and spirit into the concept of “city image”, i.e., city image is the public’s comprehensive evaluation of the internal and external strengths and future development of a city, reflecting its characteristics and style [12,13]. With the development of modern networks, some scholars have proposed that the construction of city image is a process of external communication, i.e., the city image is formed through the combination of mass media, personal experience and environmental factors [14]. Further, it was suggested that the media is the main tool of city image communication, and the media encodes and decodes comprehensive information regarding the city and acts on public perceptions to eventually form the city image [15,16,17,18,19]. The media has an irreplaceable role in the communication of city image due to its timeliness and comprehensiveness [20]. City image is an objective social existence and a subjective social evaluation, and the construction of a city image is also the process of the city being perceived and re-evaluated by the public [21,22]. According to the different subjects of city image perception, the city image can be further divided into international city image and domestic city image. The international city image is the content presented by the city image in the process of international communication, which is the city image in the eyes of the international public. The international city image relies on international communication, and good international communication ideas are conducive to the establishment of a good international city image [23,24,25,26]. Therefore, understanding the current city image of Beijing on international mainstream social media is an essential part of enhancing Beijing’s international image.

2.2. International City Image and City Tourism Competitiveness

In recent years, with the booming tourism industry, the relationship between the international city image and the competitiveness of urban tourism has received attention from several scholars. It has been argued that the competitiveness of a tourist destination is expressed as the perception of the tourists [27,28]. Vinyals-Mirabent [29] confirmed through a study that attraction factors such as city architecture and culture are essential features of city tourism competitiveness and help to distinguish a city’s image from that of its competitors. Kim and Lee [30] found that the dynamic, static, and concrete dimensions of a city’s image positively influence tourists’ willingness to return. Some subsequent scholars have also confirmed the view that city image is an important factor influencing the future behavioral intentions of tourists, which directly or indirectly affects their decision-making behavior [31,32,33]. In addition, some studies have investigated the relationship between city tourism competitiveness and national culture and found that components of city image such as national culture have a significant impact on city tourism competitiveness [34,35]. Destination image was also found to be an antecedent of destination personality, and destination personality directly affects individuals’ attitudes toward visiting [36,37]. Existing research on the international city image also suggests that residents’ and tourists’ perceptions of the city can influence the level of government support for tourism, with more positive perceptions of the city leading to more support for tourism in city-building [38,39,40]. The international city image can have an impact on the competitiveness of tourism, and the international city image becomes a strategic means of development for the city to improve its tourism competitiveness. Therefore, this paper examines the city’s characteristics from the perspective of Beijing’s international image, and proposes image-shaping solutions to enhance Beijing’s urban tourism competitiveness.

3. Model Methodology

3.1. Word Separation and Word Frequency Statistics

The separation of English words was mainly performed by “word_tokenize()” in the Natural Language Toolkit (NLTK) module, a Python-based platform for a natural language processing toolset. To improve the word separation effect, this paper first preprocesses the original text, including abbreviation reduction, Uniform Resource Locator (URL) deletion, emoji deletion, “@” “#” symbol deletion, word form reduction, synonym replacement, phrase recognition and deactivation removal, to obtain the word separation results. The word frequency statistics can then be completed using “collections.Counter()”.

3.2. Term Frequency-Inverse Document Frequency

The TF-IDF [41] method consists of two components: Term Frequency (TF) and Inverse Document Frequency (IDF). TF measures the frequency with which words occur in a document and can be calculated by Equation (1).
t f i , j = n i , j k n k , j
In the context of this paper, we further elaborated the meaning of the symbols represented in the formula, where i is different words, j refers to different tweets, n i , j is the number of times word i appears in tweet j, k n k , j represents the total number of times all words appear in tweet j.
IDF measures the importance of words, and its mechanism reduces the weight of common words and increases the weight of rare words. The specific calculation formula is shown in Equation (2).
i d f i = log | D | | j : t i d j | + 1
where | D | is the total number of tweets, | j : t i d j | represents the number of tweets containing the word t i . If none of the tweets contain the word t i , this leads to | j : t i d j | = 0 . To avoid the denominator of 0, we use | j : t i d j | + 1 .
Ultimately, the value of TF-IDF is obtained by multiplying the resultant value of Equation (1) with the resultant value of (2), as in Equation (3). The more important the word is, the lager its TF-IDF value will be.
t f i d f = t f i , j i d f i

3.3. Latent Dirichlet Allocation Topic Model

Word frequency statistics can simply and intuitively extract hot words, but do not consider the semantic association behind the words. Some words may be generated in the same thematic context, such as “It’s very sunny outside and there is no wind” and “I ate a lot of ice cream”, which both reflect the topic of “hot weather”. However, if we only calculate the word frequency, each word in these two sentences is not necessarily a high-frequency word, making it difficult to uncover the topic of the text.
The LDA topic model is a text representation model that takes semantic association into account. The LDA topic model considers that an article has multiple topics and the words in the text have a certain probability of belonging to these topics.
The key to applying the LDA topic model is to determine the optimal number of topics. The effect of topic extraction in the LDA topic model is directly related to the number of potential topics, and the two most common evaluation methods to determine the number of topics are based on coherence [42,43,44] or perplexity [45]. Coherence refers to the quantitative calculation of whether the semantic association of words under a topic generated by an LDA is closer, and the formula for calculating topic coherence is shown in Equation (4).
c o h e r e n c e ( T ) = ( v i , v j ) T p ( v i , v j )
where T is a topic, v i and v j are the words within the topic, and p ( v i , v j ) is the scoring function to measure the semantic closeness of the words within the topic.
In practice, the function p ( v i , v j ) is generally used in the UMass algorithm [43], with the formula shown in Equation (5).
p ( v i , v j ) = ln p ( v i , v j ) + ε p ( v i ) p ( v j )
where p ( v i , v j ) denotes the co-occurrence probability of words v i and v j , and p ( v i ) and p ( v j ) denote the occurrence probability of words v i and words v j , respectively, and ε is a smaller constant.
Perplexity can be understood as how uncertain the trained model is about the topic to which article d belongs. This degree of uncertainty is perplexity, which is calculated as in Equation (6).
p e r p l e x i t y ( D t e s t   ) = exp { d = 1 M log p ( w d ) d = 1 M N d }
In the formula, M represents the number of texts in the test set, N d denotes the number of words in the dth text, and p ( w d ) represents the probability of occurrence of each word in the dth text.
In summary, this paper chose the number of topics under larger coherence and smaller perplexity to complete the construction of the LDA topic model.

3.4. Valence Aware Dictionary and sEntiment Reasoner Sentiment Analysis

Sentiment analysis is a subfield of natural language processing that aims to extract attitudinal dispositions from text. The three most common methods of sentiment analysis are plain Bayesian, Long Short-Term Memory (LSTM), and VADER methods [46,47,48,49,50,51]. This paper uses the powerful VADER method, which uses a comprehensive, high-quality sentiment vocabulary and complex linguistic rules to generate sentiment scores [52]. In addition to the determination of sentiment words, sentiment intensity is also measured, mainly based on punctuation, degree adverbs, negation, and conjunctions. For example, in the sentence “I really can’t recommend it.”, “recommend” is the positive sentiment word, while the previous word is the negative word, “can’t”, so the emotional value of “can’t recommend” is −1.11 (−0.74 × 1.5). The previous word is the degree word, “really”. According to the principle that the degree word that is farther away from the emotion word is assigned a lower score, the assignment of really is 0.293 × 0.95 = 0.27835. The sentiment score of the whole sentence is −0.27835 − 0.74 × 1.5 = −1.38835. A detailed introduction and code can be found at https://github.com/cjhutto/vaderSentiment (accessed on 10 July 2022).

4. Empirical Research

4.1. Data Collection

This paper used Twitter (https://twitter.com/home, accessed on 5 June 2022) as the original data source for the study, and crawled tweets with “Beijing” or “Peking” between 1 January 2017 and 31 December 2021 through Python language programming. The total number of tweets with “Beijing” or “Peking” between 1 January 2017, and 31 December 2021, was 129,260, and the number of tweets per year is detailed in Table 1.

4.2. Data Processing

Firstly, this paper converts the German language in the tweets into English. Since the same two words “Beijing” and “Peking” are also used in German, the authors further used the Language Identification (LangID) module in Python to identify the German language text of each comment. Since the comment content has a small number of words and machine translation can reach a high level of accuracy, this paper used Google Translate (https://translate.google.cn, accessed on 6–7 June 2022) combined with manual proofreading and revision to complete the translation between German and English.
Subsequently, this paper used the NLTK module in Python for word separation, and the collections module in Python for word frequency (number of occurrences), and eliminated words with no real meaning based on the word frequency statistics, such as, “also”, “said”, “via”, etc. In the case of words that are supposed to be phrases but are split into individual words, phrases are written in the dictionary, e.g., “North Korea”, “Hong Kong”, “New York” etc., were considered. Finally, “Beijing” and “Peking” were excluded because all tweets contained one or the other, and this was of no practical significance for the subsequent study. After pre-processing, this paper performed a second word separation, followed by TF-IDF keywords statistics, and used the LDA topic model for topic-mining and sentiment analysis. The Python code involved in this paper was written and run using PyCharm software.

4.3. Keywords Statistics

In this paper, the TF-IDF value was calculated by the TF-IDF method introduced in Section 3.2, and the keywords were ranked according to the TF-IDF value for each year, while the number of keyword occurrences was counted according to the Word Frequency Statistics method introduced in Section 3.1.
Table 2 shows the top 50 keywords of 2017. “China” has the highest TF-IDF value and the most occurrences; except for the TF-IDF value of 2021, which is in second place, the TF-IDF value and occurrences were all in first place. This shows that Beijing, as the capital of China, is sometimes used as a pronoun for “China”. Next, “duck” ranked second in terms of TF-IDF value and number of occurrences. Peking duck is a world-renowned Beijing dish, which has apparently become an iconic food in Beijing and even China in the eyes of friends around the world. In addition, we found many adjectives in the keywords, such as “new”, “great”, “good”, “open”, “friend”. It is easy to see that the current global image of Beijing is dominated by positive adjectives, which are more related to the continuous strengthening of its international metropolis and expansions of its openness. The words “art”, “glass” and “opera” were among the top 50 keywords for 2017, indicating that Beijing’s traditional culture and arts are receiving attention from friends around the world. At the same time, many Twitter users expressed their desire to go to Beijing and eat Peking duck, so the keyword “want” ranked seventh in 2017.
According to the top 50 keywords of 2018 (Table 3), it is easy to see that the ranking of “trade” improved, moving from 44th place in 2017 to 7th in 2018, and “tariff”, “Washington”, “government”, “dispute” appeared in the top 50 keywords for 2018. At this point, Beijing was more of a referent for China, and the U.S. government’s imposition of tariffs on Chinese products sparked global concern. “duck”, “art”, “opera”, and “glass” remained in the top 50 keywords. The adjectives in the top 50 keywords are also all positive words. The top 50 keywords of 2019 (Table 4) feature 42 keywords from the top 50 keywords of 2018, and generally maintain the same focus as people had in 2018.
The coronavirus epidemic swept the world in 2020, and the 50 keywords of 2020 (Table 5) show that people paid more attention to the epidemic, such as “coronavirus”, “virus”, “corona”, “pandemic”, “COVID”. The ranking of “duck” dropped, “art”, “opera”, and “glass” no longer appeared in the top 50 keywords, and the number of positive adjectives decreased.
In 2021, as the epidemic improved, epidemic-related words no longer appeared in the top 50 keywords (Table 6). “duck” moved to first place, “opera” moved to fifth place, and “brooch” appeared in the top 50 keywords for the first time. Beijing will be the host city of the 24th Winter Olympic Games in 2022, and “Olympics” and “winter” were among the top 10 keywords. At the same time, more positive words, such as “love” and “friend”, appeared in the top 50 keywords.
This paper counted the occurrence of words that appeared in the top 50 keywords in the five-year period from 2017 to 2021, forming Figure 1, with 13 words in total. Among them, “China” and “Chinese” achieved a significant decrease in 2021, despite their increasing number in 2017–2020, which could indicate that Beijing has improved its national image and is more representative of its own city image. “duck” has become a famous dish in Beijing and is known worldwide. This was especially noticeable in 2021, when the epidemic improved, and the number of occurrences of the word “duck” significantly increased. “want” expresses some of the thoughts and aspirations of the global friends regarding Beijing, and “first” was more frequently used in tweets regarding Beijing. The three high-frequency adjectives, “new”, “good” and “great”, which were all found over the five-year period, show the positive image of Beijing. In 2021, the occurrence of the words “good” and “great” increased, but use of the word “new” decreased compared to the previous year. As Beijing has many famous universities, the word “university” is widely mentioned in tweets. By checking the original tweets, the authors found that many tweeters are studying at universities in Beijing, or hope to come to study at universities in Beijing, and some tweeters have left footprints in university libraries.

4.4. Topic Mining

Appendix A shows the coherence and perplexity of the LDA topic model for a different number of topics each year from 2017 to 2021, and this paper determined the number of topics for each year based on a larger coherence and smaller perplexity, and finally determined the number of topics for 2017–2021 as 3, 3, 4, 5, and 3, respectively, and Table 7, Table 8, Table 9, Table 10 and Table 11 shows the results of the LDA topic model for 2017–2021.
In the results presented in 2017 (Table 7), Topic1 is Beijing school culture, which includes the words “university”, “school” and some words describing school or school life. Topic2 is Beijing traditional culture, which includes Beijing traditional food “duck”, as well as “art”, “glass”, Topic3 is closely related cities, and the cities closely related to Beijing are “New York”, “Barcelona “, “Moscow”, “London”, “Shanghai”, and many of the tweeters also traveled between these cities and Beijing.
In the results presented in 2018 (Table 8), Topic1 shows the characteristics of the combination of Topic1 and Topic2 themes in 2017, mainly consisting of “duck”, “art”, “opera” and traditional culture and art, as well as “university” and “student” of campus culture, which can be said to be a wide range of Beijing cultural topics. In 2018, Topic2 is the trade friction, including “trade”, “Trump”, “tariff”, “Washington”. Topic3 presents the tourism, including some cities with close ties to Beijing and the words “museum”, “airport “, “flight”, “car” and other tourist attractions and travel tools.
In the results presented in 2019 (Table 9), Topic1 contains words such as “event”, “game”, “winter” and other words closely related to winter sports. After reviewing the content of the original tweets, these words were found more often found in tweets expressing expectations for the Beijing Winter Olympics. Topic2 is the campus culture, including “university”, “school” and “student” and “student”. Topic3 presents the same topic as Topic2 in 2018: the trade friction. Topic4 is mainly composed of the words “duck “, “glass”, “food”, “film”, and other words that form the cultural topic.
In the results presented in 2020 (Table 10), in addition to some words representing trade friction, words such as “Biden” and “election” also appear in Topic1, so it can be said that Topic1 focuses more on the Sino–US relations. Topic2 has words such as “duck” and “restaurant”, making the topic more focused on the food culture. Topic3 focuses on the epidemic, including the words “coronavirus”, “virus”, “corona”, “outbreak “, “Wuhan”, “pandemic” and other related words. Topic4 focuses on the violent impact of Hong Kong society, mainly involving “law “, “security”, “HK” and other words. Topic5 is Beijing’s traditional culture and sports, including “opera “, “palace”, “game”, “winter”, “Olympics”.
In the results presented in 2021 (Table 11), Topic1 focuses on Beijing’s food culture and traditional arts culture, with the main categories being “duck”, “opera”, “food “, “glass”, “art”, “restaurant”, “vintage”, “brooch”. Topic2 is the Winter Olympics, mainly including “Olympics”, “winter “, “game”, “Olympic” and other words. Topic3 is campus culture, including “university “, “school”, “student”.

4.5. Sentiment Analysis

The adjectives in the keyword statistics, such as “new”, “great”, “good”, “open”, “international”, and “friend”, help us to understand people’s emotional perception of Beijing. Most of them are adjectives that express positive emotional tendencies.
Through the VADER sentiment analysis method, this paper further quantified the sentiment values of Beijing city at various timepoints. Figure 2 depicts the trends in sentiment value from January to December 2017–2021, and found two timepoints showed significant decreases in August 2019 and April 2020. When the image of Beijing as the capital of China was somewhat affected by the violent social shock in Hong Kong in August 2019, and the global outbreak of the coronavirus outbreak in April 2020, some global friends changed their emotional perception of China and the city of Beijing within a short period of time. The emotional perception of a city is closely related to major events.
Compared to the mean value of 5-year sentiment, the majority of months in 2017, 2018, 2019 and 2021 showed sentiment values above the mean, with the exception of 2019 and 2020, where the sentiment trend line fluctuated somewhat, and 2017, 2018 and 2021 show relatively stable monthly changes. The 5-year sentiment mean line has mean values below 0.1000 in June, July and August, and above 0.1000 in all other months, and above the mean values in October and November. After further calculation, the annual sentiment values for 2017–2021 are all above 0, showing a positive sentiment tendency, with the annual sentiment values for 2017, 2018, 2019 and 2021 being above 0.1000.

5. Further Discussion

5.1. Analysis of the Characteristics of the International Image of Beijing City

Beijing presents a kind of ancient capital charm. The city has a long history and is a humanistic city where cultural preservation and development go hand in hand. “palace”, “museum”, and “wall” appear in keywords and the LDA topic model.
Secondly, tweets about Beijing reveal a certain cultural atmosphere. On the one hand, this is a traditional culture, with words such as “opera”, “art”, “glass”, “vintage”, “brooch” and so on becoming keywords. On the other hand, the campus culture, with words such as “university”, “school”, “student”, and the many famous universities in Beijing, make the city’s campus culture strong.
Third, “duck” was mentioned many times in tweets, and appeared in the top 10 keywords every year. Peking duck has become a famous Beijing specialty and a food card of Beijing.
Fourthly, Beijing, as the capital of China, has many opportunities to host major international events, which gives it a huge advantage in international communications. As the host city of the 2022 Winter Olympic Games, Beijing is highly anticipated worldwide.
Fifthly, through the adjectives included in keywords, we can describe the international image of Beijing city more clearly. Words such as “new”, “great”, “good”, “open”, “international” and “friend” show that Beijing is gradually becoming a modern and civilized international city, and that the friendliness and openness of Beijing attracts people from all over the world.
Sixthly, Beijing, as an international metropolis, has close ties with other cities, “New York”, “Barcelona”, “Moscow”, “London”, “Shanghai” appear in the high-frequency vocabulary many times, and tweeters repeatedly say that they travel between these cities and Beijing.
Finally, the paper also notes that the international city image of Beijing is vulnerable to national events, especially controversial events that can cause some people around the world to change their emotional disposition toward Beijing within a short period of time.

5.2. Strategy for Beijing International Image Communication

First, the city image is part of the national image, and the city image of Beijing is highly linked to its national and governmental image. Therefore, the communication strategy for Beijing should be an important part of a national policy, rather than a propaganda piece or a stand-alone project. The communication strategy regarding city image should focus on the creation of a clearer national consensus about Beijing’s positioning, making the nation understand that a country’s reputation is the property of its people, and encouraging and practicing a culture of creativity across government, culture, business, investment, education, industry, and other areas. Only by standing as a nation and encouraging creative culture, so that new ideas continue to emerge in every field, can we continue to correct and counteract existing stereotypes, reaching a situation where the image of the nation and the image of the city develop together and complement each other.
Among the adjectives in keywords, words such as “new” and “first”, which are always associated with innovation, are often associated with positive content, making people feel positively about Beijing. Therefore, it is important to pay close attention to new concepts and achievements in various areas when promoting tourism in Beijing. It is also important to note that major national events that take place in Beijing will bring more direct tourism opportunities. The word “Olympics” jumped to the sixth keyword in 2021, and the Winter Olympics provided the city with the new title of the first city in the world to host both the Summer and Winter Olympic Games, bringing the new concept of “Green Olympics” and a new market of “skiing fever”. Tourism organizations and practitioners should make full use of these new opportunities to innovate and promote tourism in Beijing and build a good international image of the city.
Secondly, the focus should be on promoting Beijing’s image with regard to its culture, entertainment and appearance. According to data from the Beijing Municipal Cultural Heritage Bureau [53], Beijing has 138 national key cultural relics in the protection list, including the Summer Palace, the Great Wall, and the Palace Museum. It also has 144 national intangible cultural heritage representative projects [54], including the Beijing Opera, Kunqu Opera and others. There are also many modern landmarks, such as National Grand Theatre, the Water Cube and others. However, according to keywords, statistics and the results of LDA topic model, these rich and colorful cultural resources are far from being fully explored and displayed, and there is less relevant discussion. Media in China can create sub-accounts with different topic contents according to audience interests, make full use of pictures and videos to show more intuitive contents, explore the filmable resources of Beijing culture, and promote city tourism through storytelling.
Moreover, keywords statistics show that international attention to Beijing mostly occurs at the abstract level (representatives of the state and government, economic and social issues, etc.), ignoring its unique feeling as a physical space. In addition to the virtual image presented on the Internet, the physical image should be paid equal attention. The government should continue to improve and create urban humanistic landmarks, enrich the cultural and recreational life of residents, and improve transportation, roads and the ecological environment. According to the concepts of absorption and immersion in tourism management, people need to obtain immersion experiences through visual and auditory sensory perceptions, which could help them to form a deep and complex city image [55]. In short, the influence of Beijing’s culture and urban landscape on its overall image should be continuously expanded, which will help to create a more positive and diverse image of Beijing and allow for people to discover its diverse charms.
Finally, openness to the outside world needs to be maintained and encouraged, and the tourism organizations in Beijing should cooperate with unofficial accounts. The shaping of an international city image is a cross-regional, cross-cultural and cross-linguistic external communication. Ordinary communication only needs to convert the original information into a message that is acceptable to the general public, while external communication needs to cross multiple barriers, such as language differences and cultural differences, which means that opinion leaders play an important role in shaping the international city image. Compared with general audiences, opinion leaders are more frequently exposed to media, and they influence audiences with lower levels of media exposure, knowledge and interest by providing information and conveying their views.
With the popularity of Internet applications, the role played by online opinion-leaders is receiving more and more attention. There are two types of information sources that can play the role of opinion leaders. One is the national media of foreign publics. The other is foreign publics with direct or indirect contact with Chinese cities. During the epidemic, the international tourism market was hit hard, which also made it difficult for the foreign public to gain direct knowledge of Chinese cities. There is a Chinese idiom that originates from a historical story: “Seeing is believing”. In the post-epidemic era, the government should establish a more scientific and efficient public health policy, promote international tourism recovery and development, organize study-abroad activities between universities, and establish international friendship cities, ultimately achieving image improvement and renewal.
In addition, the role of local citizens in spreading the city image should not be ignored. According to the Beijing Urban Master Plan (2016–2035) released by the Beijing government in 2017 [56], the blueprint for the future development of Beijing is to make Beijing an international, first-class, harmonious and livable capital. When residents feel that the city is livable, they will naturally be proud to promote their city to the outside world, ultimately creating an attractive urban atmosphere in Beijing. The official media and related institutions should actively cooperate with domestic and foreign unofficial media and cooperate with the agenda-setting function of traditional media, i.e., traditional media influences the focus of public attention and the perception of the social environment by giving prominence to various topics, and continuously carrying out data tracking and analysis.

5.3. Discussion

This paper searched papers with the topic “city image” through the Web of Science (https://www.webofscience.com/wos/woscc/basic-search, accessed on 12 August 2022). After browsing and sorting, we found that most studies focused on the city image in the country, but fewer studies studied the international image of cities. In addition, most studies on the international image of cities were conducted in a qualitative way, without sufficient data support.
Subsequently, this paper set the topic as “city image”, “Beijing” and “Twitter”, and found only six papers related to this topic. Among them, two papers used social media data to map human activities, and one paper proposed a new paradigm for urban residential sensitivity to heatwave risks based on social media Big Data. All three papers were in the field of remote sensing. In addition, one paper examined how the Burberry brand has become a major trendsetter in social media marketing. One paper examined how the Beijing Museum uses social media tools as a marketing strategy to increase its visibility among the public. Another paper proposed a fine-grained spatiotemporal dynamic pattern analysis approach and applied this to a Flickr dataset from Beijing: the paper shows that the method can also be applied to Twitter data.
In summary, international city image studies lack case studies; notably, there are few studies on the international image of Beijing. The analysis is mostly qualitative, and there are few studies on the international image of cities based on international social media platforms using text-analysis techniques. This paper further expands the research in related fields and enriches the research results. It also makes some contributions in terms of methodology, theory and practical application, including the following three aspects:
  • From the methodological perspective, this paper further mines the international image of Beijing based on Twitter comments, using keyword extraction through the TF-IDF method, topic analysis through the LDA model and sentiment analysis through the VADER method. The combination of the three methods allowed for a more comprehensive analysis of Twitter users’ perception of Beijing’s city image. A social media platform text-mining framework was formed for city-image research.
  • From the theoretical perspective, this paper extended the research related to the international image of cities and city tourism competitiveness, emphasizing that tourism competitiveness is enhanced through the city image, which is not limited to the city tourism image but includes a more comprehensive international image of cities, including economic trade, sports events, film and entertainment, and social issues.
  • From the perspective of practical application, Beijing is currently in the promotional period of international communication center construction, and better shaping the international image of the city plays a crucial role in the future development of Beijing. This study provides powerful data support and communication strategies to allow for relevant departments to make decisions, which, in turn, helps Beijing to establish a good international image.

6. Conclusions

This paper collected tweets about “Beijing” (“Peking”) on Twitter from 2017 to 2021 through web-crawler technology, and analyzed the international city image presented through TF-IDF keywords statistics, LDA topic mining, and VADER sentiment analysis. Based on the analysis results, this paper proposed a strategy to communicate the international city image, further stimulate tourism’s vitality and enhance the city’s competitiveness in terms of tourism.
According to the research analysis, the international city image of Beijing in Twitter presents these characteristics: (1) Beijing has some cultural connotations, but, as an ancient capital with a long history, there are still many cultural resources that are not fully displayed. (2) Beijing, as the host city of the 2022 Winter Olympic Games, is attracting the attention of people around the world. (3) Beijing, as an international metropolis, has close ties with other cities. Many keywords also show that Beijing is gradually becoming a modern and civilized international city, and its friendliness and openness attract people from all over the world. (4) As Beijing is the capital of China, the world people’s emotional tendency towards the city is easily influenced by the national events in China.
Through an analysis of the international city image of Beijing and the presented characteristics, the corresponding strategies were further proposed based on communication, including the following steps: (1) encouraging creative culture and making use of major international activities and events to build a good image; (2) deeply exploring and promoting the content of city image focusing on culture, entertainment and city style, so that people can discover the diverse charm of Beijing; (3) insisting on and encouraging openness to the outside world and actively developing cooperation with domestic and foreign unofficial media.
The main significance of this study is as follows. (1) It further enriches the research perspective of urban tourism competitiveness. The international city image of Beijing was analyzed through user comments on the mainstream social media platform Twitter. Based on the analysis, image enhancement strategies to attract more tourists and improve urban tourism competitiveness were proposed. (2) The international city image was measured based on the TF-IDF method, LDA method and VADER method, which enriched the image perception research theory and method, and provided references for relevant research. (3) Taking Beijing as a case study site, powerful data support and communication strategies were provided to support governmental decision making, as well as references for other cities manage their international city image.
However, we should note the limitations of the current study further research directions. (1) The research in this paper mainly uses text data, and the original data types could be further expanded in the future to include pictures, videos, audios, etc. Furthermore, multiple data types could be studied from multiple perspectives. (2) Due to the limitation of the languages learned by the authors, it was not possible to proofread the English results obtained by the machine translation of tweets in other languages. From the perspective of data rigor, the tweets collected in this paper were mainly in English and German. The research results mainly reflect the countries and regions that use these languages, but do not reflect the perceptions of other countries and regions that speak other languages. Therefore, there is a need to expand the multilingual tweets in the future to present a more comprehensive international perception of Beijing’s city image. (3) In addition, the authors believe that the prediction of keyword statistics, LDA topics, and sentimental analysis value based on machine learning methods can be further explored in the future, and the Empirical Mode Decomposition (EMD) could be applied to reduce noise in non-stationary and non-linear data. (4) Finally, the international image of Beijing as presented on a wider range of social media platforms should be a future research direction.

Author Contributions

Conceptualization, H.N. and Z.Z.; methodology, H.N. and Z.Z.; writing—original draft preparation, Z.Z., M.L. and Z.L.; writing—review and editing, Z.Z., M.L., Z.L. and H.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Key Project of Beijing Social Science Foundation for Decision-making and Consulting (No. 21JCB044).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data in this study can be obtained by contacting the author: [email protected].

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Figure A1. (a) Coherence of the 2017 LDA Topic Model; (b) Perplexity of the 2017 LDA Topic Model; (c) Coherence of the 2018 LDA Topic Model; (d) Perplexity of the 2018 LDA Topic Model; (e) Coherence of the 2019 LDA Topic Model; (f) Perplexity of the 2019 LDA Topic Model; (g) Coherence of the 2020 LDA Topic Model; (h) Perplexity of the 2020 LDA Topic Model; (i) Coherence of the 2021 LDA Topic Model; (j) Perplexity of the 2021 LDA Topic Model.
Figure A1. (a) Coherence of the 2017 LDA Topic Model; (b) Perplexity of the 2017 LDA Topic Model; (c) Coherence of the 2018 LDA Topic Model; (d) Perplexity of the 2018 LDA Topic Model; (e) Coherence of the 2019 LDA Topic Model; (f) Perplexity of the 2019 LDA Topic Model; (g) Coherence of the 2020 LDA Topic Model; (h) Perplexity of the 2020 LDA Topic Model; (i) Coherence of the 2021 LDA Topic Model; (j) Perplexity of the 2021 LDA Topic Model.
Sustainability 14 10675 g0a1aSustainability 14 10675 g0a1b

References

  1. Boivin, M.; Tanguay, G.A. Analysis of the determinants of urban tourism attractiveness: The case of Québec City and Bordeaux. J. Destin. Mark. Manag. 2019, 11, 67–79. [Google Scholar] [CrossRef]
  2. Rodríguez-Díaz, B.; Pulido-Fernández, J.I. Sustainability as a key factor in tourism competitiveness: A global analysis. Sustainability 2019, 12, 51. [Google Scholar] [CrossRef]
  3. Khanzadeh, H.; Rahnavard, F.; Bamdad, N.; Mahmoudzadeh, S.M. Explanation Model of Tourism Competitiveness in Iran Touristic Cities. New Mark. Res. J. 2021, 11, 45–68. [Google Scholar]
  4. Jia, Z.; Jiao, Y.; Zhang, W.; Chen, Z. Rural tourism competitiveness and development mode, a case study from Chinese township scale using integrated multi-source data. Sustainability 2022, 14, 4147. [Google Scholar] [CrossRef]
  5. Sirkis, G.; Regalado-Pezúa, O.; Carvache-Franco, O.; Carvache-Franco, W. The Determining Factors of Attractiveness in Urban Tourism: A Study in Mexico City, Buenos Aires, Bogota, and Lima. Sustainability 2022, 14, 6900. [Google Scholar] [CrossRef]
  6. Lynch, K. The Image of the City, 1st ed.; The MIT Press: Cambridge, MA, USA, 1964; pp. 46–90. [Google Scholar]
  7. Hunt, J.D. Image as a Factor in Tourism Development. J. Travel Res. 1975, 13, 1–7. [Google Scholar] [CrossRef]
  8. Crompton, J.L. An assessment of the image of Mexico as a vacation destination and the influence of geographical location upon the image. J. Travel Res. 1979, 17, 18–23. [Google Scholar] [CrossRef]
  9. Embacher, J.; Buttle, F. A Repertory Grid Analysis of Austria’s Image as A Summer Vacation Destination. J. Travel Res. 1989, 27, 3–7. [Google Scholar] [CrossRef]
  10. Chon, K.S. The role of destination image in tourism: A review and discussion. Tourist Rev. 1990, 45, 2–9. [Google Scholar] [CrossRef]
  11. Baloglu, S.; McCleary, K.W. A model of destination image formation. Ann. Tourism Res. 1999, 26, 868–897. [Google Scholar] [CrossRef]
  12. Hong, S.; Kim, J.; Jang, H.; Lee, S. The roles of categorization, affective image and constraints on destination choice: An application of the NMNL model. Tourism Manag. 2006, 27, 750–761. [Google Scholar] [CrossRef]
  13. Chen, X.; Li, J.; Han, W.; Liu, S. Urban Tourism Destination Image Perception Based on LDA Integrating Social Network and Emotion Analysis: The Example of Wuhan. Sustainability 2021, 14, 12. [Google Scholar] [CrossRef]
  14. Foot, J.M. From boomtown to bribesville; the images of the city. Milan, 1980–1997. Urban Hist. 1999, 26, 393–412. [Google Scholar] [CrossRef]
  15. Al-ghamdi, S.A.; Al-Harigi, F. Rethinking image of the city in the information age. Procedia Comput. Sci. 2015, 65, 734–743. [Google Scholar] [CrossRef]
  16. Martí, P.; Serrano-Estrada, L.; Nolasco-Cirugeda, A. Social media data: Challenges, opportunities and limitations in urban studies. Comput. Environ. Urban 2019, 74, 161–174. [Google Scholar] [CrossRef]
  17. Priporas, C.V.; Stylos, N.; Kamenidou, I.E. City image, city brand personality and generation Z residents’ life satisfaction under economic crisis: Predictors of city-related social media engagement. J. Bus. Res. 2020, 119, 453–463. [Google Scholar] [CrossRef]
  18. Lin, M.S.; Liang, Y.; Xue, J.X.; Pan, B.; Schroeder, A. Destination image through social media analytics and survey method. Int. J. Contemp. Hosp. Manag. 2021, 33, 2219–2238. [Google Scholar] [CrossRef]
  19. Pan, X.; Rasouli, S.; Timmermans, H. Investigating tourist destination choice: Effect of destination image from social network members. Tourism Manag. 2021, 83, 104217. [Google Scholar] [CrossRef]
  20. Huang, J.; Obracht-Prondzynska, H.; Kamrowska-Zaluska, D.; Sun, Y.; Li, L. The image of the City on social media: A comparative study using “Big Data” and “Small Data” methods in the Tri-City Region in Poland. Landscape Urban Plan. 2021, 206, 103977. [Google Scholar] [CrossRef]
  21. Luque-Martinez, T.; Del Barrio-García, S.; Ibanez-Zapata, J.A.; Molina, M.Á.R. Modeling a city’s image: The case of Granada. Cities 2007, 24, 335–352. [Google Scholar] [CrossRef]
  22. Adamus-Matuszyńska, A.; Michnik, J.; Polok, G. A systemic approach to city image building. The case of Katowice city. Sustainability 2019, 11, 4470. [Google Scholar] [CrossRef] [Green Version]
  23. Avraham, E. Media strategies for improving an unfavorable city image. Cities 2004, 21, 471–479. [Google Scholar] [CrossRef]
  24. Alvarez, M.D. Creative cities and cultural spaces: New perspectives for city tourism. Int. J. Cult. Tour. Hosp. Res. 2010, 4, 5–171. [Google Scholar] [CrossRef]
  25. Das, D.K. Engendering creative city image by using information communication technology in developing countries. Urban Plan. 2016, 1, 1–12. [Google Scholar] [CrossRef]
  26. Lu, X. The Current Situation and Strategy of the International Communication of Shaanxi City Image. J. Front. Soc. Sci. Technol. 2021, 1, 69–73. [Google Scholar]
  27. Novais, M.A.; Ruhanen, L.; Arcodia, C. Destination competitiveness: A phenomenographic study. Tourism Manag. 2018, 64, 324–334. [Google Scholar] [CrossRef]
  28. Fernández, J.A.S.; Azevedo, P.S.; Martin, J.M.M.; Martin, J.A.R. Determinants of tourism destination competitiveness in the countries most visited by international tourists: Proposal of a synthetic index. Tour. Manag. Perspect. 2020, 33, 100582. [Google Scholar] [CrossRef]
  29. Vinyals-Mirabent, S. European urban destinations’ attractors at the frontier between competitiveness and a unique destination image. A benchmark study of communication practices. J. Destin. Mark. Manag. 2019, 12, 37–45. [Google Scholar] [CrossRef]
  30. Kim, H.B.; Lee, S. Impacts of city personality and image on revisit intention. Int. J. Tourism Cities 2015, 1, 50–69. [Google Scholar] [CrossRef]
  31. Bigne, J.E.; Sanchez, M.I.; Sanchez, J. Tourism image, evaluation variables and after purchase behaviour: Inter-relationship. Tourism Manag. 2001, 22, 607–616. [Google Scholar] [CrossRef]
  32. Bongkoo, L.; Choongki, L.; Jaeseok, L. Dynamic nature of destination image and influence of tourist overall satisfaction on image modification. J. Travel Res. 2014, 53, 239–251. [Google Scholar]
  33. Chen, C.F.; Tsai, D.C. How destination image and evaluative factors affect behavioral intentions. Tourism Manag. 2007, 28, 1115–1122. [Google Scholar] [CrossRef]
  34. Nazarian, A.; Atkinson, P.; Foroudi, P. Influence of national culture and balanced organizational culture on the hotel industry’s performance. Int. J. Hosp. Manag. 2017, 63, 22–32. [Google Scholar] [CrossRef]
  35. Kumar, S.; Dhir, A. Associations between travel and tourism competitiveness and culture. J. Destin. Mark. Manag. 2020, 18, 100501. [Google Scholar] [CrossRef]
  36. Souiden, N.; Ladhari, R.; Chiadmi, N.E. Destination personality and destination image. J. Hosp. Tour. Manag. 2017, 32, 54–70. [Google Scholar] [CrossRef]
  37. Afshardoost, M.; Eshaghi, M.S. Destination image and tourist behavioural intentions: A meta-analysis. Tourism Manag. 2020, 81, 104154. [Google Scholar] [CrossRef]
  38. Ramkissoon, H.; Nunkoo, R. City image and perceived tourism impact: Evidence from Port Louis, Mauritius. Int. J. Hosp. Tour. Admin. 2011, 12, 123–143. [Google Scholar] [CrossRef]
  39. Stylidis, D.; Biran, A.; Sit, J.; Szivas, E.M. Residents’ support for tourism development: The role of residents’ place image and perceived tourism impacts. Tourism Manag. 2014, 45, 260–274. [Google Scholar] [CrossRef]
  40. Zuo, B.; Gursoy, D.; Wall, G. Residents’ support for red tourism in China: The moderating effect of central government. Ann. Tourism Res. 2017, 64, 51–63. [Google Scholar] [CrossRef]
  41. Jones, K.S. A statistical interpretation of term specificity and its application in retrieval. J. Doc. 1972, 28, 11–20. [Google Scholar] [CrossRef]
  42. Newman, D.; Bonilla, E.V.; Buntine, W. Improving Topic Coherence with Regularized Topic Models. In Proceedings of the 25th Annual Conference on Neural Information Processing Systems, Granada, Spain, 12–14 December 2011. [Google Scholar]
  43. Stevens, K.; Kegelmeyer, P.; Andrzejewski, D.; Buttler, D. Exploring topic coherence over many models and many topics. In Proceedings of the Empirical Methods in Natural Language Processing 2012, Jeju Island, Korea, 12–14 July 2012. [Google Scholar]
  44. Röder, M.; Both, A.; Hinneburg, A. Exploring the space of topic coherence measures. In Proceedings of the WSDM 2015: Eighth ACM International Conference on Web Search and Data Mining, Shanghai, China, 2–6 February 2015. [Google Scholar]
  45. Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent Dirichlet Allocation. J. Mach. Learn. Res. 2013, 3, 993–1022. [Google Scholar]
  46. Kumar, S.; Yadava, M.; Roy, P.P. Fusion of EEG response and sentiment analysis of products review to predict customer satisfaction. Inform. Fusion 2019, 52, 41–52. [Google Scholar] [CrossRef]
  47. Kumar, S.; Gahalawat, M.; Roy, P.P.; Dogra, D.P.; Kim, B.G. Exploring impact of age and gender on sentiment analysis using machine learning. Electronics 2020, 9, 374. [Google Scholar] [CrossRef]
  48. Dang, N.C.; Moreno-García, M.N.; De la Prieta, F. Sentiment analysis based on deep learning: A comparative study. Electronics 2020, 9, 483. [Google Scholar] [CrossRef] [Green Version]
  49. Mujahid, M.; Lee, E.; Rustam, F.; Washington, P.B.; Ullah, S.; Reshi, A.A.; Ashraf, I. Sentiment analysis and topic modeling on tweets about online education during COVID-19. Appl. Sci. 2021, 11, 8438. [Google Scholar] [CrossRef]
  50. Nemes, L.; Kiss, A. Information Extraction and Named Entity Recognition Supported Social Media Sentiment Analysis during the COVID-19 Pandemic. Appl. Sci. 2021, 11, 11017. [Google Scholar] [CrossRef]
  51. Kamal, S.; Sharma, S.; Kumar, V.; Alshazly, H.; Hussein, H.S.; Martinetz, T. Trading Stocks Based on Financial News Using Attention Mechanism. Mathematics 2022, 10, 2001. [Google Scholar] [CrossRef]
  52. Hutto, C.J.; Gilbert, E.E. VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text. In Proceedings of the Eighth International Conference on Weblogs and Social Media (ICWSM-14), Ann Arbor, MI, USA, 1–4 June 2014. [Google Scholar]
  53. Beijing Municipal Cultural Heritage Bureau. Available online: http://wwj.beijing.gov.cn/bjww/wwjzzcslm/1737418/1738088/1742737/index.html (accessed on 17 July 2022).
  54. China Intangible Cultural Heritage Network & China Intangible Cultural Heritage Digital Museum. Available online: https://www.ihchina.cn/news2_details/24607.html (accessed on 17 July 2022).
  55. Pine, B.J.; Gilmore, J.H. The Experience Economy: Work is Theater and Every Business A Stage, 1st ed.; Harvard Business School Press: Boston, UK, 1998; pp. 56–58. [Google Scholar]
  56. The People’s Government of Beijing Municipality. Available online: http://www.beijing.gov.cn/gongkai/guihua/wngh/cqgh/201907/t20190701_100008.html (accessed on 17 July 2022).
Figure 1. Trends of keywords occurring in all 5 years.
Figure 1. Trends of keywords occurring in all 5 years.
Sustainability 14 10675 g001
Figure 2. Trends in sentiment value from January to December 2017–2021.
Figure 2. Trends in sentiment value from January to December 2017–2021.
Sustainability 14 10675 g002
Table 1. Statistics on the number of tweets from 2017 to 2021.
Table 1. Statistics on the number of tweets from 2017 to 2021.
YearNumber of Tweets
201725,806
201826,258
201926,334
202026,113
202124,749
Total129,260
Table 2. Top 50 keywords of 2017.
Table 2. Top 50 keywords of 2017.
No.KeywordsTF-IDF ValueOccurrencesNo.KeywordsTF-IDF ValueOccurrences
1China0.0323464526art0.0046406
2duck0.0246192127video0.0045325
3Chinese0.0139153628best0.0044308
4new0.0122120529trip0.0044299
5airport0.009054830open0.0044320
6Trump0.008674831glass0.0044406
7want0.008167132Berlin0.0042290
8world0.008175433visit0.0042322
9first0.007159734Nanjing0.0042203
10great0.006953635president0.0042345
11North Korea0.006756136car0.0041334
12flight0.006752037opera0.0040294
13game0.006648438photo0.0040271
14university0.006657939New York0.0040304
15night0.006438040friend0.0038278
16city0.006145841course0.0038134
17train0.006037342party0.0038298
18Peking Duk0.005943043show0.0038281
19Shanghai0.005844944trade0.0037281
20good0.005639745home0.0034231
21Moscow0.005541246foreign0.0034293
22people0.005245747capital0.0034246
23air0.005138448Xi0.0034264
24smog0.004731749tour0.0034222
25German0.004733250congress0.0034239
Table 3. Top 50 keywords of 2018.
Table 3. Top 50 keywords of 2018.
No.KeywordsTF-IDF ValueOccurrencesNo.KeywordsTF-IDF ValueOccurrences
1China0.0322600326country0.0040456
2Chinese0.0163246727war0.0049452
3duck0.0243220928trip0.0046447
4new0.0132168629art0.0045442
5university0.0107130830right0.0043422
6world0.0097121331government0.0040418
7trade0.0097114832Washington0.0047415
8first0.008592533dispute0.0043414
9Trump0.007786534work0.0041412
10great0.007779835minister0.0038411
11president0.006778936best0.0047404
12people0.006474737foreign0.0038399
13want0.007170738Xi0.0037395
14Shanghai0.006570439national0.0038391
15tariff0.006268540team0.0042385
16city0.006367341opera0.0045379
17good0.006966942company0.0036378
18visit0.005962443event0.0038374
19show0.006262144glass0.0038371
20airport0.007054545open0.0040367
21game0.005754046tour0.0039367
22international0.004953247car0.0038366
23flight0.005450948night0.0041365
24state0.004850449Moscow0.0040364
25student0.004649250meeting0.0036361
Table 4. Top 50 keywords of 2019.
Table 4. Top 50 keywords of 2019.
No.KeywordsTF-IDF ValueOccurrencesNo.KeywordsTF-IDF ValueOccurrences
1China0.0322619926Berlin0.0049413
2duck0.0196163427show0.0049471
3Chinese0.0168268128flight0.0048446
4new0.0145187729student0.0047519
5airport0.010499230war0.0047457
6world0.0104133131German0.0046426
7trade0.0103123332Moscow0.0044415
8university0.0099119833Germany0.0044432
9people0.0085106834visit0.0043450
10want0.008388835photo0.0043361
11first0.007890236work0.0042417
12city0.007278837tariff0.0042435
13government0.006878938dispute0.0042391
14right0.006674639team0.0042402
15Trump0.006572040video0.0040334
16good0.006462041open0.0040342
17great0.006465542minister0.0039431
18Shanghai0.006364743night0.0039302
19international0.006269244deal0.0039385
20president0.005869445USA0.0038384
21country0.005564446foreign0.0038412
22Washington0.005249747game0.0038357
23company0.005052248trip0.0037342
24state0.005052549business0.0037339
25event0.004943650meeting0.0037369
Table 5. Top 50 keywords of 2020.
Table 5. Top 50 keywords of 2020.
No.KeywordsTF-IDF ValueOccurrencesNo.KeywordsTF-IDF ValueOccurrences
1China0.0349755826Germany0.0055565
2Chinese0.0179303627Washington0.0054550
3new0.0150213128national0.0054633
4duck0.0148109829good0.0053525
5world0.0116155230president0.0053588
6people0.0109150831Shanghai0.0053541
7government0.0089114732pandemic0.0051546
8want0.0087101433Wuhan0.0051570
9coronavirus0.008697434Germany0.0051520
10right0.008196935Moscow0.0051457
11law0.008098536flight0.0049497
12Trump0.007993537party0.0048494
13city0.007993938million0.0048537
14first0.007791839official0.0046520
15corona0.007678840global0.0046514
16state0.007385741crisis0.0045467
17country0.007190642company0.0044491
18security0.007083443foreign0.0044511
19virus0.006672744EU0.0044412
20Biden0.005949645human0.0044453
21case0.005867846COVID0.0043111
22outbreak0.005861047work0.0043450
23university0.005763848medium0.0043452
24show0.005660449great0.0043407
25international0.005663050life0.0042391
Table 6. Top 50 keywords of 2021.
Table 6. Top 50 keywords of 2021.
No.KeywordsTF-IDF ValueOccurrencesNo.KeywordsTF-IDF ValueOccurrences
1duck0.0446382926great0.0046424
2China0.0269417227author0.0046237
3Chinese0.0180263428beat0.0046236
4university0.0143174829human0.0045473
5opera0.0134144130brooch0.0045337
6Olympics0.0128129231government0.0044472
7new0.0111124632Peking Duk0.0044256
8winter0.008983933school0.0043455
9world0.008194934encyclopedia0.0043196
10right0.008084835national0.0042444
11first0.007377836team0.0041378
12people0.007281237house0.0041304
13good0.007258438official0.0041407
14want0.006851439restaurant0.0039298
15game0.006863840happy0.0039282
16Biden0.006359141international0.0039373
17Olympic0.005551142old0.0038296
18city0.005251043night0.0038291
19love0.005139244photo0.0038294
20food0.005141045show0.0038359
21country0.005055146eat0.0037207
22site0.005031547friend0.0036295
23best0.004935848party0.0035331
24life0.004831749work0.0035336
25state0.004744650dinner0.0035231
Table 7. LDA topic model results of 2017.
Table 7. LDA topic model results of 2017.
Topic1 WordsProbability DistributionTopic2 WordsProbability DistributionTopic3 WordsProbability Distribution
new0.0226duck0.0838China0.0947
university0.0225China0.0341Chinese0.0254
friend0.0194airport0.0208New York0.0202
first0.0183glass0.0200Peking Duk0.0188
flight0.0155Trump0.0185great0.0186
Chinese0.0144world0.0174want0.0133
air0.0129North Korea0.0169new0.0112
good0.0111Chinese0.0160Barcelona0.0098
international0.0109art0.0147Moscow0.0098
happy0.0107city0.0134London0.0098
photo0.0105opera0.0115trip0.0095
best0.0100Moscow0.0112car0.0091
gold0.0089right0.0105Shanghai0.0090
open0.0088green0.0105old0.0074
school0.0080game0.0104video0.0072
Table 8. LDA topic model results of 2018.
Table 8. LDA topic model results of 2018.
Topic1 WordsProbability DistributionTopic2 WordsProbability DistributionTopic3 WordsProbability Distribution
duck0.0383China0.0661China0.0204
China0.0275trade0.0217Shanghai0.0145
university0.0196Trump0.0169airport0.0128
Chinese0.0167Chinese0.0164museum0.0127
new0.0119president0.0139world0.0114
great0.0097new0.0130Chinese0.0107
first0.0097tariff0.0121want0.0099
show0.0094people0.0094right0.0094
world0.0088world0.0087New York0.0082
national0.0085war0.0086million0.0080
tour0.0082state0.0084Germany0.0079
student0.0075visit0.0084flight0.0078
art0.0063Washington0.0083car0.0078
best0.0062Moscow0.0081game0.0076
opera0.0055air0.0076largest0.0075
Table 9. LDA topic model results of 2019.
Table 9. LDA topic model results of 2019.
Topic1 WordsProbability DistributionTopic2 WordsProbability DistributionTopic3 WordsProbability DistributionTopic4 WordsProbability Distribution
China0.0165China0.0270China0.0773duck0.0599
world0.0144Chinese0.0245trade0.0254airport0.0334
city0.0143university0.0156Chinese0.0153new0.0281
event0.0124government0.0099Trump0.0141China0.0153
Shanghai0.0114people0.0089want0.0135Chinese0.0152
game0.0099school0.0067new0.0114world0.0117
great0.0097new0.0063president0.0103open0.0096
air0.0090student0.0062war0.0101largest0.0096
fan0.0082country0.0062Washington0.0093international0.0089
photo0.0079police0.0058tariff0.0084flight0.0074
night0.0076first0.0057right0.0084glass0.0073
first0.0073good0.0056deal0.0081car0.0068
show0.0073city0.0056USA0.0078food0.0068
video0.0068world0.0055minister0.0078first0.0066
winter0.0067company0.0052people0.0076film0.0063
Table 10. LDA topic model results of 2020.
Table 10. LDA topic model results of 2020.
Topic1 WordsProbability DistributionTopic2 WordsProbability DistributionTopic3 WordsProbability Distribution
China0.0613duck0.0420China0.0455
Trump0.0205Chinese0.0287new0.0280
Chinese0.0176China0.0285coronavirus0.0179
want0.0172university0.0116corona0.0168
world0.0148new0.0095case0.0159
Biden0.0131company0.0084virus0.0137
Washington0.0109Silk Road0.0079outbreak0.0131
trade0.0106good0.0079Wuhan0.0120
president0.0103park0.0078million0.0111
war0.0085want0.0071woman0.0109
deal0.0084great0.0068people0.0106
Moscow0.0080restaurant0.0065Chinese0.0105
government0.0073love0.0063city0.0094
medium0.0071wall0.0061pandemic0.0092
election0.0065video0.0061first0.0080
Topic4 wordsProbability distributionTopic5 wordsProbability distribution
China0.0380China0.0181
law0.0194summer0.0140
Chinese0.0181world0.0132
right0.0176city0.0124
security0.0151opera0.0118
people0.0114first0.0115
government0.0110palace0.0111
national0.0098game0.0104
human0.0098winter0.0101
state0.0092Shanghai0.0093
new0.0073Olympics0.0091
international0.0066photo0.0084
freedom0.0065flight0.0084
country0.0061people0.0081
HK0.0054airport0.0081
Table 11. LDA topic model results of 2021.
Table 11. LDA topic model results of 2021.
Topic1 WordsProbability DistributionTopic2 WordsProbability DistributionTopic3 WordsProbability Distribution
duck0.0951Olympics0.0328China0.0393
opera0.0438China0.0274university0.0276
Chinese0.0215winter0.0212Chinese0.0212
food0.0105game0.0160new0.0139
glass0.0096right0.0146school0.0119
defend0.0091Olympic0.0141people0.0111
art0.0090Chinese0.0105world0.0080
love0.0088growing0.0094great0.0066
green0.0078first0.0089quite0.0065
restaurant0.0076human0.0088Chan0.0062
good0.0075world0.0079national0.0060
vintage0.0075state0.0071city0.0056
want0.0071team0.0070student0.0056
brooch0.0070summer0.0067house0.0053
old0.0068president0.0066friend0.0049
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zhang, Z.; Luo, M.; Luo, Z.; Niu, H. The International City Image of Beijing: A Quantitative Analysis Based on Twitter Texts from 2017–2021. Sustainability 2022, 14, 10675. https://doi.org/10.3390/su141710675

AMA Style

Zhang Z, Luo M, Luo Z, Niu H. The International City Image of Beijing: A Quantitative Analysis Based on Twitter Texts from 2017–2021. Sustainability. 2022; 14(17):10675. https://doi.org/10.3390/su141710675

Chicago/Turabian Style

Zhang, Zhishuo, Manting Luo, Ziyu Luo, and Huayong Niu. 2022. "The International City Image of Beijing: A Quantitative Analysis Based on Twitter Texts from 2017–2021" Sustainability 14, no. 17: 10675. https://doi.org/10.3390/su141710675

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop