Accommodation Experience in the Sharing Economy: A Comparative Study of Airbnb Online Reviews

: Current research investigating the accommodation experience in the sharing economy in China is limited, especially from a cross-cultural perspective. To ﬁll this gap, this study examined the accommodation experience of Airbnb guests using text-mining techniques and compared the accommodation experience perception between two culturally di ﬀ erent groups: domestic Chinese and foreign English-speaking Airbnb guests. The results showed that the two groups shared eight common dimensions, including “Convenience / Location”, “Amenities”, “Feel at home”, “Check-in / out”, “Experience”, “Availability / Transportation”, “Host”, and “Style / Decoration”. However, there are di ﬀ erences in the relative importance of each dimension of accommodation experience between the domestic and foreign Airbnb guests. For example, the foreign guests more often mentioned homeliness, location / convenience, and availability / transportation, while the domestic guests showed greater interest in check-in procedures and style / decoration. Additionally, the two groups have several unique dimensions. The dimensions unique to foreign guests are “Recommendation” and “Booking ﬂexibility”, while the dimensions unique to domestic guests are “Revisit” and “Cleanliness”. This study provides both theoretical and practical implications for peer-to-peer accommodation hosts and platforms. For example, Airbnb hosts can improve the satisfaction of Airbnb guests by improving several common extracted topics (e.g., amenities quality and host response) and the fact that foreign guests care more about homeliness, while domestic guests pay more attention to the check-in process and house design and decoration.


Introduction
The rapid growth of the sharing economy or collaborative consumption has rapidly changed the trends of consumers, which has sparked mounting interest from both academia and industry professionals. The sharing economy is a peer-to-peer activity that allows people to share or obtain goods and services, often through online platforms [1,2]. In the tourism and hospitality sector, Airbnb is a prominent example of peer-to-peer accommodation, enabling people to offer their unoccupied houses or rooms for short-term rental. As of 2019, Airbnb has served 500 million people with over 6 million Airbnb listings in more than 191 countries and regions [3]. Peer-to-peer accommodation platforms such as Airbnb have become an indispensable part of the tourism and hospitality industry.
Alongside the gaining popularity of peer-to-peer accommodation, much research has been undertaken. Recent studies have examined various topics on peer-to-peer accommodation such as its advantages and challenges [4][5][6] and questions of trust and reputation [7,8]. In addition, studies have also explored measuring the effects of peer-to-peer accommodation on the hotel industry [9,10], Sustainability 2020, 12, 10500 2 of 11 identifying factors affecting price [2,11,12], and exploring geographic patterns [13,14]. However, limited, yet still growing, studies have been devoted to accommodation experiences in the sharing economy, although it has been commonly recognized that the growing popularity of peer-to-peer accommodation can be partly attributed to its providing guests with unique stay experiences through direct interactions with local communities [5].
Currently, there is little understanding of the accommodation experience in the sharing economy in China, especially from a cross-cultural perspective. To fill this gap, in our study we attempt to examine Airbnb guest's accommodation experience in terms of the hidden dimensions presented in online reviews (180,460 review comments from 28,452 Airbnb listings in Beijing, China). The specific objectives are (1) To identify the dominant topics in Airbnb accommodation experience presented in review texts for both domestic and foreign guests; and (2) To compare the extracted dimensions in Airbnb accommodation experience of domestic and foreign guests.
The rest of this study is organized as follows. Section 2 presents a review of the extant literature related to accommodation experiences in the sharing economy and user-generated content (UGC). In Section 3, we describe the data and methodology used in this article. Then, the main results are shown in Section 4, and the conclusions are presented in Section 5.

Accommodation Experience in the Sharing Economy
The rapid growth of peer-to-peer accommodation such as Airbnb can be partly attributed to its offering an alternative accommodation experience for guests. Peer-to-peer accommodation provides guests a local experience by enabling guests to interact with the hosts or neighbors. Moreover, peer-to-peer accommodation tends to be more scattered than conventional hotels, thus making it possible for the user to stay in a non-tourist area, or even a local area [5].
Many studies have explored people's accommodation experiences in the sharing economy, and among these studies, there are three main fruitful lines. The first line is identifying the dimensions/attributes of accommodation experience in the sharing economy [15][16][17][18]. Earlier studies have mainly used survey data or interview data to identify the dimensions/attributes that form the basis of or affect the accommodation experience. For example, [15] investigated Airbnb guests' perceptions of service quality and found that convenience and assurance are important service quality factors. Wang and Jeong [18] showed that amenities and the host-guest relationship could affect users' satisfaction with the Airbnb stay. More recent studies have started to analyze user-generated content (UGC) with text-mining techniques to explore accommodation experiences in the sharing economy. For example, Luo and Tang [17] identified five important aspects that impact accommodation satisfaction: communication, experience, location, product/service, and value. Cheng and Jin [16] found three key attributes ('location', 'amenities', and 'host') that affected Airbnb users' accommodation experiences, drawing from 181,263 Airbnb review comments in Sydney, Australia.
Examining the factors affecting accommodation experiences is another theme in the sharing economy research [19][20][21][22][23]. Factors such as authenticity, social interaction, location, and trust have been found to significantly affect guests' accommodation experiences in the sharing economy. For example, Sthapit and Jiménez-Barreto [19] interviewed twenty people who had positive Airbnb experiences and found that the positive accommodation experiences were significantly associated with the social interactions with the host, the attitude of the hosts, and the location of the accommodation.
The third line is comparing the accommodation experience in the sharing economy with other sectors such as hotels [24][25][26][27]. Several studies have found that there can be significant differences in accommodation experiences for peer-to-peer accommodation and hotels. For example, Belarmino, Whalen, Koh, and Bowen [25] investigated and compared the key attributes/dimensions of peer-to-peer accommodation and hotels with the guests' online reviews. It was found that the guests of peer-to-peer accommodation care more about interaction with hosts, and the guests of hotels care more about the hotel amenities. Additionally, Zhang, Cui, Cheng, Zhang, and Li [26] explored the guests' perceived differences between peer-to-peer accommodation such as Airbnb and hotels, and the key attributes identified of significant difference were pets, atmosphere, flexibility, value for money, and quality assurance.
To date, previous studies have mainly focused on Western tourists' accommodation experiences in the sharing economy; limited yet rising studies have investigated Eastern tourists' accommodation experiences, especially those of Chinese tourists [28,29]. China is one of the biggest markets for Airbnb, and Airbnb has said that China would be its largest market by 2020. Therefore, it is necessary to understand the customers' accommodation experiences in the sharing economy in China. To this end, this study aims to explore Airbnb guests' accommodation experiences in China and the similarities and differences in accommodation experience between domestic guests and foreign guests.

User-Generated Content as a Resource for Accommodation Experience Research
In past decades, the development of technology and social media has created a great deal of user-generated content (UGC) on a wide range of platforms like blogs, forums, and consumer review websites. UGC is commonly considered an important resource to unpack user experience that could deepen the understanding of user behavior and satisfaction both in academia and the tourism/hospitality industry.
Accommodation experience is one of the main topics in hospitality research that has received much attention. In particular, many studies used online consumer reviews, a type of evaluation of product or service created by consumers based on their personal experiences [30], as the primary data to investigate consumers' experience in accommodation experience research. Consumer reviews were always focused on a specialized topic and contained the customers' evaluation of products or services, which makes them an appropriate and reliable way to understand costumers' feelings and perceptions about their experiences [31]. For example, Zhou et al. [32] identified seventeen hotel attributes affecting customer satisfaction by analyzing online reviews from upscale hotels. Barreda and Bilgihan [33] found that travelers' online reviews contained both positive and negative hotel experiences and identified the main dimensions that motivate travelers to evaluate their hotel experiences.
In the sharing economy, a growing number of studies have also explored the accommodation experience with UGC. Brochado et al. [34] examined 1776 Airbnb web reviews in three nations, US, Portugal, and India, and tested the convergence of consumer experience using thematic analysis. Cheng and Jin [16] investigated the attributes affecting Airbnb users' experiences with a set of online review comments. Three attributes, "location", "amenities", and "host", were found to be the key influencers. With online reviews, Zhang, Cui, Cheng, Zhang, and Li [26] identified the key similarities and differences between peer-to-peer accommodation and hotels, such as the ability to bring pets, atmosphere, flexibility, and value for money.
Currently, few studies have investigated the impact of cultural dissimilarity on people's accommodation experiences in the sharing economy. Although previous studies have explored accommodation experiences with UGC, none of them are from a cross-cultural perspective. Moreover, culture was commonly recognized as an important factor affecting people's travel and accommodation experiences [35,36], which could play an important role in the development of the sharing economy, but it has yet not been deeply studied. Our study contributes to the literature by comparing the online reviews of two culturally different groups and the aspects they value in their accommodation experiences with text-mining techniques, especially the Latent Dirichlet Allocation (LDA) analysis, a common and useful method in topic modeling.
In this study, we argue that UGC offers great potential for understanding users' accommodation experiences in the sharing economy, particularly for foreign visitors. As the number of inbound tourists in China is growing, understanding the preferences of foreign visitors is becoming more and more important. Therefore, by analyzing millions of online reviews from Airbnb guests, our study sheds light on such matters and hence provides valuable insights into the preferences and differences of domestic and foreign visitors.

Data and Methodology
The Airbnb data for Beijing, China was obtained from Inside Airbnb website (http://insideairbnb. com/get-the-data.html), which provides open dataset about Airbnb listings and reviews. As of April 2019, we collected 180,452 review comments from 15,749 Airbnb listings in Beijing, China. Figure 1 displays the daily growth of the reviews. As shown, a clear and constant increasing pattern in reviews per day was observed from 2010 to 2019. In particular, the reviews per day have started to increase significantly since 2016, which may be due to the increasing acceptance of Airbnb in China. Additionally, we examined the language of all reviews and found that the top five frequent languages in our sample are Chinese (68.1%), English (14.9%), Korean (14.8%), French (0.3%), and Traditional Chinese (0.2%), which account for about 98.3% of the total collected Airbnb reviews. sheds light on such matters and hence provides valuable insights into the preferences and differences of domestic and foreign visitors.

Data and Methodology
The Airbnb data for Beijing, China was obtained from Inside Airbnb website (http://insideairbnb.com/get-the-data.html), which provides open dataset about Airbnb listings and reviews. As of April 2019, we collected 180,452 review comments from 15,749 Airbnb listings in Beijing, China. Figure 1 displays the daily growth of the reviews. As shown, a clear and constant increasing pattern in reviews per day was observed from 2010 to 2019. In particular, the reviews per day have started to increase significantly since 2016, which may be due to the increasing acceptance of Airbnb in China. Additionally, we examined the language of all reviews and found that the top five frequent languages in our sample are Chinese (68.1%), English (14.9%), Korean (14.8%), French (0.3%), and Traditional Chinese (0.2%), which account for about 98.3% of the total collected Airbnb reviews. In this study, we assumed that the Chinese reviews were written by domestic guests, and the English reviews were written by English-speaking foreign guests. To compare the Airbnb accommodation experience of foreign and domestic guests, we chose the 122,914 Chinese reviews and 26,280 English reviews to form the English and Chinese corpus.
First, text pre-processing was conducted before the Latent Dirichlet Allocation (LDA) analysis. Text pre-processing was done with several python modules (e.g., Natural Language Toolkit, Jieba), including text tokenization, word stemming, and stop words removing. Then, the LDA method was applied to extract the experiential dimensions of accommodation experience derived from Airbnb reviews. In this study, we assumed that the Chinese reviews were written by domestic guests, and the English reviews were written by English-speaking foreign guests. To compare the Airbnb accommodation experience of foreign and domestic guests, we chose the 122,914 Chinese reviews and 26,280 English reviews to form the English and Chinese corpus.
First, text pre-processing was conducted before the Latent Dirichlet Allocation (LDA) analysis. Text pre-processing was done with several python modules (e.g., Natural Language Toolkit, Jieba), including text tokenization, word stemming, and stop words removing. Then, the LDA method was applied to extract the experiential dimensions of accommodation experience derived from Airbnb reviews.
LDA is a generative probabilistic model used for discovering the underlying topics in discrete data such as text documents [37]. It is the most commonly used method in topic modeling. The basic Sustainability 2020, 12, 10500 5 of 11 idea is that documents are represented as random mixtures over latent topics, where each topic is characterized by a distribution over words. For the LDA method, a set of terms is defined. It is assumed that a sequence of N words constitutes a review, which is referred to as a "document" in the literature, w = {w 1 , w 2 , . . . , w N }, whilst M reviews form a corpus, D = {w 1 , w 2 , . . . , w M }. Additionally, we assume that there is K number of dimensions across the corpus comprising all the M reviews.
Based on the above definitions, LDA assumes the following generative process for each document w in a corpus D, with N d words in each document w using K topics [37]: Choose θ d ∼ Dir(α) for a document's topic distribution, where α is the k-vector parameter of the Dirichlet prior to the per-document topic distributions; 3.
For each of the N words w n ∈ {1, 2 . . . , N d } for document d: • Choose a topic z d, n ∼ Multinomial(θ d ); • Choose a word w d,n from p(w d,n z d,n , β) , a multinomial probability conditioned on the topic z d,n , where β is a parameter of the Dirichlet prior on the per-topic word distribution.
Based on Bayesian probability principle, we can compute the probability of a corpus by taking the product of the marginal probabilities of a single document: Following the abovementioned LDA process, we can extract the guests' most valued topics of accommodation experience in the sharing economy and the probability of each topic occurring in Airbnb online reviews.

Dimensions Extracted from Airbnb Reviews
With the LDA method, we extracted the top ten dimensions (topics) of Airbnb accommodation experiences for both domestic and foreign guests. Within each topic, we also obtained the associated words and their relative weights. To name each dimension, we conducted the following process: first, one researcher named the dimension; then the second researcher confirmed the name. The name of the dimension was identified based on the logical connection between the most frequent words for a topic. For example, in Table 1, the topics "Amenities" and "Availability/public transportation" were identified based on the top 15 frequent words. For topic "Amenities", we got several keywords, like "water", "bathroom", "shower", and "kitchen", which are connected and can be generalized as topic "Amenities". Similarly, for the topic "Availability/transportation", the top frequency words like "convenient", "subway", "station", "transportation", and "walking" are related to the availability of public transportation.  Tables 2 and 3 present the top ten dimensions (topics) and associated words extracted from Chinese reviews and English reviews, respectively. The ten extracted dimensions for English reviews are "Convenience/Location", "Amenities", "Feel at home", "Check-in/out", "Experience", "Availability/Transportation", "Host", "Style/Decoration", "Recommendation", and "Booking flexibility". Three dimensions ("Feel at home", "Experience", and "Recommendation") represent consumers' perception of his or her experience and the resulting response. Two dimensions ("Convenience/Location" and "Availability/Transportation") indicate consumer preference for the location and convenience of Airbnb listing; the remaining dimensions show the quality of Airbnb listings, including service quality and facility quality. Similarly, the ten extracted dimensions for Chinese reviews are "Convenience/Location", "Amenities", "Feel at home", "Check-in/out", "Experience", "Availability/Transportation", "Host", "Style/Decoration", "Revisit", and "Cleanliness". The foreign guests and domestic guests shared eight common dimensions in accommodation experiences in Airbnb.  Table 3. Extracted dimensions and associated words from Chinese reviews.

Dimension Top 10 Words for Each Dimension
Convenience/Location

Dimension Comparison between Domestic Guests and Foreign Guests
Then, we compared the extracted dimensions from Airbnb English reviews with the dimensions extracted from Airbnb Chinese reviews. The common dimensions between Airbnb English and Chinese reviews are "Convenience/Location", "Amenities", "Feel at home", "Check-in/out", "Experience", "Availability/Transportation", "Host", and "Style/Decoration". These topics reflect common aspects that affect consumers' satisfaction with the accommodation experience. For example, a convenient location and easy transportation are important for both domestic and foreign guests.
However, the importance of each common dimension varies slightly between domestic and foreign guests. Figure 2 showed the comparison result of the relative importance of each dimension in terms of the practical frequency of the occurrence of each extracted dimension. As shown, for topics like "Amenities", "Host", and "Experience", foreign and domestic guests had similar mentions, since they are common yet important aspects of the accommodation experience. The foreign guests made greater mention of homeliness, location/convenience, and availability/transportation. This makes sense, since they will be in a significantly different and new environment, and they would like to choose Airbnb listings with convenient locations and easy transportation. They also care more about homeliness, since they want to have the feeling of being at home. Domestic guests more often mentioned check-in and checkout as well as style/decoration. This suggests that domestic guests are more likely to interact with Airbnb hosts about check-in issues. Additionally, domestic guests pay more attention to the details of Airbnb listings, especially the style and design.
In addition, several unique dimensions were found to be common to both Chinese reviews and English reviews. The dimensions unique to foreign guests are "Recommendation" and "Booking flexibility", while the dimensions unique to domestic guests are "Revisit" and "Cleanliness". The domestic guests tend to revisit, while the foreign guests would like to recommend. This makes sense, since it is easier for domestic guests to revisit. Foreign guests would like to have flexibility in booking, since their schedules may change easily. The domestic guests care about cleanliness, indicating their high expectations of Airbnb. sense, since they will be in a significantly different and new environment, and they would like to choose Airbnb listings with convenient locations and easy transportation. They also care more about homeliness, since they want to have the feeling of being at home. Domestic guests more often mentioned check-in and checkout as well as style/decoration. This suggests that domestic guests are more likely to interact with Airbnb hosts about check-in issues. Additionally, domestic guests pay more attention to the details of Airbnb listings, especially the style and design. In addition, several unique dimensions were found to be common to both Chinese reviews and English reviews. The dimensions unique to foreign guests are "Recommendation" and "Booking flexibility", while the dimensions unique to domestic guests are "Revisit" and "Cleanliness". The domestic guests tend to revisit, while the foreign guests would like to recommend. This makes sense,

Conclusions
This study examined the accommodation experiences of guests on Airbnb using text-mining techniques (LDA method) and extracted latent topic dimensions from large but unstructured online reviews. Then, a cross-cultural comparative study was conducted between two culturally different groups, the domestic Chinese and foreign English-speaking guests. The results showed that the two groups shared eight common dimensions in accommodation experience, including "Convenience/Location", "Amenities", "Feel at home", "Check-in/out", "Experience", "Availability/Transportation", "Host", and "Style/Decoration". However, there were differences in the relative importance of each dimension of accommodation experience between the domestic and foreign Airbnb guests. For example, the foreign guests more often mentioned homeliness, location/convenience, and availability/transportation, while domestic guests more often mentioned check-in and style/decoration. In addition, the two groups have several unique dimensions. The dimensions unique to foreign guests are "Recommendation" and "Booking flexibility", while the dimensions unique to domestic guests are "Revisit" and "Cleanliness". In summary, there exists a subtle nuance between the accommodation experiences of domestic and foreign guests, and the cultural variable could be used to explain the difference.
The results of this study have both theoretical and practical implications. Theoretically, this study contributes to the literature by utilizing text mining techniques to examine the hidden features from online reviews and comparing experience perceptions between culturally different groups. In addition, this study extracted important dimensions of the accommodation experience in the sharing economy for two culturally different groups, and future research could investigate how consumer decisions in the sharing economy would be affected by these dimensions and how those influences differ among culturally different groups. Practically, this study can provide several implications for both Airbnb hosts and peer-to-peer accommodation platforms. First, this study helps Airbnb hosts better understand customers' accommodation needs, thus improving their competitiveness in the industry. For Airbnb hosts, the research findings suggest that Airbnb hosts can improve the satisfaction of Airbnb guests by improving several common extracted topics (e.g., amenities quality and host response). For example, Airbnb hosts should be responsive and provide a smooth check-in process for guests. The findings also reiterate the importance of location and convenience, which is impossible for Airbnb hosts to improve, but can be partly improved or controlled by peer-to-peer accommodation platforms. Additionally, for culturally different groups, the findings have different implications. Foreign guests care more about homeliness; thus, Airbnb hosts can provide a comfortable and homelike environment to improve guest satisfaction and emphasize such features on their websites to attract the foreign guest. Domestic guests pay more attention to the check-in process and house design and decoration; thus, to improve domestic guests' satisfaction, Airbnb hosts should be more responsive to guests in terms of check-in issues and improving their house design and decoration.
Several limitations should be mentioned, which also provide directions for future research. First, this study selected Beijing, China as a case study to investigate the accommodation experience in the sharing economy, which means that the results are not meant to be generalized to other cities. Second, we only analyzed the online reviews on Airbnb, which means that we could explore other peer-to-peer accommodation platforms like HomeAway. Furthermore, the data was collected from Inside Airbnb website, which may present information of which the public are unable to judge the levels of accuracy. Nevertheless, our assumption would not affect the findings of this study. In the future, datasets from other platforms should also be collected and analyzed. Third, we only examined the Airbnb reviews from two culturally different groups (the domestic Chinese and foreign English-speaking customers). Future studies could also examine reviews in other languages and explore the experiential perception of customers from other cultural backgrounds. Last, the LDA method used in this study assumed there was no correlation between topics, which can be further improved by accounting for correlations between topics with advanced methods.
Author Contributions: Z.Z. provided literature reviews and quantitative analyses. R.J.C.F. guided the project with direction and ideas, added value to methods and recommendations, and edited the manuscript. All authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.