Analysis of Customer Satisfaction in Tourism Services Based on the Kano Model

Zhou, Kailin; Yao, Zhong

doi:10.3390/systems11070345

Open AccessArticle

Analysis of Customer Satisfaction in Tourism Services Based on the Kano Model

by

Kailin Zhou

¹ and

Zhong Yao

^1,2,*

¹

School of Economics and Management, Beihang University, Beijing 100191, China

²

Beijing Key Laboratory of Urban Operation Emergency Support Simulation Technology, Beijing 100191, China

^*

Author to whom correspondence should be addressed.

Systems 2023, 11(7), 345; https://doi.org/10.3390/systems11070345

Submission received: 19 May 2023 / Revised: 15 June 2023 / Accepted: 30 June 2023 / Published: 6 July 2023

(This article belongs to the Section Systems Practice in Social Science)

Download

Browse Figures

Versions Notes

Abstract

:

Understanding customer needs is of great significance to enhance service quality and competitive advantage. However, for the tourism industry, it is still unclear how to mine service improvement strategies from tourist-generated online reviews. This paper aims to develop a data-driven approach to conduct a fine-grained dimension analysis of customer satisfaction with tourism services. First, this paper uses Latent Dirichlet Allocation to explore the key dimensions of tourist satisfaction from online reviews. Next, based on the Chinese sentiment dictionary, tourists’ emotional attitudes towards each service dimension can be identified. Then, the backpropagation neural network is used to measure the complex relationship between tourists’ sentiment orientations towards different dimensions and their satisfaction. Finally, according to the improved Kano model, multi-dimensional attribute classification is realized to support the strategic analysis of tourism service quality improvement. The proposed method is empirically verified through a real tourism review dataset. The results exhibit the theoretical and practical implications of our method.

Keywords:

customer satisfaction; Kano model; online review; backpropagation neural network; online reviews

1. Introduction

Customer satisfaction is a psychological state achieved by customers based on their subjective judgment of the degree to which products or services fulfill their requirements [1]. The inherent characteristics of tourism services determine that customer satisfaction is an important reference to evaluate its quality. Firstly, tourism services are experience-oriented services, and the quality of service largely depends on the experience and feelings of tourists themselves, which is difficult to be evaluated by consistent testing standards [2]. Secondly, tourists participate in and experience the whole process of the service, and have a tangible perception of its advantages and disadvantages. Thirdly, customer satisfaction has a strong correlation with recommendation intention [3], and improving customer satisfaction is an important driving force for the economic development of tourism enterprises. Some existing studies have shown that effective product improvement or Research and Development analysis can be conducted by modeling or measuring customer satisfaction, supplemented by other relevant models [4,5]. Consumer surveys or experiments are a typical way of measuring customer satisfaction [6]. However, this approach requires rigorous design and appropriate procedures to ensure the quality of participants’ responses. Undoubtedly, this is costly in money and time, and the data can become quickly outdated. For the tourism service industry, tourists’ consumption has the characteristics of one-time, mobility and hedonism, which makes it more difficult to carry out customer satisfaction surveys. Therefore, how to use the accumulated online tourism data resources to measure the customer satisfaction of tourism services is worth further research.

User-generated content (UGC) is the public content spontaneously generated by users in the Web2.0 environment, which includes various forms of data such as audio, video, text, pictures, etc. UGC contains a wealth of people’s attitudes, opinions and other information, and the research on its mining is becoming deeper and deeper. Among them, text data have become one of the most important elements reflecting market sentiment, as well as one of the main forms of tourism’s Big Data [7]. Online travel reviews objectively reflect the tourists’ real perception of the attractions and services of the tourist destination, which is one of the most important ways of online word-of-mouth communication and is the most authentic portrayal of tourists’ views, feelings, and sentiments. It is also the true expression of tourists, which contains the tourists’ attitude towards the tourism service elements of tourism destinations. Therefore, travel online reviews contain rich information, which is of great value for managers and researchers to understand customer satisfaction [8]. In addition, compared with survey research or experimental research, online review data have the characteristics of public availability, low cost, spontaneous generation, great insight, and large number of participants [9], which make online review data more suitable for constructing comprehensive customer satisfaction models. With the above advantages in mind, online travel reviews are a potentially powerful data resource for understanding customer satisfaction with tourism services.

Some studies in the field of tourism have confirmed that customers’ travel sentiments have a significant impact on their satisfaction. For example, a survey on customer satisfaction of diving services shows that high customer satisfaction is closely related to sentiments, such as excitement, pleasure, awe, surprise, etc. All themes in the theoretical framework of diver satisfaction determined in this study are regulated by sentiments [10]. Geetha et al. [11] studied the relationship between customer sentiment and customer ratings in online reviews of the hotel industry, and found consistency between customer ratings and customer feelings for advanced and economic hotels, and that customer sentiment explained significant differences in customer ratings for the two hotel categories. In fact, the overall satisfaction of customers is a comprehensive consideration of the various tourism services provided. Similar to the research of Bi et al. [7], this paper regards customer satisfaction with tourism services as the result of the comprehensive effect of the perception (emotion) of customers in the multiple dimensions of tourism services they experience, which is consistent with our intuition. However, it is still unclear how to design service improvement strategies according to the association between customer satisfaction and multiple dimensions of service quality perception. This paper focuses on developing a data-driven approach to conduct a fine-grained dimension analysis of customer satisfaction with tourism services. We will achieve this aim by answering three sub-issues:

(1) how to explore the key quality factors affecting customer satisfaction from the online travel review data of tourist attractions?

(2) how to measure the complex impact of tourists’ emotional attitude in each key factor on overall customer satisfaction?

(3) based on the influence of each key factor obtained, how to classify their attribute types and gain insight into how these key factors play a role in improving customer satisfaction, especially when faced with severe challenges, such as the COVID-19 pandemic?

To solve these problems, this paper establishes a framework for constructing a customer satisfaction model from online travel reviews. In this framework, firstly, the Latent Dirichlet Allocation (LDA) model is used to extract important tourism service features that customers care about from online reviews of tourist attractions. Then, based on the Chinese sentiment dictionary, emotional attitudes (positive or negative) about these important characteristics can be identified in the review data. Then, to measure how customer sentiment of each feature affects customer satisfaction, a measurement method based on BPNN is proposed. On these bases, combined with the Kano model, the dimension of customer satisfaction is classified. Finally, this paper uses the review data of tourist attractions in Guizhou province, since the outbreak of COVID-19, to verify the proposed method and analyze the customer satisfaction of tourism services in Guizhou. Notably, Guizhou is a leading tourism province in China, and rich tourist review resources have accumulated on online platforms, influencing its choice as a typical example for this paper. Similarly, tourism services in other regions can be analyzed according to our method guideline, which exhibits generalization.

The rest of this paper is organized as follows. Section 2 reviews the relevant literature. Section 3 presents our proposed research method. Section 4 explains our method through empirical research. Section 5 demonstrates the theoretical and practical implications, and presents the limitations of our study with directions for future research.

2. Literature Review

In this section, we will review the research on customer satisfaction measurement, then introduce the Kano model.

2.1. Research on Customer Satisfaction Measurement

With the accumulation of massive online review data, many studies are emerging to analyze customer satisfaction directly or indirectly from online reviews. These studies can be mainly divided into three main streams: (1) to explore the key attributes affecting customer satisfaction from online reviews and conduct sentiment analysis; (2) research on the relationship between product/service attribute performances and customer satisfaction; (3) research on customer satisfaction model based on online reviews.

2.1.1. Attribute Extraction and Sentiment Analysis Based on Online Reviews

With the rapid development of the Internet, massive UGC has accumulated continuously. UGC contains rich and real customer perspective information and is regarded as a data resource with a strong potential for understanding and managing customer demands [12]. However, online review data exist in the form of free text. As a kind of unstructured data, text data cannot be directly analyzed. Therefore, the conversion of text review data into structured data that can be directly utilized is the basis for subsequent analysis. Customer opinion mining from online reviews is one of the most pursued areas of research, which is mainly carried out based on attribute extraction and sentiment analysis [13].

Attribute extraction refers to extracting topics that users pay frequent attention to from online reviews, as well as keywords related to topics. There are two main categories of attribute extraction methods: (1) The statistical model-based methods, such as association rule mining [14], Hidden Markov Model [15], LDA model [8], etc. Among them, the LDA model becomes one of the most widely used models. For example, the study of Tirunillai and Tellis used the improved LDA model to propose a unified framework for extracting tourism service attributes from online reviews [16]. Another study utilized the LDA model to identify the key dimensions of customer satisfaction from 266,554 pieces of online review data [8]. (2) The rule-based method, which formulates the corresponding extraction rules according to the characteristics of the review text and the research goal to realize the extraction of attributes. For example, Kang and Zhou [17] proposed an unsupervised rule-based approach to identify subjective and objective characteristics from online reviews. Rana and Cheah et al. [18] defined a rules-based sequential pattern for online review mining and proposed a rules-based two-stage extraction model for dimensional extraction.

Sentiment analysis can mine the sentiment information hidden in online reviews to help understand customers’ emotional attitudes toward product and service attributes. Sentiment analysis methods are mainly divided into lexical sentiment analysis, such as dictionary-based sentiment analysis and corpus-based sentiment analysis [19] and sentiment analysis based on machine learning, such as support vector machine (SVM), Naive Bayes, unsupervised machine learning [20], etc.

2.1.2. Research on the Influence of Attribute Performance on Customer Satisfaction

On the basis of attribute and sentiment mining, the subsequent research mainly focuses on analyzing the impact of tourism service attribute performances on customer satisfaction in online reviews. It is worth noting that since online reviews express customers’ views and feelings, attribute performances in related studies refer to customers’ feelings on product/service attributes, which are usually expressed by customers’ emotional attitudes towards attributes. In the field of tourism, the several present studies mainly focus on the hotel industry. For example, a recent study examined how cultural traits affect the role of attribute-level experience on tourist satisfaction, and used a deep learning algorithm to propose an attribute-level sentiment analysis model to extract tourists’ attribute-level experience from online reviews. An empirical study based on nearly 50,000 online reviews collected by TripAdvisor found that positive/negative attribute experience has different impacts on customers with different cultural traits [21]. In addition, to understand the customers’ demands of five-star hotels, Bi et al. [7] proposed an online review mining method from the perspective of attribute importance–performance analysis (IPA). In this method, LDA is first used to find several useful hotel attributes from online reviews, then SVM is used to analyze customers’ performance feelings on these attributes in the reviews, and then an integrated neural network model is used to calculate the importance of attributes. Finally, an IPA diagram is constructed according to the results to analyze customer demand. From the perspective of service improvement, Zhang et al. [22] based on the existing research on the relationship between service performance and customer satisfaction, and considering the influence of consumer expectations and subjective opinions of management, proposed an online review-driven method to determine the priority of hotel service resource allocation. In this method, the LDA model is used to extract service attributes, and the recursive neural tensor network is used to divide the attribute sentiments involved in reviews into five categories. Then, the traditional PRCA model is improved to analyze the asymmetric relationship between attribute performances and customer satisfaction. On this basis, the customer mention frequency is calculated, the customer satisfaction function is constructed, and finally, the improvement strategy analysis is realized under the framework of the Kano model.

These studies have made outstanding contributions to the tourism field to gain insight into customer satisfaction from online reviews. However, there are still limitations in these research methods. First of all, for the research that extracts service attributes from online reviews for performance analysis, there is no quantitative measurement of the impact of these attributes on customer satisfaction, such as the following study [7]. Second, even if the influence of service attributes on customer satisfaction is measured, these studies are usually based on the assumption that customer satisfaction (online ratings) follows a Gaussian distribution, and the relationship between satisfaction and all attribute tendencies follows additive independence, such as the following study [22]. In fact, in many real situations, customer satisfaction follows a positive skew, asymmetric, bimodal (or J-shaped) distribution [23,24]. At the same time, because the attributes automatically mined from reviews are not as rigorous as those in a well-designed questionnaire, there may be more complex multilinear or nonlinear relationships between different attributes and customer satisfaction. Third, some studies do not pay attention to the categories of service attributes, such as the following study [21]. However, some studies have confirmed that service attributes can be divided into different categories, and the attributes of different categories will affect customer satisfaction in different ways [25,26]. For example, performance attributes will cause dissatisfaction when they are not implemented, but satisfaction when they are implemented; however, a higher degree of realization of the reverse attributes will lead to an increase in tourist dissatisfaction. Recently, the empirical study of Xu et al. [27] also showed that the attribute type moderates the impact of perceived attribute experience on overall satisfaction. Therefore, identifying the category of tourism service attributes is helpful to provide a clearer improvement direction for promoting tourism satisfaction and realize a more effective allocation of tourism service resources, which is necessary for further research.

In summary, the extant related literature does not provide an effective scheme to design tourism service improvement strategies from abundant UGC. We will develop a comprehensive approach including multiple techniques to model the complex relationship between customer satisfaction and multiple dimensions of service quality perception, and to classify the types of service attributes.

2.2. Kano Model

The Kano model is a two-dimensional demand analysis model that classifies product/service attributes into different categories. According to the realization degree of attributes and their impact on customer satisfaction, Kano divides into five types of attributes, as shown in Figure 1 [25]:

(1) Performance attributes: The realization of such attributes is positively related to tourist satisfaction. When such attributes are not implemented, tourists will be dissatisfied; when implemented, they will bring satisfaction to tourists;

(2) Excitement attribute: This kind of attribute will make tourists satisfied when it is realized, but will not lead to dissatisfaction when it is not realized;

(3) Must-be attributes: the realization of such attributes is taken for granted by tourists, and will not improve satisfaction; however, not being realized will lead to dissatisfaction of tourists;

(4) Reverse attributes: the higher the realization of such attributes, the more dissatisfied tourists will be;

(5) Indifferent attributes: the realization degree of such attributes has nothing to do with tourist satisfaction, or has only a very small impact.

3. An Approach to Modeling Tourist Satisfaction in Online Travel Reviews

In this section, we propose our methodological framework, and then detail each part in the following.

3.1. Methodological Framework

This section introduces a methodological framework for modeling tourist satisfaction from online travel reviews, as shown in Figure 2. For a clearer introduction to the method framework, first define some basic concepts covered in this article.

Online travel reviews: An online travel review is text generated by a tourist, including the tourist’s opinions on the tourism services he/she experienced;
Tourist satisfaction: Tourist satisfaction is a kind of psychological state, resulting from tourists’ overall subjective evaluation of tourism services based on their expectations and actual performance [7]. In previous studies, online ratings of customers are usually used to represent customers’ satisfaction with products and services [8,16,28,29]. Following these studies, this paper will use online ratings of tourists to express their satisfaction levels with the tourism services they experienced;
Tourist satisfaction dimension (TSD): Tourists usually evaluate tourism services based on their perception of some important attributes of the tourism services they experienced. Similar to the research of Guo et al. [8], this paper defines these important attributes of tourism services as TSDs;
Classification of TSDs: In this paper, under the classification framework of the Kano model, TSDs are divided into five categories, namely, performance TSD, excitement TSD, must-be TSD, reverse TSD, and indifferent TSD, whose meanings correspond to the five basic attributes of Kano model.

Based on Figure 2 and an explanation of the basic concepts, let’s look at the three main parts of the framework.

3.1.1. Mining Tourists’ Sentiments toward TSDs from Online Reviews

The collected online review data exist in the form of free text, which cannot be directly analyzed. In this part, through various text processing technologies, tourists’ positive or negative sentiments on different TSDs are mined from online text reviews, so as to transform unstructured online text reviews into a structured data matrix for modeling tourist satisfaction. Specifically, this part consists of two stages: (1) mine TSDs from online reviews using the LDA model; (2) identify the sentiment orientations of the review data for each TSD based on the sentiment dictionary.

3.1.2. Identifying the Category of Each TSD

Tourist satisfaction is expressed by the online rating given by tourists. The process of tourists giving the overall rating is a comprehensive evaluation of the performance of the tourism services in various aspects. Thus, tourist satisfaction can be viewed as a complex combination of tourist sentiments regarding the multiple TSDs covered in their reviews. Based on the structured review data obtained in the last stage, as well as overall tourist satisfaction (online ratings), this paper uses the BPNN network to depict the influence of tourists’ positive and negative sentiments of TSDs on their satisfaction, and then puts forward the calculation method of effect according to the weight parameters obtained by model training.

3.1.3. Measuring the Influence of Each TSD Sentiment on Tourism Service Satisfaction

Identifying the corresponding attribute category of each TSD under the Kano framework is conducive to improving the tourism services more effectively and thus improving tourist satisfaction. According to the importance calculated in the previous step, we can use the effect-based Kano model (EKM) to convert the extracted TSDs into five categories (performance TSD, excitement TSD, must-be TSD, reverse TSD, and indifferent TSD). Finally, the identified TSDs category plays an important role in the improvement strategy of tourism services.

Let’s go through each of these sections in detail based on the overall framework in Figure 2.

3.2. Excavating Tourists’ Sentiments on TSDs

3.2.1. Extracting TSDs Based on LDA

Previous studies have proved that LDA is an effective topic extraction method for online reviews [8,30]. LDA is a three-level Bayesian model, which assumes that each item in the set contains a finite number of topics. In the LDA model, words, topics, and documents are three important concepts. The word is a basic unit. Each document consists of multiple words and contains one or more topics. In this context, each online review can be viewed as a document, so the words in the document are the words in the review. According to the frequency of each word in each review, we can obtain the topic distribution of the review, and the word distribution of the topic by training the LDA model.

Extracting TSDs from travel reviews using LDA mainly includes two steps: preprocessing text reviews and extracting TSDs from review data:

Preprocessing text review data

In the Chinese travel text review data, there are not only words related to the required TSDs, but also a large number of noise words and irrelevant words, which will not become the target CSDs and will aggravate the data sparsity problem. Therefore, it is necessary to improve the effect of the LDA model through the pretreatment process. First, we divide the Chinese text review data into words. Then, we filter the corresponding words in the sentences according to the stop word dictionary (HIT Stop Word List, Chinese Stop Word List), negation dictionary, degree adverb dictionary, and sentiment dictionary. Let R = {r₁,r₂,…r_M} denote the online review dataset, where

r_{m} (m = 1, 2 \dots, M)

denotes the

M

th text review, and

m

is the number of reviews in the dataset. By counting the occurrence frequency of each word in each preprocessed text review, we can obtain the review-word matrix

X_{M \times N}

, where

N

denotes the number of words appearing in all the preprocessed review data;

2.: TSDs extraction based on LDA

By using the obtained

X_{M \times N}

matrix as input, the LDA model can be trained. The output of the trained LDA model has three parts, including review-topic matrix, topic-word matrix, and topic list. Because there are noise words in the obtained topics, and some subject words may have similar meanings, we can manually filter the noise words and merge subject words with similar meanings to obtain more reasonable results. Then, we select the appropriate topics from the results and assign the tag to each topic. As in some existing studies, each thematic term can be regarded as a TSD [7,16]. Let

I

denote the number of TSDs (i.e., topics), each consisting of multiple frequent words, and let

C_{i}

denote the number of frequent words under the

i

th topic, so that the

i

th TSD can be denoted as

t_{i} = {w o r d_{i 1}, w o r d_{i 2}, \dots, w o r d_{i C_{i}}}

, where

w o r d_{i j} (i = 1, 2 \dots, I; j = 1, 2, \dots, C_{i})

denotes the

j

th frequent word in the

i

th TSD.

3.2.2. Dictionary-Based TSD Granularity Sentiment Recognition

Review text decomposition

Typically, a single online review may contain several sentences related to different TSDs. Table 1 shows several examples of online travel review text. At the same time, the TSDs mentioned in different reviews may be different. To identify tourists’ emotional attitudes towards different attributes, it is necessary to extract the sentences containing each attribute from the original reviews, that is, decompose the original reviews into components under different TSDs. First, we divide the online reviews in

R

into clauses according to punctuation marks.

Then, according to the obtained

t_{i} = {w o r d_{i 1}, w o r d_{i 2}, \dots, w o r d_{i C_{i}}}

, we extract sentences from

R

which contains

w o r d_{i j} (i = 1, 2 \dots, I; j = 1, 2, \dots, C_{i})

, and obtain the review set

R_{i}

3.3. Evaluate the Impact of Each TSD’s Sentiment Orientation on Tourist Satisfaction

In this section, we propose a method based on backpropagation neural network (BPNN) to measure how tourists’ sentiments towards TSDs affect their satisfaction.

Most current studies modeling tourist satisfaction from online review data follow the following assumptions [28,29]: (1) assume that tourist satisfaction (i.e., online rating) follows a Gaussian distribution; (2) at the same time, it is assumed that tourist satisfaction is a linear combination of tourists’ emotional attitudes towards TSDs; (3) in addition, the multicollinearity between different TSDs is low. However, in many practical problems, these assumptions cannot be satisfied. In practice, the TSDs mined from online reviews and the online ratings of tourists usually have the following characteristics: (1) tourist satisfaction (online ratings) usually follows the positive skew, asymmetric, bimodal (or J-shaped) distribution [23]; (2) tourist satisfaction may be a nonlinear combination of sentiments toward TSDs; (3) there may be a multicollinearity relationship between the different attributes automatically mined from the reviews, and there may be a complex nonlinear relationship between the attributes and tourist satisfaction. In fact, tourist satisfaction is a complex union of their emotional attitudes towards the full range of TSDs involved in the reviews. Therefore, considering the above characteristics, this section proposes a new method to measure the impact of tourists’ TSD sentiments on tourist satisfaction.

Neural network (NN) is an effective prediction method. In some complex data environments (such as non-normal data, nonlinear relationship, multicollinearity, etc.), NN is significantly better than the multiple regression model because it is not affected by collinear independent variables and does not require the linear assumption of multivariate input variables and dependent variables [7,33]. Although NN is proposed to be used for prediction tasks, some studies have shown that it can also be used to determine the weight information of input variables [34]. For example, artificial NNs were utilized in the study [35] to evaluate the relative importance of the influence factors of consumer acceptance of behavioral-targeted advertising services. Therefore, NN is undoubtedly a competitive alternative method for measuring the influence of positive and negative TSDs sentiments of tourists on their satisfaction. BPNN is one of the most popular NN models, which will be used as the importance measurement technique in this paper. Specifically, this paper builds a BPNN model with three layers of network structure, including input layer, hidden layer, and output layer, as shown in Figure 3 below. Generally, BPNN includes two processes of forward information propagation and reverse error propagation to train the model. In the forward process, the input node transmits the tourists’ sentiment information about each TSD to the hidden node, and then the hidden node transmits the corresponding information to the output layer through the activation function. In the reverse process, according to the error between the model calculation results and the real results, the gradient descent method is used to minimize the error, and the model parameters (namely the weight) are updated [36].

Let

x_{m} = (E_{1 m}^{P o s}, E_{2 m}^{N e g}, \dots, E_{i m}^{P o s}, E_{i m}^{N e g}, \dots, E_{I m}^{P o s}, E_{I m}^{N e g})

denote the structured data of

r_{m}

, that is, the emotional attitudes of tourists towards each TSD (a row of data in Table 5), where

E_{i m}^{P o s}, E_{i m}^{N e g} \in {0, 1}

and

E_{i m}^{P o s} + E_{i m}^{N e g} \leq 1 (i = 1, 2, \dots, I, m = 1, 2, \dots, M)

. In addition, let

y_{m}

denote the review

r_{m}

corresponding to tourist satisfaction. Then, the training sample is composed of

x_{m}

and

y_{m}

, which can be denoted as

S = {(x_{1}, y_{1}), (x_{2}, y_{2}), \dots, (x_{M}, y_{M})}, m = 1, 2, \dots, M

. The following describes in detail the method of calculating the impact of tourist’ TSD sentiment on tourist satisfaction based on the BPNN training.

(1) Let

b

denote the trained BPNN model. Let

w_{i h}^{P o s}

and

w_{i h}^{N e g}

denote the weight between the input nodes

E_{i}^{P o s}

and

E_{i}^{N e g}

in BPNN b and the

h

th hidden node, respectively (the blue line in Figure 3), where

i = 1, 2 \dots I, h = 1, 2 \dots, D

. Let

w_{h}

denote the weight between the

h

th hidden node in

h

and the output node (yellow line in Figure 2),

h = 1, 2 \dots, D

. Let

W_{i}^{P o s}

and

W_{i}^{N e g}

, respectively, denote the influence of positive and negative sentiments of tourists towards each TSD on tourist satisfaction.

W_{i}^{P o s}

and

W_{i}^{N e g}

can be obtained from Formulas (2) and (3), respectively.

W_{i}^{P o s} = \frac{\sum_{h = 0}^{D} (w_{i h}^{P o s} \times w_{h})}{\sum_{i = 0}^{I} \sum_{h = 0}^{D} | w_{i h}^{P o s} \times w_{h} | + \sum_{i = 0}^{I} \sum_{h = 0}^{D} | w_{i h}^{N e g} \times w_{h} |}, (i = 1, 2, \dots, I),

(2)

W_{i}^{N e g} = \frac{\sum_{h = 0}^{D} (w_{i h}^{N e g} \times w_{h})}{\sum_{i = 0}^{I} \sum_{h = 0}^{D} | w_{i h}^{P o s} \times w_{h} | + \sum_{i = 0}^{I} \sum_{h = 0}^{D} | w_{i h}^{N e g} \times w_{h} |}, (i = 1, 2, \dots, I) .

(3)

(2) In order to reduce the overfitting problem and enhance the reliability of the results, we conduct 10-fold cross-validation on the dataset. According to Equations (2) and (3), we calculate the sentiment weight information of the 10 trained BPNN models, respectively, and take their average value as the final required result, denoted as

{\bar{W}}_{i}^{P o s}

and

{\bar{W}}_{i}^{N e g}

,

i = 1, 2, \dots, I

.

(3) Based on the calculated

{\bar{W}}_{i}^{P o s}

and

{\bar{W}}_{i}^{N e g}

, we can evaluate the total impact (relative importance weight) of the

i

th TSD on tourist satisfaction. Let

R_{i}

denote the range of influence of the

i

th TSD on tourist satisfaction.

R_{i}

can be calculated by Formula (4):

R_{i} = | {\bar{W}}_{i}^{P o s} - {\bar{W}}_{i}^{N e g} |, (i = 1, 2, \dots, I) .

(4)

(4) Let

T E_{i}

denote the total impact (importance weight) of the

i

th TSD on tourist satisfaction,

i = 1, 2 \dots, I

. Then, according to the studies [37,38], the formula for calculating

T E_{i}

is (5):

T E_{i} = \frac{R_{i}}{\sum_{i = 1}^{I} R_{i}}, (i = 1, 2, \dots, I) .

(5)

3.4. TSD Category Recognition Based on Kano Model

According to the obtained

{\bar{W}}_{i}^{P o s}

and

{\bar{W}}_{i}^{N e g}

, as well as the basic principle of the Kano model, we proposed a model based on the effect of Kano (Effect-Based Kano (EKM)), which can identify the category of each TSD from the perspective of tourists. The core idea of EKM is shown in Figure 4.

(1) In Figure 4a, positive sentiment is considered as the performance of the TSD achieves the requirements of tourists (that is, the green rectangle in Figure 4a); in contrast, negative sentiment is considered to be when the performance of the TSD does not achieve the requirement of tourists (the red rectangle in Figure 4a). In addition, the online ratings of tourists indicate the overall satisfaction of tourists with the tourism services they enjoyed.

(2) In Figure 4b, with the introduction of

{\bar{W}}_{i}^{P o s}

and

{\bar{W}}_{i}^{N e g}

, the traditional Kano model framework is divided into two parts. Among them, the right side is the part related to positive sentiments, that is, the requirements for the TSD are fulfilled. Meanwhile,

{\bar{W}}_{i}^{P o s}

can be regarded as the influence of

t_{i}

on the overall satisfaction of tourists when TSD

t_{i}

is satisfied; accordingly, the left side of Figure 4b is the part related to negative sentiments, that is, the requirements of the TSD are not fulfilled. Meanwhile,

{\bar{W}}_{i}^{N e g}

can be regarded as the influence of

t_{i}

on the overall satisfaction of tourists when TSD

t_{i}

is not satisfied. At this point, the detailed meanings of

{\bar{W}}_{i}^{P o s}

and

{\bar{W}}_{i}^{N e g}

are as follows:

(i)

{\bar{W}}_{i}^{P o s}

> 0 indicates that the overall satisfaction of tourists will increase when their requirements for

t_{i}

are satisfied;

(ii)

{\bar{W}}_{i}^{P o s}

≤ 0 indicates that the overall satisfaction of tourists will not increase when their requirements for

t_{i}

are satisfied;

(iii)

{\bar{W}}_{i}^{N e g}

≥ 0 indicates that the overall satisfaction of tourists will not decrease when their requirements for

t_{i}

are not satisfied;

(iv)

{\bar{W}}_{i}^{N e g}

< 0 indicates that the overall satisfaction of tourists will decrease when their requirements for t_i are not satisfied.

In Figure 4c,

{\bar{W}}_{i}^{P o s}

and

{\bar{W}}_{i}^{N e g}

are denoted as the horizontal axis and vertical axis, respectively. Therefore, the TSD represented as a curve in Figure 4b can be converted into a point in Figure 4c. Thus, according to the basic principles of the Kano model, combined with

{\bar{W}}_{i}^{P o s}

and

{\bar{W}}_{i}^{N e g}

, TSDs can be divided into five types in Figure 4c, the detailed explanation is as follows:

(i) If both

| {\bar{W}}_{i}^{P o s} | < (\frac{1}{10 \times I})

and

| {\bar{W}}_{i}^{N e g} | < (\frac{1}{10 \times I})

, this indicates that

t_{i}

has a very small effect on the overall satisfaction of tourists, and

t_{i}

is an indifferent attribute. It is worth noting that that means the threshold that determines whether a CSD is an indifferent property; the classification conditions for other cases are as follows, (ii)–(v);

(ii) If

| {\bar{W}}_{i}^{P o s} | \leq (\frac{1}{10 \times I})

and

| {\bar{W}}_{i}^{N e g} | < (\frac{1}{10 \times I})

, then

t_{i}

is a must-be attribute, that is, when the requirements of tourists for

t_{i}

are satisfied, the overall satisfaction of tourists will not increase. When they are not satisfied, the overall satisfaction of tourists will decrease;

(iii) If

| {\bar{W}}_{i}^{P o s} | \leq (\frac{1}{10 \times I})

and

| {\bar{W}}_{i}^{N e g} | \geq (\frac{1}{10 \times I})

, then

t_{i}

is a reverse attribute, that is, when the requirements of tourists for

t_{i}

are satisfied, the overall satisfaction of tourists will not increase. When they are not satisfied, the overall satisfaction of tourists will not decrease;

(iv) If

| {\bar{W}}_{i}^{P o s} | > (\frac{1}{10 \times I})

and

| {\bar{W}}_{i}^{N e g} | < (\frac{1}{10 \times I})

, then

t_{i}

is a performance attribute, that is, when the requirements of tourists for

t_{i}

are satisfied, the overall satisfaction of tourists will increase. When they are not satisfied, the overall satisfaction of tourists will decrease;

(v) If

| {\bar{W}}_{i}^{P o s} | > (\frac{1}{10 \times I})

and

| {\bar{W}}_{i}^{N e g} | \geq (\frac{1}{10 \times I})

, then

t_{i}

is an excitement attribute, that is, when the requirements of tourists for

t_{i}

are satisfied, the overall satisfaction of tourists will increase. When they are not satisfied, the overall satisfaction of tourists will not decrease.

4. Empirical Research Based on Online Review Data

This section will verify the method proposed in Section 3 by online reviews posted by tourists. This study is carried out on a PC, which is configured with 8 GB memory space, 1.8 GHz dual-core Intel Core i5 processor, and an iOS operating system. Data processing and method implementation uses Python. The following chapters first introduce the experimental data, then explain the experimental procedures and some important experimental results.

4.1. Data Source

The tourism industry is a pillar industry in Guizhou, China. For this study, we crawled 10,307 online reviews of tourist attractions in Guizhou between 1 January 2020, and 31 May 2021, from Ctrip (www.ctrip.com, accessed on 8 June 2021), one of the most popular online travel platforms in China. The time span of the dataset includes the initial period of the COVID-19 outbreak to the period of the normal management of the pandemic, so it can reflect the views of tourists on the services of tourist attractions in Guizhou during the COVID-19 period. Figure 5 is an example of the data collected. As can be seen from the figure, the collected data include the review text of tourists, the ratings of tourists (overall satisfaction), the date of the reviews, etc.

4.2. Exploring Tourist Sentiments towards TSDs

According to the process of CSDs extraction in Section 3.1.1, we use the LDA model to extract CSDs from 10,307 reviews. The LDA parameters are

n u m_t o p i c s = 20

,

a l p h a = 0.1

,

e t a = 0.01

and

p a s s e s = 2000

. By filtering the noise words under each topic, merging topics with similar meanings, and assigning a label to each topic, we finally obtain 18 TSDs, as shown in Table 6. In Table 6,

C_{i} (i = 1, 2, \dots, 18)

is the number of frequent words included in

t_{i}

(i.e., TSD), and the total frequency is the total number of times the word

w o r d_{i j}

in

t_{i}

appears in all reviews. As can be seen from the table, ‘Huangguoshu Waterfall’ and ‘sightseeing transportation in tourist attraction’ are the two TSDs about which the tourists are most concerned, with each TSD appearing more than 7000 times in the total review data. Secondly, two TSDS, ‘general feeling’ and ‘ticket service’, appear 6400 times and 4800 times, respectively. Next, the four TSDs of ‘Pingba Cherry Garden’, ‘Zunyi Red Culture’, ‘Fanjingshan’, and ‘Minority Culture’ all appear more than 3000 times.

According to Section 3.1.2, the sentiment orientation of 10,307 reviews with respect to 18 TSDs can be decided. The statistical results for the sentiment orientation of each TSD across all reviews are shown in Figure 6. As can be seen from the figure, for all TSDs, the number of positive reviews outnumbers the number of negative reviews. Among them, t₁₆, i.e., the dimension of ‘overall feeling’, shows the largest number of positive sentiments, which is much higher than the other dimensions. On the contrary,

t_{5}

, i.e., the dimension of ‘scenic tourism transportation’ has the most negative sentiments. Next, we convert the nominal encoded data into structured data according to Formula (1).

t_{1}

t_{10}

t_{2}

t_{11}

t_{3}

t_{12}

t_{4}

t_{13}

t_{5}

t_{14}

t_{6}

t_{15}

t_{7}

t_{16}

t_{8}

t_{17}

t_{9}

t_{18}

4.3. Measuring the Influence of CSDs Sentiment on Tourist Satisfaction

We used the obtained structured data to train the BPNN model (the number of nodes in the hidden layer is 37), where the network parameters are set as the learning rate = 0.6, maximum allowable error = 0.01, and the number of iterations = 1000. In order to overcome the overfitting problem and enhance the reliability of the results, we performed 10-fold cross-validation on the dataset and recorded the weight parameter information of the 10 trained BPNN models. The RMSE values of the 10 models are shown in Figure 7. It can be seen from the figure that the RMSE values of the 10 models are all small, which can obtain better prediction performance. Finally, according to Formulas (2) and (3), we can calculate the importance information

W_{i}^{P o s}

and

W_{i}^{N e g} (i = 1, 2, \dots, 18)

of 10 BPNN models about each TSD, respectively, and calculate their average value as the final

{\bar{W}}_{i}^{P o s}

and

{\bar{W}}_{i}^{N e g}

of each TSD, as shown in Table 7.

4.4. Measuring the Influence of CSDs Sentiment on Tourist Satisfaction

Based on the results in Table 7 and the EKM attribute classification process in Figure 4, the TSD categories of Guizhou tourism services are obtained, as shown in Figure 8. In the figure, 0.0056 (that is, 1/(10 × 18)) is the threshold that determines whether a TSD is an indifferent TSD (that is, the blue area in Figure 8). As can be seen from the figure, the TSDs of most Guizhou tourism services belong to the must-be attribute, and no TSD is identified as the excitement attribute. Specifically, indifferent TSDs include ‘Huangguoshu Waterfall’, ‘Karst cave scenery’, ‘Huaxi Wetland Park’, ‘Zunyi Red Culture’, and ‘tourist traffic of tourist attractions’. Must-be TSDs include ‘Fanjing Mountain’, ‘Miao Village’, ‘natural scenery’, ‘Guiyang landmarks’, ‘tour guides and hotels’, ‘ticket services’, ‘night views of Maotai Town’, ‘Pingba cherry blossom Garden’, ‘ethnic culture’, and ‘ancient town tours’; performance TSDs include ‘Qianling Mountain’, ‘the overall feeling’; reverse TSDs are adverse environmental factors.

According to the classification results of TSDs, we can determine the TSD priority order in the formulation of the tourism service promotion strategy. First of all, the above 10 must-be TSDs represent the tourists’ basic requirements for Guizhou tourism services. When these requirements are not fulfilled, tourists will feel very dissatisfied. Therefore, relevant tourism managers should first examine must-be TSDs and try to fulfill the requirements associated with them. Then, since tourist satisfaction is directly proportional to the level of performance TSDs, relevant managers should consider fulfilling tourists’ requirements related to performance TSDs on the premise of fulfilling the must-be TSDs. Finally, for reverse TSDs, managers should carefully examine these factors to avoid such situations as much as possible.

5. Conclusions

Online tourism platforms provide an open and convenient channel for tourists to share their experiences in travel. At the same time, for the tourism industry, these online travel reviews also contain valuable information related to the quality of tourism services. This paper focuses on how to measure the key factors of customer satisfaction from the online travel review data and then classifies the key factors to provide the basis for the tourism service quality improvement strategy. To this end, this paper proposed a framework for modeling customer satisfaction from free review texts. First, this paper used the LDA model to explore the key dimensions of tourist satisfaction from online reviews. Next, based on the sentiment dictionary, we identified tourists’ sentiment attitudes towards all dimensions from these reviews; then, we used the deep learning network to measure the complex relationship between tourists’ sentiment orientations to different dimensions and their satisfaction. Finally, according to the improved Kano model, namely EKM, multi-dimensional attribute classification was realized to support the analysis of tourism service quality improvement.

5.1. Theoretical Contribution

The contribution of this paper has three aspects. First, this paper contributes to the research of online product/service reviews in the field of Information Systems (IS). Most of the related literature studies the impact of online reviews on customer purchases [39,40], explores the antecedents of the usefulness [41,42] of product reviews, or studies the influencing factors of online review behavior [43]. The research of this paper focuses on the mining of customers’ experience of service in online reviews, expands the scope of IS research, and puts forward an effective and feasible method to transform a large amount of online review data into useful business intelligence. Specifically, compared to [7,21,22,23,24], we model customer satisfaction more accurately through BPNN, and classify the types of service attributes through quantitative calculation process of EKM.

Secondly, this study contributes to the literature in the field of tourism. This study provides an effective methodological framework for analyzing tourist satisfaction from the perspective of customer demand. Generally, customer satisfaction is usually evaluated by conducting market surveys or experiments to collect and analyze customer preference data. However, this method of customer preference survey often requires a lot of time and cost. The method framework provided in this study can be refined into a dimensional evaluation of tourism service satisfaction based on online travel review data. Especially for the influence calculation of tourist satisfaction dimension based on BPNN, this method can model more complex data situations and does not require the assumption of Gaussian distribution of satisfaction (online rating) and the additive independence between all attribute sentiments, which is more realistic.

Thirdly, this study contributes to the literature in the field of market analysis. This study proposes an Effect-based Kano model (EKM), which is oriented by tourist satisfaction and realizes the classification and prioritization of tourists’ requirements in the dimension of service attributes. By perfectly integrating the influence of customer sentiment on satisfaction in service dimensions with the traditional Kano model, EKM can analyze and visualize customer requirements for tourism services. Using the dataset in this paper, the validity of EKM can be empirically explained.

5.2. Practical Contribution

This article also contributes several valuable enlightenments to the tourism management industry. Comprehensively using the proposed method, this paper mines four types of information from online travel reviews, including TSDs of tourism service, tourist sentiments towards these TSDs, the impact of tourist sentiments towards TSDs on their satisfaction, and the category of each TSD. The four kinds of mined information can provide helpful information for real tourism industry managers, and further provide reference analysis methods for other product/service industries.

Firstly, the TSDs mined from online reviews usually represent the aspects that tourists care most about tourism services. Therefore, according to the extracted TSDs, managers can know which tourism service dimensions are of most concern to tourists.

Secondly, tourists’ views or sentiments on tourism service TSDs reflect their multi-dimensional perception of tourism services and can be regarded as the actual performance of the tourism service dimension provided. Therefore, managers can evaluate the performance of tourism services in each dimension based on customers’ views or sentiments on the service dimensions. On this basis, managers can further understand the advantages and disadvantages of tourism services.

Thirdly, the proposed method enables managers to assess the influence of tourists’ TSD sentiment on their satisfaction, which can be regarded as aggregated customer preferences, which is important for planning and decision-making related to service quality optimization.

Fourthly, the method in this paper can enable managers to understand the different categories of tourism service TSDs, which is crucial for managing tourist demands and deciding service optimization strategies. Specifically, the service characteristics of tourism are divided into five categories, including performance type, excitement type, must-be type, reverse type, and indifferent type. This understanding can help decision makers of the tourism industry to decide the priority of TSD development or promotion and, make more effective development plans for tourism service quality.

5.3. Limitations and Future Research Directions

There are also several limitations in this study, which deserve further attention. Firstly, this study assumes that all online reviews obtained from tourism social platforms are authentic. However, spam reviews, which may be manipulated or fake, are common on online platforms. At present, fake review recognition is receiving more and more research attention [44]. Therefore, it is a necessary and interesting research line to combine effective fake review recognition methods with this study. Secondly, a potential problem of this study is the representativeness of reviewers on online tourism platforms, that is, it is widely recognized that online reviewers may not be effective representatives of all target groups. Therefore, it is not reasonable to use online tourism reviews to infer the tourism service demands of all tourists for the representativeness of the reviewers and online reviews adopted. To reduce this concern, one possible direction for future research is to incorporate review data from multiple sources (multiple online tourism platforms) to reduce bias issues related to a single platform. Alternatively, we can collect some characteristic information of reviewers and adopt some statistical methods to extend the self-selection questions in online review data. Thirdly, this study analyzes all tourists as a whole. In fact, different consumer types often have different requirements for service attributes. In the future, we can further consider the customer type factor for analysis based on the current study. Fourthly, a potential research direction is to study other potentially more effective methods for analyzing tourist sentiment, such as building a professional sentiment dictionary in the field of tourism, so as to improve the accuracy of analysis results.

Author Contributions

Conceptualization, Z.Y.; formal analysis, K.Z.; methodology, K.Z.; supervision, Z.Y.; visualization, K.Z.; writing—original draft, Z.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Oliver, R.L. A cognitive model of the antecedents and consequences of satisfaction decisions. J. Mark. Res. 1980, 17, 460–469. [Google Scholar] [CrossRef]
Al-Ababneh, M.M. Service quality and its impact on tourist satisfaction. Inst. Interdiscip. Bus. Res. 2013, 164-177, 164–177. [Google Scholar]
Sepehr, S.; Head, M. Understanding the Role of Competition in Video Gameplay Satisfaction. Inf. Manag. 2018, 55, 407–421. [Google Scholar] [CrossRef]
Iqbal, Z.; Grigg, N.; Govindaraju, K.; Campbell-Allen, N. A distance-based methodology for increased extraction of information from the roof matrices in QFD studies. Int. J. Prod. Res. 2016, 54, 3277–3293. [Google Scholar] [CrossRef]
Jia, W.; Liu, Z.; Lin, Z.; Qiu, C.; Tan, J. Quantification for the importance degree of engineering characteristics with a multi-level hierarchical structure in QFD. Int. J. Prod. Res. 2016, 54, 1627–1649. [Google Scholar] [CrossRef]
Groves, R.M. Nonresponse rates and nonresponse bias in household surveys. Int. J. Public Opin. Q. 2006, 70, 646–675. [Google Scholar] [CrossRef]
Bi, J.W.; Liu, Y.; Fan, Z.; Zhang, J. Wisdom of crowds: Conducting importance-performance analysis (IPA) through online reviews. Tourism Manag. 2019, 70, 460–478. [Google Scholar] [CrossRef]
Guo, Y.; Barnes, S.; Jia, Q. Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent Dirichlet allocation. Tourism Manag. 2017, 59, 467–483. [Google Scholar] [CrossRef]
Cui, R.; Gallino, S.; Moreno, A.; Zhang, D. The operational value of social media information. Prod. Oper. Manag. 2018, 27, 1749–1769. [Google Scholar] [CrossRef]
Ince, T.; Bowen, D. Consumer satisfaction and services: Insights from dive tourism. Serv. Ind. J. 2011, 31, 1769–1792. [Google Scholar] [CrossRef]
Geetha, M.; Singha, P.; Sinha, S. Relationship between customer sentiment and online customer ratings for hotels—An empirical analysis. Tourism Manag. 2017, 61, 43–54. [Google Scholar] [CrossRef]
Park, S.; Lee, J.S.; Nicolau, J. Understanding the dynamics of the quality of airline service attributes: Satisfiers and dissatisfiers. Tourism Manag. 2020, 81, 104163. [Google Scholar] [CrossRef]
Fan, Z.P.; Li, G.M.; Liu, Y. Processes and methods of information fusion for ranking products based on online reviews: An overview. Inf. Fusion 2020, 60, 87–97. [Google Scholar] [CrossRef]
Kangale, A.; Kumar, S.K.; Naeem, M.A.; Williams, M.; Tiwari, M.K. Mining consumer reviews to generate ratings of different product attributes while producing feature-based review-summary. Int. J. Syst. Sci. 2016, 47, 3272–3286. [Google Scholar] [CrossRef]
Wong, T.; Lam, W. Hot item mining and summarization from multiple auction web sites. In Proceedings of the Fifth IEEE International Conference on Data Mining (ICDM’05), Houston, TX, USA, 27–30 November 2005; pp. 797–800. [Google Scholar]
Tirunillai, S.; Tellis, G.J. Mining marketing meaning from online chatter: Strategic brand analysis of big data using latent Dirichlet allocation. J. Mark. Res. 2014, 51, 463–479. [Google Scholar] [CrossRef]
Kang, Y.; Zhou, L. RubE: Rule-based methods for extracting product features from online consumer reviews. Inf. Manag. 2017, 54, 166–176. [Google Scholar] [CrossRef]
Rana, T.A.; Cheah, Y.-N. A two-fold rule-based model for aspect extraction. Expert Syst. Appl. 2017, 89, 273–285. [Google Scholar] [CrossRef]
Liu, Y.; Bi, J.; Fan, Z. A method for ranking products through online reviews based on sentiment classification and interval-valued intuitionistic fuzzy TOPSIS. Int. J. Inf. Technol. Decis. Mak. 2017, 16, 1497–1522. [Google Scholar] [CrossRef]
Mankad, S.; Han, H.S.; Goh, J.; Gavirneni, S. Understanding online hotel reviews through automated text analysis. Serv. Sci. 2016, 8, 124–138. [Google Scholar] [CrossRef]
Wei, Z.; Zhang, M.; Ming, Y. Understanding the effect of tourists’ attribute-level experiences on satisfaction—Across-cultural study leveraging deep learning. Curr. Issues Tour. 2023, 26, 105–121. [Google Scholar] [CrossRef]
Zhang, C.; Xu, Z.; Gou, X.; Chen, S. An online reviews-driven method for the prioritization of improvements in hotel services. Tourism Manag. 2021, 87, 104382. [Google Scholar] [CrossRef]
Hu, N.; Pavlou, P.; Zhang, J. Why do product reviews have a J-shaped distribution? Commun. ACM 2009, 52, 144–147. [Google Scholar] [CrossRef]
Hu, N.; Pavlou, P.; Zhang, J. On self-selection biases in online product reviews. MIS Q. 2017, 41, 449–471. [Google Scholar] [CrossRef]
Kano, N.; Seraku, N.; Takahashi, F.; Tsuji, S. Attractive quality and must-be quality. J. Jpn. Soc. Qual. Control 1984, 14, 39–48. [Google Scholar]
Ji, P.; Jin, J.; Wang, T.; Chen, Y. Quantification and Integration of Kano’s model into QFD for optimising product design. Int. J. Prod. Res. 2014, 52, 6335–6348. [Google Scholar] [CrossRef]
Xu, X. Examining the role of emotion in online consumer reviews of various attributes in the surprise box shopping model. Decis. Support Syst. 2020, 136, 113344. [Google Scholar] [CrossRef]
Decker, R.; Trusov, M. Estimating aggregate consumer preferences from online product reviews. Int. J. Res. Mark. 2010, 27, 293–307. [Google Scholar] [CrossRef]
Farhadloo, M.; Patterson, R.; Rolland, E. Modeling customer satisfaction from unstructured data using a Bayesian approach. Decis. Support Syst. 2016, 90, 1–11. [Google Scholar] [CrossRef]
Huang, Z.; Dong, W.; Ji, L.; Gan, C.; Lu, X.; Duan, H. Discovery of clinical pathway patterns from event logs using probabilistic topic models. J. Biomed. Inf. 2014, 47, 39–57. [Google Scholar] [CrossRef]
Xu, L.; Lin, H.; Pan, Y.; Ren, H.; Chen, J. Constructing the affective lexicon ontology. J. China Soc. Sci. Tech. Inf. 2008, 27, 180–185. [Google Scholar]
Zhang, Y.; Lin, Z. Predicting the helpfulness of online product reviews: A multi-lingual approach. Electron. Commer. Res. Appl. 2018, 27, 1–10. [Google Scholar] [CrossRef]
Phillips, P.; Zigan, K.; Silva, M.; Schegg, R. The interactive effects of online reviews on the determinants of Swiss hotel performance: A neural network analysis. Tourism Manag. 2015, 50, 130–141. [Google Scholar] [CrossRef]
Hao, W.; Lu, Z.; Wei, P.; Feng, J.; Wang, B. A new method on ANN for variance based importance measure analysis of correlated input variables. Struct. Saf. 2012, 38, 56–63. [Google Scholar] [CrossRef]
Wang, G.; Tan, G.W.H.; Yuan, Y.; Ooi, K.B.; Dwivedi, Y.K. Revisiting TAM2 in behavioral targeting advertising: A deep learning-based dual-stage SEM-ANN analysis. 2022, 175, 121345. Technol. Forecast. Soc. Chang. 2022, 175, 121345. [Google Scholar] [CrossRef]
Deng, W.; Chen, W.; Pei, W. Back-propagation neural network based importance–performance analysis for determining critical service attributes. Expert Syst. Appl. 2008, 34, 1115–1125. [Google Scholar] [CrossRef]
Green, P.E.; Srinivasan, V. Conjoint analysis in consumer research: Issues and outlook. J. Consum. Res. 1978, 5, 103–123. [Google Scholar] [CrossRef]
Qi, J.; Zhang, Z.; Jeon, S.; Zhou, Y. Mining customer requirements from online reviews: A product improvement perspective. Inf. Manag. 2016, 53, 951–963. [Google Scholar] [CrossRef]
Wang, Y.; Wang, W.; Liu, Z. An empirical study on the influence of online negative reviews on potential consumers’ purchasing intention. Inf. Sci. 2018, 36, 10. [Google Scholar]
Zhang, Z.; Guo, C.; Goes, P. Product comparison networks for competitive analysis of online word-of-mouth. ACM Trans. Manag. Inf. Syst. 2013, 3, 1–22. [Google Scholar] [CrossRef]
Korfiatis, N.; García-Bariocanal, E.; Sánchez-Alonso, S. Evaluating content quality and helpfulness of online product reviews: The interplay of review helpfulness vs. review content. Electron. Commer. Res. Appl. 2012, 11, 205–217. [Google Scholar] [CrossRef]
Yin, D.; Bond, S.; Zhang, H. Anxious or angry? Effects of discrete sentiments on the perceived helpfulness of online reviews. MIS Q. 2014, 38, 539–560. [Google Scholar] [CrossRef]
Balaji, M.S.; Wei, K.; Chong, A. Determinants of negative word-of-mouth communication using social networking sites. Inf. Manag. 2016, 53, 528–540. [Google Scholar] [CrossRef]
Ahmed, H.; Traore, I.; Saad, S. Detection of online fake news using n-gram analysis and machine learning techniques. In Intelligent, Secure, and Dependable Systems in Distributed and Cloud Environments; ISDDC; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2017; p. 10618. [Google Scholar] [CrossRef]

Figure 1. Traditional Kano model [25].

Figure 2. Framework for modeling tourist satisfaction based on online reviews.

Figure 3. Schematic diagram of BPNN structure.

Figure 4. Effect-based Kano Model (EKM).

Figure 5. An online review posted by a tourist.

Figure 6. Statistical results of TSD sentiment orientations in the review data.

Figure 7. RMSE for 10-fold cross-validation.

Figure 8. Classification results of Guizhou TSDs.

Table 1. Three examples of online travel text reviews.

No.	Review Content
1	Huangguoshu Waterfall lives up to its name and is spectacular.
2	This is a very huge water cave, beautiful scenery, can see very high, the cave is very cool, different sky, lights a dozen colorful, especially good looking, the ticket is not expensive. Welcome everyone here.
3	Although the service is not great, I have to say that the scenery is good, especially the waterfalls and the river valley are very commendable, but it is recommended to visit during the off-season.

Table 2. Examples of short sentence extraction about TSDs.

No.	TSDs
No.	Huangguoshu Waterfall	Ticket	Scenery	Overall	Service
1	Huangguoshu Waterfall lives up to its name	-	-	-	-
2	-	the ticket is not expensive	beautiful scenery	-	-
3	especially the waterfalls and the river valley are very commendable	-	-	the scenery is good	the service is not great

Table 3. Examples of Dalian University of Technology Chinese Emotion Vocabulary Ontology Database.

Words	Part of Speech	Classification of Emotion	Polarity
dirty	adj	NN	2
rescue	verb	PH	1
forces	adj	NN	2

Table 4. Nominal coded data for each TSD in online travel reviews.

Online Reviews	TSDs
Online Reviews	$t_{1}$	$t_{2}$	…	$t_{I}$
$r_{1}$	Positive	Null	…	Null
$r_{2}$	Null	Positive	…	Negative
…	…	…	…	…
$r_{M}$	Null	Null	…	Positive

Table 5. Structured online travel reviews.

Online Reviews	TSDs
	$t_{1}$		$t_{2}$		…	$t_{I}$
	$E_{1}^{P o s}$	$E_{1}^{N e g}$	$E_{1}^{P o s}$	$E_{2}^{N e g}$	…	$E_{1}^{P o s}$	$E_{2}^{N e g}$
$r_{1}$	1	0	0	0	…	0	0
$r_{2}$	0	0	1	0	…	0	1
…	…	…	…	…	…	…	…
$r_{M}$	0	0	0	0	…	1	0

Table 6. The extracted tourist satisfaction dimension based on LDA.

No.	TSD		Total Frequency	No.	TSD		Total Frequency
$t_{1}$	Natural scenery	11	2505	$t_{10}$	Ticket Service	12	4850
$t_{2}$	Night view of Maotai Town	14	1897	$t_{11}$	Zunyi Red culture	21	3691
$t_{3}$	Landmark buildings of Guiyang City	10	2471	$t_{12}$	Ancient town tourism	8	1140
$t_{4}$	The Hmong village	9	1566	$t_{13}$	Huaxi Wetland Park	7	1356
$t_{5}$	Sightseeing traffic of tourist attractions	13	7131	$t_{14}$	Minority Cultures	15	3203
$t_{6}$	Pingba Cherry Blossom Garden	12	3907	$t_{15}$	Huangguoshu Waterfall	10	7546
$t_{7}$	Karst cave scenery	15	2785	$t_{16}$	Overall feeling	12	6427
$t_{8}$	Qianling Mountain	14	2446	$t_{17}$	Fanjing Mountain	11	3333
$t_{9}$	Adverse environmental factors	14	1477	$t_{18}$	Tour guide and hotel	4	988

Table 7. The final

{\bar{W}}_{i}^{P o s}

and

{\bar{W}}_{i}^{N e g}

for each TSD.

Table 7. The final

{\bar{W}}_{i}^{P o s}

and

{\bar{W}}_{i}^{N e g}

for each TSD.

TSD				TSD
	−0.0034	−0.0069	0.0410	$t_{10}$	−0.0089	−0.0069	0.0231
$t_{2}$	−0.0077	−0.0096	0.0222	$t_{11}$	−0.0036	−0.0038	0.0018
$t_{3}$	−0.0018	−0.0076	0.0686	$t_{12}$	−0.0196	−0.0117	0.0096
$t_{4}$	−0.0087	−0.0030	0.0669	$t_{13}$	−0.0027	−0.0020	0. 0083
$t_{5}$	−0.0051	−0.0042	0.0111	$t_{14}$	−0.0196	−0.0105	0.1074
$t_{6}$	−0.0168	−0.0070	0.1150	$t_{15}$	−0.0034	0.0016	0.0584
$t_{7}$	0.0013	−0.0010	0.0268	$t_{16}$	0.0053	−0.0062	0.1355
$t_{8}$	0.0008	−0.0064	0.0843	$t_{17}$	−0.0059	−0.0022	0.0438
$t_{9}$	−0.0087	0.0023	0.1293	$t_{18}$	−0.0040	−0.0079	0.0468

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhou, K.; Yao, Z. Analysis of Customer Satisfaction in Tourism Services Based on the Kano Model. Systems 2023, 11, 345. https://doi.org/10.3390/systems11070345

AMA Style

Zhou K, Yao Z. Analysis of Customer Satisfaction in Tourism Services Based on the Kano Model. Systems. 2023; 11(7):345. https://doi.org/10.3390/systems11070345

Chicago/Turabian Style

Zhou, Kailin, and Zhong Yao. 2023. "Analysis of Customer Satisfaction in Tourism Services Based on the Kano Model" Systems 11, no. 7: 345. https://doi.org/10.3390/systems11070345

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Analysis of Customer Satisfaction in Tourism Services Based on the Kano Model

Abstract

1. Introduction

2. Literature Review

2.1. Research on Customer Satisfaction Measurement

2.1.1. Attribute Extraction and Sentiment Analysis Based on Online Reviews

2.1.2. Research on the Influence of Attribute Performance on Customer Satisfaction

2.2. Kano Model

3. An Approach to Modeling Tourist Satisfaction in Online Travel Reviews

3.1. Methodological Framework

3.1.1. Mining Tourists’ Sentiments toward TSDs from Online Reviews

3.1.2. Identifying the Category of Each TSD

3.1.3. Measuring the Influence of Each TSD Sentiment on Tourism Service Satisfaction

3.2. Excavating Tourists’ Sentiments on TSDs

3.2.1. Extracting TSDs Based on LDA

3.2.2. Dictionary-Based TSD Granularity Sentiment Recognition

3.3. Evaluate the Impact of Each TSD’s Sentiment Orientation on Tourist Satisfaction

3.4. TSD Category Recognition Based on Kano Model

4. Empirical Research Based on Online Review Data

4.1. Data Source

4.2. Exploring Tourist Sentiments towards TSDs

4.3. Measuring the Influence of CSDs Sentiment on Tourist Satisfaction

4.4. Measuring the Influence of CSDs Sentiment on Tourist Satisfaction

5. Conclusions

5.1. Theoretical Contribution

5.2. Practical Contribution

5.3. Limitations and Future Research Directions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI