Next Article in Journal
Multi-Objective Optimization of the Organization’s Performance for Sustainable Development
Previous Article in Journal
A Study on the Impact of Boundary-Spanning Search on the Sustainable Development Performance of Technology Start-Ups
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Methodology for Machine-Learning Content Analysis to Define the Key Labels in the Titles of Online Customer Reviews with the Rating Evaluation

by
Ayat Zaki Ahmed
and
Manuel Rodríguez Díaz
*
Department of Economics and Business, Faculty of Economy, Business and Tourism, University of Las Palmas de Gran Canaria, 35017 Las Palmas de Gran Canaria, Spain
*
Author to whom correspondence should be addressed.
Sustainability 2022, 14(15), 9183; https://doi.org/10.3390/su14159183
Submission received: 25 June 2022 / Revised: 19 July 2022 / Accepted: 22 July 2022 / Published: 27 July 2022
(This article belongs to the Section Economic and Business Aspects of Sustainability)

Abstract

:
Online reputation is of great strategic importance to companies today. Customers share their emotions and experiences about the service received or the product acquired through online opinions in the form of quantitative variables or text comments. Although quantitative variables can be analyzed using different statistical methods, the main limitation of comment content analysis lies in the statistical analysis because the texts are qualitative. This study proposes and applies a methodology to develop a machine learning designed to identify the key labels related to the quantitative variables in the general rating of the service received from an airline. To this end, we create a quantitative dichotomous variable from zero to one from a database of comment title labels, thus facilitating the conversion of titles into quantitative variables. On this basis, we carry out a multiple regression analysis where the dependent variable is the overall rating and the independent variables are the labels. The results obtained are satisfactory, and the significant labels are determined, as well as their signs and coefficients with the general ratings. Findings show that the significant labels detected in titles positively influence the prediction of the overall rating of airline. This paper is a new approach to applying cluster analysis to the text content of customers’ online reviews in an airline. Thus, the proposed methodology results in a quantitative value for the labels that determines the direction and intensity of customers’ opinions. Moreover, it has important practical implications for managers to identify the weakness and the strengths of their services in order to increase their positioning in the market by developing meaningful strategies.

1. Introduction

The online reputation of a company, brand, product, or service is composed of a set of opinions, experiences, and evaluations that customers share through different social media [1]. Online reputation is extremely important in a company’s strategy because customers increasingly use information obtained on the Internet to make decisions about buying services such as those provided by hotels or airlines [2,3,4,5,6,7,8,9]. Moreover, the information available on the Internet about the quality of services and the value perceived by customers, both of a given company and its competitors, is quite relevant for measuring the degree of compliance with the objectives established [10,11,12,13,14]. Specific authors, such as Chun [15] and Hernández Estárico et al. [16], recognize that the online reputation is an effective means of measuring the level of service quality perceived by customers and their degree of satisfaction, and it also has a direct influence on the level of companies’ income and profits [17].
The online reputation on specialized websites such as TripAdvisor is determined through two means. On the one hand, websites obtain information from customers using a scale of quantitative variables that measure the perceived quality of the service and customers’ perceived value [1,18,19]. According to Torres [20], service quality is a process oriented towards an evaluable result, and so service quality is assessed through items that measure its main attributes. In contrast, this author also points out that customer satisfaction is an overall evaluation of a user’s experience or emotions with the service received. From this perspective, customer comments are qualitative variables used to evaluate customer satisfaction [21]. Hence, companies give great importance to classifying the content of online reviews in order to take measures to strengthen the relationship with their customers [22].
The airline industry needs to continuously analyze the online customer reviews on its service. previous studies demonstrate that online reviews shared by consumers are more persuasive than information shared by marketers, as customers have no interest and are therefore more credible and independent [23,24,25,26].
According to Araque et al. [27], the increase in user-generated content (UGC) on websites and social media such as TripAdvisor, Facebook, or Twitter has enhanced social media’s influence on decisions to purchase services, products, or brands. The analysis of content has become an essential tool for companies to use to find out their customers’ assessments, developing different techniques such as Sentiment Analysis [22,28,29], whose purpose is to classify the opinions and feelings expressed by users in text comments. Sentiment analysis involves determining the meaning and intensity of what is expressed in a text, which may be a positive, negative, or neutral feeling (or polarity) about a particular product or service [30]. Some authors have based their sentiment analyses on machine learning approaches, where an assessment is made of the words that influence the polarity and intensity of users’ comments [31,32,33]. In the airline industry, online feedback from customers is also becoming crucial because it determines their satisfaction with the basic dimensions of the service offered [34].
The majority of studies have focused on the influence of online customer reviews on customer’s decisions. A few investigators have attempted to examine some different methods to use intelligence extracted from UGC to gather information for an organization [35,36]. This study fills this gap and aims to propose a methodology for performing machine learning, based on determining the key labels that drive customers’ online ratings in an airline. The process consists of different stages that start with the creation of a database of labels, in order to later quantitatively evaluate their relationship with the general rating variable on TripAdvisor. Through descriptive and multiple regression analyses, key labels related to the overall rating, sign, and intensity are established. Cluster analysis is also carried out to test whether groups of clients can be identified based on the vocabulary they use in their comments. A review of the specialized literature is first carried out to achieve this objective, and then the proposed methodology is described. The following section shows the results obtained, ending with a description of the main conclusions reached.

2. Literature Review

Airlines use content analysis of online (positive or negative) reviews to evaluate customers’ satisfaction. Travelers can write their comments and post online ratings to describe, narrate, recommend, or criticize their travel experiences [37,38,39]. Reviewers use and evaluate products or services and disseminate their evaluations, making them an early link in an innovative diffusion process [40]. Reviews are especially important when a product or service has substantial experience attributes, making it difficult to carry out pre-consumption quality assessment. Clients seek purchase recommendations from external sources [40,41]. Marketers also want to understand the significant and predictive powers of reviews. Knowing whether reviews are positive or negative allows marketers to forecast their products’ sales. The airline industry is one of the essential factors in Tourism competitiveness because companies have to deal with various challenges, such as fuel prices, increasing security precautions, low-cost carriers, economic crisis, and restrictive government regulations [42,43]. It is crucial for managers of airlines to not only correctly perceive what their customers want and expect, but also to appropriately manage their resources in meeting their customers’ expectations [44,45,46,47,48,49]. If reviews are aggressive, corrective measures, such as revising promotional strategies or redesigning the brand, may be instituted [50].
In the airline industry, various studies have been dedicated to investigating the relationship between service quality and related issues such as customer satisfaction and brand [34,51,52,53,54,55,56]. Farooq et al. [55] had studied the quality of service in Malaysia airlines using second generation PLS-SEM tools to get robust results, their findings showed that airlines should focus on all aspects of service quality, with particular focus on image and personnel service in order to enhance their customer satisfaction. Research from Noviantoro and Huang [57] using data mining method to examine U.S airline passenger satisfaction mention that (1) online boarding, (2) inflight Wi-Fi service, (3) baggage handling, and (4) inflight entertainment are the principal four services to be improved by the airline to gain passenger satisfaction. For any airline, these are the main factors in overcoming the economic crisis by understanding and facilitating flexible new ways of dealing with their customers, such as rapid response on social media, airline refunds, and flight change and overbooking policies [58,59,60,61]. From this perspective, many researchers have proposed various scales to examine the service quality in the aviation industry with different aspects such as courtesy of staff, seat comfort, empathy and reliability, ground staff, flight experience, flight schedule, inflight comfort, flight timeliness and airline image [62,63,64,65,66,67,68].
Online reviews and overall ratings form part of the most common UGC models [11]. A review is a sample of textual opinion that describes the traveler’s experience with the airline in a qualitative manner [69]. Unlike a review, the rating is based on a five-point scale, measured in a quantitative way that describes whether and to what extent a customer is satisfied with the airline [70]. Investigated together, review and rating can design a complete traveler response image and synergistically maximize analysis validity [71,72]. However, showing the relationship between online reviews and general ratings is a complicated task because the two sources of data are typologically different. Whereas rating data can be measured using the simple model consisting of structured and quantitative numbers that can efficiently be measured using statistical tools [73], reviews consist of natural language and sentiment summarization. This means that reviews are disorganized and qualitative, which makes it difficult to understand and analyze them using traditional tools [74,75,76,77].
The relationship between customer ratings and reviews, facilitated by the quantification of textual review, has also been analyzed in customer satisfaction studies (Table 1). For instance, Büscken and Allenby [72] correlated the frequency of customer review topics with ratings using multiple linear regressions. In line with this study, if the coefficient of the frequency of a topic is positive, the customers are relatively satisfied with it, and so more frequently mentioning this topic will lead to a positive increase in the rating. Hao et al. [78] successfully distinguished satisfactory titles (topics) from unsatisfactory ones by comparing the appearance of topics associated with positive and negative ratings. In sum, managers can discover not only whether consumers are happy or unhappy, but also why, which has provided an essential methodological foundation for this investigation.
Class differentiation is the main problem of Sentiment Analysis that is being addressed by learning machine algorithms. At first, it is much easier to differentiate between a negative and positive opinion than between a negative and very negative one. Moreover, this latter difference may be more precise for one person than for another because these evaluations are subjective. The solution to this type of problem is to define the critical characteristics of the text that contribute to identifying the difference between them in each of the defined classes. To do this, Insúa Yáñez [79] used two main approaches. The first is the Bag of Words (BOW) model, which is the simplest and is based on the creation of a vector of characteristics where each variable corresponds to a word in the domain. Its value is a series of occurrences in the text or a statistical value that reflects the word’s relationship with its context. Some of the machine learning methods applied are Naïve Bayes [80], K-nearest neighbor [81], Maximum Entropy [82], Adaboost [83], Decision Tree [84], Winnow [85], and Support Vector Machine [86].
In the absence of a label database, a sentiment classification model can be constructed based on a lexicon that defines the words’ polarity [32]. Depending on the terms’ frequency, a document will be classified as positive, negative, or even neutral. An additional method proposed by Liu et al. [87] is the lexicon-based sentimental classification model with unlabeled data that defines words’ polarity in a semi-supervised learning model. Another alternative that has been developed is the ensemble technique, where the results of various classification models are combined to arrive at an integrated result [88]. The other approach is Word Embeddings (WE), where a composed vector is created through training in a neural network that receives a broad set of texts as input and tries to learn the similarity relations between the words they contain. Each word’s result is a vector of real digits of a given length, which facilitates the search for similarity relations, or even addition and subtraction operations between the vectors that represent the words, yielding another word as a result [89,90].The methodology applied in this study does not start from any initial lexicon that indicates the meaning of the words. Moreover, a database is created from the words used by airlines users in the titles of online comments on TripAdvisor. Next, the database is debugged, and a vector-based numerical database is created with dichotomous variables (zero and one). This database is used to carry out a linear regression where the dependent variable is the overall rating and the independent variables are the labels. This provides a sense of each significant label, measured by a positive or negative sign, and the intensity evaluated by the coefficient is obtained.
Table 1. Recent research on overall ratings from online reviews in the aviation industry and tourism.
Table 1. Recent research on overall ratings from online reviews in the aviation industry and tourism.
StudyVariablesResearch ContextKey Findings
Chang et al.
(2022) [91]
Online reviews & Overall ratings of airlinesTripAdvisor
191,123 reviews
-
Analyze airline reviews and understand passenger satisfaction before and during COVID-19
-
This study applies BERT to learn linguistic features and aspect level ratings.
-
Findings reveal that both review and ratings drop significantly during the pandemic.
Dhar & Bose
(2022) [92]
Online reviews &
Star rating
Mobile app
146,914
-
Sentiment and emotion analysis of the review text and reaction analysis of the emojis.
-
The consolidated sentiment and happiness emotion in reviews strongly impact their star ratings.
-
The proposed perception score closely reflects the emotions, sentiments and reactions embedded in the reviews.
Stamolampros et al.
(2020) [93]
Online reviews of Airlines& average airline ratingsTriAdvisor
380,000 reviews
-
Passengers express higher overall rating for domestic carriers than international carriers.
-
Using Hofstede’s framework shows that cultural dimensions affect the intensity of domestic bias.
Park et al.
(2020) [56]
Online reviews of Airlines & overall satisfaction ratingsTripAdvisor
157,035 reviews
20 U.S airlines
-
The quality of airline service, such as cleanliness, food and beverages, and inflight entertainment, affects the variation of positive ratings as a satisfier.
-
Other airline service attributes, such as customer service and check-in and boarding influence negative ratings as a dissatisfier.
Lucini et al.
(2020) [34]
Online reviews of Airlines & overall satisfaction ratingsAir travel review ATR
-
OCRs can be used to measure customer satisfaction
-
Airline recommendation by customers was predicted with an accuracy of 79.95%
Song et al.
(2020) [94]
Online sentiment reviews & the ratings of airlinesSKYTRAX
24,165 online reviews
-
User sentiment analysis show that there is a significant and negative correlation between the users’ emotions and their flight delay experiences.
-
Sentiment analysis based on a sentiment dictionary is used to classify user reviews.
-
Co-occurrence analysis is used to identify passengers’ concerns on different aspects of service in the aviation industry.
Sharma et al.
(2020) [95]
User sentiment review &overall review of flightTripAdvisor
157,036 reviews
20 US airlines
-
It analyzes the relation between ratings and review sentiment by using prospect theory.
-
Variations in ratings closer to the reference point result in higher marginal impact on sentiment than equivalent variations.
Tsai et al.
(2020) [96]
Online hotel review &
The overall ratings
TripAdvisor
1009 US hotels
23,430 reviews
-
Proposed a novel approach to generate high quality summaries of online hotel reviews.
-
Both review helpfulness and hotel features were considered before review summarization.
-
Online hotel reviews were collected in experimental evaluation.
Korfiatis et al.
(2019) [97]
Online reviews of Airlines & overall satisfaction ratingsTripAdvisor
557,208 reviews
-
Using (STM) structural topic model, the review text is coupled with numerical ratings.
-
Online reviews give a solution through quality features which better predict variation in passenger preferences and competition.
Sezgen et al.
(2019) [53]
Online reviews of airlinesTripAdvisor
5120 reviews
50 airlines
-
Using (LSA) Latent Semantic Analysis, findings show that passenger satisfaction and dissatisfaction dimensions differ depending on airline service class
-
Friendly and helpful staff, value and low fares are the main important factors for economy and Premium class passenger.
Punel et al.
(2019) [98]
Online reviews
Airlines ratings
Skytrax
40,510 reviews
-
The geographical regions shaped by the country of residence of passengers impact travel experiences, perception, and evaluation of airline services.
-
North American passengers complain more about their national airline.
-
Americans care more about money and less about in-flight services.
Siering et al.
(2018) [99]
Online reviews &
Overall airline rating
Airlinequality.com
1000 airlines reviews
195 airlines
-
Explanation of service recommendation based on core and augmented service aspects.
-
Prediction of the recommendation decision by means of machine learning techniques.
-
Analyze the role of airline service models (low cost vs. full service carriers.

3. Research Methodology

These three steps will be followed to obtain and process the necessary information to achieve the proposed research objectives: (1) Obtain the online customer reviews from the specialized website; (2) prepare the database from the information obtained; and (3) perform the statistical analysis of the database. Regarding the first step, the study was carried out based on the online opinions of 5278 customers about Iberia airline in the period of 2016–2018. All the comments were made in the Spanish language, but the methodology developed in this article is indifferent to the type of language used, as in the case of English. Customers are mainly from Spain and Latin American countries. The data were obtained from the TripAdvisor website, where information is provided on the general rating given by the customer, the title, and the comment. The general rating is a quantitative variable with five options ranging from poor to excellent (1—poor; 2—bad; 3—normal; 4—very good; 5—excellent). The title is made up of a limited number of words that describe the general meaning of the user’s opinion, whereas the comment presents all the aspects the customer wants to highlight, whether positive or negative. Both the title and the comment are qualitative variables that must be treated in a content analysis. To this end, a methodology and software have been designed to treat the information and prepare it for statistical analysis in quantitative terms.
The second step in the methodology consists of elaborating the database in order to treat it statistically. This requires, first, creating the labels from the words used in the titles of the reviews. Second, all the labels that are not considered relevant to the general rating must be deactivated, as in the case of the articles the, a, or an, or verbs such as to be or to have, for example. Third, it is necessary to define whether the labels will be integrated depending on the words detected in the titles or the words’ root. The latter makes it possible to reduce the number of tags because, for example, singular and plural words will be assigned to the same tag. Likewise, regular verbs or superlatives should also be assigned to the same common root. In this context, lemmatization is the way words are reduced to their elemental form, excluding final inflections [100]. Generally, because the software was developed in English, a specific program adapted to the Spanish language is used in this study. Finally, a database is created where each row is a review, and the columns represent the total number of labels, so that if a label is in the title of the comment, it will be assigned a one, and if not a 0. This is a method of converting a text into a vector of dichotomous variables, 0 or 1, that can be treated with quantitative statistical procedures. In this context, vector space models (VSMs) represent words as a vector in multidimensional space. An incredibly efficient model from a computational point of view is Word2vec, where the texts are transformed into vectors. Therefore, if a text includes a specific word, its variable is given a value of 1, and otherwise a value of 0 [101].
Finally, a database is created where each row is a comment and the columns show the total number of labels, so that if a label is in the title of the comment, it will be assigned a 1, and if not a 0. This is a method of converting a text into a vector of dichotomous variables that can be treated with quantitative statistical procedures. In this study, 2567 labels were detected once the irrelevant ones had been deactivated. To simplify them, we developed a program where the labels were assigned to the same root. To do so, first, the plurals or endings of regular verbs were eliminated. Second, a minimum number of characters was determined to form a common root because, if a minimum number is chosen, the program can detect many coincidences. For example, if we are not searching for similar words, the “bad” label can be detected in multiple words that start with bad, such as badminton. Therefore, the program was set to search for tags based on the root when it has six characters or more. If the tag has five or fewer characters, the text is searched for similar words. Using this method, we reduced the labels to 1523.
Once the databases have been created, three types of statistical analyses are carried out. The first analysis is to determine the number of times each label appears in the titles of the comments. Next, the average of the ratings obtained in the titles of reviews where each label appears is obtained, and so it is possible to evaluate to what extent it has a positive influence, that is, it obtains an average of more than 3, or a negative influence when it achieves an average of less than 3. The second statistical analysis is linear regression, where the dependent variable is the overall rating and the independent variables are the labels. This multiple regression analysis is carried out for the database of all the labels and the roots of the labels. The results allow us to determine if there is a significant relationship between each of the labels and the comment’s overall rating. Finally, cluster analysis with the significant variables is carried out to test whether users can be classified according to the types of words in their online comments. Confirming clusters among significant users would be a first step in determining whether the type of vocabulary used by different users is similar.
Based on the above, the machine learning model proposed in this article to determine the key labels related to overall online customer ratings follows these steps (see Figure 1):
(1)
Obtaining customers’ online reviews containing their general rating, title, and comment shared on a social network.
(2)
The tag database with the words contained in the titles of the comments could also be created with the comments themselves, but the number of tags would increase enormously, which would make the statistical study difficult. Furthermore, it can be assumed that the most relevant words to define the customer experience are found in the titles.
(3)
Debugging the labels to deactivate those that do not directly impact the sense of the customers’ online ratings. This task must be carried out by a specialized team or person who will transfer their knowledge to the machine learning system as the number of labels increases. From here, the final database of the labels to be used in the statistical analysis will be obtained.
(4)
Create a numerical database where each row corresponds to an online comment from the customers and the columns correspond to the variables that, first, consist of the general rating given by the customers (variables between 1 and 5). In contrast, the rest of the variables are the active labels, which will be given a value of 1 if they are found in the title of the comment, and 0 otherwise.
(5)
If necessary, create a new numerical database of labels based on the root of the words to reduce the number of variables without losing the meaning of the words.
(6)
Conduct a linear or other type of regression if considered necessary, where the dependent variable is the general rating and the independent variables are the activated labels. A descriptive analysis is also carried out of the averages of the general rating obtained in titles with the same label, which helps to relate the meaning and intensity of the tags to the general rating.
(7)
Check whether the regression model is consistent through the adjusted R square obtained. Likewise, the labels with a significant relationship, positive or negative, with the general rating will be determined. Furthermore, the minimum number of times a label appears can be established for use in subsequent analyses. By establishing the significant labels and the minimum number of times they have to appear in the titles or comments, a final database of significant labels can be created.
(8)
Once the meaningful labels have been determined, cluster analysis can be conducted to check whether a classification is adequate. The variables would be the labels, initial, root, or significant, as decided in the research, resulting in a classification of customers according to the types of words they use in their titles or online comments.
(9)
Continue to obtain information from new online customer reviews.
(10)
Verify whether the words in the new titles are already in the original label database. If so, the next step is to be taken. If not, a new label will be created and evaluated to determine whether it should be deactivated, depending on its impact on the customers’ general assessment. Once these decisions have been made, the next step is carried out.
(11)
Assess whether the words in the title of the new review correspond to any significant label. If so, an overall rating can be predicted. In this case, the difference between the predicted rating and the actual rating is evaluated to establish how accurate the prediction is. Another statistical analysis that can be performed in this step is to assign the new customer who makes the online comment to one of the predefined clusters to classify the user and start to define a profile based on the words s/he uses.
(12)
When a relevant number of predictions have been made, a joint analysis is carried out to validate the regression model applied and the assignment of users to predefined clusters.
(13)
When a significant number of new online reviews have been added, we move to Step 4 to perform a new regression analysis to confirm whether the labels are still significant, whether they maintain the coefficients, and whether there are new labels that are significant. A new cluster analysis can also be carried out to check whether the defined clusters are still maintained or, conversely, have to be reidentified because substantial changes have occurred. A comparative analysis is performed to determine whether there are changes in the vocabulary used by customers to assess the service received.
In this study, the proposed machine learning model’s primary application is carried out to detect the key labels that significantly influence the overall rating of online customer reviews. This is the essential part of the model because introducing new comments would involve detecting whether they incorporate the significant labels and, therefore, whether the rating assessment can be predicted. All activities related to the feedback of new data are not covered in this article, although from a practical perspective, they are relatively simple to solve using software.
First, detecting whether a new word in a title or comment is listed in the first tag database only requires a search. If it is found, nothing else has to be done. However, if the word is not in the database, it has to be included, and a specialist will have to determine whether or not it is significant in predicting the overall rating. In this part of the process, currently available knowledge from a specialized person is transferred to machine learning. As the vocabulary increases, the system will automatically detect whether a word is to be activated or deactivated. Once the new word enters the first database as off, or in the debugger as an active label, the system prepares to perform a new statistical analysis when it reaches a certain number of new online reviews.
Another function to be performed in new research is predicting a new online review. Knowing the labels significantly related to the general rating and their coefficient could establish a prediction of the possible rating given by the user making the comment. The next step is to carry out a simple comparison with the user’s real rating, data that will be integrated into the existing database as two new variables, that is, the predicted rating would be one variable and the difference from the real rating the other. When a certain number of new comments is reached, a joint analysis of the variable where the differences are collected will be conducted to determine whether an acceptable level of success has been achieved. Finally, once a certain number of new comments have increased the database, the statistical analysis will be performed again to assess whether the labels are still meaningful or any new ones should be added. The new coefficients will also be calculated and compared with the existing ones, which will determine whether there are changes in the intensity with which users use their words to rate the service. A new database must be created to perform this function, or variables must be added to the database of already existing labels, specifying the level of significance, the coefficient obtained, and the date on which the statistical analysis is conducted. With this information, it is relatively simple to elaborate on comparative reports.

4. Analysis of Results

The analysis to validate the proposed model has two parts. In the first part, a descriptive analysis of the labels is carried out to determine the number of times they appear and the average value of the rating obtained in these comments. Second, a multiple regression analysis is conducted with the labels created from a 6-character word root, i.e., a label is considered a root when it has six or more characters; in other cases, the label is treated as a word. Finally, a cluster analysis is performed to check whether user groups can be identified based on the type of vocabulary used to write their feedback online.

4.1. Descriptive Analysis of the Labels

The first analysis is descriptive, and the labels have been chosen that appear in at least three different comment titles. The purpose of this study is, first, to determine the number of times a tag appears in a title. This allows us to establish how often it is used and whether it is a word that is employed a lot or a little to describe customers’ perceptions. Second, the average of the reviews’ general ratings for each label detected in their titles is a fascinating index because it gives us a clue as to where user ratings are oriented. Considering that the mean value of the overall rating is 3, all averages that are significantly higher will indicate that the label is used in positive feedback titles. Conversely, if the average is below 3, the label is usually found in negative comments.
This analysis allows us to detect the key labels that direct a customer’s assessment toward a positive, negative, or neutral pole. Hence, the next step is to determine whether the relationship between a label’s appearance in a title and the general rating is significant. This will be done in the following section, where a multiple regression analysis will be performed. Regarding the results obtained in Appendix A, it can be seen that there are more significant labels above the average of 3 than below it. In order to simplify Annex A, we have chosen to represent only those labels that have been used in at least three different comments.
The descriptive analysis conducted is very enlightening because it obtains the average rating from user ratings where each label appears. As Appendix A shows, the pleasure, notable, and printing labels are found in comments where the average rating is 4.75, confirming their direct relationship with a high rating of the service received. It can be seen that the rest of the adjectives and words that are linked to a positive aspect of the brand or the service received obtain an average rating above 4. On the other hand, the labels linked to ratings below 1 are chaos, avoidance, unpresentable, motive, terror, and thrown away. Another aspect highlighted is that a significant number of labels are positioned in intermediate rating averages, which either means that they are not the key labels to define the rating’s orientation and intensity or that they are linked to a neutral rating.
Another exciting fact shown in Appendix A is the number of times a label appears in the comments’ titles. These data are complementary to the previous ones because if a label is used a lot and related to a specific rating, it would confirm the rating obtained. For example, the label excellent appears in 327 comments with an average rating of 4.64, and the word punctual is used in 468 comment titles with an average rating of 4.15, which denotes a relationship with positive aspects of the airline. In contrast, labels such as pessimistic appear 45 times, with an average rating of 1.47, and baggage appears 51 times, with an average rating of 2.24, which means that when a customer remembers baggage, it is because something has not gone as expected.

4.2. Regression Analysis

In this step, a multiple regression analysis is performed where the dependent variable is the rating, and the independent variables are the labels. The purpose is to establish the tags that are significantly related to the rating. The regression analysis has a double advantage over other types of prediction methods, such as neural networks. On the one hand, it determines which labels are significant and which are not, so that key labels can be identified. On the other hand, it gives a coefficient as a result, indicating the direction of the relationship, positive or negative, and the intensity of the relationship based on the size of the value.
Appendix B shows the significant labels at 5% and 10%, with a total of 369. Likewise, the adjusted R square was 0.546, which is a high value in the social sciences. Therefore, it can be stated that the relationships obtained between the labels and the ratings are consistent and demonstrate a logic between the words used to describe the service or image of the airline and the quantitative assessment reflected in the customer’s rating. The adjusted R square of the original labels is 0.614. As can be seen, this is a higher result than for the root of the labels, but the latter is also satisfactory and allows for a simplification of the content analysis.
In Appendix B, the labels have been organized according to the coefficient achieved, from highest to lowest. Among the labels with positive coefficients, there are labels with the root of places, maintain, congratulations, relax, personalized. It should be emphasized that there are labels, such as wrong, whose meaning can change when there is a word of negation next to it; for example, you are not wrong. In addition, some labels have a positive sign, such as indecent, which appeared only once or twice, because it is not found in the descriptive analysis. In contrast, labels with a negative coefficient include disappointment, rudeness, or defect.

4.3. Cluster Analysis

The K-means cluster analysis was performed using the significant tags. This study can be considered exploratory because this type of study has not usually been carried out from this perspective. For this reason, we decided to define five clusters because the rating variable has five response alternatives. It is necessary to highlight that the formation of clusters from the type of words used by users to share their opinions may be a more complex analysis. However, it is a first step that offers us alternatives and suggestions for developing future, in-depth research on this exciting topic. Therefore, it should be considered a first approach to applying cluster analysis to the text content of customers’ online opinions.
The number of cases assigned to these five clusters was as follows: Cluster 1, 598 cases; Cluster 2, 600 cases; Cluster 3, 129 cases; Cluster 4, 617 cases; and Cluster 5, 3334 cases. As can be seen, Cluster 5 has a very high number of cases, which means further research into the characteristics of this cluster is needed in order to identify possible new criteria for classifying clients. However, the purpose of carrying out this statistical study is to demonstrate whether clusters can be identified based on the words used by customers in the titles of the comments. To the extent that good results are obtained, the objective pursued is validated. In this case, future research should seek the best statistical methods and techniques to classify customers according to their words. Table 2 shows the results obtained in the Chi-square test between the members of each of the clusters, and the rating assigned by the users was 565.475, with a significance level of 0.000.
Table 2 shows that the users who tend to give a good rating to the airline were assigned to Cluster 1, given that the rating of 4 includes 40.0% of the customers and the rating of 5 includes 42.3%. Therefore, the customers classified in this cluster have a high rating of the airline’s service. Cluster 2 is similar to Cluster 1, but with the difference that half of the users give a rating of 4 (51.3%), followed by a rating 5 (27.2%). Further analysis will examine whether there are any relevant differences between customers’ words in the two clusters that are generally satisfied with the airline.
On the one hand, Cluster 3 classifies customers who give a lower rating to the services offered by the airline. In rating 1, 31.0% of customers are found, in rating 2, 26.4%, and in rating 3, 31.0%. Hence, it is clear that the most dissatisfied customers tend to share a common vocabulary to rate the service they have received. On the other hand, Cluster 4 obtains a distribution of percentages similar to the total average, except in ratings 1 (4.9%) and 2 (6.6%), where the allocation of customers is significantly lower than the total average. Finally, Cluster 5 is the largest, with 3334 cases, which shows a tendency toward medium or low ratings of the service received, with half of the assigned clients concentrated in ratings 1, 2, and 3.
Appendix C shows the results obtained in the clusters identified according to the significant labels. The number of times each label appears in the titles of the comments is shown. Thus, it is possible to determine the words that most identify the users assigned to each cluster. In Cluster 1, the most frequently used tag is travel (362), followed by excellent (260). Other labels that also identify this group of clients are punctual (31), comfortable (24), not (16), always (15), calm (14), pleasant (12), normal (12), more (12), but (11), experience (10), and pleasure (10). The excellent label matches the positive ratings of this cluster, which is associated with the travel tag. Other adjectives used to rate the airline positively are punctual, comfortable, quiet, pleasant, experience, or pleasure. In contrast, a label that can be considered negative is no, which is usually used a large number of times, and, in this cluster, only appears 16 times.
The labels that appear most in Cluster 2 are flight (88), punctual (70), good (65), travel (54), experience (38), comfortable (27), not (21), and but (20). These are generally positive tags, especially related to flight punctuality. However, some clients use the words not and but, which are typically used to make a negative remark or detract from some positive attribute. These results are related to the great concentration in the value 4 rating in this cluster. In contrast, in Cluster 3, the most frequently used labels are worse (77) and space (53), with fewer labels assigned. This result means that the customers assigned to this cluster are unhappy with the poor performance of different airline services, but especially with the space between the seats.
Cluster 4 groups clients who use varied and generally positive language. The most commonly used labels are excellent (50), punctual (49), calm (30), normal (30), and pleasant (25), which are, in general, positive adjectives. However, labels are also used that negatively rate the airline’s service, such as delayed (23) and uncomfortable (12), in contrast to other positive qualifiers such as perfect (18) and comfortable (18). Therefore, these customers are generally satisfied, but some show dissatisfaction with the plane’s delays and discomfort. Cluster 5, the most numerous, is the one with the most extensive variety of labels to express customers’ qualitative ratings. The labels used most are no (199), experience (118), more (103), comfortable (95), good (90), better (89), normal (86), delay (83), always (80), bad (60), quality (52), wait (51) and badly (50). These results show that this cluster is the worst, and defined by using varied qualifiers and with the opposite sense. Another visual description of the cluster analysis on the identified labels is displayed in Table 3 to provide airlines with proper managerial solutions and recommendations based on interpretation in Appendix C.

5. Discussion of Results

Based on the research carried out, it is clear that the proposed methodology for designing a learning machine model that detects the key tags in the titles of customers’ online comments is validated. Moreover, it is a methodology that has a learning process, only requiring specific updates to improve the tags that make up the database to analyze the contents of the users’ online comments. The initial process of data capture is the previous step to carrying out the statistical analyses. The first step was the descriptive analysis, which verified that the comparison of the linguistic sense of the labels with the averages of the ratings of the titles of the comments where they appear was logical. Thus, the labels that show positive aspects of the airline tend to obtain high scores around the value of 4, whereas the negative labels orient the averages towards low values between 1 and 2.
Another fundamental aspect of the descriptive analysis is establishing the number of times a label appears in the comments. These data are essential for designing and debugging a label database for content analysis. When tags appear only a few times, for example, 1 or 2 times, it is not possible to validate the relationship between the tag and the average of the ratings obtained. Moreover, it is possible that if the label is complemented by others that may change its meaning, such as a negation, it may give an average that does not correspond to the logic of the label when reading it in isolation. This is the case of good or not good, where the meaning of the label without the context has a specific orientation, but when linked to another label, its meaning can change diametrically. Despite these appreciations that serve as a basis for future research, the results obtained are satisfactory because the relationship between the key labels and the rating is demonstrated only by analyzing the averages obtained in the ratings and the number of times they appear in the customer reviews. Likewise, labels that do not have a significant influence on ratings are observed. The descriptive results obtained, along with the results of the regression analysis, make a fundamental contribution to the development of learning machine algorithms for detecting keywords in the content analysis of online ratings.
In this line, regression analysis confirms that the machine’s approach to learning the key labels is valid. The high number of significant labels and the high adjusted square R validate the proposed methodology. It is confirmed that the linear regression provides essential information for the development of a learning machine and artificial intelligence program, given that it provides two essential data for assessing the labels: (1) the level of significance and (2) the coefficient. Together with the descriptive data of the average ratings and the number of times they appear in the titles of the comments, these data make it possible to develop an algorithm that learns continuously.
On the one hand, the significant labels are determined, but we must not forget the other labels, which, in time, can be of great interest in the constant improvement of the learning algorithm. In this study, only the labels are analyzed individually. However, in future research, the study should be combined with other key labels that can modify or consolidate the relationship between the comment title and the rating given by a customer. Another aspect to consider is that, because the participation of a specialist feeds the label database, a moment will come when the vocabulary available in the database will be so extensive that it will require only a small number of improvements. In this context, the regression is also presented as a useful tool to study the interrelations between labels that appear in the same title or comment.
In this research, a first step is taken because key labels are identified. These are qualitative variables, and they are related to a quantitative variable of maximum strategic interest, such as the airline’s global assessment and its service. Moreover, a means of assessing the labels is discovered that is scientifically tested through descriptive and multiple regression analysis. The results obtained confirm that, as expected, there is a logical relationship between the words used to evaluate a specific service, company, or brand and a variable that measures the perception or attitude, depending on the degree of interaction between the passenger and the airline rating measured in TripAdvisor. Moreover, in this type of variable, this logic has a positive or negative sign and an intensity, depending on the coefficient obtained.
Regarding the cluster analysis carried out, even though some of its results are not conclusive, it can be considered an exploratory study of interest for future research. First, four clusters of clients were identified who use similar words to evaluate their online opinions. From this perspective, the analysis demonstrated that the type of label can be used to categorize customers. Regarding Cluster 5, it was the worst defined, given that almost half of the users were assigned to it. Nevertheless, this result does not disqualify this study because it shows that it is an exciting and fruitful field of study where the classification of customers must advance, not only by using different techniques, but also by improving the process of dividing and, if necessary, subdividing the clusters created.
Likewise, it would be interesting to use other types of classificatory variables, if possible, such as age, social class, nationality, and level of study, if they contribute to clarifying the characteristics of the typologies defined. From the results obtained, it can be stated that the methodology for preparing labels and determining which ones are significant in the general rating is satisfactory for carrying out a classification of customers. Along these lines, depending on the information stored in the databases and processed with the appropriate statistical methods, it will be possible to pre-classify some of the users according to the type of vocabulary they use to express their opinions on the Internet.
The results of this process has been the understanding and identification of clusters with keywords that are similar to each other in terms of content and unlike other clusters. Moreover, comparing the results of this study with previous research which has defined key labels and topics extracted from online reviews of airlines (34,56,94,97,99) has shown similarities in terms of the key labels identified upon modeling algorithms process in identifying repetitive labels in the reviews. However, our findings demonstrate that using cluster analysis can be identified based on the words used by passengers in the title of comments. In addition, they show the most important aspects of services identified by the passengers assigned to each cluster group rating of Iberia, and the main aspects is about (1) quality of customer service which refers to the important of the performance of this service, (2) inflight experience, (3) punctuality of flight, (4) flight delays, (5) check-in, (6) personnel service. Research from Masorgo et al. [102] using expectancy disconfirmation theory EDT, they found that both arrival delays and involuntary denied boarding negatively affect customer satisfaction, their results show the importance of managing passenger expectation about airline service and the inflight experience.
Finally, an essential aspect of implementing this type of methodology to develop learning machines and artificial intelligence in content analysis is the way the label databases are constructed. As the proposed model shows, this has to be a continuous process where information is transferred from the Internet and a specialist to the software developed for this purpose. Two alternatives have been proposed in the study. One is to create the database based on the different words that appear in the comments’ titles without additional processing. The other alternative is to establish the roots of the words used in order to reduce the number of labels without losing the meaning of the words. In this case, the software was created to generate and identify the roots of the words with six or more characters. The results showed that many of these types of labels have a significant relationship with the general rating, and the descriptive analysis contributed to clarifying the meaning and intensity of the labels.
Several contrasting programs have already been used to simplify the words according to their English root, but not in Spanish. Thus, it was necessary to develop and implement specific software to apply to the Spanish comments about Iberia airline. Despite obtaining satisfactory results, the program can be improved, for example, by grouping the conjugations of irregular verbs within the same basic label. We create a variable that is the reference for that tag when the database is created. For example, the past tense of the verb to go is went, which cannot be related because they share a minimum number of familiar characters. For these cases, a variable could be added where the base label is assigned. It would also be interesting to determine when the system creates acts autonomously without consulting a specialist. In this context, a periodic validation should be carried out of whether its processes and forecasts align with reality.

6. Conclusions

This study’s main conclusion is that content analysis of online customer opinions can be transformed into dichotomous variables (0, 1) and then related to a quantitative variable such as the overall rating. Through this methodology, coefficients and signs of the labels are obtained that facilitate their interpretation by a learning machine or artificial intelligence. In this context, the vocabulary people use tends to vary over time, as well as the assessment given to each word. Therefore, the results obtained in this study are of great interest for developing this line of research because they offer methodological support for giving a quantitative value to the labels.
The creation of the label database is an essential aspect of carrying out this type of study. It requires the development of specific software to solve the problems that arise in the language. In this context, the study demonstrated that a tag database based on the root of the words obtains satisfactory results and simplifies these types of studies and algorithms. In this regard, alternatives that could improve the learning machine for future research were observed, such as the insertion of irregular verbs and simple word structures, which would make it possible to better predict the meaning and intensity of the customers’ opinions.
Based on the two aspects mentioned above, a learning machine structure is designed that, at first, obtains information from a specialized person, but as its knowledge increases, can become autonomous. In this context, utilizing descriptive statistical analysis and multiple regressions, it starts an internal process of permanent learning that can be the basis for developing an artificial intelligence designed to manage the strategic aspect of online relationships with customers. The linear regression offers two fundamental data to build a continuous learning algorithm, that is, the sign of each label’s relationship with the general rating and the intensity measured by the coefficient obtained. This analysis must complement the descriptive analysis, where the frequency with which the words are used is a key factor in the evaluation of the words and their relationship with the average rating obtained in the comments where each label appears.
The results obtained in the multiple linear regression show that a high adjusted R square is achieved, demonstrating that the logic of language can be translated into quantitative terms related to customers’ online opinions. Moreover, the regression provides essential information for developing a learning machine and artificial intelligence in the language used in online user feedback. In this context, the study demonstrates the great importance of having a sign of a relationship regarding a quantitative variable that values customers’ perceptions or attitudes toward a brand or service. Likewise, each label’s coefficient is a measure of intensity of the general rating, which makes it an essential element in assessing customer interactions. From this perspective, the linear regimen offers more information than neural networks, where the key labels, their sign, and their intensity cannot be determined.
The limitations of this study provide opportunities for carrying out future research and improvements in this field. On the one hand, the need to improve the algorithm to find the roots of words to simplify the learning process in languages other than English has already been mentioned. This process should be as automated as possible, so that a specialist’s involvement is concentrated in the earliest stages of learning. Furthermore, the joint analysis of descriptive and multiple regression results needs to be improved. Thus, the frequency of use of words should be taken into account, so that the coefficients and signs that are statistically assigned are not biased. A minimum level of frequency needs to be established to validate the results obtained.
Regarding the cluster analysis, the results indicated that exploratory research shows an exciting future for this type of investigation. Along these lines, it might be possible to define clients’ profiles with greater precision by taking other variables into account such as flight class (first class, business class, economy class) [53], culture dimensions [93], domestic and international flights [93], geographical regions [98], low cost and full service carriers [99], using different platforms to gather information such as Twitter [103], mobile app [92], and SKYTRAX [94,98]. This research can be incorporated in more airlines in Europe and can provide comparative investigation between airlines. It is also necessary to validate the proposed methodology over time, especially in forecasting the rating evaluation when a customer gives an opinion and the typology to which s/he is most likely to belong. This methodology can be apply in multiple fields like hotels, hospitals and google play store.
In conclusion, we must emphasize the importance of having a reference variable that allows us to evaluate the meaning and intensity of the words in online reviews. In this case, we have used TripAdvisor’s general rating, but other variables can be used to facilitate deciphering how customers think and what they value when they comment on a company, brand, or service. Regarding the competitive market of airlines and their various services, this study can be considered as a guide for airlines that need to keep up their positioning in the market by developing marketing strategies that attract new passengers and increase their online reputation. The results of this research give business insights implications and recommendations for predicting passenger satisfaction by analyzing online reviews in the titles. Along these lines, the possibility of merging quantitative and qualitative customer assessments as two sides of the same coin is essential for developing learning algorithms and artificial intelligence that can help employees to manage companies more competitively. The contribution of this investigation correlates to the evolution of a new approach to spot the strengths and weakness of airline’ services depending on the identified key labels in order to help managers to improve passenger experience.

Author Contributions

Investigation, A.Z.A. and M.R.D. All authors have read and agreed to the published version of the manuscript.

Funding

Fundación C. Universitaria de Las Palmas.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Descriptive analysis of labels.
Table A1. Descriptive analysis of labels.
LabelNumberMeanLabelNumberMeanLabelNumberMean
ideal35bus943.61assistance33
pleasure124.75turbulence53.6height33
notable44.75lots203.6seat1372.99
printing44.75according53.6time482.98
magnificent144.71resolved53.6regular392.97
fantastic104.7queue53.6nothing502.96
marvelous104.7assistant53.6individual202.95
practical34.67acceptable253.6port212.95
defraud34.67on board193.58plaza1332.95
focus34.67whole73.57pain202.9
excellent3274.64rest73.57average92.89
latest54.6vacation163.56checking82.88
impeccable594.58crew863.56educate72.86
flawless154.53quite323.56where72.86
exceptional184.5age113.55bathroom72.86
phenomenal64.5air193.53down72.86
chair44.5service4583.52total132.85
fan144.43touch43.5then132.85
unbeatable54.4topic83.5case132.85
agreed54.4seriousness43.5airport202.85
have34.33feel 43.5web62.83
pass34.33route43.5repeat62.83
modern124.33point123.5half62.83
entertaining34.33lend83.5less122.83
effective34.33order63.5cheap62.83
appreciate34.33menu63.5attention112.82
chord34.33frequent103.5straits112.82
pleasurable234.3replicate43.5narrow172.82
TRUE44.25although183.5go52.8
serviceable44.25aspect43.5arrive52.8
sensation44.25current83.5uncomfortable542.76
even44.25cabin213.48value42.75
like84.25deal863.47sun442.75
favor44.25food723.47follow42.75
astonishing44.25something133.46manner42.75
cross44.25airline1383.46mother42.75
home184.22minute93.44low-cost42.75
cute54.2without2503.43justify42.75
cordial114.18entre563.43thanks42.75
leap64.17swindle123.42works82.75
drinks64.17price913.42tough42.75
fast324.16lines123.42passenger232.74
punctual4684.15connection193.42reenter112.73
professional224.14job53.4old222.73
efficiency74.14also53.4after1402.73
tourist184.11gate53.4date72.71
trading94.11routine53.4enough32.67
careful194.11legs153.4appear92.67
nice1004.11request53.4palm32.67
relation224.09need53.4or32.67
great524.08row53.4check in32.67
quiet594.07escape53.4control32.67
class344.06subsequent to53.4tariff32.67
always1144.05inside153.4fault32.67
quality804.04return513.39high382.66
summer34mess163.38already332.64
last74amazing83.38bad3812.64
journey144during83.38desire82.63
usually84detail83.38delay1032.61
startles44but1703.37between52.6
simply54period113.36dice202.6
sure164all113.36ok192.58
relaxed54bread223.36side192.53
regional64on203.35hours562.52
reasonable64much523.35respect62.5
promotion34highlight33.33slow42.5
predicted44put33.33can62.5
prefer34power63.33equal42.5
Fare-quality34couple33.33constant42.5
pilot84inconvenient63.33behavior42.5
film44provision63.33see632.41
displays44different33.33lack292.41
opinion64portion63.33responsibility52.4
require34whatever33.33operated52.4
Christmas34which63.33operate102.4
deserves54short93.33crummy52.4
pet34face243.33loose52.4
praise34mouth33.33close52.4
language34alike1173.32change52.4
big44yet283.32why82.38
fleet54some193.32baggage522.38
easy54small103.3badly822.38
excellence44bridge143.29common82.38
spectacular54delicious183.28zero82.38
charm34neither2483.28world142.36
in time104never223.27scammed32.33
say44offer653.26sir32.33
price-quality54flag193.26remains92.33
cabotage34only83.25priority32.33
help34transport43.25predisposition32.33
friendly274system43.25minimum32.33
bueno1083.96availability43.25mini62.33
very6053.95cutout43.25Information152.33
comfort393.95post43.25difficulty32.33
incidence163.94possibility43.25pause92.33
general303.93position43.25deficient62.33
fly393.92conventional83.25review32.33
food1833.92latinoamerica43.25obtain32.33
ate113.91delete43.25fare42.25
business553.91depends83.25let42.25
duration413.9basic43.25care162.25
international103.9acompaña43.25check-in42.25
end193.89give253.24luggage512.24
proposal83.88if7913.22util52.2
reach83.88land143.21dirty102.2
concerning73.86none243.21inexpensive52.2
Mexico143.86difference143.21fatal52.2
setback73.86withdraw53.2guilt52.2
right753.85revisit53.2ancient102.2
luck63.83recommend53.2reservation172.18
preferential63.83normal1383.2contribute172.18
moment63.83final53.2cancelation62.17
intercontinental63.83additional53.2attitude62.17
factor63.83rule1403.19leave192.16
traverse63.83more1923.19employee72.14
thing63.83space783.18suitcase82.13
premium113.82responsible63.17organization102.1
expectations113.82to get better483.17possible152.07
surprise263.81special123.17attention142.07
direct273.81choice63.17client512.06
degrade53.8foot323.16reply32
incidents53.8stewardess203.15page32
reliable103.8life213.14online82
executive53.8medium223.14kids42
effectiveness53.8unexceptional73.14hands52
classic53.8low283.14subsequently42
when93.78go after163.13unpunctual42
since263.77corporation83.13impossible142
flight8173.75scarce93.11grade32
pears43.75standard103.1exists82
place43.75no7113.1gave112
plenty43.75economic103.1charge82
such3333.75from1033.1ticket112
long463.74before393.1water72
holiday453.73expensive223.09exhaust32
weight113.73same123.08disappointment221.91
new223.73airplane1333.08sardines71.86
a lot263.73by1633.06poor71.86
warranty113.73towering173.06plus51.8
comfortable1603.73pay213.05left51.8
option393.72express263.04advertising41.75
and14143.71boarding263.04higher41.75
transatlantic73.71form303.03cheated41.75
clothes423.71wifi43buy41.75
interests73.71tv43infant41.75
going6543.71crossing33regrettable151.73
better2733.7transoceanic63glass31.67
attention3183.69contact33still31.67
vip33.67pulling33probable31.67
satisfied33.67type43hair61.67
except63.67persons53overbooking91.67
peace63.67think33horror61.67
reduced33.67part83explanation61.67
money33.67pay103disaster211.62
background33.67gold133nightmare51.6
number33.67leisure73painful51.6
frequency33.67level83warn51.6
defect33.67local33month71.57
status33.67bring63worse41.5
entertainment243.67together43pessimistic451.47
select63.67irregular33error91.44
cash33.67internal43human51.4
destination93.67try33shame81.38
must33.67until93terrible61.33
costa33.67gentle93prepotent31.33
puddle33.67cold43inexistent31.33
charco33.67stay33non-compliance31.33
coast153.67scale173void31.33
thank33.67link33abuse31.33
problem643.66efficient93mistreatment41.25
motto653.66lag33fraud241.08
experience1823.66mean33thrown away31
departure233.65cost63fright41
company1463.65catering63motive31
extra113.64pricey33unpresentable51
able to83.63fits43avoidance31
organized83.63transportation33chaos41
habitual83.63respite33app31

Appendix B

Table A2. Regression analysis with rating as independent variable (r2 adjusted 0.546).
Table A2. Regression analysis with rating as independent variable (r2 adjusted 0.546).
VariablesBetaTSig.VariablesBetaTSig.
(constant)3.739131.6280.000 ***space−0.535−3.9090.000 ***
seat3.8642.2360.025 *reservation−0.562−1.9710.049 *
keep3.7702.2030.028 *never−0.576−2.6700.008 **
additional3.5394.0280.000 ***client−0.607−4.0160.000 ***
tower3.2892.4410.015 *if−0.617−3.1440.002 **
deliver3.2763.3830.001 **nothing−0.628−3.1140.002 **
husband3.1132.9540.003 **charge−0.637−1.9420.052
add up3.1013.1240.002 **lost−0.672−3.2950.001 **
then3.0152.2090.027 *improved−0.698−3.2580.001 **
distribution2.9253.4090.001 **pain−0.721−2.9920.003 **
slow2.8932.1640.031 *baggage−0.769−4.2370.000 ***
forecast2.8834.6080.000 ***all−0.772−2.2290.026 *
restrict2.8602.1990.028 *regular−0.779−5.0140.000 ***
agency2.8362.7920.005 **disgust−0.786−2.1340.033 *
gender2.5972.1510.032 *stewardess−0.788−2.8540.004 **
congratulations2.5372.6460.008 **level−0.813−2.3120.021 *
whole2.5242.3130.021 *impossible−0.818−2.8960.004 **
find2.4902.8300.005 **until−0.821−2.3320.020 *
natural2.3552.4390.015 *no−0.840−10.6880.000 ***
alien2.3052.3530.019 *absolute−0.840−2.2540.024 *
centimeter2.2202.1840.029 *high−0.854−2.3070.021 *
affair2.1691.7140.087horror−0.900−3.3550.001 **
evident2.1531.7330.083give−0.902−1.8340.067
barbarian2.1163.5080.000 ***vary−0.904−2.3770.017 *
fan2.1012.3360.020 *delay−0.907−8.2220.000 ***
tracks2.0882.3340.020 *lack−0.930−4.4300.000 ***
price2.0582.7970.005 **uncomfortable−0.965−8.2570.000 ***
number2.0151.8350.067drinks−0.969−1.7920.073
beverage1.9431.6440.100organization−1.017−2.9090.004 **
mistake1.9292.5570.011 *buy−1.023−3.8200.000 ***
light1.9151.8000.072quality−1.054−1.8680.062
indecent1.8902.0250.043 *think−1.064−2.7780.005 **
documentation1.8711.9780.048 *solve−1.137−2.1680.030 *
curious1.8541.6470.100communicate−1.150−2.3220.020 *
defraud1.8372.1690.030 *lowcost−1.171−2.7090.007 **
electronic1.8011.8230.068old−1.175−3.8940.000 ***
royal1.7281.7210.085warn−1.196−2.7970.005 **
little bit1.7091.9010.057complicate−1.197−1.8920.059
place1.6962.5150.012 *delay−1.221−3.4580.001 **
instructions1.6861.8570.063appear−1.224−3.3280.001 **
mini1.6821.8040.071badly−1.230−10.6790.000 ***
talk1.5721.6990.089telephone−1.260−1.9090.056
relax1.5431.7650.078plus−1.274−3.0390.002 **
personalized1.5121.8190.069pause−1.279−2.8860.004 **
remedy1.4721.7970.072chaotic−1.289−2.4620.014 *
round1.4541.7090.087terrible−1.328−3.4830.001 **
road1.4261.7230.085arrival−1.356−2.0690.039 *
agile1.4111.6890.091want−1.375−2.0490.041 *
warm1.3822.5010.012 *bad−1.416−12.0930.000 ***
have1.3402.5540.011 *unpunctual−1.419−2.6850.007 **
touch1.3292.0530.040 *error−1.441−2.5940.010 *
after1.3162.1020.036 *discriminate−1.453−2.3540.019 *
easy1.3001.6510.099thrown away−1.489−2.3160.021 *
agree1.2701.9790.048 *mistreatment−1.507−3.2140.001 **
opera1.2432.0140.044 *possibility−1.523−2.2930.022 *
system1.2391.9470.052water−1.539−2.4680.014 *
luxury1.1842.0060.045 *ticket−1.540−2.7870.005 **
magical1.1801.9950.046*forget−1.556−3.2680.001 **
web1.1791.9380.053negligent−1.577−1.7080.088
meeting1.1671.9740.048 *wide−1.582−1.8310.067
maximum1.1511.9860.047 *avoidance−1.583−3.2860.001 **
moment1.1372.6070.009 **any−1.590−2.3220.020 *
unforgettable1.1281.9430.052sell−1.600−3.4380.001 **
behavior1.1121.9560.051sardines−1.620−2.9600.003 **
predisposition1.0402.0640.039 *down−1.623−2.5900.010 *
remove1.0241.7300.084non-existent−1.630−2.6590.008 **
remarkable1.0172.2470.025 *particular−1.648−2.8040.005 **
magnificent0.9664.4030.000 ***painful−1.659−5.0140.000 ***
fantastic0.9153.5620.000 ***frightening−1.671−3.3870.001 **
great0.8863.6070.000 ***cheated−1.678−4.8300.000 ***
brilliant0.8634.0980.000 ***poor−1.699−4.8340.000 ***
extraordinary0.8561.6880.092lousy−1.707−5.2110.000 ***
excellent0.85216.0560.000 ***load−1.712−1.9290.054
end0.8251.9670.049 *deficient−1.715−4.3780.000 ***
perfect0.7927.3190.000 ***breach−1.739−2.1230.034 *
charm0.7863.1300.002 **decline−1.739−3.0000.003 **
phenomenal0.7552.2160.027 *money−1.739−2.1230.034 *
enjoyment0.7493.3400.001 **vulgar−1.739−2.1230.034 *
compete0.7341.7320.083terror−1.747−4.8560.000 ***
wonderful0.7152.7320.006 **worse−1.755−16.3310.000 ***
house0.6992.9140.004 **exhausting−1.761−2.9990.003 **
according0.6871.8570.063scammed−1.772−2.8030.005 **
like0.6661.8050.071rate−1.782−3.3910.001 **
faultless0.6422.9050.004 **win−1.787−2.1560.031 *
unbeatable0.6371.6790.093sad−1.804−2.3160.021 *
neither0.6222.2590.024 *countryside−1.812−2.1670.030 *
first0.5862.9160.004 **half−1.817−3.3450.001 **
big0.5854.0860.000 ***shame−1.839−5.3300.000 ***
better0.5764.8070.000 ***fatal−1.866−4.4910.000 ***
overcome0.5182.1250.034 *disaster−1.867−11.1700.000 ***
almost0.5171.8360.066obsolete−1.889−3.0190.003 **
time0.4662.2070.027 *disappointment−1.900−10.2920.000 ***
passenger0.4651.8520.064reason−1.902−2.5730.010 *
trust0.3251.6390.101hair−1.925−4.5070.000 ***
fast0.3122.1540.031 *precarious−1.926−2.3390.019 *
fly0.2791.7980.072value−1.948−3.6790.000 ***
always0.2752.8970.004 **guilt−1.965−2.9080.004 **
very0.2576.1690.000 ***shatter−1.970−2.3490.019 *
good0.2333.0680.002 **awful−1.996−2.4340.015 *
comfortable0.2323.1310.002 **regrettable−2.010−8.3230.000 ***
punctual0.2194.3810.000 ***recognize−2.028−2.4540.014 *
pleasant0.2172.2540.024 *abusive−2.048−3.7400.000 ***
calm0.2011.9450.052pessimistic−2.054−18.1030.000 ***
but0.1972.1790.029 *leave−2.061−3.6220.000 ***
travel0.1392.7990.005 **cancel−2.077−5.2670.000 ***
experience0.1281.8320.067impolite−2.107−3.1680.002 **
flight0.1092.6640.008 **unpresentable−2.127−5.4690.000 ***
and0.0801.6220.105bored−2.135−2.6000.009 **
more−0.177−1.8270.068distorted−2.136−3.6390.000 ***
wait−0.262−2.0730.038 *transfer−2.201−2.5910.010 *
man−0.337−1.9320.053abandonment−2.221−3.7960.000 ***
respect−0.453−1.7510.080spoil−2.227−1.6610.097
low−0.489−1.6170.106filth−2.265−2.1360.033 *
clean−0.518−1.6990.089inoperative−2.266−1.9290.054
scarce−0.588−2.1510.032 *zero−2.266−5.0120.000 ***
return−0.594−1.9080.056abuse−2.273−3.3800.001 **
yet−0.602−1.6250.104date−2.281−1.8100.070
cheap−0.608−2.0770.038 *antihuman−2.285−2.2830.022 *
mediocre−0.673−1.8920.059reschedule−2.316−3.7720.000 ***
consider−0.713−1.9270.054anger−2.324−2.7950.005 **
fair−0.748−1.6130.107overbooking−2.335−6.0420.000 ***
dirty−0.751−1.6270.104terminal−2.344−1.9310.054
disorganize−0.808−2.3390.019 *fraud−2.346−12.1240.000 ***
operated−0.847−1.6520.099meters−2.352−1.8190.069
explanation−0.861−2.1370.033 *strict−2.397−2.5580.011 *
standard−0.875−1.8360.066apparatus−2.434−2.0160.044 *
cargo−0.885−2.0570.040 *uncomfortable−2.440−2.9010.004 **
loose−0.929−1.8980.058general−2.446−2.0060.045 *
response−0.939−1.6540.098doing−2.461−3.9360.000 ***
adapt−1.007−1.6840.092lies−2.474−4.2290.000 ***
fault−1.082−2.2240.026 *divide−2.503−2.7120.007 **
damage−1.100−1.7860.074dire−2.594−7.6870.000 ***
ignore−1.222−2.4480.014 *delete−2.640−2.7620.006 **
app−1.232−1.7700.077unhuman−2.641−3.7490.000 ***
word−1.252−1.8570.063run away−2.694−3.2470.001 **
remember−1.268−2.1140.035 *oversold−2.711−3.2790.001 **
satellite−1.273−1.8560.063expression−2.712−2.6550.008 **
null−1.451−2.3780.017 *jinx−2.714−3.0660.002 **
baby−1.521−2.4930.013 *revulsion−2.739−3.3430.001 **
gain−1.543−1.6520.099tyranny−2.739−3.3430.001 **
letter−1.563−1.7070.088unforgivable−2.739−3.3430.001 **
clouds−1.571−1.8290.067fiasco−2.739−3.3430.001 **
exorbitant−1.588−1.7760.076failure−2.739−3.3430.001 **
infinite−1.647−1.6240.104queue−2.739−3.3430.001 **
infimum−1.657−1.9080.056authorize−2.739−3.3430.001 **
negligence−1.696−1.8700.062outrageous−2.739−3.3430.001 **
request−1.725−1.8270.068authentic−2.739−3.3430.001 **
suffer−1.739−2.1230.034 *xenophobic−2.739−3.3430.001 **
frozen−1.739−2.1230.034 *criminals−2.739−3.3430.001 **
pate−1.739−2.1230.034 *very poor−2.739−3.3430.001 **
hate−1.739−2.1230.034 *garbage−2.739−3.3430.001 **
riot−1.739−2.1230.034 *incompetent−2.739−3.3430.001 **
mortal−1.739−2.1230.034 *misery−2.739−3.3430.001 **
unsatisfied−1.739−2.1230.034 *navel−2.749−2.9210.004 **
free−1.739−2.1230.034 *tremendous−2.819−3.4340.001 **
go around−1.739−2.1230.034 *inefficiency−2.819−3.4340.001 **
perfume−1.739−2.1230.034 *court−2.865−3.0970.002 **
penalize−1.739−2.1230.034 *chaos−2.920−5.3150.000 ***
owners−1.739−2.1230.034 *horny−2.927−3.0200.003 **
unconcern−1.757−1.9100.056true−2.935−3.5610.000 ***
go down−1.766−2.0860.037 *scam−2.950−5.9370.000 ***
truck−1.787−2.1560.031 *default−2.950−2.1560.031 *
ancient−1.794−2.1060.035 *delight−2.954−2.1210.034 *
fed up−1.795−2.0220.043 *award−2.992−2.8210.005 **
regret−1.816−2.0300.042 *still−3.023−1.8910.059
waste−1.832−2.2170.027 *rue−3.080−2.8790.004 **
stumbling−1.848−2.2540.024 *delicacy−3.081−2.3100.021 *
liquor−1.869−1.8830.060juice−3.139−2.0700.039 *
refrain−1.878−2.2890.022 *secondary−3.204−3.7390.000 ***
divert−1.891−2.2080.027 *frustration−3.221−3.7510.000 ***
canned−1.891−1.6320.103car−3.311−4.0190.000 ***
admit−1.899−2.3090.021 *botched−3.324−3.9980.000 ***
believe−1.899−2.3090.021 *rude−3.409−2.8900.004 **
see−1.907−1.8400.066converted−3.437−3.1380.002 **
useless−1.943−2.1800.029 *confusion−3.636−3.0520.002 **
functional−1.973−2.2350.025 *permitted−3.693−2.9050.004 **
remember−1.996−1.8950.058ambient−3.708−2.3660.018 *
private−2.001−1.7520.080chair−3.763−2.7130.007 **
turbulent−2.072−2.5210.012 *occupied−3.826−2.3670.018 *
sweetie−2.111−2.5030.012 *find out−4.170−3.0480.002 **
converting−2.151−1.9100.056omitted−4.207−2.4950.013 *
modification−2.177−2.5110.012 *nobody−4.425−3.2230.001 **
airplane−0.285−2.1750.030 *relapsing−4.531−3.5450.000 ***
hours−0.323−2.0810.037 *lay off−4.681−2.9300.003 **
normal−0.532−6.7220.000 ***indifferent−5.076−3.6350.000 ***
Significance level: * p < 0.05; ** p < 0.01, *** p < 0.001.

Appendix C

Table A3. Cluster analysis based on labels.
Table A3. Cluster analysis based on labels.
Cluster 1Cluster 4Cluster 5 (Continuation)
LabelNumberLabelNumberLabelNumber
nice12pleasant25absolute7
airplane8water2abusive5
well9void2enjoyable36
characteristic9plane10water4
warm9warn2tall6
load2stewardess2old8
client3economic3backwardness6
comfortable27well9plane49
buy3ticket2warn5
disaster2quality9stewardess12
charm3trait8cheap8
distance4cancel9drinks3
exceptional3almost2good90
excellent260comfortable18ticket6
experience10buy4quality52
marvelous5frustration2cancel3
like2disaster2chaos4
scheduled5attraction2chaotic3
horror2scarce3Position5
embarrassing8gap5house15
horrible3remain3almost9
wrong4awesome4zero6
luggage2excellent50client45
more12adventure9charge13
best6explanation2comfortable95
very2fantastic6compete4
blank2date2buy20
neither2unusual3communicate4
Not16general3consider6
normal12gender3any3
never3great6guilt5
accurate3hours8damage3
but11pure2give7
pessimistic9impossible2disappointment20
pleasure10uncomfortable12deficient3
first5unbeatable3defraud5
punctual31regrettable2leave7
fast5clean2disgust6
standar3magnificent2disaster28
respect3atrocious7disrupt5
prorogation2bags7money3
if7Wonderful3discriminate3
always15larger14charm6
beats2better5find3
calm14nought2cheated8
crew4neither3scarce3
travel362no way10frightening3
return4normal30wait51
Cluster 2operated3fraud8
LabelNumberworse6amazing5
pleasant18lost6experience118
old2perfect18explanation4
airplane15nevertheless5extra-ordinary3
stewardess6pessimistic4lack25
drinks2enjoyment7wonderful5
good65first4fatal5
feature4punctual49phenomenal3
warm4fast9end7
client2delayed23loose3
comfortable27if2cold3
according3continuously5general18
deficient3overcome2gender19
disrupt2pause2great15
scarce3time7big34
space17calm30relish3
outstanding15personnel5until6
experience38journey4hours37
overall9Cluster 5horror8
sex10LabelNumberirreproachable10
hours5want6impossible9
flawless2fast20unpresentable5
awkward15regular29unpunctual3
shitty17reservation16mortifying29
awful10respect13non-existent3
rather3remove3inhuman3
nil3delay83regrettable12
neither4sardines7clean9
not21divide3low cost4
normal9if30place3
faultless5always80pleasant12
but20solve8abysmal42
pessimistic2standard3badly50
enjoyment11overcome15impolite4
punctual70postponement7baggage45
fast2fare3lousy6
if3terrible5mistreatment5
habitually12terror6wonderful8
duration2weather25more103
wholly3scam3maximum4
soundless11thrown away3mediocre8
staff5all6better89
travel54quiet20improvable19
fly3behind4moment5
flight88crew23nothing34
Cluster 3value4dire6
LabelNumbervariar7ni23
airplane2sell4level7
comfortable5shame7no199
disaster2fly35normal86
scarce3return6remarkable3
space53web4number3
fraud2bad60never18
experience7delight13obsolete3
Lack2seat5forget4
horror3poor5organization6
impossible3possibility3overbooking7
uncomfortable2first12appear10
crappy6prompt306pasajero20
mejor2pensar5hair6
no means4Lost32Pain16
worse77impeccable39painful7
pero9however111
precise4
fast2
regularly2
trip3

References

  1. Rodríguez-Díaz, M.; Espino-Rodríguez, T.F. Determining the reliability and validity of online reputation databases for lodging: Booking.com, TripAdvisror, and HolidayCheck. J. Vacat. Mark. 2018, 24, 261–274. [Google Scholar] [CrossRef]
  2. Horster, E.; Gottschalk, C. Computer-assisted webnography: A new approach to online reputation management in tourism. J. Vacat. Mark. 2012, 18, 229–238. [Google Scholar] [CrossRef]
  3. Yacouel, N.; Fleischer, A. The role of cybermediaries in reputation building and price premiums in the online hotel market. J. Travel Res. 2012, 51, 219–226. [Google Scholar] [CrossRef]
  4. Li, H.; Ye, Q.; Law, R. Determinants of customer satisfaction in the hotel industry: An application of online review analysis. Asia Pac. J. Tour. Res. 2013, 18, 784–802. [Google Scholar] [CrossRef]
  5. Gössling, S.; Hall, C.M.; Andersson, A.C. The manager’s dilemma: A conceptualization of online review manipulation strategies. Curr. Issues Tour. 2018, 21, 484–503. [Google Scholar] [CrossRef]
  6. Yusuf, A.S.; Busalim, A.H. Influence of e-WOM engagement on consumer purchase intention in social commerce. J. Serv. Mark. 2018, 32, 493–504. [Google Scholar] [CrossRef]
  7. Akram, U.; Junaid, M.; Zafar, A.U.; Li, Z.; Fan, M. Online purchase intention in Chinese social commerce platforms: Being emotional or rational? J. Retail. Consum. Serv. 2021, 63, 102669. [Google Scholar] [CrossRef]
  8. Bahtar, A.Z.; Muda, M. The impact of User–Generated Content (UGC) on product reviews towards online purchasing–A conceptual framework. Procedia Econ. Financ. 2016, 37, 337–342. [Google Scholar] [CrossRef] [Green Version]
  9. Xu, Z.; Dang, Y.; Wang, Q. Potential buyer identification and purchase likelihood quantification by mining user-generated content on social media. Expert Syst. Appl. 2022, 187, 115899. [Google Scholar] [CrossRef]
  10. Vermeulen, I.E.; Seegers, D. Tried and tested: The impact of online hotel reviews on consumer consideration. Tour. Manag. 2009, 30, 123–127. [Google Scholar] [CrossRef]
  11. Ye, Q.; Law, R.; Gu, B.; Chen, W. The influence of user-generated content on traveler behavior: An empirical investigation on the effects of e-word-of-mouth to hotel online bookings. Comput. Hum. Behav. 2011, 27, 634–639. [Google Scholar] [CrossRef]
  12. Kim, W.G.; Lim, H.; Brymer, R.A. The effectiveness of managing social media on hotel performance. Int. J. Hosp. Manag. 2015, 44, 165–171. [Google Scholar] [CrossRef]
  13. Lee, S.; Ro, H. The impact of online reviews on attitude changes: The differential effects of review attributes and consumer knowledge. Int. J. Hosp. Manag. 2016, 56, 1–9. [Google Scholar] [CrossRef]
  14. Rodríguez-Díaz, M.; Espino-Rodríguez, T.F.; Rodríguez-Díaz, R. A model of market positioning based on value creation and service quality in the lodging industry: An empirical application of online customer reviews. Tour. Econ. 2015, 21, 1273–1294. [Google Scholar] [CrossRef]
  15. Chun, R. Corporate reputation: Meaning and measurement. Int. J. Manag. Rev. 2005, 7, 91–109. [Google Scholar] [CrossRef]
  16. Estárico, E.H.; Medina, L.F.; Marrero, S.M. Una aproximación a la reputación en línea de los establecimientos hoteleros españoles. Pap. Tur. 2012, 52, 63–88. [Google Scholar]
  17. Varini, K.; Sirsi, P. Social media and revenue management; where should the two meet? J. Technol. Manag. Grow. Econ. 2012, 3, 33–46. [Google Scholar] [CrossRef] [Green Version]
  18. Ye, Q.; Li, H.; Wang, Z.; Law, R. The influence of hotel price on perceived service quality and value in e-tourism: An empirical investigation based on online traveler reviews. J. Hosp. Tour. Res. 2014, 38, 23–39. [Google Scholar] [CrossRef]
  19. Rodríguez-Díaz, M.; Espino-Rodríguez, T.F. A methodology for a comparative analysis of the lodging tourism destinations based on online customer review. J. Destin. Mark. Manag. 2018, 8, 147–160. [Google Scholar] [CrossRef]
  20. Torres, E.N. Deconstructing service quality and customer satisfaction: Challenges and directions for future research. J. Hosp. Mark. Manag. 2014, 23, 652–677. [Google Scholar] [CrossRef]
  21. Oliver, R.L. Satisfaction: A Behavioural Perspective on the Consumer; McGraw-Hill: New York, NY, USA, 1997. [Google Scholar]
  22. Salminen, J.; Yoganathan, V.; Corporan, J.; Jansen, B.J.; Jung, S.G. Machine learning approach to auto-tagging online content for content marketing efficiency: A comparative analysis between methods and content type. J. Bus. Res. 2019, 101, 203–217. [Google Scholar] [CrossRef]
  23. Chevalier, J.A.; Mayzlin, D. The effect of word of mouth on sales: Online book reviews. J. Mark. Res. 2006, 43, 345–354. [Google Scholar] [CrossRef] [Green Version]
  24. Reimer, T.; Benkenstein, M. When good WOM hurts and bad WOM gains: The effect of untrustworthy online reviews. J. Bus. Res. 2016, 69, 5993–6001. [Google Scholar] [CrossRef]
  25. Park, D.H.; Lee, J.; Han, I. The effect of on-line consumer reviews on consumer purchasing intention: The moderating role of involvement. Int. J. Electron. Commer. 2007, 11, 125–148. [Google Scholar] [CrossRef]
  26. Plotkina, D.; Munzel, A. Delight the experts, but never dissatisfy your customers! A multi-category study on the effects of online review source on intention to buy a new product. J. Retail. Consum. Serv. 2016, 29, 1–11. [Google Scholar]
  27. Araque, O.; Corcuera-Platas, I.; Sánchez-Rada, J.F.; Iglesias, C.A. Enhancing deep learning sentiment analysis with ensemble techniques in social applications. Expert Syst. Appl. 2017, 77, 236–246. [Google Scholar] [CrossRef]
  28. Liu, B. Sentiment Analysis: Mining Opinions, Sentiments, and Emotions; Cambridge University Press: Cambridge, UK, 2015. [Google Scholar]
  29. Balducci, B.; Marinova, D. Unstructured data in marketing. J. Acad. Mark. Sci. 2018, 46, 557–590. [Google Scholar] [CrossRef]
  30. Kiritchenko, S.; Zhu, X.; Mohammad, S.M. Sentiment analysis of short informal texts. J. Artif. Intell. Res. 2014, 50, 723–762. [Google Scholar] [CrossRef]
  31. Pang, B.; Lee, L.; Vaithyanathan, S. Thumbs up? Sentiment classification using machine learning techniques. arXiv 2002, preprint. arXiv:cs/0205070. [Google Scholar]
  32. Melville, P.; Gryc, W.; Lawrence, R.D. Sentiment analysis of blogs by combining lexical knowledge with text classification. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, 28 June–1 July 2009; pp. 1275–1284. [Google Scholar]
  33. Wang, S.I.; Manning, C.D. Baselines and bigrams: Simple, good sentiment and topic classification. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju Island, Korea, 8–14 July 2012; Volume 2, pp. 90–94. [Google Scholar]
  34. Lucini, F.R.; Tonetto, L.M.; Fogliattp, F.S.; Anzanello, M.J. Text mining approach to explore dimensions of airlines customer satisfaction using online customer reviews. J. Air Transp. Manag. 2020, 83, 101760. [Google Scholar] [CrossRef]
  35. Alzate, M.; Arce-Urriza, M.; Cebollada, J. Mining the text of online consumer reviews to analyze brand image and brand positioning. J. Retail. Consum. Serv. 2022, 67, 102989. [Google Scholar] [CrossRef]
  36. Palese, B.; Usai, A. The relative importance of service quality dimensions in E-commerce experiences. Int. J. Inf. Manag. 2018, 40, 132–140. [Google Scholar] [CrossRef]
  37. González-Rodríguez, M.R.; Martínez-Torres, R.; Toral, S. Post-visit and pre-visit tourist destination image through eWOM sentiment analysis and perceived helpfulness. Int. J. Contemp. Hosp. Manag. 2016, 28, 2609–2627. [Google Scholar] [CrossRef]
  38. Kwok, L.; Xie, K.L.; Richards, T. Thematic framework of online review research. Int. J. Contemp. Hosp. Manag. 2017, 29, 307–354. [Google Scholar] [CrossRef]
  39. Nieto-García, M.; Muñoz-Gallego, P.A.; González-Benito, Ó. Tourists’ willingness to pay for an accommodation: The effect of eWOM and internal reference price. Int. J. Hosp. Manag. 2017, 62, 67–77. [Google Scholar] [CrossRef] [Green Version]
  40. West, P.M.; Broniarczyk, S.M. Integrating multiple opinions: The role of aspiration level on consumer response to critic consensus. J. Consum. Res. 1998, 25, 38–51. [Google Scholar] [CrossRef]
  41. Rogers Everett, M. Diffusion of Innovations, 5th ed.; Free Press: New York, NY, USA, 2003. [Google Scholar]
  42. Dolnicar, S.; Grabler, K.; Grün, B.; Kulnig, A. Key drivers of airline loyalty. Tour. Manag. 2011, 32, 1020–1102. [Google Scholar] [CrossRef] [Green Version]
  43. Calisir, N.; Basak, E.; Calisir, F. Key drivers of passenger loyalty: A case of Frankfurt–Istanbul flights. J. Air Transp. Manag. 2016, 53, 211–217. [Google Scholar] [CrossRef]
  44. Chow, C.K.W. On-time performance, passenger expectations, and satisfaction in the Chinese airline industry. J. Air Transp. Manag. 2015, 47, 39–47. [Google Scholar] [CrossRef]
  45. Park, J.W.; Robertson, R.; Wu, C.L. The effect of airline service quality on passengers’ behavioural intentions: A Korean case study. J. Air Transp. Manag. 2004, 10, 435–439. [Google Scholar] [CrossRef]
  46. Guo, Y.; Barnes, S.J.; Jia, Q. Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent dirichlet allocation. Tour. Manag. 2017, 59, 467–483. [Google Scholar] [CrossRef] [Green Version]
  47. Forgas, S.; Moliner, M.A.; Sánchez, J.; Palau, R. Antecedents of airline passenger loyalty: Low-cost versus traditional airlines. J. Air Transp. Manag. 2010, 16, 229–233. [Google Scholar] [CrossRef]
  48. O’Connell, J.F.; Williams, G. Passengers’ perceptions of low cost airlines and full service carriers: A case study involving Ryanair, Aer Lingus, Air Asia and Malaysia Airlines. J. Air Transp. Manag. 2005, 11, 259–272. [Google Scholar] [CrossRef] [Green Version]
  49. Truong, D.; Pan, J.Y.; Buaphiban, T. Low cost carriers in Southeast Asia: How does ticket price change the way passengers make their airline selection? J. Air Transp. Manag. 2020, 86, 101836. [Google Scholar] [CrossRef]
  50. Basuroy, S.; Chatterjee, S.; Ravid, S.A. How critical are critical reviews? The box office effects of film critics, star power, and budgets. J. Mark. 2003, 67, 103–117. [Google Scholar] [CrossRef]
  51. Hussain, R.; Al Nasser, A.; Hussain, Y.K. Service quality and customer satisfaction of a UAE-based airline: An empirical investigation. J. Air Transp. Manag. 2015, 42, 167–175. [Google Scholar] [CrossRef]
  52. Tahanisaz, S. Evaluation of passenger satisfaction with service quality: A consecutive method applied to the airline industry. J. Air Transp. Manag. 2020, 83, 101764. [Google Scholar] [CrossRef]
  53. Sezgen, E.; Mason, K.J.; Mayer, R. Voice of airline passenger: A text mining approach to understand customer satisfaction. J. Air Transp. Manag. 2019, 77, 65–74. [Google Scholar] [CrossRef]
  54. Bellizzi, M.G.; Eboli, L.; Forciniti, C.; Mazzulla, G. Passengers’ Expectations on Airlines’ Services: Design of a Stated Preference Survey and PreliminaryOutcomes. Sustainability 2020, 12, 4707. [Google Scholar] [CrossRef]
  55. Farooq, M.S.; Salam, M.; Fayolle, A.; Jaafar, N.; Ayupp, K. Impact of service quality on customer satisfaction in Malaysia airlines: A PLS-SEM approach. J. Air Transp. Manag. 2018, 67, 169–180. [Google Scholar] [CrossRef]
  56. Park, S.; Lee, J.S.; Nicolau, J.L. Understanding the dynamics of the quality of airline service attributes: Satisfiers and dissatisfiers. Tour. Manag. 2020, 81, 104163. [Google Scholar] [CrossRef]
  57. Noviantoro, T.; Huang, J.P. Investigating airline passenger satisfaction: Data mining method. Res. Transp. Bus. Manag. 2022, 43, 100726. [Google Scholar] [CrossRef]
  58. Yoon, M.G.; Lee, H.Y.; Song, Y.S. Linear approximation approach for a stochastic seat allocation problem with cancellation & refund policy in airlines. J. Air Transp. Manag. 2012, 23, 41–46. [Google Scholar]
  59. Dalalah, D.; Ojiako, U.; Chipulu, M. Voluntary overbooking in commercial airline reservations. J. Air Transp. Manag. 2020, 86, 101835. [Google Scholar] [CrossRef]
  60. Ma, J.; Tse, Y.K.; Wang, X.; Zhang, M. Examining customer perception and behaviour through social media research–An empirical study of the United Airlines overbooking crisis. Transp. Res. Part E Logist. Transp. Rev. 2019, 127, 192–205. [Google Scholar] [CrossRef]
  61. Seo, E.J.; Park, J.W. A study on the effects of social media marketing activities on brand equity and customer response in the airline industry. J. Air Transp. Manag. 2018, 66, 36–41. [Google Scholar] [CrossRef]
  62. Boubker, O.; Naoui, K. Factors affecting airline brand love, passengers’ loyalty, and positive word-of-mouth. A case study of Royal Air Maroc. Case Stud. Transp. Policy 2022, 10, 1388–1400. [Google Scholar] [CrossRef]
  63. Singh, B. Predicting airline passengers’ loyalty using artificial neural network theory. J. Air Transp. Manag. 2021, 94, 102080. [Google Scholar] [CrossRef]
  64. Koklic, M.K.; Kukar-Kinney, M.; Vegelj, S. An investigation of customer satisfaction with low-cost and full-service airline companies. J. Bus. Res. 2017, 80, 188–196. [Google Scholar] [CrossRef]
  65. Sandada, M.; Matibiri, B. An investigation into the impact of service quality frequent flier programs and safety perception on satisfaction and customer loyalty in the airline industry in Southern Africa. South East Eur. J. Econ. Bus. 2016, 11, 41–53. [Google Scholar] [CrossRef] [Green Version]
  66. Ali, F.; Dey, B.L.; Filieri, R. An assessment of service quality and resulting customer satisfaction in Pakistan International Airlines: Findings from foreigners and overseas Pakistani customers. Int. J. Qual. Reliab. Manag. 2015, 32, 486–502. [Google Scholar] [CrossRef]
  67. Suki, N.M. Passenger satisfaction with airline service quality in Malaysia: A structural equation modeling approach. Res. Transp. Bus. Manag. 2014, 10, 26–32. [Google Scholar] [CrossRef]
  68. Gupta, H. Evaluating service quality of airline industry using hybrid best worst method and VIKOR. J. Air Transp. Manag. 2018, 68, 35–47. [Google Scholar] [CrossRef]
  69. Lee, T.Y.; Bradlow, E.T. Automated marketing research using online customer reviews. J. Mark. Res. 2011, 48, 881–894. [Google Scholar] [CrossRef]
  70. Anderson, M.; Magruder, J. Learning from the crowd: Regression discontinuity estimates of the effects of an online review database. Econ. J. 2012, 122, 957–989. [Google Scholar] [CrossRef]
  71. Wang, H.; Lu, Y.; Zhai, C. Latent aspect rating analysis on review text data: A rating regression approach. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 24–28 July 2010; pp. 783–792. [Google Scholar]
  72. Büschken, J.; Allenby, G.M. Sentence-based text analysis for customer reviews. Mark. Sci. 2016, 35, 953–975. [Google Scholar] [CrossRef] [Green Version]
  73. Zhang, Z.; Zhang, Z.; Law, R. Positive and negative word of mouth about restaurants: Exploring the asymmetric impact of the performance of attributes. Asia Pac. J. Tour. Res. 2014, 19, 162–180. [Google Scholar] [CrossRef]
  74. Lerman, K.; Blair-Goldensohn, S.; McDonald, R. Sentiment summarization: Evaluating and learning user preferences. In Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), Athens, Greece, 30 March–3 April 2009; pp. 514–522. [Google Scholar]
  75. Ganu, G.; Elhadad, N.; Marian, A. Beyond the stars: Improving rating predictions using review text content. In Proceedings of the WebDB 2009, Providence, RI, USA, 28 June 2009; Volume 9, pp. 1–6. [Google Scholar]
  76. Liu, X.; Singh, P.V.; Srinivasan, K. A structured analysis of unstructured big data by leveraging cloud computing. Mark. Sci. 2016, 35, 363–388. [Google Scholar] [CrossRef]
  77. Titov, I.; McDonald, R. A joint model of text and aspect ratings for sentiment summarization. In Proceedings of the ACL-08: HLT, Columbus, OH, USA, 20 June 2008; pp. 308–316. [Google Scholar]
  78. Dun, Y.; Wang, N.; Wang, M.; Hao, T. Revealing learner inter ests through topic mining from question-answering data. Int. J. Distance Educ. Technol. (IJDET) 2017, 15, 18–32. [Google Scholar] [CrossRef]
  79. Insua Yañez, A. Sistema Deep Learning para el Análisis de Sentimientos en Opiniones de Productos para la Ordenación de Resultados de un Buscador Semántico; Universidad de La Coruña: A Coruña, Spain, 2019. [Google Scholar]
  80. McCallum, A.; Nigam, K. A comparison of event models for naive bayes text classification. In Proceedings of the AAAI-98 Workshop on Learning for Text Categorization, Menlo Park, CA, USA, 27 July 1998; Volume 752, pp. 41–48. [Google Scholar]
  81. Yang, Y.; Liu, X. A re-examination of text categorization methods. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, USA, 15–19 August 1999; pp. 42–49. [Google Scholar]
  82. Berger, A.; Della Pietra, S.A.; Della Pietra, V.J. A maximum entropy approach to natural language processing. Comput. Linguist. 1996, 22, 39–71. [Google Scholar]
  83. Freund, Y.; Schapire, R.E. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 1995, 55, 119–139. [Google Scholar] [CrossRef] [Green Version]
  84. Quinlan, J. C4.5: Programs for Machine Learning; Morgan Kaufmann Publisher: San Mateo, CA, USA, 1993. [Google Scholar]
  85. Zhang, T. Regularized winnow methods. In Advances in Neural Information Processing Systems; 2000; pp. 703–709. Available online: https://proceedings.neurips.cc/paper/2000/file/9a1de01f893e0d2551ecbb7ce4dc963e-Paper.pdf (accessed on 24 June 2022).
  86. Joachims, T. Text categorization with support vector machines: Learning with many relevant features. In Proceedings of the European Conference on Machine Learning, Chemnitz, Germany, 21–23 April 1998; Springer: Berlin/Heidelberg, Germany, 1998; pp. 137–142. [Google Scholar]
  87. Liu, B.; Li, X.; Lee, W.S.; Yu, P.S. Text classification by labeling words. In Proceedings of the 19th National Conference on Artifical Intelligence, San Jose, CA, USA, 25–29 July 2004; Volume 4, pp. 425–430. [Google Scholar]
  88. Xia, R.; Zong, C.; Li, S. Ensemble of feature sets and classification algorithms for sentiment classification. Inf. Sci. 2011, 181, 1138–1152. [Google Scholar] [CrossRef]
  89. Levy, O.; Goldberg, Y. Dependency-based word embeddings. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Baltimore, MD, USA, 22–27 June 2014; Volume 2, pp. 302–308. [Google Scholar]
  90. Kusner, M.; Sun, Y.; Kolkin, N.; Weinberger, K. From word embeddings to document distances. In Proceedings of the International Conference on Machine Learning, Lille, France, 6–11 July 2015; pp. 957–966. [Google Scholar]
  91. Chang, Y.C.; Ku, C.H.; Le Nguyen, D.D. Predicting aspect-based sentiment using deep learning and information visualization: The impact of COVID-19 on the airline industry. Inf. Manag. 2022, 59, 103587. [Google Scholar] [CrossRef]
  92. Dhar, S.; Bose, I. Walking on air or hopping mad? Understanding the impact of emotions, sentiments and reactions on ratings in online customer reviews of mobile apps. Decis. Support Syst. 2022, 113769. [Google Scholar] [CrossRef]
  93. Stamolampros, P.; Dousios, D.; Korfiatis, N. Evaluating domestic bias on airline passengers’ ratings: The moderating effect of cultural value orientation. Int. J. Hosp. Manag. 2020, 87, 102466. [Google Scholar] [CrossRef]
  94. Song, C.; Guo, J.; Zhuang, J. Analyzing passengers’ emotions following flight delays-a 2011–2019 case study on SKYTRAX comments. J. Air Transp. Manag. 2020, 89, 101903. [Google Scholar] [CrossRef]
  95. Sharma, A.; Park, S.; Nicolau, J.L. Testing loss aversion and diminishing sensitivity in review sentiment. Tour. Manag. 2020, 77, 104020. [Google Scholar] [CrossRef]
  96. Tsai, C.F.; Chen, K.; Hu, Y.H.; Chen, W.K. Improving text summarization of online hotel reviews with review helpfulness and sentiment. Tour. Manag. 2020, 80, 104122. [Google Scholar] [CrossRef]
  97. Korfiatis, N.; Stamolampros, P.; Kourouthanassis, P.; Sagiadinos, V. Measuring service quality from unstructured data: A topic modeling application on airline passengers’ online reviews. Expert Syst. Appl. 2019, 116, 472–486. [Google Scholar] [CrossRef] [Green Version]
  98. Punel, A.; Hassan, L.A.H.; Ermagun, A. Variations in airline passenger expectation of service quality across the globe. Tour. Manag. 2019, 75, 491–508. [Google Scholar] [CrossRef]
  99. Siering, M.; Deokar, A.V.; Janze, C. Disentangling consumer recommendations: Explaining and predicting airline recommendations based on online reviews. Decis. Support Syst. 2018, 107, 52–63. [Google Scholar] [CrossRef]
  100. Feldman, R.; Sanger, J. The text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
  101. Ruiz de Villa, G. Introducción a Word2vec (Skip Gram Model). 2018. Available online: https://medium.com/@gruizdevilla/introducci%C3%B3n-a-word2vec-skip-gram-model-4800f72c871f (accessed on 24 June 2022).
  102. Masorgo, N.; Mir, S.; Hofer, A.R. Expectations vs Experience: Managing the Adverse Effects of Service Failures on Customer Satisfaction in the Airline Industry. Transp. J. 2022, 61, 231–262. [Google Scholar] [CrossRef]
  103. Punel, A.; Ermagun, A. Using Twitter network to detect market segments in the airline industry. J. Air Transp. Manag. 2018, 73, 67–76. [Google Scholar] [CrossRef]
Figure 1. Machine learning model of significant labels in online customer reviews.
Figure 1. Machine learning model of significant labels in online customer reviews.
Sustainability 14 09183 g001
Table 2. Cluster analysis of rating.
Table 2. Cluster analysis of rating.
ClusterRatingTotal
12345
1Frequency153061239253598
% File2.5%5.0%10.2%40.0%42.3%100.0%
2Frequency153777308163600
% File2.5%6.2%12.8%51.3%27.2%100.0%
3Frequency403440114129
% File31.0%26.4%31.0%8.5%3.1%100.0%
4Frequency3041137244165617
% File4.9%6.6%22.2%39.5%26.7%100.0%
5Frequency41237485110966013334
% File12.4%11.2%25.5%32.9%18.0%100.0%
TotalTotal5125161166189811865278
% Total file9.7%9.8%22.1%36.0%22.5%100.0%
Pearson Chi-squared: 565.475. df: 16. Significance: 0.000.
Table 3. Managerial recommendation for Iberia.
Table 3. Managerial recommendation for Iberia.
IBERIAGroup1 (The quality of service–inflight experience)
In general, there is an excellence satisfaction level with Iberia’ services in the reviews, however the company needs to redouble its efforts to improve the in-flight entertainment experience for passengers.
Group2 (The quality of service–flight punctuality)
Iberia accomplished good results when it comes to punctuality of flight
Group3 (Comfort of seats)
The key labels are (worse and space). Managers have to pay more attention to passenger seat space, design and comfort in order to reduce customer dissatisfaction on the mentioned topic
Group4 (Flight problems: delays, cancellations, reschedule, and overbooking)
Passenger satisfaction with Iberia’ service is at a good level, but the negative reviews are about delay. It is recommended that managers take decisions regarding the time of procedures such as check-in before flight.
Group5 (The quality of service, check-in and boarding, baggage delivering, personnel service)
Most of the negative reviews are about bad quality of service, delays to check-in baggage, and the attitude of the crew staff, which should be re-evaluated. Managers can identify the cause of passenger dissatisfaction with Iberia staff service and training new customer communication to meet up with their needs before and during the flight (check-in, boarding and handling service).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zaki Ahmed, A.; Rodríguez Díaz, M. A Methodology for Machine-Learning Content Analysis to Define the Key Labels in the Titles of Online Customer Reviews with the Rating Evaluation. Sustainability 2022, 14, 9183. https://doi.org/10.3390/su14159183

AMA Style

Zaki Ahmed A, Rodríguez Díaz M. A Methodology for Machine-Learning Content Analysis to Define the Key Labels in the Titles of Online Customer Reviews with the Rating Evaluation. Sustainability. 2022; 14(15):9183. https://doi.org/10.3390/su14159183

Chicago/Turabian Style

Zaki Ahmed, Ayat, and Manuel Rodríguez Díaz. 2022. "A Methodology for Machine-Learning Content Analysis to Define the Key Labels in the Titles of Online Customer Reviews with the Rating Evaluation" Sustainability 14, no. 15: 9183. https://doi.org/10.3390/su14159183

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop