Abstract
In the face of problems such as information overload and the information cocoon resulting from big data, it is a key point of current research to solve the problem of semantic fuzziness of online reviews and improve the accuracy of personalized recommendation algorithms by using online reviews. Based on the advantage of the probabilistic language term set to deal with fuzzy information and the historical data of online hotel reviews, this paper proposes a collaborative filtering recommendation algorithm for hotels. Firstly, the text data of hotel online reviews are crawled by a crawler and processed by jieba and TF-IDF tools. Secondly, the hotel evaluation attribute set is constructed, and the sentiment analysis of the review statements is carried out with the help of the HowNet sentiment dictionary and manual annotation method. The probabilistic language term set is used to classify the data and derive statistics, and the maximum deviation method is used to determine the weight of each attribute. Then, the cosine similarity formula is fused with the modified cosine similarity formula to calculate the similarity and construct the decision matrix. Finally, combined with the historical data of the user’s hotel selection, the hotel recommendation results are generated. This paper collected review data from 10 hotels in Macau from the official “Ctrip” website. The proposed recommendation algorithm model was then applied to process and analyze the data, resulting in the generation of a ranked list of hotel recommendations. To validate the accuracy and effectiveness of this research, the recommendation results were compared with those produced by other algorithms.
Keywords:
hotel recommendation; probabilistic language term set; modified cosine similarity; cosine similarity; online review MSC:
68T09; 94A16
1. Introduction
With the continuous development of Internet technology and the widespread application of e-commerce, people’s consumption patterns have gradually shifted from offline to online. As the pandemic subsides, there is a growing demand for travel, and users can book hotels online through various platforms []. However, the explosive growth of data has led to the increasingly serious problem of “information overload,” making the extraction of useful information from massive data increasingly important []. Online reviews can assist users in hotel selection, but when faced with a large amount of hotel information and online reviews, users often find themselves in a dilemma. Personalized recommendation systems can help users choose suitable hotels by mining user preferences and hotel features. Research has shown that collaborative filtering recommendation algorithms are more suitable and widely used in hotel recommendations [].
Online reviews contain information such as user preferences and hotel features and have strong interpretability. They have been widely used in hotel recommendation algorithms. Determining how to extract user preferences and hotel features from online reviews is crucial to improving the accuracy of hotel recommendation algorithms [,,]. Tao et al. [] conducted an instance analysis of hotel online reviews using the λ-fuzzy measure and the Choquet integral operator, considering the correlation between different features, and generated hotel recommendation rankings for travelers. Wang et al. [] used statistical methods to construct a recommendation index based on online reviews to generate recommendation results. Cui et al. [] built a scenario-based user profile, calculated similarity using the improved P-J correlation coefficient, and proposed a collaborative filtering recommendation algorithm. Qian et al. [] applied uncertain variables and uncertain sets to design a sentiment analysis model, analyzed user preferences, and designed a new nearest-neighbor search method based on the sentiment analysis model to generate recommendation results. Li et al. [] proposed a hybrid approach that used the LSTM (long short-term memory) algorithm to calculate user preference information from online reviews and combined it with matrix decomposition recommendation algorithms to improve recommendation accuracy. Cao et al. [] established a CNN sentiment classifier to classify emotions in online review text, then merged the classification results into user ratings, improving collaborative filtering algorithms.
However, due to the strong semantic ambiguity of online review information and the presence of malicious reviews, the above methods have their limitations. In the field of natural language processing, scholars such as Pang [] have proposed the concept of the probabilistic linguistic term set (PLTS) as an effective tool for handling fuzzy information. PLTS can describe the importance of different evaluation terms and has been used to solve problems such as sentiment analysis, text classification, and information retrieval, and has been widely applied in hotel reviews [], engineering evaluation, logistics management, medical services [], and other fields. Some researchers have combined PLTS with recommendation algorithms. Zhou et al. [] used PLTS to extract user emotions from review information, proposed a VIKOR-based probabilistic linguistic multi-criteria decision-making method, and conducted experimental analysis using movie information recommendations. Chen et al. [] proposed a user-friendly function to transform personalized semantics of online reviews into probability distributions corresponding to user ratings. They constructed a probability language rating matrix and used integrated probabilities to represent the fuzziness of trust, proposing a user collaborative filtering recommendation algorithm based on probability language and dual trust networks. Cui et al. [] used PLTS to aggregate user preferences from online reviews while eliminating ambiguity in online reviews. They calculated similarity using the cosine similarity formula and generated recommendation results. In the above studies, there is room for improvement in the richness of the user sentiment hierarchy extracted from online reviews (using five-granularity linguistic terms), and the similarity calculation formulas used did not consider the influence of the evaluation scale, resulting in limited differentiation of evaluation results.
To address the above issues, this paper proposes a collaborative filtering recommendation algorithm based on a probabilistic linguistic term set (PLTS). When extracting user sentiment from online reviews, a seven-granularity linguistic term is used to more accurately reflect user preferences in online reviews and improve the accuracy of recommendation results. In attribute weight calculation, the maximum deviation method is used to determine attribute weights. In similarity calculation, given that a single cosine similarity calculation tends to be optimistic [], while corrected cosine similarity calculation tends to be pessimistic, a fusion similarity calculation formula is used, taking into account the influence of the evaluation scale, to improve the quality of the recommendation algorithm.
2. Preliminaries
2.1. Probabilistic Language Term Set
In 2016, Pang et al. [] proposed the probabilistic linguistic term set (PLTS), which uses probability distributions to represent different levels of importance of linguistic evaluation. Lin and Xu [] proposed the standardization of PLTS, the distance formula, and other related theories.
Definition 1.
Assuming that is a Linguistic Term Set (LTS) where is a positive integer, then the probabilistic linguistic term set can be defined as:
where represents the th term, is the probability of the th term, and represents the language term with a probability value . represents the number of terms in the probabilistic language term set .
With reference to [], it is assumed that there is a linguistic term set:
If 40% of the users give a “” rating of the hotel, 45% give a “” rating of the hotel, and 15% give a “” rating of the hotel, then there is .
The sum of the term probabilities in PLTS is. When , this PLTS is called the complete probabilistic language term set; when , this PLTS is called the incomplete probabilistic language term set. In PLTS operation or information aggregation, the existence of an incomplete PLTS will lead to the loss of the final information and the inaccuracy of the result, so it is necessary to standardize the incomplete PLTS before the operation or information aggregation. Pang’s standardized formula is as follows:
where .
Definition 2
[]. When the number of terms in two probabilistic language term sets is different, it will increase the difficulty of calculation and comparison. Therefore, it is necessary to standardize the number of terms in the two probabilistic language term sets, fill in the missing terms, and make the number of terms in the two probabilistic language term sets the same. The supplementary process is as follows:
Suppose there are two arbitrary sets of probabilistic language terms and . , and the number of terms for and respectively. When the number of the two is not equal, the missing parts needs to be completed. If , then the language terms are added to , and the added element is the smallest term in , and its probability is 0, and vice versa.
Definition 3
[]. Assume that the probability of two arbitrary linguistic term set and , , , and , the distance and as follows:
where and represent subscripts of the language terms and , respectively, and satisfy the following properties:
- (1)
- ;
- (2)
- .
2.2. Cosine Similarity
Cosine similarity is used to calculate the distance between two vectors, mainly by calculating the cosine value of the angle between two vectors to calculate the difference between them [].
Definition 4
[]. Assume that there exists an arbitrary set of probabilistic language terms , where . Set represents the properties of the hotel, ; then, according to the cosine similarity formula, under the same attribute , the formula for the similarity between two probabilistic language term sets is as follows:
where and represent the respective score sets of hotel and hotel under the attribute , respectively. The cosine similarity satisfies the following properties:
- (1)
- ;
- (2)
- .
2.3. Modified Cosine Similarity
Modified cosine similarity is a variant of cosine similarity. Although cosine similarity can be used to measure the difference between two vectors [], it pays more attention to the relationship between vector angles and is insensitive to the absolute values of specific data such as size and length. Therefore, in 2001, Sarvar et al. [] proposed the concept of modified cosine similarity, and subtracted the average value of corresponding features in the calculation to reduce the problems caused by these deficiencies. Considering the influence of the degree of scoring bias on the score [], the calculation of similarity is more reasonable.
Definition 5
[]. Assume the existence of an arbitrary set of probabilistic language terms , where , Set C represents the properties of the hotel, . Under the same attribute , according to the modified cosine similarity formula, can obtain the similarity between the two probabilistic language term sets ; the formula is as follows:
where and represent the respective average score sets of hotel and hotel , respectively, and the modified cosine similarity satisfies the following properties:
- (1)
- ;
- (2)
- .
3. Materials and Methods
3.1. Model Proposal
With the widespread development and adoption of various travel apps and websites, tourists often refer to evaluations of hotels by other users. They combine these evaluations with their own preferences for hotel attributes such as service, price, and location when assessing and choosing hotels. However, due to time constraints and the abundance of hotel review information, tourists can only access a portion of the information, leading to a lack of comprehensive and objective assessments of hotels. Addressing the challenge of providing valuable information to tourists from vast amounts of data is essential, and recommendation algorithms are a commonly used approach to solve this problem.
Considering the aforementioned issues, current research on hotel recommendation algorithms rarely integrates users’ historical data, such as textual reviews. Furthermore, probabilistic linguistic term sets (PLTSs) exhibit favorable characteristics. They can effectively capture users’ perceptions and intentions from textual data, enabling precise recommendations. Hence, this paper proposes a model that combines PLTS with online reviews to enhance the accuracy of hotel recommendation algorithms. This model consists of three main modules.
Module 1: Sentiment Analysis.
Using the jieba tool for tasks such as word segmentation and part-of-speech tagging, the original review text is processed. Sentiment words and intensity adverbs are analyzed to construct the PLTS for the review text.
Module 2: Attribute Weight Calculation and Similarity Computation.
Based on the PLTS of the review text, attribute weights and similarities are computed to obtain a hotel similarity evaluation matrix.
Module 3: Prediction and Recommendation.
Utilizing the hotel similarity evaluation matrix based on the probabilistic linguistic dataset and incorporating user historical data, the model generates a recommendation list.
3.2. Material and Methods
With the large-scale development and popularization of various tourism software packages and websites, tourists prefer to read other users’ evaluation of hotels and evaluate and choose hotels based on their own preferences for hotel attributes, such as service, price, and transportation. However, due to the limited time and the large amount of hotel review information, tourists can only obtain partial information and lack a comprehensive and objective evaluation of the hotel. Determining how to provide tourists with valuable information from a large number of data is an urgent problem to be solved, and recommendation algorithms are common used to solve this problem.
In this paper, the set will be employed to represent the set of hotels, while the set will represent the hotel attributes that users consider when referencing other users’ review information during the hotel selection process. A large number of online review texts are processed based on the probabilistic language term set, and after obtaining useful information, the system will recommend multiple hotels according to the historical data of hotel selection by tourists. The specific idea is shown in Figure 1.
Figure 1.
Research idea.
3.3. Data Acquisition and Preprocessing
After determining the research content, the data will be crawled and processed to prepare for the subsequent recommendation algorithm. The specific steps are shown in Figure 2.
Figure 2.
Idea of data acquisition and preprocessing.
3.3.1. Data Crawling
In this paper, Python 3.6 is used to crawl the hotel review data of a tourism website in the same period of time, and the set represents the hotel set, that is, , where is the number of hotels. Each hotel is crawled for review information, which is exported to a csv file.
3.3.2. Online Comment Text Processing
After obtaining the review data, the jieba tool performs word segmentation and parts-of-speech tagging processing on the original review text. In addition, we select and refer to the stop word table UTF-8 provided by IKAnalyzer to delete words with little or no value, such as modal words and function words, so as to reduce the interference of stop words on sentence analysis, and the sentences with ambiguous semantics are processed manually. After word segmentation and stop word processing, the words describing hotel attributes and evaluation in the review are obtained. In this paper, represents the word set of the comment statement of the hotel, and represents the word in the word set .
3.3.3. Word Analysis of Hotel Evaluation
- (1)
- Construct word knowledge base
As there is currently no sentiment dictionary specifically for hotel review analysis, this paper developed its own vocabulary for this model, which was constructed by a professional familiar with the web terminology of online hotel reviews, by manually examining the review content. Tourists’ online evaluations of hotels generally involve multiple attributes of the hotel and represent different attitudes, such as the attitude towards hotel facilities and services, as shown in Table 1. By looking at online reviews of various hotels, we can identify the attribute objects evaluated in the reviews and judge their tendency fields. After consulting experts, we finally built a word knowledge base for analyzing hotel reviews [].
Table 1.
Categories of words for hotel evaluation.
- (2)
- Establish standard attribute words
In the model, the attribute words in the reviews reflect the tourists’ attention to some aspects of the hotel when choosing the hotel, but the expression of the attributes is slightly different. Therefore, before the analysis and processing of the attribute words, the word frequency of the reviews is analyzed, the standard attribute words are selected, and the common attribute words are classified. The properties of the hotel are represented by the set , , and is used to represent the set of attribute words involved in the comment of the hotel, . Assuming that represents the common attribute words in the comment, and represents the selected standard attribute words, the similarity between and is calculated according to the calculation method of semantic similarity in the Synonymous Word Forest [], and the common attribute words are transformed into standard attribute words.
- (3)
- Determine sentiment analysis rules
Since the expression of comments is highly flexible, the HowNet sentiment dictionary and manual annotation methods are used for sentiment analysis of comment statements []. In the analysis of hotel emotion words, in addition to the positive and negative emotion analysis, 7-granularity is also adopted to distinguish the degree of emotion. As shown in Table 2 below, the positive and negative of emotion words express the tourists’ positive, neutral, or even negative attitudes towards the hotel, while the degree adverbs describing emotion are mainly used to describe the intensity of the attitude, such as “extremely”, “very” and other degree adverbs.
Table 2.
Reference for emotion analysis and intensity ranking of emotion word classification scale.
In this paper, we construct a more detailed 7-granularity classification of emotion words, transform emotion adverbs into 7-granularity language terms, and explore the intensity of emotion words. Suppose is a 7-granularity size scale set, , where the positive and negative signs of the s subscript represent positive and negative emotions, respectively, represents a neutral attitude, and the number of subscripts represents the degree of attitude. The larger the number, the stronger the attitude. represents “very poor”, represents “very poor”, represents “bad”. represents “general”, represents “good”, represents “very good”, represents “very good”.
In order to make the degree of positive and negative emotions more accurate, the division method follows the following rules:
- (a)
- Positive or negative words without degree adverbs are classified as or ;
- (b)
- Other degree adverbs are uniformly classified as or according to the classification of emotion words;
- (c)
- Emotion intensity words containing “extreme”, “invincible”, “only”, and “very” are classified as or according to the positive and negative direction of emotion words.
3.3.4. Conversion of Online Comments
After the hotel reviews are preprocessed, the data are described and counted using probabilistic language term sets (PLTSs). After data crawling, the attributes in the q review data of each hotel are transformed with language terms, and all attributes and the frequency of language terms are counted. This is represented by a probabilistic language term set, which is divided into the following three steps, as shown in Figure 3:
Figure 3.
Steps for converting comments into a probabilistic language term set.
Step 1: Assume that hotel exists, analyze its review text set , attribute word set , and then transform the emotion words in the evaluation text into 7-granularity language terms according to the rules.
Step 2: Calculate the frequency of each evaluation language term in each attribute word, and count the total frequency of each attribute .
Step 3: Calculate the evaluation term probability of attribute words. The calculation formula is shown in Formula (6). After calculating the of all attribute words, use the probabilistic language term set to describe the evaluation set of the th attribute of the hotel.
3.4. Steps for Recommendation Algorithms
After the comments are converted into a probabilistic language term set, the recommendation algorithm will be constructed. The specific steps are shown in Figure 4.
Figure 4.
Idea of recommendation algorithm.
- Step 1: Construct evaluation matrix
The evaluation matrix of the hotel is constructed to describe the obtained probabilistic language term set. is used to represent the evaluation information about the hotel attribute . , and are integers.
- Step 2: Determine the attribute weights
Each tourist has different needs for hotels, so they will pay different degrees of attention to the attributes of hotels []. In this paper, the maximum deviation method is adopted to determine the attribute weight. In general, the greater the attribute weight, the greater the deviation degree between hotels, and the stronger their attribute differentiation degree []. Assuming that there is a hotel , under the same attribute , the deviation degree between the hotel and other hotels is first calculated, and then the total deviation degree between the hotel and other hotels is calculated. Secondly, the total deviation of attributes in the evaluation matrix is calculated, and the maximum deviation optimization model is constructed. Finally, the Lagrange function is used to solve the problem, and the standardized attribute weights are obtained. Specific calculations are as follows:
Suppose that there is an attribute weight set . According to Formula (3), under attribute , the deviation degree between hotel and other hotels is:
where .
The total deviation degree between hotel and each hotel is:
In the evaluation matrix , the total deviation degree between all attributes is:
The maximum deviation optimization model is constructed as follows:
Then, the Lagrange function is constructed to solve the model:
Finally, the normalized attribute weights are obtained:
- Step 3: Construct the similarity matrix
The method of weighted similarity is used to calculate the similarity between hotels, that is, the similarity of two hotels under the same attributes is first calculated, and then the weight of attributes is added to calculate the weighted similarity, and finally the similarity matrix between hotels is constructed, from which the similarity between the two matrices can be directly observed.
In view of the fact that the cosine similarity calculation results tend to be optimistic [], while the modified cosine similarity calculation results tend to be pessimistic, this paper adopts a compromise and an improved similarity calculation formula, as shown in Formula (13), where α is the adjustment coefficient:
- (1)
- According to Formula (13), calculate the similarity of of two hotels under the same attribute , ,
- (2)
- The weighted similarity between the two hotels is obtained according to the attribute weights.
- (3)
- Construct the hotel similarity matrix M:
- Step 4: Hotel recommendation ranking
According to the similarity matrix obtained in Step 3 and the historical hotel stays of tourists, the hotels that reach the threshold are recommended and sorted according to the similarity size, and the hotel recommendations are made to tourists. Assume that is the hotel where the visitor has previously stayed, is the hotel subscript, and the recommended threshold is . When , the system will recommend the hotel to the tourist; otherwise, the hotel will not be recommended.
4. Case Application
4.1. Data Crawling
In the case analysis, this paper randomly selected 10 hotels in Macau according to their popularity, including five-star, four-star, and three-star hotels. In accordance with the principle of priority of recent reviews, Python was used to crawl 200 pieces of recent online review information, and “hotel name” and “review content” were selected as valid fields for subsequent analysis. Table 3 shows the 10 hotels selected, namely two three-star hotels, three four-star hotels and five five-star hotels. The hotel collection can be used as .
Table 3.
Selected hotels in Macau.
4.2. Definition of Hotel Standard Attributes
Based on the word frequency analysis of reviews and the analysis of historical residents’ concerns about hotels, hotel attribute words can be divided into the following five attribute sets , as shown in Table 4, from which it can be seen that there are five standard attribute words, such as for “service”, for “location”, etc.
Table 4.
Standard hotel property categories.
4.3. Data Preprocessing
Firstly, word segmentation of the online review data retrieved by jieba is performed, and then the stop words are deleted to obtain the attribute words, emotion words, and degree adverbs of emotion in the review, and the attribute set is formed. Then, according to the method of synonym merging, attribute words are converted into standard attribute words, and emotion words and emotion degree adverbs are analyzed and converted into 7-granularity language terms. The number of occurrences of each evaluation term in each standard attribute word is counted, and the total number of occurrences of each attribute is counted. For example, for the hotel , Table 5 shows the analysis results.
Table 5.
Statistics of the occurrence of evaluation terms of hotel h1.
Thus, its probabilistic language term set is represented as follows:
4.4. Hotel Recommendation Result Generation
Step 1: After the preliminary data processing, the evaluation matrix of 10 hotels can be constructed according to the probabilistic language term set.
Step 2: First, the probabilistic language term set is standardized, and then the maximum deviation method is used to calculate the weight of each attribute.
For example, for hotel and , for the same standard properties, the standardized treatments of and are as follows:
According to Formula (7), under the standard attribute , the difference between hotel and is . In the same way, the mutual difference between different attributes of each hotel can be calculated, and finally the deviation degree of each attribute can be obtained: , , , , . Then, according to Formula (8), the total deviation degree can be obtained as follows:
The maximum deviation degree optimization model is constructed and solved using the Lagrange function . The normalized attribute weights are as follows:
Step 3: Calculate the similarity between pairwise matrices and construct the similarity matrix.
Taking hotels and as an example, the cosine similarity and modified cosine similarity of the two hotels can be calculated according to Formulas (4) and (5), and then the final similarity of the two hotels can be calculated according to Formula (13) (Table 6).
Table 6.
Similarity calculation process of hotel and for the same attribute.
Based on the formula and attribute weights, the final similarity between the two hotels is 0.8650. Similarly, the similarity between the two hotels is calculated, and the similarity matrix is constructed.
Step 4: Set a threshold to generate the hotel recommendation order based on the historical check-in data of the passenger. If there are two hotels with the same similarity, the recommendation order is the same.
Assuming that the preset recommendation threshold is , if the passenger has used hotel before, the sequence of hotel recommendation is , and the same can be obtained in other cases.
Assuming that the preset recommendation threshold is , if the passenger has used hotel before, the sequence of hotel recommendation is: , and the same can be obtained in other cases. For hotel , since the similarity between and is the same, hotel and hotel are recommended in the same order. Similarly, for hotel , there is the same recommendation ranking situation.
5. Comparative Analysis
In this paper, the hotel recommendation algorithm based on the probabilistic language term set can make use of the online review information of users, intelligently provide users with hotel recommendations that match their preferences, and at the same time sort multiple recommended hotels. In order to highlight the excellent performance of this algorithm, we conducted an in-depth analysis and comparison with the recommended algorithms proposed by Cui et al. [] and Zhou et al. [], as shown in Table 7. In order to prove the feasibility of the algorithm, we performed the calculations. According to the law, for the hotel, the recommended hotels are ranked. To rank the recommended hotels, , the results are consistent with the outcomes obtained using the algorithm proposed in this paper. According to the attribute weight formula () and the VIKOR method criteria from the literature, the recommended hotel ranking is , which aligns with the results obtained by the algorithm presented in this paper. Except for hotel , the optimal hotel recommendations are the same as those in this paper, effectively validating the feasibility of the proposed method.
Table 7.
Algorithm comparison.
As observed from the table above, this paper employs the maximum deviation method, consistent with Cui’s literature algorithm, for handling weights. Meanwhile, Zhou’s study primarily relies on the frequency of occurrence of keywords, which can be easily influenced by the quantity of keywords. Regarding attribute similarity, this paper combines cosine similarity with modified rectangular similarity, while Cui’s article relies solely on cosine similarity. Concerning the sorting method, this paper primarily ranks the hotels based on their similarity to generate a recommendation list. In terms of the recommendation approach, this paper combines user’s historical hotel selection data with the generated hotel similarity list for recommendations. In summary, the method proposed in this paper exhibits superiority.
6. Conclusions and Future Research Directions
In response to the application challenges of a large volume of hotel online review data in recommendations, this paper employs a probabilistic language terminology set to express hotel attributes and user preferences within online reviews. It uses a seven-level granularity language terminology to more precisely capture the fuzzy emotional semantics in online reviews and proposes a hotel collaborative filtering recommendation algorithm based on PLTS. We utilize jieba for text segmentation, conduct sentiment analysis using the HowNet sentiment dictionary, and effectively transform the review terms into PLTS using the TF-IDF statistical method. We compile probability language information by constructing an evaluation matrix, determine attribute weights using the maximum deviation method, calculate the weighted similarity between hotels using an improved fusion similarity computation formula, and present the recommendation results. Via a case analysis with examples from Ctrip, the effectiveness of the recommendation algorithm proposed in this paper is validated. Furthermore, via a comparative analysis with other methods, the relative advancement of the recommendation approach presented in this paper is demonstrated.
Limitation and Future Work: The data acquisition and attributes in the case study are relatively small, and the selection of hotel attributes relies on subjective experience. In the future, these shortcomings will be addressed; we plan to extract hotel attributes from online reviews based on the model, and use larger-scale data to optimize the algorithm.
Author Contributions
Conceptualization, E.W.; Methodology, E.W. and Y.C.; Software, Y.L.; Investigation, Y.C. and Y.L.; Data curation, Y.C.; Writing—original draft, Y.C.; Writing—review & editing, E.W.; Project administration, E.W. All authors have read and agreed to the published version of the manuscript.
Funding
This study was supported by the Research on Tourism Product Recommendation Algorithm in the Key Fields of Guangdong General Universities (Digital Economy; No. 2021ZDZX3010).
Data Availability Statement
Both data and algorithms are listed in the article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Government of the Macao Special Administrative Region. Bureau of Statistics and Census [DB/OL]. Available online: https://www.dsec.gov.mo/zh-CN/Statistic?id=401 (accessed on 10 January 2023).
- Shokeen, J.; Rana, C. A study on features of social recommender systems. Artif. Intell. Rev. 2020, 53, 965–988. [Google Scholar] [CrossRef]
- Liu, S.; Chen, Z.; Li, X. Time-semantic-aware Poisson tensor factorization approach for scalable hotel recommendation. Inf. Sci. 2019, 504, 422–434. [Google Scholar] [CrossRef]
- Linden, G.; Smith, B.; York, J. Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Comput. 2003, 7, 76–80. [Google Scholar] [CrossRef]
- Shi, Y.; Larson, M.; Hanjalic, A. Collaborative filtering beyond the user-item matrix: A survey of the state of the art and future challenges. ACM Comput. Surv. (CSUR) 2014, 47, 1–45. [Google Scholar] [CrossRef]
- Zhu, P.; Cheng, L.; Gao, C.; Wang, Z.; Li, X. Locating multi-sources in social networks with a low infection rate. IEEE Trans. Netw. Sci. Eng. 2022, 9, 1853–1865. [Google Scholar] [CrossRef]
- Tao, L.; You, T.; Yuan, Y. Hotel Selection Based on Hotel Feature Information and Online Review Information. J. Northeast Univ. Nat. Sci. 2019, 40, 1667–1672. [Google Scholar]
- Wang, S.; Wu, S. Personalized Attraction Recommendation Using Online Reviews. J. Huaqiao Univ. Nat. Sci. 2018, 39, 467–472. [Google Scholar]
- Cui, C.; Wang, X.; Li, W. Research on Contextual Environment-Based User Profile Tourism Product Recommendation Algorithm. Pract. Underst. Math. 2019, 49, 122–131. [Google Scholar]
- Lihua, S. Advanced collaborative filtering recommendation model based on sentiment analysis of online review. J. Shandong Univ. Eng. Sci. 2019, 49, 47–54. [Google Scholar]
- Li, X.J.; Deng, G.S.; Wang, X.Z.; Wu, X.L.; Zeng, Q.W. A hybrid recommendation algorithm based on user comment sentiment and matrix decomposition. Inf. Syst. 2023, 117, 102244. [Google Scholar] [CrossRef]
- Cao, H.; Kang, J. Study on improvement of recommendation algorithm based on emotional polarity classification. In Proceedings of the 2020 5th International Conference on Computer and Communication Systems (ICCCS), Shanghai, China, 15–18 May 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 182–186. [Google Scholar]
- Pang, Q.; Wang, H.; Xu, Z. Probabilistic linguistic term sets in multi-attribute group decision making. Inf. Sci. 2016, 369, 128–143. [Google Scholar] [CrossRef]
- Zhang, Y.; Liang, D.; Xu, Z. Cross-platform hotel evaluation by aggregating multi-website consumer reviews with probabilistic linguistic term set and Choquet integral. Ann. Oper. Res. 2022, 1–35. [Google Scholar] [CrossRef] [PubMed]
- Celotto, A.; Loia, V.; Senatore, S. Fuzzy linguistic approach to quality assessment model for electricity network infrastructure. Inf. Sci. 2015, 304, 1–15. [Google Scholar] [CrossRef]
- Zhou, H.; Ma, H.; Liu, J. Research on Film and Television Recommendation Algorithm Integrating Sentiment Analysis and Probabilistic Language. Inf. Stud. Theory Pract. 2020, 43, 180–186. [Google Scholar]
- Chen, S.; Zhang, C.; Zeng, S.; Wang, Y.; Su, W. A probabilistic linguistic and dual trust network-based user collaborative filtering model. Artif. Intell. Rev. 2023, 56, 429–455. [Google Scholar] [CrossRef]
- Cui, C.; Wang, X.; Li, W. Research on Tourism Attraction Recommendation Algorithm Based on User Online Reviews. Syst. Sci. Math. 2020, 40, 1103. [Google Scholar]
- Cui, C.; Wei, M.; Che, L.; Wu, S.; Wang, E. Hotel recommendation algorithms based on online reviews and probabilistic linguistic term sets. Expert Syst. Appl. 2022, 210, 118503. [Google Scholar] [CrossRef]
- Lin, M.; Xu, Z. Probabilistic linguistic distance measures and their applications in multi-criteria group decision making. Soft Comput. Appl. Group Decis.-Mak. Consens. Model. 2018, 357, 411–440. [Google Scholar]
- Chen, Z.; Huang, C.; Zhou, R.; Wang, Z.; Cao, F. Crowdsourcing Task Recommendation Algorithm Based on Collaborative Filtering. Inf. Technol. Informatiz. 2021, 8, 119–121. [Google Scholar]
- Luo, S.Z.; Zhang, H.Y.; Wang, J.Q.; Li, L. Group decision-making approach for evaluating the sustainability of constructed wetlands with probabilistic linguistic preference relations. J. Oper. Res. Soc. 2019, 70, 2039–2055. [Google Scholar] [CrossRef]
- Sarwar, B.; Karypis, G.; Konstan, J.; Riedl, J. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th international conference on World Wide Web, Hong Kong, China, 1–5 April 2001; pp. 285–295. [Google Scholar]
- Chu, H.; Liu, Q.; Mu, C. A Collaborative Filtering Recommendation Algorithm Improved for Modified Cosine Similarity. J. Yantai Univ. Nat. Sci. Eng. Ed. 2021, 34, 330–336. [Google Scholar]
- Jiang, L.; Zhang, Q. Research on Personalized Recommendation Strategy Based on Sentiment Analysis of Reviews—A Case Study of Douban Movie Reviews. Inf. Stud. Theory Pract. 2017, 40, 99–104. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).