Next Article in Journal
Design Optimization of Alloy Wheels Based on a Dynamic Cornering Fatigue Test Using Finite Element Analysis and Multi-Additional Sampling of Efficient Global Optimization
Next Article in Special Issue
Maximum Coverage by k Lines
Previous Article in Journal
Classification of Genus Three Zero-Divisor Graphs
Previous Article in Special Issue
Map Construction and Path Planning Method for Mobile Robots Based on Collision Probability Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Tour Route Recommendation Model by the Improved Symmetry-Based Naive Bayes Mining and Spatial Decision Forest Search

1
College of Computer Science, Sichuan University, Chengdu 610065, China
2
Institute of Geospatial Information, PLA Strategic Support Force Information Engineering University, Zhengzhou 450001, China
*
Authors to whom correspondence should be addressed.
Symmetry 2023, 15(12), 2168; https://doi.org/10.3390/sym15122168
Submission received: 30 October 2023 / Revised: 24 November 2023 / Accepted: 27 November 2023 / Published: 6 December 2023
(This article belongs to the Special Issue Computer Science and Symmetry/Asymmetry: Feature Papers)

Abstract

:
In machine learning, classifiers have the feature of constant symmetry when performing the attribute transformation. In the research field of tourism recommendation, tourists’ interests should be mined and extracted by the symmetrical transformation in founding the training dataset and creating the classifier, so as to ensure that the recommendation results meet the individualized interests and needs. In this paper, by applying the feature of constant symmetry in the classifier and analyzing the research background and existing problems of POI tour routes, we propose and construct a tour route recommendation model using improved symmetry-based Naive Bayes mining and spatial decision forest search. First, the POI natural attribute classification model is constructed based on text mining to classify the natural attributes of the destination POIs. Second, the destination POI recommendation model based on the improved symmetry-based Naive Bayes mining and decision forest algorithm is constructed, outputting POIs that match tourists’ interests. On this basis, the POI tour route recommendation model based on a spatial decision tree algorithm is established, which outputs the optimal tour route with the lowest sub-interval cost and route interval cost. Finally, the validation and comparative experiments are designed to output the optimal POIs and tour routes by using the proposed algorithms, and then the proposed algorithm is compared with the commonly used route planning methods, GDM and 360M. Experimental results show that the proposed algorithm can reduce travel costs by 4.56% and 10.36%, respectively, on the optimal tour route compared to the GDM and 360M and by 2.94% and 8.01%, respectively, on the suboptimal tour route compared to the GDM and 360M, which verifies the advantages of the proposed algorithm over the traditional route planning methods.

1. Introduction

1.1. Research Background and Problem Discussion

POI tour route recommendation is a hot topic in smart tourism research, with the goal of recommending popular tour routes to tourists and reducing tourists’ work and effort to plan the tour routes by themselves. It tries to recommend the most satisfying POIs and tour routes to tourists in order to improve their travel experience and satisfaction. At present, the traditional methods used for POI and tour route recommendation include the user-based collaborative filtering method, the item-based collaborative filtering method, and so on [1,2]. These traditional recommendation methods have certain drawbacks. The user-based collaborative filtering method is an approximate-style recommendation method that mainly searches for historical tourists with similar interests to the current tourists and directly recommends POIs and routes that the historical tourists have once visited to the current tourists. The item-based collaborative filtering method is to judge the preferences of the current tourists, match the similarity of POIs that the tourists have once visited, and recommend similar POIs to the tourists. Both of these methods are fuzzy recommendations with uncertainty. The user-based collaborative filtering method focuses on the similarity between the users and establishes the relationship between the users by judging their feature attributes. For tourism recommendation, it directly recommends POIs that the historical users prefer and have once visited to the current tourists. The drawback is that the recommended POIs cannot fully represent the interests of current tourists. The results may have a significant deviation from the current tourists’ interests, which could reduce their satisfaction. The item-based collaborative filtering method usually uses scoring methods to establish a user preference measurement model for items. Users determine their preferences for items through scoring, rather than measuring their interests based on their preference for item feature attributes. For tourism recommendations, searching for POIs with similar scoring to those once visited by tourists is mainly based on tourists’ subjective perceptions of POIs. It does not represent that the tourists are fully familiar with POI feature attributes, nor does it represent an objective evaluation of tourists’ ability to match their interests with POI feature attributes. Therefore, directly using high or low scoring as the criteria for recommending POIs is also a fuzzy recommendation with uncertainty. As to the POI tour route recommendation, the traditional methods directly recommend routes visited by historical tourists to current tourists, which is also a fuzzy recommendation method with uncertainty that cannot fully match tourists’ interests [3,4].

1.2. Problem Solving Methods

By analyzing the existing problems in traditional recommendation methods, it can be concluded that in order to recommend POIs and tour routes that match tourists’ interests, the following problems must be solved, and it is necessary to obtain the POI recommendations from the perspective of mining tourists’ interests and POI feature attributes.
First, obtain the tourists’ interests from the perspective of tourist interest mining. The acquisition of tourists’ interests should be based on POI feature attributes, and the tourist interest demand vector should be determined by mining POI feature attributes rather than simply obtaining POI evaluating scores. The mining of tourist interest data requires subdividing POI feature attributes and determining the weight for each feature attribute. By calculating and judging the weight, the degree of tourist interest in each feature attribute could be measured.
Second, the division of POIs in destination cities should match tourists’ interests and conform to the symmetry feature. There are significant differences in the interests and needs of different tourists, resulting in discrepant final division results. In the aspect of symmetry-based data mining, the POIs that a tourist will visit should have the symmetrical and neatest features of the preferred POIs he has visited, because the preferred POIs could symmetrically reflect the tourist’s interests. Thus, it is possible to establish a POI classification model based on symmetry-based historical interests by obtaining and mining the POIs he has once visited. The classification model consists of two parts: one is the classification of the POI natural attributes, and the other is the classification of the POI tourism attributes. Among them, the classification of POI natural attributes is based on the natural features and functions of POIs, while the classification of POI tourism attributes is based on tourists’ preferences for the previously visited attractions. By training a classification model, it is possible to establish a tourism destination POI classification algorithm based on tourists’ interests and accurately classify tourism destination POIs.
Third, by using the constructed POI classification model, it is possible to establish a spatial decision forest algorithm for recommending POIs. The purpose is to obtain the recommendation degree of each POI based on the tourists’ interests and the POI classification algorithm. The recommendation degrees are used as nodes to establish a spatial decision tree and spatial decision forest and to recommend the POIs that best match tourists’ interests. Fourth, in response to the POI tour route recommendation problem, an optimal tour route search algorithm should be constructed based on the recommended POIs so that the final searched route contains the recommended POIs and has the lowest spatial cost, allowing tourists to meet their travel interests while minimizing the travel cost and improving their satisfaction.

2. Related Works

There are many previously studied methods for tourism POI and tour route recommendation. Lin et al. [5] established a POI recommendation model based on generalized regression neural networks, which deeply mines the low-level feature information of the model and better trains the model parameters on high-dimensional sparse datasets so that the recommendation results are no longer overly generalized. Bin et al. [6] proposed a neural multi-scenario modeling framework that learns the tourism characteristics of tourists and tourist attractions by modeling multiple tourism scenarios. It recommends personalized attractions by calculating the similarity between tourists and candidate attractions in the potential tourism spaces. Li et al. [7] established a POI recommendation method based on stratified sampling statistics and singular value decomposition. The stratified sampling statistics are used to obtain the user preferences for different group attributes, and the singular value decomposition method is used to predict the user scores. These two methods are combined to recommend POIs for users. Mizutani et al. [8] designed a tourist attraction recommendation system that takes into account the changes in user needs. The system integrates Web GIS (geographic information system), pairing system, evaluation system, and recommendation system into one system and connects with external SNS (social network services: Twitter and Facebook) to recommend qualified POIs for users. Huang et al. [9] proposed a seasonal perception tourist attraction recommendation method based on seasonal theme preferences and dual trust relationships, which captures seasonal preferences from tourists’ historical travel behaviors and obtains the evaluation data trusted by users, thereby predicting POI scores for tourists and making recommendations. Kethorn et al. [10] established a POI recommendation method based on tourist check-in data, which is based on the historical behavioral data of tourists. Remigijus et al. [11] established a greedy genetic algorithm based on travel constraints and personalized scorings of tourists to search for the optimal POIs and recommend them to tourists. Zhang et al. [12] established a POI recommendation method based on the relationship between users’ preferred images and POIs, using the travel images as a data source for mining users’ travel preferences. By establishing the relationship between the users’ travel preferences and the tourism images through a European algorithm, POIs are recommended for tourists. Liang et al. [13] introduced the long-term and short-term memory networks for feature extraction of contextual information, tourist attraction information, tourist comments, etc., to analyze users’ online behaviors and long-term interest preferences and recommend POIs for users. Han et al. [14] proposed a tourism POI recommendation model based on the geotagged photos, which integrates the spatial, temporal, and visual embedding methods to cluster the tourism photos and recommend POIs for tourists.
According to the analysis of the related works, the current tourism POI and route recommendation methods mainly focus on the following points: One is to improve and optimize the recommendation algorithm to improve its accuracy, recall, and other indicators. It aims to improve the performance of recommendation algorithms, with the goal of designing algorithms with better performance and recommendation efficiency, but it ignores the tourists’ personalized interests and the natural and tourism attributes of POIs. Second, the dependence on historical behavioral data of tourists is still too high, with a bias towards obtaining tourists’ interests from historical behavioral data such as check-in locations, photos, and travel trajectories. This method of obtaining interests is not precise enough, since there is neither mining on POI feature attributes nor recommending POIs from the perspective of tourists’ interests on POI feature attributes, resulting in inaccurate recommendation results. Third, the recommendation on POI tour routes is based on historical tour trajectories and photos, and there is a lack of research on the optimal route searching method under the current geographical and spatial constraints of the destination city. There is no concern for the cost incurred by tourists traveling along the route; thus, the recommended POI tour routes are detached from real-world tourism scenarios, making it difficult to meet the interests and demands of tourists.
Based on the research background, the discussed problem-solving methods, and the related works, this paper constructs a tour route recommendation model using improved symmetry-based Naive Bayes mining and spatial decision forest search. The structure and main contributions of the work are as follows: Figure 1 shows the overall framework and flowchart of the research content.
(1)
The POI natural attribute classification model based on text mining is constructed. Set up a text mining algorithm to classify the natural attributes of tourist destination POIs, making the process of recommending POIs rely on each natural attribute classification. Searching for the POIs with the highest label weights from the classified POIs can improve the accuracy of the recommending results.
(2)
A method model for mining tourist interests is established. The model sets the POIs that the tourists have once visited as a source of interest. Tourists’ preferences for POI tourism attributes are obtained by tourists’ determining and judging once-visited POIs. This method can obtain tourists’ interests and preferences for POIs from the perspective of tourism attributes rather than depending on the POIs provided by historical tourists or subjective scoring provided by tourists, and it has higher accuracy.
(3)
The destination POI recommendation model based on the improved symmetry-based Naive Bayes mining and spatial decision forest algorithms is established. The POIs provided by tourists as well as their interest data are used to construct the improved symmetry-based Naive Bayes mining model. Then the destination POIs are classified based on tourists’ interest tendencies, so that the POIs that belong to different natural attribute classifications could be divided into different tourism attribute classifications by tourists’ preferences, and finally the POIs with the highest recommendation degrees are precisely recommended.
(4)
The POI tour route recommendation model based on a spatial decision tree search algorithm is established. By constructing the route vector, route sub-interval, and route interval models, the sub-interval cost function and interval cost function are designed. Then the spatial decision tree model is constructed for the sub-interval cost function to output the optimal sub-interval. Based on the cost of each sub-interval, the interval decision tree algorithm is constructed to output a decision tree with interval costs as the tree nodes; thereby, the optimal travel routes for tourists are recommended.
(5)
The validation experiment and comparative experiment are performed using the proposed algorithm to output the optimal POIs and tour routes. The experiment compares the proposed algorithm (PRA) with the commonly used route planning methods (GDM and 360M) and verifies the advantages of the proposed algorithm over the traditional route planning methods. The output POI tour routes can effectively reduce travel costs.

3. Methodology

3.1. POI Natural Attribute Classification Model Based on Text Mining

Dividing the natural attributes of POIs in tourist destination cities is the key to accurately recommending POIs, since tourists’ interests in POIs first come from their preferences for the natural attributes of POIs. According to the characteristics of urban tourist attractions and relevant definitions in tourism, the natural attributes of POIs are divided into categories such as “natural scenery”, “cultural history”, “leisure shopping”, “amusement parks and venues”, etc. The category of natural attribute depends on the texts describing the POI. By constructing a text mining algorithm, the tendency of POI to natural attribute classification is obtained, and its natural attribute category is determined by comparing the tendency degrees [15]. In this section, we construct a POI natural attribute classification model based on text mining. Here are the relative definitions.
Definition 1.
Tourism destination POI P a ( i ) . When tourists get to a certain tourist destination for tourism activities, the POI of the tourist destination with natural attributes and tourism attributes is defined as the tourism destination POI, denoted as P a ( i ) , in which a is the tourist destination label, i is the tourist destination POI code, m is the quantity of tourist destination POIs, and satisfies 0 < i m , i , m N .
Definition 2.
POI natural attribute label λ N ( i ) and sub-label λ N ( i ,   j ) . In the tourism perspective, in order to quantify POI feature attributes and construct a recommendation algorithm, the basic item features and tourism function attributes of POI are defined as natural attributes, and their text labels are marked as λ N ( i ) , 0 < i k , i , k N . Make topology on each label λ N ( i ) and set l number of secondary labels that express natural attributes λ N ( i ) for the text frequency statistics on the destination POI P a ( i ) . Define the secondary label of the destination POI P a ( i ) as the natural attribute sub-label, denoted as λ N ( i ,   j ) .
Definition 3.
POI natural attribute vector λ N ( i ) and natural attribute matrix λ N ( i ,   j ) . The k × 1 dimension column vector composed of POI natural attribute labels λ N ( i ) is defined as the POI natural attribute vector λ N ( i ) , which is used to express and quantify the natural attributes of POI. Based on the row elements λ N ( i ) of vector λ N ( i ) , the sub-labels λ N ( i ,   j ) are mapped to the related vertical column j of each row i to obtain a k × l dimension topological matrix λ N ( i ,   j ) , and the matrix is defined as the POI natural attribute matrix λ N ( i ,   j ) . According to the definition, the vector λ N ( i ) and the matrix λ N ( i ,   j ) are constructed as the Formulas (1) and (2).
λ N ( i ) = λ N ( 1 ) , λ N ( 2 ) , , λ N ( k ) T
λ N ( i ,   j ) = λ N ( 1 , 1 ) λ N ( 1 , 2 ) λ N ( 1 , l ) λ N ( 2 , 1 ) λ N ( 2 , 2 ) λ N ( 2 , l ) λ N ( k , 1 ) λ N ( k , 2 ) λ N ( k , l )
Definition 4.
POI label word frequency t f ( λ N ( i ) ) , inverse text frequency i d f ( λ N ( i ) ) and label weight t f i d f ( λ N ( i ) ) . POI label word frequency t f ( λ N ( i ) ) represents the statistical word frequency of sub-labels λ N ( i ,   j ) corresponding to a natural attribute λ N ( i ) of POI in the encyclopedia big data text. The inverse text frequency i d f ( λ N ( i ) ) represents the reciprocal of the text frequency, while the text frequency is the statistical count of the times that documents containing sub-labels λ N ( i ,   j ) corresponding to a natural attribute λ N ( i ) of POI appears in all documents in the corpus. The label weight t f i d f ( λ N ( i ) ) is used to calculate the statistical weight of sub-labels λ N ( i ,   j ) corresponding to a natural attribute λ N ( i ) of POI in the encyclopedia big data text.
According to the definition, the label word frequency t f ( λ N ( i ) ) , inverse text frequency i d f ( λ N ( i ) ) and label weight t f i d f ( λ N ( i ) ) for POI natural attribute label λ N ( i ) are constructed as Formulas (3)–(5). In Formula (3), n λ N ( i ,   j ) represents the frequency of the sub-label λ N ( i ,   j ) for label λ N ( i ) of the row i and column j in the matrix λ N ( i ,   j ) appearing in the encyclopedia big data text. s : λ N ( i ) d s indicates the number of documents where the label λ N ( i ) λ N ( i ,   j ) appear in the total number of documents | D | , 0 < i k , 0 < j l , i , k , j , l N .
t f ( λ N ( i ) ) = j = 1 l n λ N ( i , j ) i = 1 k j = 1 l n λ N ( i , j )
i d f ( λ N ( i ) ) = log | D | s : λ N ( i ) d s + 1
t f i d f ( λ N ( i ) ) = t f ( λ N ( i ) ) × i d f ( λ N ( i ) )
Definition 5.
POI P a ( i ) natural attribute classification G N ( i ) . Through text mining and classification algorithm, the final k number of classifications for m number of tourist destination POIs is obtained. Each classification is defined as the natural attribute classification of POIs P a ( i ) , denoted as G N ( i ) , 0 < i k , i , k N . According to the definition of the natural attribute matrix λ N ( i ,   j ) , the row rank r a n k   ( λ N ( i ,   j ) ) of the matrix is the k number of classifications G N ( i ) , and the row of the matrix corresponds to one classification G N ( i ) .
Based on the text mining model, the POI P a ( i ) natural attribute structure tree classification algorithm based on the optimal label weight t f i d f ( λ N ( i ) ) searching is constructed. The goal is to determine the natural classification of the destination POI P a ( i ) by searching for the maximum label weight t f i d f ( λ N ( i ) ) , and ultimately construct an initial structure tree Tree   G N ( i ) containing k number of POI classifications G N ( i ) for m number of tourism destination POIs. The algorithm is constructed as follows:
Step 1: Initialize P a ( i ) , label λ N ( i ) , corresponding sub-labels λ N ( i ,   j ) . For arbitrary POI P a ( i ) , obtain its popular science text from Baidu encyclopedia big data. Establish a corpus with a total | D | number of documents, including POI’s Baidu encyclopedia big data document.
Step 2: Count the frequency of sub-labels λ N ( i ,   j ) corresponding to each label λ N ( i ) in the encyclopedia big data text, and calculate the t f ( λ N ( i ) ) for labels λ N ( i ) . Count the number of documents s : λ N ( i ) d s with corresponding sub-labels λ N ( i ,   j ) of label λ N ( i ) in the total | D | number of documents, and calculate the i d f ( λ N ( i ) ) of labels λ N ( i ) .
Step 3: Calculate the label weights t f i d f ( λ N ( i ) ) based on the t f ( λ N ( i ) ) and the i d f ( λ N ( i ) ) of labels λ N ( i ) . The POI natural attribute classification structure tree Tree   G N ( i ) is initialized as Figure 2. The weight of each label t f i d f ( λ N ( i ) ) in the figure is denoted as t · ( λ N ( i ) ) . The tree node is composed of a data linked list, and the list header is for classification G N ( i )   ~   λ N ( i ) , the list content is t · ( λ N ( i ) ) . The POI natural attribute classification algorithm based on the classification tree structure Tree   G N ( i ) is constructed as Figure 2.
Step 3.1: Establish an initialized vector T i n i to store the k number of label weights t · ( λ N ( i ) ) for λ N ( i ) , with a vector dimension of 1 × k . Store the k number of label weights t · ( λ N ( i ) ) for λ N ( i ) to the k number of elements T i n i ( u ) of the vector T i n i , 0 < u k , u , k N .
Step 3.2: The search for the elements of the initial layer L G N ( i ) ( 1 ) of the structure tree Tree   G N ( i ) .
(1)
Take λ N ( 1 ) and λ N ( 2 ) , compare t · ( λ N ( 1 ) ) and t · ( λ N ( 2 ) ) . If t · ( λ N ( 1 ) ) t · ( λ N ( 2 ) ) , store λ N ( 1 )   ~   t · ( λ N ( 1 ) ) into L G N ( i ) ( 2 ) element λ N ( x 1 ) ; If t · ( λ N ( 1 ) ) < t · ( λ N ( 2 ) ) , store λ N ( 2 )   ~   t · ( λ N ( 2 ) ) into L G N ( i ) ( 2 ) element λ N ( x 1 ) .
(2)
Take λ N ( 3 ) and λ N ( 4 ) , compare t · ( λ N ( 3 ) ) and t · ( λ N ( 4 ) ) . If t · ( λ N ( 3 ) ) t · ( λ N ( 4 ) ) , store λ N ( 3 )   ~   t · ( λ N ( 3 ) ) into the L G N ( i ) ( 2 ) element λ N ( x 2 ) ; Then If t · ( λ N ( 3 ) ) < t · ( λ N ( 4 ) ) , store λ N ( 4 )   ~   t · ( λ N ( 4 ) ) into L G N ( i ) ( 2 ) element λ N ( x 2 ) .
(3)
In line with the same method, compare t · ( λ N ( u ) ) and t · ( λ N ( u + 1 ) ) . If t · ( λ N ( u ) ) t · ( λ N ( u + 1 ) ) , store λ N ( u )   ~   t · ( λ N ( u ) ) into the L G N ( i ) ( 2 ) element λ N ( x α ) ; Then If t · ( λ N ( u ) ) < t · ( λ N ( u + 1 ) ) , store λ N ( u + 1 )   ~   t · ( λ N ( u + 1 ) ) into L G N ( i ) ( 2 ) element λ N ( x α ) . Traverse u   ~   ( 0 , k ] N .
(4)
Output the second layer L G N ( i ) ( 2 ) of the structure tree Tree   G N ( i ) .
Step 3.3: The search in the elements of the second layer L G N ( i ) ( 2 ) of the structure tree Tree   G N ( i ) .
(1)
Take λ N ( x 1 ) and λ N ( x 2 ) , compare t · ( λ N ( x 1 ) ) and t · ( λ N ( x 2 ) ) . If t · ( λ N ( x 1 ) ) t · ( λ N ( x 2 ) ) , store λ N ( x 1 )   ~   t · ( λ N ( x 1 ) ) into the L G N ( i ) ( 3 ) element λ N ( y 1 ) ; Then If t · ( λ N ( x 1 ) ) < t · ( λ N ( x 2 ) ) , store λ N ( x 2 )   ~   t · ( λ N ( x 2 ) ) into L G N ( i ) ( 3 ) element λ N ( y 1 ) .
(2)
Take λ N ( x 3 ) and λ N ( x 4 ) , compare t · ( λ N ( x 3 ) ) and t · ( λ N ( x 4 ) ) . If t · ( λ N ( x 3 ) ) t · ( λ N ( x 4 ) ) , store λ N ( x 3 )   ~   t · ( λ N ( x 3 ) ) into the L G N ( i ) ( 3 ) element λ N ( y 2 ) ; Then If t · ( λ N ( x 3 ) ) < t · ( λ N ( x 4 ) ) , store λ N ( x 4 )   ~   t · ( λ N ( x 4 ) ) into L G N ( i ) ( 3 ) element λ N ( y 2 ) .
(3)
In line with the same method, compare t · ( λ N ( x u ) ) and t · ( λ N ( x u + 1 ) ) , If t · ( λ N ( x u ) ) t · ( λ N ( x u + 1 ) ) , store λ N ( x u )   ~   t · ( λ N ( x u ) ) into L G N ( i ) ( 3 ) element λ N ( y α ) ; If t · ( λ N ( x u ) ) < t · ( λ N ( x u + 1 ) ) , store λ N ( x u + 1 )   ~   t · ( λ N ( x u + 1 ) ) into L G N ( i ) ( 3 ) element λ N ( y α ) . The number of the node x u for the second layer L G N ( i ) ( 2 ) is half of that in the initial layer L G N ( i ) ( 1 ) . x u traverses x u   ~   ( 0 , 0.5   k ) N .
(4)
Output the third layer L G N ( i ) ( 3 ) of the structure tree Tree   G N ( i ) .
Step 3.4: In line with the same method as the step 3.2–step 3.3, output the No. p layer L G N ( i ) ( p ) of the structure tree Tree   G N ( i ) . The number of the node x u for the No. p layer L G N ( i ) ( p ) is half of that in the parent layer L G N ( i ) ( p 1 ) . According to the iteration algorithm, the number N n o d e of the node x u for the No. p layer L G N ( i ) ( p ) meets the model of Formula (6). In the formula, one extra node is kept to meet the condition in case of the odd number on the node. Traverse p   ~   ( 0 , max p ) N , until the root node number of L G N ( i ) ( p max ) in the tree Tree   G N ( i ) meets N n o d e = 1 .
N n o d e = 1 + ( 0.5 ) p 1 k
Step 4: Output the data list for the root node in the No. p layer L G N ( i ) ( p max ) of the tree Tree   G N ( i ) . The list content is the related label weight t f i d f ( λ N ( i ) o p t ) . The algorithm ends. The natural attribute G N ( i )   ~   λ N ( i ) relating to the root node is the natural attribute classification that the POI belongs to.

3.2. POI Recommendation Model Based on the Improved Symmetry-Based Naive Bayes Mining and Spatial Decision Forest Algorithm

3.2.1. The Improved Symmetry-Based Naive Bayes Classification Algorithm Based on the Once-Visited POIs

In machine learning, classifiers have the feature of symmetry in extracting attributes from the training set. In tourism recommendation, when a classifier is founded, the training data set is mined to perform symmetry transformation in tourists’ interests so as to obtain accurate recommendation results that match tourists’ interests. A Naive Bayes classifier is a typical method in machine learning to perform symmetry-based interest transformation when extracting tourists’ attributes, interests, and requirements. It is suitable to precisely obtain the classification based on POI tourism attributes and tourists’ interests. The establishment of the Naive Bayes algorithm is based on the Bayes theorem and requires that the different features be independent of each other. The basic principle is to construct a classification model based on Bayesian posterior probability by utilizing the classified samples and their independent feature attributes. The values of the feature attributes are set as numerical ranges. By calculating the probability of the object being classified as belonging to each classification, the object is ultimately classified into the class with the highest probability [16]. Based on this classification modeling idea, the POIs that the tourists have once visited are listed as the training samples. Each POI contains tourism attributes, and each tourism attribute is the factor with which tourists are most concerned when planning their travel itineraries, while the tourism attributes must be independent from each other. Tourists evaluate the preferences of POIs they have once visited, and the preferences are used as the classification criterion. By confirming the preferences on POIs and the quantified tourism attributes, we construct an improved symmetry-based Naive Bayes classification algorithm based on the once-visited POIs. Here are the relative definitions.
Definition 6.
The once-visited POI P b ( i ) . The n number of POIs provided by the tourist that he has once visited and has certain preferences for is defined as the once-visited POI, denoted as P b ( i ) . The b is the label for the once-visited POI, which is used to distinguish the destination POI P a ( i ) . The i is the code for the POI P b ( i ) , 0 < i n , i , n N . The POIs P b ( i ) and their quantified tourism attributes are designed as the training set to construct the symmetry-based Naive Bayes classification algorithm.
Definition 7.
POI tourism attribute label λ T ( i ) and the quantified sub-interval λ T ( i ,   j ) . The attributes that affect tourists’ choice of POIs for tourism planning are defined as the tourism attributes, and each tourism attribute is marked as a label λ T ( i ) , 0 < i g , i , g N . The g represents the number of tourism attribute labels λ T ( i ) . The tourism attributes include indicators such as “travel cost”, “travel time”, “POI- A Class” and “POI popularity”, each of which directly affects tourists’ choice of POIs and POI classification results. In order to construct the symmetry-based Naive Bayes classification model, each label indicator λ T ( i ) is quantified as a corresponding sub-interval, denoted as λ T ( i ,   j ) , 0 < j l , j , l N , and l represents the number of sub-intervals λ T ( i ,   j ) . The sub-interval λ T ( i ,   j ) of the label λ T ( i ) is represented as a numerical range.
Definition 8.
POI tourism attribute vector λ T ( i ) and tourism attribute matrix λ T ( i ,   j ) . The g × 1 dimension column vector composed of POI tourism attribute labels λ T ( i ) is defined as the POI tourism attribute vector λ T ( i ) , which is used to express and quantify the tourism attributes of POIs. Based on the row elements λ T ( i ) of vectors λ T ( i ) , the quantified sub-intervals λ T ( i ,   j ) are mapped to the related vertical column j of each row i to obtain a g × l dimension topological matrix λ T ( i ,   j ) , and the matrix is defined as the POI tourism attribute matrix λ T ( i ,   j ) . According to the definition, the vector λ T ( i ) and the matrix λ T ( i ,   j ) are constructed as the Formulas (7) and (8). In order to construct the symmetry-based Naive Bayes classification algorithm, the matrix λ T ( i ,   j ) is recorded in the form of a data table. Arbitrary POI P b ( i ) corresponds to a matrix λ T ( i ,   j ) quantization value, and once the matrix λ T ( i ,   j ) is confirmed, the vector λ T ( i ) is immediately determined, representing the tourism attribute quantization vector of the POI P b ( i ) .
λ T ( i ) = λ T ( 1 ) , λ T ( 2 ) , , λ T ( g ) T
λ T ( i ,   j ) = λ T ( 1 , 1 ) λ T ( 1 , 2 ) λ T ( 1 , l ) λ T ( 2 , 1 ) λ T ( 2 , 2 ) λ T ( 2 , l ) λ T ( g , 1 ) λ T ( g , 2 ) λ T ( g , l )
Definition 9.
The P a ( i ) Bayesian posterior probability model P ( C ( i ) | P a ( i ) ) . The probability that a tourism destination POI P a ( i ) belongs to a classification C ( i ) is defined as Bayesian posterior probability, denoted as P ( C ( i ) | P a ( i ) ) . The higher the P a ( i ) Bayesian posterior probability value P ( C ( i ) | P a ( i ) ) is, the higher the probability of POI P a ( i ) belonging to the classification C ( i ) will be. Formula (9) is the constructed P a ( i ) Bayesian posterior probability model P ( C ( i ) | P a ( i ) ) .
P ( C ( i ) | P a ( i ) ) = P ( P a ( i ) | C ( i ) ) P ( C ( i ) ) P ( P a ( i ) )
According to Definitions 6–9 as well as the quantified matrix λ T ( i ,   j ) , we construct the symmetry-based Naive Bayes classification algorithm based on the training set POI P b ( i ) and tourism attributes.
Step 1: The tourist confirms n number of P b ( i ) and divides each P b ( i ) into preference classifications C ( i ) . The quantity of C ( i ) meets 0 < i w , i , w N . According to the preference degrees of the tourist to the POIs P b ( i ) , the C ( i ) could be defined as C ( 1 ) : “Most favorite”; C ( 2 ) : “Favorite” and C ( 3 ) : “Like”, three classifications in total. Expand the quantified interval data table for the vector λ T ( i ) , and construct the new vector with the classification indicator, C ( i ) as Formula (10) shows.
λ T ( i ) * = λ T ( 1 ) , λ T ( 2 ) , , λ T ( g ) , C ( i ) T
Step 2: Problem confirmation. The standard for solving the problem that a POI P a ( i ) belongs to a certain classification C ( x ) is calculating P ( C ( x ) | P a ( i ) ) and satisfies that: for arbitrary C ( y ) , excluding C ( x ) , there is always the P ( C ( x ) | P a ( i ) ) > P ( C ( y ) | P a ( i ) ) , 0 < x , y w , x , y , w N , x y .
Step 3: Problem transformation: Since the P ( P a ( i ) ) to any classification C ( i ) is a constant, to obtain the maximum value P ( C ( i ) | P a ( i ) ) is transformed into obtaining the maximum value of the denominator P ( P a ( i ) | C ( i ) ) P ( C ( i ) ) . According to the actual tourism conditions, there are differences in the probabilities of POI classifications C ( i ) . According to the Bayesian theorem, the model P ( C ( i ) ) is constructed as the Formulas (11) and (12), n ( i ) is the number of samples belonging to the classification C ( i ) in the training sample set P b ( i ) and n is the total number of samples in the training sample set P b ( i ) .
P ( C ( i ) ) = n ( i ) n
P ( C ( i ) ) = n ( i ) i = 1 w n ( i )
Step 4: The conditional probability P ( λ T ( i ) | C ( i ) ) of the tourism attribute label λ T ( i ) for the POI P a ( i ) is constructed in Formula (13), the mark n ( i ,   j ) is the frequency of label λ T ( i ,   j ) appearing in the classification C ( i ) in the training sample set P b ( i ) and n ( i ) is the number of samples belonging to the classification C ( i ) in the training sample set P b ( i ) . To avoid the possibility that the conditional probability might be 0, increase the value of n ( i ,   j ) by 1 item and set that P ( λ T ( i ,   j ) | C ( i ) ) could not exceed 1.
P ( λ T ( i ,   j ) | C ( i ) ) = n ( i ,   j ) + 1 n ( i )
Step 5: The model constructed for the conditional probability P ( P a ( i ) | C ( i ) ) P ( C ( i ) ) is shown as Formula (14). The mark g is the number of tourism attribute label λ T ( i ) , the mark λ T ( i ,   j ) represents the label that belongs to the quantified matrix λ T ( i ,   j ) .
P ( P a ( i ) | C ( i ) ) P ( C ( i ) ) = i = 1 g P ( λ T ( i ,   j ) | C ( i ) ) · P ( C ( i ) )
Step 6: Take the recommendation degree δ N B = max   P ( P a ( i ) | C ( i ) ) P ( C ( i ) ) , then the classification C ( i ) relating to the δ N B is the classification that the POI P a ( i ) belongs to. The classification algorithm for P a ( i ) ends.
Calculate the recommendation degrees δ N B for m number of POIs P a ( i ) in the destination POI set P a ( i ) . Classify the m number of POIs P a ( i ) into w number of classifications C ( i ) , count the POI P a ( i ) number in each C ( i ) , denoted as h ( i ) .

3.2.2. Improved POI Recommendation Degree Model Based on Tourism Attribute Interest Network

The Naive Bayes classification algorithm calculates the conditional probabilities of the m number of destination POIs in a set P a ( i ) from the perspective of the probability that the once-visited POIs belong to different interest clusters C ( i ) , and uses it as the recommendation degree δ N B . The modeling conditions of the Naive Bayes classification algorithm require that each tourism attribute λ T ( i ) be independent of the others. The proposed Naive Bayes classification algorithm uses tourism attributes such as “travel cost”, “travel time”, “POI A-Class”, and “POI popularity”. Each attribute is independent in terms of functional properties, while there is correlation in the subjective evaluation and selection of different tourists, and it will have an impact on the final recommendation of destination POIs in the set P a ( i ) . Tourists have different attitudes and judgments in the evaluation correlations and interest weights of POI tourism attributes when choosing POIs for sightseeing; that is, their demand tendencies for different tourism attributes are different. To make the recommendation results output by the Naive Bayes classification algorithm more accurate, we introduce the model of tourism attribute interest weight, construct a tourism attribute interest network based on historical visited POIs, and introduce it into the conditional probability model output by the Naive Bayes classification algorithm to further optimize the POIs that accurately match tourists’ interests.
Definition 10.
Once-visited POI interest weight ε ( i ) and tourism attribute interest weight ω ( i ) . Tourists evaluate and set weights for n number of POIs in the once-visit POI set P b ( i ) based on their own interests. We define this weight as the once-visited POI interest weight ε ( i ) , 0 < ε ( i ) < 1 , ε ( i ) R . Tourists evaluate POI tourism attributes based on their own interests and set interest weights. This weight is defined as the tourism attribute interest weight ω ( i ) , 0 < ω ( i ) < 1 , ω ( i ) R . The interest weight of a tourism attribute determines the degree to which tourists attach importance to tourism attributes. The higher the weight ω ( i ) is, the higher the tourist’s interest tendency towards attribute λ T ( i ) will be, and vice versa. According to the definition, the relationship model among tourism attributes is constructed from the perspective of interest weight, as shown in Formula (15), g is the number of tourism attributes λ T ( i ) .
i = 1 g ω ( i ) = 1
Definition 11.
Tourism attribute interest network N e t · λ T ( i ) . The network composed of n number of POIs in the POI set P b ( i ) and their tourism attribute quantification values is defined as the tourism attribute interest network, denoted as N e t · λ T ( i ) . The final interests of tourists in various tourism attributes are determined by the network N e t · λ T ( i ) . Figure 3 shows the constructed tourism attribute interest network N e t · λ T ( i ) . In the network, the horizontal network N e t r o represents the quantified values of tourism attributes for POIs P b ( i ) , while the vertical network N e t c o represents the iteration of each POI by the quantified values of a single tourism attribute.
According to Definitions 10 and 11, and the tourism attribute interest network N e t · λ T ( i ) , the tourism attribute interest iterative model λ T ( i ) * is constructed by introducing weight ε ( i ) and weight ω ( i ) , as shown in Formula (16), and N o r m · represents the normalization function, g represents the number of tourism attributes λ T ( i ) , and n represents the number of once-visited POIs P b ( i ) .
λ T ( i ) * = i = 1 g j = 1 n ε ( j ) · ω ( i ) · N o r m λ b · T ( i ) n
According to the quantified values λ a · T ( i ) of tourism attributes in the destination POIs P a ( i ) , the tourist interest matching recommendation degree δ M A is constructed as shown in Formula (17). Due to the introduction of weight ε ( i ) and weight ω ( i ) , the coefficient value ζ of the recommendation degree δ M A is set as ζ = 0.1 .
δ M A = 1 i = 1 g λ T ( i ) * ζ · N o r m λ a · T ( i ) p 1 p
To optimize the accuracy of the Naive Bayes classification algorithm in recommending POIs, the tourist interest matching recommendation degree δ M A is introduced into Formula (14) recommendation degree δ N B = max   P ( P a ( i ) | C ( i ) ) P ( C ( i ) ) , and the destination POI P a ( i ) recommendation model δ ( i ) is constructed as shown in Formula (18).
δ ( i ) = δ N B   ×   δ M A
The destination POI P a ( i ) recommendation model δ ( i ) introduces a tourist interest weight and tourism attribute interest network based on an improved Naive Bayes classification algorithm, and the output POIs will be closer to the tourists’ interests.

3.2.3. POI Recommendation Model Based on the Spatial Decision Forest Algorithm

The structural tree Tree   G N ( i ) generated by the text mining algorithm is used to classify the natural attributes of the destination POIs. At the same time, based on the tourism attributes of the once-visited POIs and considering the factors in tourism planning, a symmetry-based Naive Bayes classification algorithm is constructed to classify the tourism attributes of the destination POIs. The degrees of tourists’ preferences for POIs come from the judgment on the natural attributes and tourism attributes of the once-visited POIs, that is, the comprehensive consideration of POI categories such as “natural scenery”, “cultural history”, “leisure shopping”, “amusement parks and venues”, as well as indicators such as “travel cost”, “travel time”, “POI- A Class, and “POI popularity”. When tourists judge the POI as “Most Favorite”, it indicates that they have a high level of interest in the natural attributes and tourism attributes of the POI [17]. The goal of recommending POIs and tour routes for tourists is to search for d number of POIs with the highest tourist interests among the m number of POIs and construct a route search algorithm to output the route with the lowest travel cost. Based on the modeling objectives, relevant definitions are provided as follows:
Definition 12. 
Recommendation decision matrix  T d ( i ) Based on the natural attribute classification results output by the text mining algorithm and the tourism attribute classification results output by the Naive Bayes algorithm, a global optimal search algorithm is constructed to list the destination POIs  P a ( i )  in the classification  C ( i )  by the natural attributes and their recommendation degree  δ ( i ,   j ) 0 < i k 0 < j h ( i )  and store them in a certain data structure in a  k × h ( i )  dimension matrix. This matrix is defined as a recommendation decision matrix, denoted as  T d ( i ) . The matrix consists of  k  number of rows and  h ( i )  number of columns.  k  represents the number of natural attribute classifications and  h ( i )  represents the number of destination POIs included in the tourism attribute classification  C ( i ) . An arbitrary row of the matrix represents a natural attribute classification  G N ( i )  in the tourism attribute classification  C ( i ) . The matrix  T d ( i )  code  i  corresponds to the encoding of the tourism attribute classification  C ( i ) , that is, one classification  C ( i )  corresponds to one matrix  T d ( i ) 0 < i w i , w N w  is the number of classification  C ( i ) . Formula (19) is the constructed recommendation decision matrix for the classification  C ( i ) .
T d ( i ) = δ ( 1 ,   1 ) δ ( 1 ,   2 ) δ ( 1 ,   h ( i ) ) δ ( 2 ,   1 ) δ ( 2 ,   2 ) δ ( 2 ,   h ( i ) ) δ ( k ,   1 ) δ ( k ,   2 ) δ ( k ,   h ( i ) )
Definition 13. 
Recommendation decision tree  Tree   C ( i )  and recommendation decision forest  F o r e s t   C ( i ) . In the process of constructing a recommendation decision matrix  T d ( i )  for classification  C ( i )  by using the global optimal search algorithm, the destination POIs  P a ( i )  and recommendation degrees  δ ( i ,   j )  are grown from the child nodes “ G N ( i ) ” assigned by the root node “ C ( i ) ”, and the binary tree structure is extended to the lower level. The improved binary tree derived from the matrix  T d ( i )  is defined as the recommendation decision tree, denoted as  Tree   C ( i ) . A decision forest composed of a  w  number of decision trees  Tree   C ( i )  corresponding to classifications  C ( i )  is defined as a recommendation decision forest, denoted as  F o r e s t   C ( i ) . Decision trees and decision forest are the visual representations of constructing an optimal destination POI recommendation algorithm. The decision tree  Tree   C ( i )  meets the following conditions:
(1)
The root node represents the classification  C ( i ) , and the growth node represents the classification   G N ( i ) ;
(2)
The recommendation degree  δ ( i ,   j )  of any child node   N o d e ( x ,   y )  in the previous layer must be higher than that of any child node  N o d e ( x + 1 ,   y )  in the next layer, which   x   represents the layer of the decision tree and   y   represents the node within the layer;
(3)
In the same layer, the recommendation degree   δ ( i ,   j )   of the left child node   N o d e ( x ,   y )   must be higher than that of the right child node   N o d e ( x ,   y   +   Δ ) ;
(4)
The total number of nodes in the decision tree is  k   +   h ( i )   +   1 , and the total number of layers is satisfied  log 2 ( k + h ( i ) + 1 ) .
According to the modeling principle and Definitions 12 and 13, construct the POI recommendation model based on the spatial decision forest algorithm in Algorithm 1. Figure 4 shows the process for constructing the decision tree Tree   C ( i ) and decision forest F o r e s t   C ( i ) .
Algorithm 1: The POI recommendation model based on the spatial decision forest algorithm
1:
Take i = 1 , construct a recommendation decision tree Tree   C ( 1 ) for tourism attribute classification C ( 1 ) . Collect all destination POIs P a ( i ) included in C ( 1 ) and determine the natural attribute classification G N ( t ) that each POI belongs to. The tourism attribute classification C ( 1 ) includes h ( 1 ) number of POIs, and suppose that each natural attribute classification G N ( t ) includes h ( 1 ,   t ) number of POIs.
2:
The root node is defined as C ( 1 ) , and the growth child node is defined as G N ( t ) , 0 < t k , t , k N . Calculate the recommendation degree   δ of the destination POIs P a ( i ) in C ( 1 ) . Randomly store the POI recommendation degrees   δ of each corresponding classification G N ( i ) for each row of the recommendation decision matrix T d ( i ) according to the element storage rules, denoted as δ ( i ,   j ) .
3:
For growth nodes G N ( t ) . Take t = 1 , which contains h ( 1 ,   1 ) number of POIs, corresponding to the first row of the matrix T d ( i ) . Thus, the binary tree derived from the growth node G N ( t ) contains h ( 1 ,   1 ) number of child nodes.
4:
Take all the elements δ ( 1 ,   x ) of the first row in the matrix T d ( i ) , 0 < x h ( 1 ,   1 ) , and derive the binary tree relating to G N ( 1 ) . Figure 4A shows the initial state of the root node C ( 1 ) and the growth child node G N ( t ) .
5:
Judge δ ( 1 ,   1 ) and δ ( 1 ,   2 ) :
6:
  If δ ( 1 ,   1 ) δ ( 1 ,   2 ) , store δ ( 1 ,   1 ) and δ ( 1 ,   2 ) into N o d e ( 1 ,   1 ) and N o d e ( 1 ,   2 ) ;
7:
  If δ ( 1 ,   1 ) < δ ( 1 ,   2 ) , store δ ( 1 ,   1 ) and δ ( 1 ,   2 ) into N o d e ( 1 ,   2 ) and N o d e ( 1 ,   1 ) .
8:
Add δ ( 1 ,   3 ) , compare δ ( 1 ,   1 ) , δ ( 1 ,   2 ) and δ ( 1 ,   3 ) :
9:
  If δ ( 1 ,   1 ) δ ( 1 ,   2 ) :
10:
    If δ ( 1 ,   1 ) δ ( 1 ,   2 ) > δ ( 1 ,   3 ) : store δ ( 1 ,   1 ) , δ ( 1 ,   2 ) and δ ( 1 ,   3 ) into N o d e ( 1 ,   1 ) ,
      N o d e ( 1 ,   2 ) and N o d e ( 2 ,   1 ) ;
11:
    If δ ( 1 ,   1 ) δ ( 1 ,   3 ) > δ ( 1 ,   2 ) : store δ ( 1 ,   1 ) , δ ( 1 ,   2 ) and δ ( 1 ,   3 ) into N o d e ( 1 ,   1 ) ,
      N o d e ( 2 ,   1 ) and N o d e ( 1 ,   2 ) ;
12:
    If δ ( 1 ,   3 ) δ ( 1 ,   1 ) > δ ( 1 ,   2 ) : store δ ( 1 ,   1 ) , δ ( 1 ,   2 ) and δ ( 1 ,   3 ) into N o d e ( 1 ,   2 ) ,
      N o d e ( 2 ,   1 ) and N o d e ( 1 ,   1 ) .
13:
  If δ ( 1 ,   1 ) < δ ( 1 ,   2 ) :
14:
    If δ ( 1 ,   3 ) < δ ( 1 ,   1 ) < δ ( 1 ,   2 ) : store δ ( 1 ,   1 ) , δ ( 1 ,   2 ) and δ ( 1 ,   3 ) into N o d e ( 1 ,   2 ) ,
      N o d e ( 1 ,   1 ) and N o d e ( 2 ,   1 ) ;
15:
    If δ ( 1 ,   1 ) < δ ( 1 ,   3 ) < δ ( 1 ,   2 ) : store δ ( 1 ,   1 ) , δ ( 1 ,   2 ) and δ ( 1 ,   3 ) into N o d e ( 2 ,   1 ) ,
      N o d e ( 1 ,   1 ) and N o d e ( 1 ,   2 ) ;
16:
    If δ ( 1 ,   1 ) < δ ( 1 ,   2 ) < δ ( 1 ,   3 ) : store δ ( 1 ,   1 ) , δ ( 1 ,   2 ) and δ ( 1 ,   3 ) into N o d e ( 2 ,   1 ) ,
      N o d e ( 1 ,   2 ) and N o d e ( 1 ,   1 ) .
17:
Add δ ( 1 ,   i ) , compare δ ( 1 ,   1 )   ~   δ ( 1 ,   i ) in line with the same algorithm in step 5 to step 16, and store POI recommendation degrees δ ( i ,   j ) by the binary tree descending sub-node storage rules.
18:
Traverse the h ( 1 ,   1 ) number of recommendation degrees δ ( 1 ,   i )   ~   i ( 0 ,   h ( 1 ,   1 ) ] of child nodes until a binary tree containing h ( 1 ,   1 ) number of child nodes is derived. The binary tree search algorithm for G N ( 1 ) ends, and a complete binary tree is generated, as shown in Figure 4B.
19:
Repeat the algorithm from step 3 to step 18, searching for the binary trees derived from other classifications G N ( t ) , traversing G N ( t )   ~   t ( 0 , k ] , with each binary tree G N ( t ) containing h ( 1 ,   t ) number of child nodes. When t = k , the traversal process is completed, a recommendation structure tree Tree   C ( 1 ) consisting of   k number of derived binary trees from   k number of   G N ( t ) is generated, as shown in Figure 4C.
20:
Repeat steps 1 through 19. For C ( i ) , take i = 2 , generate the recommendation structure tree Tree   C ( 2 ) for C ( 2 ) . In the same method, traverse C ( i )   ~   i ( 0 , w ] and generate   w number of recommendation structure trees Tree   C ( i ) .
21:
Generate a recommendation decision forest F o r e s t   C ( i ) by   w number of recommendation decision trees Tree   C ( i ) , as Figure 4D shows. In the decision forest, the sub-node N o d e ( 1 ,   1 ) of G N ( t ) in each Tree   C ( i ) store has the highest recommendation degree, and its related destination POI P a ( i ) is the optimal POI for G N ( t ) . The algorithm recommends the optimal POIs G N ( t ) as the tour route POIs, representing the natural attributes and tourism attributes with the highest tourist interests.

3.3. POI Tour Route Recommendation Model Based on the Spatial Decision Tree Algorithm

By using the POI recommendation model, the decision trees Tree   C ( i ) and decision forest F o r e s t   C ( i ) representing the tourism attribute classifications C ( i ) are obtained. The optimal POI recommendation is realized by selecting the sub-nodes with the best recommendation degrees δ from the natural attribute classifications G N ( t ) included in the decision tree. From a geospatial perspective, POIs are distributed in different geographical locations of tourism cities and have spatial attributes. Their spatial accessibility is constrained by various factors, such as geographical coordinates, road distances, accessibility time, and transportation tools. Therefore, constructing a tour route model based on the recommended POIs is an effective way to obtain the optimal travel route [18,19,20]. We integrate the urban geospatial constraints and construct a POI tour route recommendation model based on the spatial decision tree algorithm. The relevant definitions are given as follows:
Definition 14. 
Tour route vector T r o and vector element T r o ( i ) . Tourists start traveling from the starting point S in the city, and the traveling process forms a complete route by following a certain sequence of d number of POIs recommended by the decision forest algorithm. Extract the route as a 1 × d dimension vector, randomly store d number of POIs into the vector, and define this vector as a tour route vector, denoted as T r o . The elements in the vector are denoted as T r o ( i ) . According to the definition, the starting point S traverses the d number of elements T r o ( i ) of vector T r o to form a POI storage order, which represents a tour route, i   ~   i ( 0 , d ] , i , d N .
Definition 15. 
Route sub-interval s u b i n · T r o ( i ,   j ) and route interval i n · T r o . The path interval between arbitrary adjacent elements T r o ( i ) and T r o ( j ) in vector T r o is defined as a route sub-interval, denoted as s u b i n · T r o ( i ,   j ) . The whole traveling interval composed of the POI storage vector T r o is defined as the route interval, denoted as i n · T r o . sub-interval s u b i n · T r o ( i ,   j ) represents the moving and traveling space between POIs, while interval i n · T r o represents the tour route.
Definition 16. 
Sub-interval cost f s u b i n · T r o ( i ,   j ) and interval cost f i n · T r o . The distance between road nodes n o d e ( x ) and n o d e ( x   +   1 ) that form the sub-interval s u b i n · T r o ( i ,   j ) is D ( x , x   +   1 ) (Unit: km), then the spatial cost f s u b i n · T r o ( i ,   j ) for tourists to move from the node T r o ( x ) to the node T r o ( x   +   1 ) meets the Formula (20), k is the number of road nodes within the sub-interval. The spatial cost between the POI nodes T r o ( x ) and T r o ( x   +   1 ) that make up the tour route interval i n · T r o is f s u b i n · T r o ( i ,   j ) , then the spatial cost f i n · T r o ( x ,   x   +   1 ) generated by tourists traveling from the starting point S to the last POI element T r o ( i ) of the vector T r o meets Formulas (21) or (22), f s u b i n · T r o ( S ,   1 ) represents the spatial cost between the starting point S and the first POI element T r o ( 1 ) of the vector T r o .
f s u b i n · T r o ( i ,   j ) = x = 1 k D ( x , x + 1 )
f i n · T r o ( x ,   x   +   1 ) = f s u b i n · T r o ( S ,   1 ) + x = 1 d 1 f s u b i n · T r o ( i ,   j ) ( x )
f i n · T r o ( x , x   +   1 ) = f s u b i n · T r o ( S , 1 ) + x = 1 d 1 γ = 1 k D ( γ , γ + 1 )
Definition 17. 
Sub-interval decision tree s u b t r · T r o ( i ,   j ) and interval decision tree t r · T r o . Using the sub-interval costs f s u b i n · T r o ( i ,   j ) formed by multiple roads within the sub-interval s u b t r · T r o ( i ,   j ) as nodes, a cost-complete binary tree is constructed via a heap sorting algorithm, and the complete binary tree is defined as a sub-interval decision tree, denoted as s u b t r · T r o ( i ,   j ) . Using the interval costs f i n · T r o formed by the d number of POIs contained within the interval i n · T r o and the starting point S as nodes, a cost complete binary tree is constructed via the heap sorting algorithm, and the complete binary tree is defined as an interval decision tree, denoted as t r · T r o . According to the characteristics of a complete binary tree, both decision trees s u b t r · T r o ( i ,   j ) and t r · T r o meet the following modeling conditions:
(1) 
The root node stores the minimum cost  f s u b i n · T r o ( i ,   j )   or  f i n · T r o ;
(2) 
The cost  f s u b i n · T r o ( i ,   j )   or  f i n · T r o   of arbitrary child node in the previous layer must be smaller than the cost of arbitrary child node in the next layer;
(3) 
In the same layer, the cost  f s u b i n · T r o ( i ,   j )   or  f i n · T r o   of the left child node must be smaller than the cost of arbitrary right child node;
(4) 
When the total number of nodes is  n , the height of the tree’s layer meets  log 2 n .
Based on Definitions 14–17 and the POI tour route modeling approach, we construct a POI tour route recommendation model based on the spatial decision tree algorithm. The POI route algorithm consists of two parts: Algorithm 2 is to find out the optimal solution of the route sub-interval s u b i n · T r o ( i ,   j ) , and Algorithm 3 is to find out the optimal solution of the route interval i n · T r o .
Algorithm 2: Optimal solution algorithm of the route sub-interval s u b i n · T r o ( i ,   j )
1:
Establish a vector T r o and confirm the vector elements T r o ( i ) . Establish a sub-interval s u b i n · T r o ( i ,   j ) containing elements T r o ( i ) , T r o ( j ) and k number of road nodes n o d e ( x ) . Figure 5A shows an example process for constructing the sub-interval s u b i n · T r o ( i ,   j ) .
2:
Randomly search for the moving path P a t h ( 1 ) between T r o ( i ) and T r o ( j ) , and iteratively calculate the cost f s u b i n ·   T r o ( i ,   j ) ( 1 ) .
3:
  Connect T r o ( i ) and n o d e ( 1 ) , there is no closed loop; the moving path distance is
   D ( T r o ( i ) , 1 ) , as shown in Figure 5B;
4:
  Connect n o d e ( 1 ) and n o d e ( 2 ) , there is no closed loop; the moving path distance
  is D ( 1 , 2 ) , as shown in Figure 5C;
5:
  Connect n o d e ( 2 ) and n o d e ( 3 ) , there is no closed loop; the moving path distance
   is D ( 2 , 3 ) , as shown in Figure 5D;
6:
  Connect n o d e ( 3 ) and n o d e ( 1 ) , there is a closed loop, delete n o d e ( 1 ) ; connect n o d e ( 5 ) ,
  there is no closed loop, the moving path distance is D ( 3 , 5 ) , as shown in Figure 5E
7:
  Continue searching, connect n o d e ( 5 ) and n o d e ( 8 ) , n o d e ( 8 ) , T r o ( j ) , form a
  complete moving path P a t h ( 1 ) , as shown in Figure 5F. Formula (23) shows the
  interval cost example of f s u b i n · T r o ( i ,   j ) ( 1 ) .
8:
Randomly search for the moving path P a t h ( 2 ) between T r o ( i ) and T r o ( j ) , and iteratively calculate the cost f s u b i n · T r o ( i ,   j ) ( 2 ) .
9:
Continue the searching by using the method from step 2 to step 8. Traverse k number of road nodes n o d e ( x ) to form g number of paths P a t h ( i ) without a closed loop, i ( 0 , g ] , i , g N , corresponding to g number of sub-interval costs f s u b i n · T r o ( i ,   j ) ( i ) . The mark g relates to the g number of nodes in the sub-interval decision tree s u b t r · T r o ( i ,   j ) .
10:
Take the costs f s u b i n · T r o ( i ,   j ) ( i ) of the g number of paths P a t h ( i ) and construct a complete binary tree containing g number of nodes. Figure 6A shows the initial state of the binary tree.
11:
Judge f s u b i n · T r o ( i ,   j ) ( 1 )   ~   f s u b i n · ( 1 ) and f s u b i n · T r o ( i ,   j ) ( 2 )   ~   f s u b i n · ( 2 ) :
12:
  If f s u b i n · ( 1 ) f s u b i n · ( 2 ) , store f s u b i n · ( 1 ) and f s u b i n · ( 2 ) into c n o d e ( 1 ,   1 ) and f n o d e ;
13:
  If f s u b i n · ( 1 ) < f s u b i n · ( 2 ) , store f s u b i n · ( 1 ) and f s u b i n · ( 2 ) into f n o d e and c n o d e ( 1 , 1 ) .
14:
Add f s u b i n · T r o ( i ,   j ) ( 3 )   ~   f s u b i n · ( 3 ) , compare f s u b i n · ( 1 ) , f s u b i n · ( 2 ) and f s u b i n · ( 3 ) :
15:
  If f s u b i n · ( 1 ) f s u b i n · ( 2 ) :
16:
    If f s u b i n · ( 1 ) f s u b i n · ( 2 ) > f s u b i n · ( 3 ) : store f s u b i n · ( 1 ) , f s u b i n · ( 2 ) and
     f s u b i n · ( 3 ) into c n o d e ( 1 , 2 ) , c n o d e ( 1 , 1 ) and f n o d e ;
17:
    If f s u b i n · ( 1 ) f s u b i n · ( 3 ) > f s u b i n · ( 2 ) : store f s u b i n · ( 1 ) , f s u b i n · ( 2 ) and
     f s u b i n · ( 3 ) into c n o d e ( 1 , 2 ) , f n o d e and c n o d e ( 1 , 1 ) ;
18:
    If f s u b i n · ( 3 ) f s u b i n · ( 1 ) > f s u b i n · ( 2 ) : store f s u b i n · ( 1 ) , f s u b i n · ( 2 ) and
     f s u b i n · ( 3 ) into c n o d e ( 1 , 1 ) , f n o d e and c n o d e ( 1 , 2 ) .
19:
  If f s u b i n · ( 1 ) < f s u b i n · ( 2 ) :
20:
    If f s u b i n · ( 3 ) < f s u b i n · ( 1 ) < f s u b i n · ( 2 ) : store f s u b i n · ( 1 ) , f s u b i n · ( 2 ) and
     f s u b i n · ( 3 )   into   c n o d e ( 1 , 1 ) , c n o d e ( 1 , 2 ) , and f n o d e ;
21:
    If f s u b i n · ( 1 ) < f s u b i n · ( 3 ) < f s u b i n · ( 2 ) : store f s u b i n · ( 1 ) , f s u b i n · ( 2 ) and
     f s u b i n · ( 3 ) into f n o d e , c n o d e ( 1 , 2 ) and c n o d e ( 1 , 1 ) ;
22:
    If f s u b i n · ( 1 ) < f s u b i n · ( 2 ) < f s u b i n · ( 3 ) : store f s u b i n · ( 1 ) , f s u b i n · ( 2 ) and
     f s u b i n · ( 3 ) into f n o d e , c n o d e ( 1 , 1 ) and c n o d e ( 1 , 2 ) .
23:
Add f s u b i n · T r o ( i ,   j ) ( i )   ~   f s u b i n · ( i ) , compare f s u b i n · ( 1 ) ~ f s u b i n · ( i ) by the same algorithm from the step 11 to step 22, and store f s u b i n · ( i ) in a binary tree by an ascending heap sorting algorithm. After traversing the sub-nodes i = g , the cost f s u b i n · ( i ) algorithm ends, and the binary tree is output as shown in Figure 6B.
24:
The sub-interval cost f s u b i n · T r o ( i ,   j ) ( i ) stored by the root node f n o d e of the current binary tree represents the minimum cost, and the corresponding path P a t h ( i ) is the optimal path between T r o ( i ) and T r o ( j ) .
25:
In line with the method from step 1 to step 24, traverse to search for all the optimal solutions in sub-intervals   s u b i n · T r o ( i ,   j ) . The algorithm ends.
f s u b i n · T r o ( i ,   j ) ( 1 ) = D ( T r o ( i ) ,   1 ) + D ( 1 ,   2 ) + D ( 2 ,   3 ) + D ( 3 ,   5 ) + D ( 5 , 8 ) + D ( 8 ,   T r o ( j ) )
Figure 5. The example modeling process of constructing the sub-interval s u b i n · T r o ( i ,   j ) . (A) shows the initial status of the sub-interval. (B) shows the node n o d e ( 1 ) has been found. (C) shows the node n o d e ( 2 ) has been found. (D) shows the node n o d e ( 3 ) has been found. (E) shows the node n o d e ( 5 ) has been found. (F) shows the node n o d e ( 8 ) has been found, and finally the path has been found.
Figure 5. The example modeling process of constructing the sub-interval s u b i n · T r o ( i ,   j ) . (A) shows the initial status of the sub-interval. (B) shows the node n o d e ( 1 ) has been found. (C) shows the node n o d e ( 2 ) has been found. (D) shows the node n o d e ( 3 ) has been found. (E) shows the node n o d e ( 5 ) has been found. (F) shows the node n o d e ( 8 ) has been found, and finally the path has been found.
Symmetry 15 02168 g005
Figure 6. The constructed complete binary tree s u b t r · T r o ( i ,   j ) when searching for the optimal solution in the sub-interval s u b i n · T r o ( i ,   j ) . (A) shows the initial state of the binary tree. (B) shows the output binary tree.
Figure 6. The constructed complete binary tree s u b t r · T r o ( i ,   j ) when searching for the optimal solution in the sub-interval s u b i n · T r o ( i ,   j ) . (A) shows the initial state of the binary tree. (B) shows the output binary tree.
Symmetry 15 02168 g006
Algorithm 3: The t r · T r o algorithm to find out the optimal solution for interval i n · T r o .
1:
Establish a vector T r o and confirm the vector elements T r o ( i ) . Establish a route interval i n · T r o , containing the d number of recommended POIs.
2:
Construct the first tour route T o u r ( 1 ) , relating to the interval cost f i n · T r o . Randomly store the d number of POIs to T r o , output the optimal sub-interval cost f s u b i n · T r o ( S ,   1 ) between the starting point S and the current T r o ( 1 ) , and the optimal sub-interval cost f s u b i n · T r o ( i ,   j ) between arbitrary element T r o ( x ) and its adjacent element T r o ( x   +   1 ) .
3:
Bring costs into Formulas (21) or (22) of the interval cost model f i n · T r o ( x ,   x   +   1 ) to calculate the interval cost f i n · T r o ( x ,   x   +   1 ) ( 1 ) of the route T o u r ( 1 ) .
4:
Based on the vector T r o containing d number of POI elements T r o ( i ) , according to the tour sequence, randomly exchange the storage element   T r o ( i ) and output   A ( d ,   d ) kinds of routes T o u r ( i ) ,   i ( 0 , A ( d ,   d ) ] ,   i , A ( d ,   d ) N . Traverse A ( d ,   d ) ) kinds of routes T o u r ( i ) and output the route costs f i n · T r o ( x ,   x   +   1 ) ( i )   ~   f i n · ( i ) .
5:
Use the heap-ascending algorithm to store interval costs f i n · T r o ( x ,   x   +   1 ) ( i )   ~   f i n · ( i ) into a complete binary tree, constructing a complete binary tree containing A ( d ,   d ) number of nodes. Figure 7A shows the initial state of the binary tree, and Figure 7B shows the constructed heap-ascending complete binary tree.
6:
Traverse i   ~   ( 0 , A ( d ,   d ) ] , when the sub-node i = A ( d ,   d ) has been iterated, the interval i n · T r o search algorithm ends. The current binary tree root node f n o d e stores the minimum interval cost f i n · T r o ( x ,   x   +   1 ) ( i ) , and the corresponding route T o u r ( i ) is the route with the lowest cost from the starting point S to travel through the d number of recommended POIs.
Figure 7. The constructed complete binary tree t r · T r o when searching for the optimal solution in the interval i n · T r o . (A) shows the initial state of the binary tree. (B) shows the output binary tree.
Figure 7. The constructed complete binary tree t r · T r o when searching for the optimal solution in the interval i n · T r o . (A) shows the initial state of the binary tree. (B) shows the output binary tree.
Symmetry 15 02168 g007

3.4. The Construction of Tourist Satisfaction Evaluation Model

To verify the feasibility of the proposed POI route recommendation algorithm and the effectiveness of the tourist feedback satisfaction based on the recommendation results, we construct tourist satisfaction evaluation models for the recommended POIs and the searched routes based on the evaluation criteria of the POI and tour route recommendation. The models evaluate the proposed POI route recommendation algorithm and the comparative methods in terms of four satisfaction indexes: average precision, average recall, average deviation of attribute matching degree, and average deviation of route cost. The satisfaction evaluation models and evaluation methods are constructed as follows: Table 1 shows the tourist satisfaction evaluation indexes for POI and route recommendation; the mark “√” represents the index that is used to evaluate the related item.
(1)
Average precision and average recall for POI recommendation
Set one tourist sample as X ( i ) , and the interest data provided by the tourist sample are: ① historical visited POIs P b ( i ) ; ② the expected natural attributes G N ( i ) of POIs; ③ the number of expected POIs to visit. The algorithm recommends a N P O I number of destination POIs P a ( i ) based on the constraints provided by the tourist sample while outputting the natural attributes G N ( i ) , of tourism, such as “travel cost”, “travel time”, “POI A-Class”, “POI popularity”, and a detailed introduction to POIs for the tourist sample. The tourist sample makes a “satisfactory” (S) and “unsatisfactory” (NS) judgment on the N P O I number of recommended POIs P a ( i ) . For the N P O I number of recommended POIs, we set the number of POIs that make the tourist make a ‘dissatisfied’ (NS) judgment is N P O I ( i ) , and the number of POIs that make a ‘satisfied’ (S) judgment is N N P O I ( i ) . If the number of tourist samples participating in the satisfaction evaluation is k , the average precision evaluation model of the POI recommendation algorithm based on tourist sample X ( i ) is constructed as Formula (24). If the number of the total destination POIs P a ( i ) is m , and all POI samples P a ( i ) have been browsed by tourists, then the average recall evaluation model of the POI recommendation algorithm for the tourist sample is constructed as Formula (25). The k number of samples in the model can also represent k types or pieces of interest.
The average precision reflects the proportion of satisfied POIs selected from the recommended POIs by the overall sample of tourists to all the recommended POIs. The higher the average precision is, the higher the satisfaction of tourists with the recommended POIs will be, indicating that the precision of the recommendation algorithm is higher, and vice versa. The average recall reflects the proportion of the satisfied POIs selected from the recommended POIs by the overall sample of tourists to the total destination POIs in the research domain. The higher the average recall is, the higher the satisfaction of tourists will be, and the higher the precision of the algorithm in recommending satisfied POIs from all POIs will be.
P ¯ r e c i s i o n = 1 k × i = 1 k N P O I N P O I ( i ) N P O I
R ¯ e c a l l = 1 k × i = 1 k N P O I N P O I ( i ) m
(2)
Average precision and average recall of POI route
Set one tourist sample as X ( i ) . Based on the N P O I number of recommended POIs, the POI route algorithm searches for the optimal tour routes and recommends them to the tourist sample. When the recommended number of POIs is N P O I , the overall sample number of tour routes that meet the algorithm conditions is A ( N P O I , N P O I ) , and the optimal number of tour routes recommended by the algorithm for tourist sample under this condition is N R o u t e . At the same time, the system introduces information such as the direction, tendency, itinerary, and travel cost of each recommended route to the tourist, and the tourist sample makes “satisfied” (S) and “dissatisfied” (NS) judgments on the recommended routes. For the N R o u t e number of recommended routes, the number of POIs that make the sample tourist make a “dissatisfied” (NS) judgment is N R o u t e ( i ) , and the number of POIs that make a “satisfied” (S) judgment is N r o u t e N r o u t e ( i ) . If the number of tourist samples participating in the satisfaction evaluation is k , the average precision evaluation model of the POI route algorithm is constructed based on the tourist sample X ( i ) as shown in Formula (26). If tourists have browsed all the routes, the average recall evaluation model of the POI route recommendation algorithm is constructed based on the tourist sample X ( i ) , as shown in Formula (27). The k number of samples in the model can also represent k types or pieces of interest.
The average precision reflects the proportion of satisfied routes selected from the recommended routes by the overall sample of tourists for all the recommended routes. The higher the average precision, the higher the satisfaction of tourists with the recommended routes will be. It indicates that the accuracy of the recommended routes by the recommendation algorithm is higher, and vice versa. The average recall reflects the proportion of satisfied routes selected from the recommended routes by the overall sample of tourists among all the feasible routes in the research domain. The higher the average recall is, the higher the satisfaction of tourists will be, and the higher the accuracy of the algorithm in recommending satisfied routes from all the feasible routes will be.
P ¯ r e c i s i o n = 1 k × i = 1 k N R o u t e N R o u t e ( i ) N R o u t e
R ¯ e c a l l = 1 k × i = 1 k N R o u t e N R o u t e ( i ) A ( N P O I , N P O I )
(3)
The average deviation of attribute matching degree
As to the “satisfied” (S) and “dissatisfied” (NS) judgments on the recommended POIs made by the sample tourist X ( i ) , among which the N N P O I ( i ) number of “satisfied” POIs completely conforms to the tourist’s interests, while the N P O I ( i ) number of “dissatisfied” POIs does not conform to the tourist’s interests. For the dissatisfied POIs, the tourist selects other N P O I ( i ) number of satisfied POIs to replace. Note the POI sample recommended by the algorithm but marked “dissatisfied” as P N S ( u ) , with its tourism attribute recorded as λ T N S ( i ,   j ) . The POI that is replaced by the tourist sample is noted as P S e l ( u ) , and its tourism attribute is recorded as λ T S e l ( i ,   j ) . The mark i represents the POI number and j represents the tourism attribute number. Then the POI deviation model of attribute matching degree for one tourist sample X ( i ) is constructed in Formula (28), g is the number of tourism attributes. The average deviation model of attribute matching degree is shown in Formula (29).
The average deviation of attribute matching degree represents the spatial distance between the tourist’s satisfied POI and the recommended dissatisfied POI. The smaller the average deviation is, the higher the matching degree between the POI tourism attributes recommended by the algorithm and the tourist’s interests will be, and the stronger the algorithm’s ability to meet the tourist’s interests will be, and vice versa.
D e v POI ( u ) = i = 1 N P O I ( u ) j = 1 g λ T N S ( i , j ) λ T S e l ( i , j ) p 1 p
D ¯ e v POI = 1 k × u = 1 k D e v POI ( u )
(4)
Average deviation of route cost
For the “satisfied” (S) and “dissatisfied” (NS) judgments on the recommended routes, the N r o u t e N r o u t e ( i ) number of “satisfied” routes completely conforms to the tourist’s interests, while the N r o u t e ( i ) number of “dissatisfied” routes does not conform to the tourist’s interests. For the dissatisfied routes, the tourist selects satisfied ones to replace. Set the route recommended by the algorithm but “dissatisfied” by tourist is T N S ( u ) , and its travel cost is f T N S ( u , v ) , note the route that replace T N S ( u ) by tourist is T S e l ( u ) , and its travel cost is f T S e l ( u , v ) . The u is the tourist number and v is the route number, the deviation model of route cost for the tourist sample is constructed in Formula (30); and the average deviation of route cost is constructed in Formula (31).
The average deviation of route cost represents the overall cost difference between the routes that tourists are satisfied with and the recommended routes that they are not satisfied with. The smaller the average deviation is, the closer the recommended route cost of the algorithm to the tourists’ needs and budget will be, and the stronger the algorithm’s ability to meet tourists’ interests will be, and vice versa.
D e v Route ( u ) = v = 1 N r o u t e ( u ) f T N S ( u ,   v ) f T S e l ( u ,   v )
D ¯ e v Route = 1 k × u = 1 k D e v Route ( u )

4. Experiment and Results Analysis

4.1. Experimental Approach and Process

To verify the feasibility and advantages of the proposed algorithm, we designed the validation experiment and the comparative experiment. The specific experimental approaches and processes are as follows:
(1)
The first step is to use the constructed text mining algorithm to achieve destination POI natural attribute classification. Select the destination POIs of the tourism city, collect POI natural attribute labels λ N ( i ) and sub-labels λ N ( i ,   j ) , and construct the natural attribute vector λ N ( i ) and natural attribute matrix λ N ( i ,   j ) . By calculating the statistical label word frequency t f ( λ N ( i ) ) , inverse text frequency i d f ( λ N ( i ) ) and label weight t f i d f ( λ N ( i ) ) of each row for sub-labels in the matrix λ N ( i ,   j ) , a natural attribute structure tree Tree G N ( i ) of the POI is constructed to determine the natural classification of the POI.
(2)
The second step is to collect tourists’ once-visited POIs and confirm their tourism attributes and quantified intervals. Tourists determine their preferences for the tourism attributes and thus construct a training set for the Naive Bayes classification algorithm. By constructing the Naive Bayes classifier, the recommendation degrees of the destination POIs are calculated, and the tourism attribute classifications are obtained. By combining the natural attribute classification and tourism attribute classification, establish a destination POI decision tree and decision forest, and output the optimal POI recommendation.
(3)
The third step is to output sub-interval decision trees s u b t r · T r o ( i ,   j ) and an interval decision tree t r · T r o containing costs based on the recommended POIs and the urban geospatial constraints. Calculate the interval cost f i n · T r o of each tour route and ultimately output the optimal tour route.
(4)
The fourth step is to select the two most commonly used electronic maps for tourism route planning, GaoDe Map and 360 Map, as the control group method, while the proposed algorithm is set as the experimental group. Use the same experimental conditions and methods to output the optimal tour routes, make comparisons on the costs of the optimal routes from the three methods, and then get the relative results and conclusions.
(5)
To evaluate the satisfaction degree of sample tourists with recommended POIs and POI routes, we use the satisfaction evaluation models constructed in Section 3.4 to make comparisons between the proposed algorithm (PRA), the item-based collaborative filtering recommendation method (IBCF), and the user-based collaborative filtering recommendation method (UBCF) in terms of POI average precision, average recall, and average deviation of attribute matching degree. At the same time, make comparisons between the proposed algorithm (PRA) and the map-searching algorithms GDM and 360M in terms of average precision, average recall, and average deviation of route cost. According to the satisfaction evaluation models constructed in Section 3.4, each evaluation index could represent and reflect a tourist’s satisfaction degree.

4.2. Data Collection

The experiment collects the following data.
(1)
For the destination POI natural attribute classification, select an encyclopedia big data text containing 10,000 words, including natural attribute sub-label text. Select another 1000 texts about POI introduction as the POI text mining corpus. Divide the natural attribute category into the following labels: G N ( 1 ) “natural scenery”, G N ( 2 ) “cultural history”, G N ( 3 ) “leisure shopping” and G N ( 4 ) “amusement park and venue”. Each label contains 5 sub-labels λ N ( i ,   j ) , and the data table is shown in Table 2. When calculating, synonyms and related words related to the sub-labels are also included in the statistics. Calculate the t f i d f ( λ N ( i ) ) for each label λ N ( i ) , and construct the natural attribute structure tree of POI Tree G N ( i ) to output its natural attribute classification G N ( i ) .
The selected 15 destination POIs in the tourism city Chengdu are: P a ( 1 ) Jinsha Site; P a ( 2 ) Tazishan Park; P a ( 3 ) Kuanzhai Alley; P a ( 4 ) Jinniu Wanda; P a ( 5 ) Happy Valley; P a ( 6 ) Eastern Suburb Memory; P a ( 7 ) The People’s Park; P a ( 8 ) Raffles Plaza; P a ( 9 ) Sichuan Museum; P a ( 10 ) Du Fu Thatched Cottage; P a ( 11 ) San Sheng Hua Xiang; P a ( 12 ) Chunxi Road; P a ( 13 ) Guose Tianxiang Amusement Park; P a ( 14 ) Qinglong Lake; P a ( 15 ) Huanhuaxi Park.
Table 2. The collected labels and sub-labels for natural attribute classification.
Table 2. The collected labels and sub-labels for natural attribute classification.
Label   λ N ( i ) λ N ( 1 )
Natural Scenery
λ N ( 2 )
Cultural History
λ N ( 3 )
Leisure Shopping
λ N ( 4 )
Amusement Park and Venue
Sub-label λ N ( i ,   1 ) River viewHistorical siteCulinary experienceLeisure sports
λ N ( i ,   2 ) Lake and reservoirAncient town and cityShoppingSports and competitions
λ N ( i ,   3 ) Greenland and parkLandscape artTheater and movieAnime and animation
λ N ( i ,   4 ) Forest viewFolk cultureIndoor leisureAdventure experience
λ N ( i ,   5 ) Mountain viewRoyal MausoleumTheater performanceRides and shows
(2)
For the tourism attribute classification of destination POIs, suppose a sample tourist for the experiment. Use the tourist’s once-visited 15 POIs as well as their tourism attributes as the training set. The sample tourist determines the preference degree for each POI as: C ( 1 ) : most favorite; C ( 2 ) : favorite; C ( 3 ) : like. The tourism attributes are: λ T ( 1 ) “travel cost” (Unit: yuan), λ T ( 2 ) “travel time” (Unit: hour), λ T ( 3 ) “POI- A Class” and λ T ( 4 ) “POI popularity”. The quantified sub-interval λ T ( i ,   j ) for each attribute is determined as shown in Table 3. Use the proposed symmetry-based Naive Bayes classification algorithm to classify the tourism attributes of the destination POIs, introduce the tourist interest weight ε ( i ) and tourism attribute interest weight ω ( i ) , and construct the decision trees and the decision forest to obtain the optimal POI recommendation.
Table 3. The collected quantified tourism attribute data interval of POI.
Table 3. The collected quantified tourism attribute data interval of POI.
Label   λ T ( i ) Quantified   Data   Interval   of   the   Sub - Label   λ T ( i ,   j )
λ T ( 1 ) Travel cost 0 λ T ( 1 ,   1 ) 50 50 < λ T ( 1 ,   2 ) 100 100 < λ T ( 1 ,   3 ) 200 λ T ( 1 ,   4 ) > 200
λ T ( 2 ) Travel time 0 λ T ( 2 ,   1 ) 1 1 < λ T ( 2 ,   2 ) 2 2 < λ T ( 2 ,   3 ) 3 λ T ( 2 ,   4 ) > 3
λ T ( 3 ) POI- A Class 1 λ T ( 3 ,   1 ) 2 2 < λ T ( 3 ,   2 ) 3 3 < λ T ( 3 ,   3 ) 4 λ T ( 3 ,   4 ) > 4
λ T ( 4 ) POI popularity λ T ( 4 ,   1 ) 0.9 0.9 < λ T ( 4 ,   2 ) 0.93 0.93 < λ T ( 4 ,   3 ) 0.96 0.96 < λ T ( 4 ,   3 ) 1
The once-visited POIs and tourist interest weights collected in the experiment are: P b ( 1 ) West Lake in Hangzhou (0.5); P b ( 2 ) Henan Museum (0.8); P b ( 3 ) Suzhou Gardens (0.7); P b ( 4 ) Zhengzhou Zhongyuan Wanda Plaza (0.1); P b ( 5 ) Xi’an Yanta Square (0.7); P b ( 6 ) Pingyao Ancient City (0.6); P b ( 7 ) Qinghai Lake (0.4); P b ( 8 ) Wangfujing, Beijing (0.2); P b ( 9 ) Yu Garden in Shanghai (0.5); P b ( 10 ) The Taihu Lake (0.6); P b ( 11 ) Xi’an Vientiane City (0.2); P b ( 12 ) Huangguoshu Waterfall (0.6); P b ( 13 ) Beijing Beihai Park (0.5); P b ( 14 ) Guangzhou Changlong Resort (0.3); P b ( 15 ) Zhengzhou Fangte Happy World (0.3).
The tourism attribute interest weights ω ( i ) are set as: λ T ( 1 ) : Travel cost (0.3); λ T ( 2 ) : Travel time (0.3); λ T ( 3 ) : POI- A Class (0.2); λ T ( 4 ) : POI popularity (0.2).
(3)
Use the geospatial data of Chengdu as the constraint. Iteratively calculate the cost of each POI sub-interval f s u b i n · T r o ( i , j ) by using the tour route algorithm, and construct a sub-interval decision tree s u b t r · T r o ( i ,   j ) to output the optimal moving path for each sub-interval. Iteratively calculate the cost of each interval f i n · T r o based on the optimal cost of sub-intervals f s u b i n · T r o ( i , j ) and construct an interval decision tree t r · T r o . Output the optimal route of the interval, corresponding to the optimal tour route. As to the geographic information collection in Chengdu city, the experiment collects the moving distances between road nodes within each sub-interval.
(4)
The comparative experiment is conducted to select the most commonly used electronic maps for tourism route planning, including GaoDe Map and 360 Map. The experimental group is the proposed tour route algorithm. The experimental conditions for the three methods are the recommended POIs and the same geospatial constraints. The control group searches for POI sub-interval moving paths, outputs the travel cost, and finally iteratively outputs the relative optimal tour routes and corresponding costs. By comparing the optimal routes and cost output of the three methods, the advantages of the proposed algorithm are demonstrated.
(5)
The tourist satisfaction degree evaluation experiment determines the sample size of tourists as k = 60 and it evaluates satisfaction degrees in two aspects: Firstly, the recommended number of POIs is N P O I = 4 . The overall number of “dissatisfied” POIs that tourists may output meets 0 N P O I ( i ) 4 . According to the quantity of destination POIs in Chengdu, there is m = 15 . Tourists provide constraints based on their interests, and PRA, IBCF, and UBCF, respectively, output the recommended POIs N P O I = 4 . Tourists determine the POIs of “dissatisfied” (NS) and “satisfied” (S) in each group of recommended POIs, calculate the average precision, average recall, and average deviation of attribute matching degree for the three sets of algorithms, and then make comparisons. Secondly, the starting point of the route is Tianfu Square, and the recommended number of routes meets N R o u t e = 4 . The overall number of “dissatisfied” routes that tourists may output meets 0 N R o u t e 4 . Based on the recommended number of POIs N P O I = 4 , the overall sample of the route is A ( N P O I , N P O I ) = 24 . PRA, GDM, and 360M, respectively, output recommended routes N R o u t e = 4 , and tourists determine the “dissatisfied” (NS) and “satisfied” (S) routes in each group of recommended routes. The average precision, average recall, and average deviation of route cost of the three algorithms are calculated and compared.

4.3. Results and Analysis

4.3.1. The Results and Analysis on the POI Natural Attribute Classification

Calculate the label word frequency t f ( λ N ( i ) ) and inverse text frequency i d f ( λ N ( i ) ) of each destination POI corresponding to the natural attribute classification G N ( i ) through the constructed text mining algorithm and decision tree algorithm Tree G N ( i ) , and finally calculate the label weight t f i d f ( λ N ( i ) ) corresponding to each natural attribute classification G N ( i ) of the POI. Table 4 shows the calculated label weights t f i d f ( λ N ( i ) ) of the destination POIs by the proposed algorithm. The bold data in the table represents the corresponding natural attribute classification of the destination POI. Figure 8 shows the trend of POI label weights t f i d f ( λ N ( i ) ) under each natural attribute classification. Figure 8A shows a histogram of the weight distribution of each POI for each classification G N ( i ) , and Figure 8B shows the weight trend of each POI in each classification G N ( i ) . The natural attribute classification of the destination POI could be determined through the results in Table 4 and the trend chart in Figure 8.
The constructed text mining algorithm is used to classify the natural attributes of destination POIs, and then the natural attribute features of the POIs are obtained and incorporated into the recommendation decision tree algorithm. According to the Table 4 data, the destination POI label weights t f i d f ( λ N ( i ) ) have different output values for different classifications G N ( i ) . For arbitrary destination POI P a ( i ) , the highest label weight value in G N ( 1 ) ~ G N ( 4 ) relates to the natural attribute classification of P a ( i ) . The natural attribute calculation results are as follows:
(1)
POIs belonging to the classification of G N ( 1 ) “natural scenery” include: P a ( 2 ) Tazishan Park; P a ( 7 ) The People’s Park; P a ( 7 ) San Sheng Hua Xiang; P a ( 14 ) Qinglong Lake; P a ( 15 ) Huanhuaxi Park.
(2)
POIs belonging to the classification of G N ( 2 ) “culture history” include: P a ( 1 ) Jinsha Site; P a ( 3 ) Kuanzhai Alley; P a ( 6 ) Eastern Suburb Memory; P a ( 9 ) Sichuan Museum; P a ( 10 ) Du Fu Thatched Cottage.
(3)
POIs belonging to the classification of G N ( 3 ) “leisure shopping” include: P a ( 4 ) Jinniu Wanda; P a ( 8 ) Raffles Plaza; P a ( 12 ) Chunxi Road.
(4)
POIs belonging to the classification of G N ( 4 ) “amusement park and venue” include: P a ( 5 ) Happy Valley; P a ( 13 ) Guose Tianxiang Amusement Park.
The experiment proves that the proposed text mining algorithm can classify the natural attributes of the destination POIs, and the classification results are reasonable. Analyze the results in Figure 8. There is a significant difference in the label weight of the corresponding classification for each POI, in which the G N ( i ) weight that the POI belongs to is the highest. In the same natural attribute classification G N ( i ) , the weight of each POI shows a fluctuating trend. For arbitrary one trend curve of G N ( i ) , where there is a peak, the probability of the corresponding POI being included in the related category is higher, and vice versa. The POI corresponding to the maximum peak of one curve is the POI belonging to the classification of the curve.

4.3.2. The Results and Analysis on POI Tourism Attribute Classification and Recommendation Decision Tree

Quantify the tourism attribute labels λ T ( i ) and sub-labels λ T ( i ,   j ) of the once-visited POIs. The sample tourist determines the preferences to the classifications of the once-visited POIs: C ( 1 ) : “Most Favorite”, C ( 2 ) : “Favorite”, C ( 3 ) : “Like”. Construct the symmetry-based Naive Bayes classification algorithm for the destination POI P a ( i ) classification, and calculate the recommendation degrees δ N B of the destination POIs. Table 5 shows the recommendation degrees δ N B of the destination POIs P a ( i ) under the conditions of classifications C ( i ) output by the constructed symmetry-based Naive Bayes classification algorithm. The bold data in the table corresponds to the tourism attribute classifications of the destination POIs. Figure 9 shows the recommendation degree distribution of each target POI belonging to C ( i ) . In Figure 9A, the red curve represents the classification C ( 1 ) , In Figure 9B, the blue curve represents the classification C ( 2 ) , In Figure 9C, the green curve represents the classification C ( 3 ) . Figure 9D shows the comparison of the three types of curves. By the tourism attribute interest network model N e t · λ T ( i ) constructed by weight ε ( i ) and ω ( i ) , and the recommendation degree model δ N B , δ M A and δ ( i ) , the recommendation degrees δ ( i ) of destination POIs are calculated, and the results are shown in Table 6. According to the POI natural attribute classification results in Table 4 and the recommendation degrees δ ( i ) in Table 6, the recommendation degree decision tree and decision forest are output. Figure 10 shows the constructed destination POI recommendation degree decision tree Tree C ( i ) and decision forest F o r e s t   C ( i ) .
Analyze the output recommendation degree results in Table 5 and the recommendation degree distribution of each destination POI belonging to the classification C ( i ) in Figure 9. The proposed improved symmetry-based Naive Bayes classification algorithm classifies each destination POI into a tourism attribute classification C ( i ) . As to the classification problem for the POIs, the conditional probabilities of POIs belonging to different classifications are determined by the tourism attributes of the once-visited POIs and the classifications on tourists’ preferences. The higher the conditional probability value for C ( i ) is, the higher the tourists’ preference and recommendation degree for this type of POI will be. Figure 9A–C show the distributions of recommendation degrees for the destination POIs in the same classification C ( i ) . The results show that the recommendation degrees for POIs in an arbitrary classification C ( i ) show the fluctuating trend. The higher the data peak is, the higher the probability of the POI belonging to the related classification C ( i ) will be, while the lower the data peak is, the lower the probability of POI belonging to the related classification C ( i ) will be. Analyzing Figure 9D, the same POI has different recommendation degrees for different classifications C ( i ) and different data peaks. The classification C ( i ) with the highest recommendation degree corresponds to the classification of the POI. According to the results of Table 5 and Figure 9, the destination POI that belongs to the C ( 1 ) “Most Favorite” is: P a ( 5 ) Happy Valley; The POIs that belong to the C ( 2 ) : “Favorite” include: P a ( 1 ) Jinsha Site; P a ( 6 ) Eastern Suburb Memory; P a ( 10 ) Du Fu Thatched Cottage; P a ( 15 ) Huanhuaxi Park. The remaining POIs belong to the C ( 3 ) “like”. The system recommends POIs from the two categories: C ( 1 ) “Most Favorite” and C ( 2 ) : “Favorite”.
Analyzing the POI recommendation degree decision tree and decision forest constructed in Figure 10, from the perspective of visualization, it can be seen that each tourism attribute classification’s branching sub-nodes contain four natural attribute classifications G N ( i ) . The sub-nodes in each classification G N ( i ) are generated by the descending order of the recommendation degrees. Through the decision tree, the natural attribute classification, tourism attribute classification, and recommendation degrees of POIs could be easily found.
After analyzing the results in Table 6, the following conclusions can be obtained: (1) After introducing the tourism attribute interest network model N e t · λ T ( i ) into the Naive Bayes classification algorithm, the calculated recommendation degrees δ ( i ) show a fluctuating trend, with a smaller volatility compared to the recommendation degrees δ N B output by the Naive Bayes classification algorithm. (2) According to Table 5 and Table 6, the variance of the recommendation degrees δ N B for the POIs is calculated to be 0.0049. After introducing the interest network model N e t · λ T ( i ) , the recommendation degree variance is 0.0041, indicating that the improved Naive Bayes classification algorithm can integrate tourists’ interests in tourism attributes λ T ( i ) and the matching degree with POI tourism attributes, so that the recommendation results not only meet the classification criteria for tourists’ interests in historical visited POIs but also meet the matching criteria for tourists’ interests in POI tourism attributes. The recommendation results are more accurate. (3) The introduced tourism attribute interest network model N e t · λ T ( i ) utilizes the tourism attribute interest weight ω ( i ) to construct the spatial relationship between tourism attributes λ T ( i ) . It is an optimization of the Naive Bayes classification algorithm that must require independent attributes. It better conforms to the objective laws of tourists choosing POIs and determining tourism attributes in real-world travel scenarios. Therefore, the recommendation results of the Naive Bayes classification algorithm are better after introducing the model N e t · λ T ( i ) . (4) The introduced tourism attribute interest network model N e t · λ T ( i ) utilizes the historical visited POI interest weights ε ( i ) to construct a relationship model between tourists and the tourism attributes of historical visited POIs. It can more accurately output tourists’ interests in tourism attributes, making the recommended POIs by the improved Naive Bayes classification algorithm more accurate.

4.3.3. Results and Analysis on the Tour Route Recommendation

Experiment sets that the natural attributes of the expected POIs for the sample tourist as: G N ( 2 ) : “culture history” and G N ( 3 ) : “leisure shopping”. According to the data in Table 6 and the results in Figure 9 and Figure 10, the optimal POIs recommended for the sample tourist are P a ( 5 ) Happy Valley in C ( 1 ) , and P a ( 1 ) Jinsha Site, P a ( 6 ) Eastern Suburb Memory, P a ( 10 ) Du Fu Thatched Cottage, P a ( 15 ) Huanhua Creek Park in C ( 2 ) . The system recommends four POIs P a ( 1 ) Jinsha sites, P a ( 5 ) Happy Valley, P a ( 6 ) Eastern Suburb Memory, and P a ( 10 ) Du Fu Thatched Cottage, as the destinations for the tour. The starting point S is Tianfu Square. According to the constructed tour route recommendation algorithm, 10 route sub-intervals s u b i n · T r o ( i ,   j ) are formed between the starting point and POIs. The sub-interval costs are calculated through the sub-interval cost function f s u b i n · T r o ( i ,   j ) , and then the lowest cost of each sub-interval is output through the sub-interval decision tree s u b t r · T r o ( i ,   j ) , as shown in Table 7, in which “ A ,   B ” represents s u b i n · T r o ( A ,   B ) . All the sub-intervals s u b i n · T r o ( i ,   j ) constitute a route interval i n · T r o , and the cost f i n · T r o of each route interval is calculated and output as shown in Table 8. In the table, the “S,P-1,5,6,10” represents the tour route, which is formed from the starting point S and the sample tourist travels in the order of P a ( 1 ) , P a ( 5 ) , P a ( 6 ) and P a ( 10 ) . The symbol s u ( x ) in Table 8 represents the sub-interval s u b t r · T r o ( i ,   j ) ( x ) . Figure 11 shows the final output tour route interval cost decision tree t r · T r o .
The optimal POIs are confirmed by the recommendation algorithm, and the constructed tour route recommendation algorithm is used to output the costs f s u b i n · T r o ( i ,   j ) of sub-intervals in Table 7 and the costs f i n · T r o of route intervals in Table 8. Analyzing the data in Table 8, when the starting point and POIs are identical, different tour route intervals are formed by the route vector T r o . Each tour interval corresponds to a tour route, and the interval costs f i n · T r o of different tour routes are discrepant. The higher the interval cost is, the higher the travel cost of the tour route will be. The lower the interval cost is, the lower the travel cost of the tour route will be. According to the calculation results in Table 8, the travel cost of route “S,P-10,1,5,6” is the lowest, which is at 25.1, which means that the sample tourist will pay the lowest travel cost when traveling in the order of “ S Tianfu Square- P a ( 10 ) Du Fu Thatched Cottage- P a ( 1 ) Jinsha Site- P a ( 5 ) Happy Valley- P a ( 6 ) Eastern Suburb Memory”; the travel cost of route “S,P-6,10,1,5” takes second place at 26.4, which is “ S -Tianfu Square- P a ( 6 ) Eastern Suburb Memory- P a ( 10 ) Du Fu Thatched Cottage- P a ( 1 ) Jinsha Site- P a ( 5 ) Happy Valley”; The third place is the cost of route “S,P-6,5,1,10”, which is 27.5, the route is “ S -Tianfu Square- P a ( 6 ) Eastern Suburb Memory- P a ( 5 ) Happy Valley- P a ( 1 ) Jinsha Site- P a ( 10 ) Du Fu Thatched Cottage”. Figure 11 shows the constructed tour route interval cost decision tree t r · T r o , which is a complete binary tree with 24 nodes and meets the rule of ascending heap sorting. The sub-node of the tree stores the optimal tour route “S,P-10,1,5,6” with the lowest cost. The decision tree could be used to quickly find and recommend the optimal tour route and the suboptimal ones.

4.3.4. Results and Analysis on the Methods Comparison

In the comparative experiment, the commonly used GaoDe map and 360 map for tourism route planning are selected as the control group, and the proposed algorithm is set as the experimental group. The control group uses relative route search methods to conduct the lowest-cost route searching performance on the same POIs P a ( 1 ) Jinsha Site, P a ( 5 ) Happy Valley, P a ( 6 ) Eastern Suburb Memory, and P a ( 10 ) Du Fu Thatched Cottage. Table 9 shows the lowest cost of each sub-interval obtained by the control group methods. The control group outputs the interval cost decision trees based on the constructed interval decision tree algorithm and outputs three optimal tour routes. The sub-interval costs and interval costs are shown in Table 10. Table 11 takes the optimal route as an example to calculate the sub-interval cost differences Δ f s u b i n · T r o ( i ,   j ) , interval cost differences Δ f i n · T r o and route cost optimization rate R i m p on the experimental group compared to the control group. The symbol s u ( x ) in Table 10 and Table 11 represents the sub-interval.
Analyze the comparison results between the experimental group and the control group. Table 9 shows the costs f s u b i n · T r o ( i ,   j ) of sub-intervals searched by the control group methods, and the optimal routes and suboptimal routes of the control group are output based on the cost. Table 10 and Table 11 show the comparison of the optimal route and suboptimal route between the experimental group and the control group. The results show that the optimal route output by the experimental group and the control group is “S,P-10,1,5, 6”, and the suboptimal routes are “S,P-6,10,1,5” and “S,P-6,5,1,10”. It indicates that the experimental group and the control group have the same principle in searching for the optimal route. However, in terms of the output route cost, the control group methods produce a higher cost than that of the experimental group method, which is manifested in the sub-interval cost and the total route cost.
(1)
Comparison between GDM and the experimental group PRA: The sub-interval costs of the optimal route “S,P-10,1,5,6” are higher than those of PRA, and the route cost is 1.2 higher than that of PRA; the sub-interval costs of the suboptimal route “S,P-6,10,1,5” are all higher than those of PRA, and the route cost is 0.8 higher than that of PRA; the sub-interval costs of the suboptimal route “S,P-6,5,1,10” are all higher than those of PRA, and the route cost is 1.1 higher than that of PRA.
(2)
Comparison between 360M and the experimental group PRA: The sub-interval costs of the optimal route “S,P-10,1,5,6” are higher than those of PRA, and the route cost is 2.9 higher than that of PRA; the sub-interval costs of the suboptimal route “S,P-6,10,1,5” are all higher than those of PRA, and the route cost is 2.3 higher than that of PRA; the sub-interval costs of the suboptimal route “S,P-6,5,1,10” are all higher than those of PRA, and the route cost is 2.7 higher than that of PRA.
(3)
Comparing to the GDM, the PRA reduces the travel cost by 4.56% in terms of optimal route cost and 2.94% and 3.85% in terms of suboptimal route costs, respectively. Comparing to the 360M, the PRA reduces the travel cost by 10.36% in terms of optimal route cost, and 8.01% and 8.94% in terms of suboptimal route costs respectively. According to the comparison results, it can be concluded that the proposed tour route recommendation algorithm can effectively reduce the travel cost of the traditional routes and has obvious advantages over the traditional methods.

4.3.5. Evaluation and Analysis of Tourist Satisfaction

The data collection for the satisfaction evaluation experiment is conducted on a sample of 60 tourists, representing 60 kinds or pieces of tourists’ interests. And then the POIs are output through the proposed algorithm (PRA), the item-based collaborative filtering recommendation (IBCF), and the user-based collaborative filtering recommendation (UBCF). Then the satisfaction evaluations are conducted by sample tourists, and the average precision ( P ¯ r e c i s i o n ), average recall ( R ¯ e c a l l ), and average deviation of attribute matching degree ( D ¯ e v POI ) are calculated. The results are shown in Table 12. Based on the recommended POIs, the proposed algorithm (PRA), GaoDe Map Method (GDM), and 360 Map Method (360M) are used to recommend routes, and the satisfaction evaluations are conducted by sample tourists. Then the average precision ( P ¯ r e c i s i o n ), average recall ( R ¯ e c a l l ), and average cost deviation of the recommended routes ( D ¯ e v Route ) are calculated. The results are shown in Table 13.
Sample tourists evaluate the satisfaction of each algorithm’s recommended POIs and routes and make choices based on the recommendation results. By using the satisfaction evaluation model we have proposed, the results in Table 12 and Table 13 that reflect tourists’ satisfactions are obtained. Analyzing the data in Table 12, in terms of POI recommendation satisfaction evaluations, PRA has the highest average precision of 0.5357, which is higher than that of the control group’s recommendation method IBCF of 0.3929 and UBCF of 0.3214. It indicates that the overall sample of tourists has the highest satisfaction with the POIs recommended by our proposed algorithm, and it also indicates that the precision of the POIs recommended by the proposed algorithm is the highest. The average recall rate of PRA is the highest, at 0.1429, which is higher than that of IBCF in the control group, which is 0.1048, and UBCF, which is 0.0857. It indicates that the overall sample of tourists has the highest satisfaction with the POIs recommended by our proposed algorithm, and the algorithm has the highest precision in recommending satisfied POIs from all destination POIs. The average deviation of attribute matching degree for PRA is the lowest, at 1.0116, which is lower than that of the control group’s recommendation method, IBCF, which is 1.8235, and UBCF, which is 2.0720. It indicates that the proposed algorithm has the strongest ability to recommend POIs that meet tourists’ interests.
Analyzing the data in Table 13, in terms of satisfaction evaluation of tourist routes, PRA has the highest average precision of 0.7500, which is higher than that of the control group’s route method GDM of 0.5000 and 360M of 0.3214. It indicates that the overall sample of tourists has the highest satisfaction with the recommended routes by the proposed algorithm, and it also indicates that the precision of the recommended routes by the proposed algorithm is the highest. The average recall of PRA is the highest, at 0.1250, which is higher than that of GDM in the control group, which is 0.0833, and that of 360M, which is 0.0536. It indicates that the overall sample of tourists has the highest satisfaction with the recommended routes of the proposed algorithm, and the algorithm has the highest precision in recommending satisfied routes from all the searched routes. The average cost deviation of the PRA route is the lowest, at 6.2714, and it is lower than that of the GDM, which is 9.0000, and that of 360M, which is 10.7643, indicating that the proposed algorithm has the strongest ability to recommend routes that meet tourists’ interests while saving travel costs.
From the perspective of satisfaction evaluation, the comparison experiment proves that the proposed algorithm can better satisfy tourists than traditional recommendation algorithms and map route planning methods. It performs better in terms of each satisfaction index and ability than traditional methods. The recommended POIs are better than traditional recommendation algorithms IBCF and UBCF in terms of satisfaction degree, and the recommended routes are better than traditional map route planning methods GDM and 360M in terms of satisfaction degree.

5. Conclusions

On the basis of analyzing the current research background and existing problems of POI tour routes, this paper proposes and constructs a POI tour route recommendation model based on the improved symmetry-based Naive Bayes mining and spatial decision forest search. The POI natural attribute classification model is constructed based on text mining, and the destination POIs are classified into natural attribute classifications. Furthermore, an improved symmetry-based Naive Bayes classification algorithm for the destination POI tourism attribute classification is constructed through once-visited POIs, and the destination POIs are classified into different tourism attribute classifications. By constructing a spatial decision forest algorithm, the POIs with natural attributes under different tourism attribute classifications are sorted, and a recommendation model is established to output the optimal POIs. Based on the recommended POIs, a POI tour route recommendation model based on the spatial decision tree algorithm is established, which outputs the tour route with the lowest sub-interval costs and interval cost. Finally, the validation experiment and the comparative experiment are performed to output the optimal POIs and tour routes by using the proposed algorithm. Then the proposed algorithm is compared with the commonly used route planning methods, GDM and 360M, demonstrating the advantages of the proposed algorithm compared to the traditional route planning methods. The output POI tour routes can effectively reduce travel costs using the proposed algorithm.

Author Contributions

Conceptualization, X.Z., J.P., B.W. and M.S.; methodology, X.Z. and J.P.; formal analysis, X.Z., B.W. and M.S.; writing—original draft preparation, X.Z. and B.W.; writing—review and editing, X.Z., J.P., B.W. and M.S.; funding acquisition, X.Z., J.P. and B.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key R&D Program of Sichuan Province, China (No. 2022YFG0034, 2023YFG0115), the Cooperative Program of Sichuan University and Yibin (No. 2020CDYB-30), the National Natural Science Foundation of China (Grant No. 42101455), the Cooperative Program of Sichuan University and Zigong (2022CDZG-6), the Key Research Base of Region and Country of Sichuan Province, Center for Southeast Asian Economic and Culture Studies (No. DNY2301) and the Leshan Science and Technology Plan Project (No. 22ZRKX025).

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Darapisut, S.; Amphawan, K.; Leelathakul, N.; Rimcharoen, S. A Hybrid POI Recommendation System Combining Link Analysis and Collaborative Filtering Based on Various Visiting Behaviors. ISPRS Int. J. Geo-Inf. 2023, 12, 431. [Google Scholar] [CrossRef]
  2. Lee, J.; Kim, J. Developing a Convenience Store Product Recommendation System through Store-Based Collaborative Filtering. Appl. Sci. 2023, 13, 11231. [Google Scholar] [CrossRef]
  3. Aldayel, M.; Al-Nafjan, A.; Al-Nuwaiser, W.; Alrehaili, G.; Alyahya, G. Collaborative Filtering-Based Recommendation Systems for Touristic Businesses, Attractions, and Destinations. Electronics 2023, 12, 4047. [Google Scholar] [CrossRef]
  4. Alabduljabbar, R. Matrix Factorization Collaborative-Based Recommender System for Riyadh Restaurants: Leveraging Machine Learning to Enhance Consumer Choice. Appl. Sci. 2023, 13, 9574. [Google Scholar] [CrossRef]
  5. Lin, S. Implementation of Personalized Scenic Spot Recommendation Algorithm Based on Generalized Regression Neural Network for 5G Smart Tourism System. Comput. Intel. Neurosc. 2022, 2022, 3704494. [Google Scholar] [CrossRef] [PubMed]
  6. Bin, C.; Gu, T.; Jia, Z.; Zhu, G.; Xiao, C. A Neural Multi-context Modeling Framework for Personalized Attraction Recommendation. Multimed. Tools. Appl. 2020, 79, 14951. [Google Scholar] [CrossRef]
  7. Li, G.; Hua, J.; Yuan, T.; Wu, J.; Jiang, Z.; Zhang, H.; Li, T. Novel Recommendation System for Tourist Spots Based on Hierarchical Sampling Statistics and SVD++. Math. Probl. Eng. 2019, 2019, 2072375. [Google Scholar] [CrossRef]
  8. Mizutani, Y.; Yamamoto, K. A Sightseeing Spot Recommendation System That Takes into Account the Change in Circumstances of Users. ISPRS Int. J. Geo-Inf. 2017, 6, 303. [Google Scholar] [CrossRef]
  9. Huang, C.; Liu, M.; Gong, H.; Xu, F. Season-aware Attraction Recommendation Method with Dual-trust Enhancement. J. Intell. Fuzzy Syst. 2017, 33, 2437–2449. [Google Scholar] [CrossRef]
  10. Kesorn, K.; Juraphanthong, W.; Salaiwarakul, A. Personalized Attraction Recommendation System for Tourists Through Check-In Data. IEEE Access 2017, 5, 26703–26721. [Google Scholar] [CrossRef]
  11. Remigijus, P.; Linas, S.; Simona, S.; Dmitrij, K.; Ernestas, F. A Novel Greedy Genetic Algorithm-based Personalized Travel Recommendation System. Expert. Syst. Appl. 2023, 230, 120580. [Google Scholar]
  12. Zhang, Y.; Liu, S. A Picture-Based Approach to Tourism Recommendation System. Front. Soc. Sci. Technol. 2023, 5, 124–130. [Google Scholar]
  13. Liang, S.; Jin, J.; Ren, J.; Du, W.; Qu, S. An Improved Dual-Channel Deep Q-Network Model for Tourism Recommendation. Big Data 2023, 11, 268–281. [Google Scholar] [CrossRef] [PubMed]
  14. Han, S.; Liu, C.; Chen, K.; Gui, D.; Du, Q. A Tourist Attraction Recommendation Model Fusing Spatial, Temporal, and Visual Embeddings for Flickr-Geotagged Photos. ISPRS Int. J. Geo-Inf. 2021, 10, 20. [Google Scholar] [CrossRef]
  15. Mulia, W.; Chelsia, P.; Widi, B.; Rizqina, M. Discovering the Importance of Halal Tourism for Indonesian Muslim Travelers: Perceptions and Behaviors When Traveling to a Non-Muslim Destination. J. Islamic Mark. 2023, 14, 61–81. [Google Scholar]
  16. Wang, Y.; Huang, Y.; Yang, K.; Chen, Z.; Luo, C. Generator Fault Classification Method Based on Multi-Source Information Fusion Naive Bayes Classification Algorithm. Energies 2022, 15, 9635. [Google Scholar] [CrossRef]
  17. Wang, Q.; Wang, F.; Li, Z.; Jiang, P.; Ren, F.; Nie, F. Efficient Random Subspace Decision Forests with a Simple Probability Dimensionality Setting Scheme. Inform. Sci. 2023, 638, 118993. [Google Scholar] [CrossRef]
  18. Li, L.; Gao, Q. Researching Tourism Space in China’s Great Bay Area: Spatial Pattern, Driving Forces and Its Coupling with Economy and Population. Land 2023, 12, 1878. [Google Scholar] [CrossRef]
  19. Wang, H.; Chen, X.; Ge, J.; Yan, Z.; He, X.; Song, Y.; Zhou, Q. Research on the Spatiotemporal Distribution and Cultural Tourism Strategy of Modern Educational Architectural Heritage in Nanjing. Sustainability 2023, 15, 14392. [Google Scholar] [CrossRef]
  20. Rabbiosi, C.; Meneghello, S. Questioning Walking Tourism from a Phenomenological Perspective: Epistemological and Methodological Innovations. Humanities 2023, 12, 65. [Google Scholar] [CrossRef]
Figure 1. The overall framework and flowchart of the research content.
Figure 1. The overall framework and flowchart of the research content.
Symmetry 15 02168 g001
Figure 2. The constructed POI natural attribute classification structure tree Tree   G N ( i ) .
Figure 2. The constructed POI natural attribute classification structure tree Tree   G N ( i ) .
Symmetry 15 02168 g002
Figure 3. The constructed tourism attribute interest network N e t · λ T ( i ) .
Figure 3. The constructed tourism attribute interest network N e t · λ T ( i ) .
Symmetry 15 02168 g003
Figure 4. The modeling process to generate the decision tree Tree C ( i ) and decision forest F o r e s t C ( i ) . (A) shows the initial state of the root node C ( 1 ) and the growth child node G N ( t ) . (B) shows a complete binary tree. (C) shows a recommendation structure tree Tree C ( 1 ) consisting of k number of derived binary trees from k number of G N ( t ) . (D) shows a recommendation decision forest F o r e s t C ( i ) by w number of recommendation decision trees Tree C ( i ) .
Figure 4. The modeling process to generate the decision tree Tree C ( i ) and decision forest F o r e s t C ( i ) . (A) shows the initial state of the root node C ( 1 ) and the growth child node G N ( t ) . (B) shows a complete binary tree. (C) shows a recommendation structure tree Tree C ( 1 ) consisting of k number of derived binary trees from k number of G N ( t ) . (D) shows a recommendation decision forest F o r e s t C ( i ) by w number of recommendation decision trees Tree C ( i ) .
Symmetry 15 02168 g004
Figure 8. The trend of POI label weights t f i d f ( λ N ( i ) ) under each natural attribute classification. (A) shows a histogram of the weight distribution of each POI for each classification G N ( i ) , and (B) shows the weight trend of each POI in each classification G N ( i ) .
Figure 8. The trend of POI label weights t f i d f ( λ N ( i ) ) under each natural attribute classification. (A) shows a histogram of the weight distribution of each POI for each classification G N ( i ) , and (B) shows the weight trend of each POI in each classification G N ( i ) .
Symmetry 15 02168 g008
Figure 9. The recommendation degree distribution of each target POI belonging to C ( i ) . In (A), the red curve represents the classification C ( 1 ) . In (B), the blue curve represents the classification C ( 2 ) . In (C), the green curve represents the classification C ( 3 ) . (D) shows the comparison of the three types of curves.
Figure 9. The recommendation degree distribution of each target POI belonging to C ( i ) . In (A), the red curve represents the classification C ( 1 ) . In (B), the blue curve represents the classification C ( 2 ) . In (C), the green curve represents the classification C ( 3 ) . (D) shows the comparison of the three types of curves.
Symmetry 15 02168 g009
Figure 10. The constructed destination POI recommendation degree decision tree Tree C ( i ) and decision forest F o r e s t   C ( i ) .
Figure 10. The constructed destination POI recommendation degree decision tree Tree C ( i ) and decision forest F o r e s t   C ( i ) .
Symmetry 15 02168 g010
Figure 11. The final output tour route interval cost decision tree t r · T r o .
Figure 11. The final output tour route interval cost decision tree t r · T r o .
Symmetry 15 02168 g011
Table 1. Tourist satisfaction evaluation indexes for POI and route recommendation.
Table 1. Tourist satisfaction evaluation indexes for POI and route recommendation.
Average
Precision
Average
Recall
Average Deviation of
Attribute Matching Degree
Average Deviation of Route Cost
POI
recommendation
Route
recommendation
Table 4. The calculated natural attribute weights of the destination POIs under each natural attribute classification.
Table 4. The calculated natural attribute weights of the destination POIs under each natural attribute classification.
G N ( 1 ) G N ( 2 ) G N ( 3 ) G N ( 4 ) G N ( 1 ) G N ( 2 ) G N ( 3 ) G N ( 4 )
P a ( 1 ) 0.00490.02670.00180.0012 P a ( 9 ) 0.00550.02960.00310.0012
P a ( 2 ) 0.02820.01530.00610.0102 P a ( 10 ) 0.01620.02480.00630.0012
P a ( 3 ) 0.00330.02410.01080.0051 P a ( 11 ) 0.02630.01310.01110.0130
P a ( 4 ) 0.00120.00830.02510.0128 P a ( 12 ) 0.00220.01120.02650.0118
P a ( 5 ) 0.00520.00710.01420.0318 P a ( 13 ) 0.00730.00540.01240.0282
P a ( 6 ) 0.00480.02230.01080.0081 P a ( 14 ) 0.02660.00960.00620.0094
P a ( 7 ) 0.02510.01350.01030.0111 P a ( 15 ) 0.02870.01200.00780.0090
P a ( 8 ) 0.00130.00580.02430.0114
Table 5. The recommendation degrees δ N B of the destination POIs P a ( i ) under the conditions of classifications C ( i ) .
Table 5. The recommendation degrees δ N B of the destination POIs P a ( i ) under the conditions of classifications C ( i ) .
P a ( 1 ) P a ( 2 ) P a ( 3 ) P a ( 4 ) P a ( 5 ) P a ( 6 ) P a ( 7 ) P a ( 8 )
C ( 1 ) 0.00370.01670.01670.01670.01110.00560.03330.0028
C ( 2 ) 0.04000.01070.01070.01070.00320.04000.01600.0266
C ( 3 ) 0.00210.20030.20020.20020.00830.01250.05010.0501
P a ( 9 ) P a ( 10 ) P a ( 11 ) P a ( 12 ) P a ( 13 ) P a ( 14 ) P a ( 15 )
C ( 1 ) 0.01670.00560.01110.01670.00370.01670.0028
C ( 2 ) 0.02130.07990.00800.02130.00050.02130.1332
C ( 3 ) 0.13350.00830.02500.13350.00630.13350.0334
Table 6. The POI recommendation degree δ ( i ) calculated by introducing the tourism attribute interest network model N e t · λ T ( i ) .
Table 6. The POI recommendation degree δ ( i ) calculated by introducing the tourism attribute interest network model N e t · λ T ( i ) .
P a ( 1 ) P a ( 2 ) P a ( 3 ) P a ( 4 ) P a ( 5 ) P a ( 6 ) P a ( 7 ) P a ( 8 )
δ ( i ) 0.03890.18380.18370.18370.00940.03670.04610.0459
P a ( 9 ) P a ( 10 ) P a ( 11 ) P a ( 12 ) P a ( 13 ) P a ( 14 ) P a ( 15 )
δ ( i ) 0.12250.07700.02300.12250.00620.12250.1220
Table 7. The lowest cost of the sub-interval output by the sub-interval cost function f s u b i n · T r o ( i ,   j ) and decision tree s u b t r · T r o ( i ,   j ) .
Table 7. The lowest cost of the sub-interval output by the sub-interval cost function f s u b i n · T r o ( i ,   j ) and decision tree s u b t r · T r o ( i ,   j ) .
S , P a ( 1 ) S , P a ( 5 ) S , P a ( 6 ) S , P a ( 10 ) P a ( 1 ) , P a ( 5 )
f s u b i n · T r o ( i ,   j ) 6.08.16.64.25.2
P a ( 1 ) , P a ( 6 ) P a ( 1 ) , P a ( 10 ) P a ( 5 ) , P a ( 6 ) P a ( 5 ) , P a ( 10 ) P a ( 6 ) , P a ( 10 )
f s u b i n · T r o ( i ,   j ) 12.13.811.97.810.8
Table 8. The tour route interval i n · T r o cost output by the interval cost function f i n · T r o and decision tree t r · T r o .
Table 8. The tour route interval i n · T r o cost output by the interval cost function f i n · T r o and decision tree t r · T r o .
i n · T r o s u ( 1 ) s u ( 2 ) s u ( 3 ) s u ( 4 ) f i n · T r o i n · T r o s u ( 1 ) s u ( 2 ) s u ( 3 ) s u ( 4 ) f i n · T r o
S,P-1,5,6,106.05.211.910.833.9S,P-6,1,5,106.612.15.27.831.7
S,P-1,5,10,66.05.27.810.829.8S,P-6,1,10,56.612.13.87.830.3
S,P-1,6,5,106.012.111.97.837.8S,P-6,5,1,106.611.95.23.827.5
S,P-1,6,10,56.012.110.87.836.7S,P-6,5,10,16.611.97.83.830.1
S,P-1,10,5,66.03.87.811.929.5S,P-6,10,1,56.610.83.85.226.4
S,P-1,10,6,56.03.810.811.932.5S,P-6,10,5,16.610.87.85.230.4
S,P-5,1,6,108.15.212.110.836.2S,P-10,1,5,64.23.85.211.925.1
S,P-5,1,10,68.15.23.810.827.9S,P-10,1,6,54.23.812.111.932.0
S,P-5,6,1,108.111.912.13.835.9S,P-10,5,1,64.27.85.212.129.3
S,P-5,6,10,18.111.910.83.834.6S,P-10,5,6,14.27.811.912.136.0
S,P-5,10,1,68.17.83.812.131.8S,P-10,6,1,54.210.812.15.232.3
S,P-5,10,6,18.17.810.812.138.8S,P-10,6,5,14.210.811.95.232.1
Table 9. The lowest cost f s u b i n · T r o ( i ,   j ) of sub-intervals output by the control group.
Table 9. The lowest cost f s u b i n · T r o ( i ,   j ) of sub-intervals output by the control group.
GDM S , P a ( 1 ) S , P a ( 5 ) S , P a ( 6 ) S , P a ( 10 ) P a ( 1 ) , P a ( 5 )
6.28.66.74.45.7
P a ( 1 ) , P a ( 6 ) P a ( 1 ) , P a ( 10 ) P a ( 5 ) , P a ( 6 ) P a ( 5 ) , P a ( 10 ) P a ( 6 ) , P a ( 10 )
12.13.912.38.710.9
360M S , P a ( 1 ) S , P a ( 5 ) S , P a ( 6 ) S , P a ( 10 ) P a ( 1 ) , P a ( 5 )
6.59.16.84.66.6
P a ( 1 ) , P a ( 6 ) P a ( 1 ) , P a ( 10 ) P a ( 5 ) , P a ( 6 ) P a ( 5 ) , P a ( 10 ) P a ( 6 ) , P a ( 10 )
12.74.012.89.411.3
Table 10. The output optimal tour routes by the experimental group method and the control group methods.
Table 10. The output optimal tour routes by the experimental group method and the control group methods.
PRA i n · T r o s u ( 1 ) s u ( 2 ) s u ( 3 ) s u ( 4 ) f i n · T r o
S,P-10,1,5,64.23.85.211.925.1
S,P-6,10,1,56.610.83.85.226.4
S,P-6,5,1,106.611.95.23.827.5
GDM i n · T r o s u ( 1 ) s u ( 2 ) s u ( 3 ) s u ( 4 ) f i n · T r o
S,P-10,1,5,64.43.95.712.326.3
S,P-6,10,1,56.710.93.95.727.2
S,P-6,5,1,106.712.35.73.928.6
360M i n · T r o s u ( 1 ) s u ( 2 ) s u ( 3 ) s u ( 4 ) f i n · T r o
S,P-10,1,5,64.646.612.828
S,P-6,10,1,56.811.346.628.7
S,P-6,5,1,106.812.86.6430.2
Table 11. The sub-interval cost differences Δ f s u b i n · T r o ( i ,   j ) , interval cost differences Δ f i n · T r o and route cost optimization rate R i m p on the experimental group comparing to the control group.
Table 11. The sub-interval cost differences Δ f s u b i n · T r o ( i ,   j ) , interval cost differences Δ f i n · T r o and route cost optimization rate R i m p on the experimental group comparing to the control group.
i n · T r o Δ s u ( 1 ) Δ s u ( 2 ) Δ s u ( 3 ) Δ s u ( 4 ) Δ f i n · T r o R i m p
S,P-10,1,5,6GDM-PRA0.20.10.50.41.24.56%
360M-PRA0.40.21.40.92.910.36%
i n · T r o Δ s u ( 1 ) Δ s u ( 2 ) Δ s u ( 3 ) Δ s u ( 4 ) Δ f i n · T r o R i m p
S,P-6,10,1,5GDM-PRA0.10.10.10.50.82.94%
360M-PRA0.20.50.21.42.38.01%
i n · T r o Δ s u ( 1 ) Δ s u ( 2 ) Δ s u ( 3 ) Δ s u ( 4 ) Δ f i n · T r o R i m p
S,P-6,5,1,10GDM-PRA0.10.40.50.11.13.85%
360M-PRA0.20.91.40.22.78.94%
Table 12. The average precision ( P ¯ r e c i s i o n ), average recall ( R ¯ e c a l l ), and average deviation of attribute matching degree ( D ¯ e v POI ) of the recommended POIs.
Table 12. The average precision ( P ¯ r e c i s i o n ), average recall ( R ¯ e c a l l ), and average deviation of attribute matching degree ( D ¯ e v POI ) of the recommended POIs.
P ¯ r e c i s i o n R ¯ e c a l l D ¯ e v POI
PRA0.53570.14291.0116
IBCF0.39290.10481.8235
UBCF0.32140.08572.0720
Table 13. The average precision ( P ¯ r e c i s i o n ), average recall ( R ¯ e c a l l ), and average cost deviation of the recommended routes ( D ¯ e v Route ) of the recommended routes.
Table 13. The average precision ( P ¯ r e c i s i o n ), average recall ( R ¯ e c a l l ), and average cost deviation of the recommended routes ( D ¯ e v Route ) of the recommended routes.
P ¯ r e c i s i o n R ¯ e c a l l D ¯ e v Route
PRA0.75000.12506.2714
GDM0.50000.08339.0000
360M0.32140.053610.7643
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhou, X.; Peng, J.; Wen, B.; Su, M. Tour Route Recommendation Model by the Improved Symmetry-Based Naive Bayes Mining and Spatial Decision Forest Search. Symmetry 2023, 15, 2168. https://doi.org/10.3390/sym15122168

AMA Style

Zhou X, Peng J, Wen B, Su M. Tour Route Recommendation Model by the Improved Symmetry-Based Naive Bayes Mining and Spatial Decision Forest Search. Symmetry. 2023; 15(12):2168. https://doi.org/10.3390/sym15122168

Chicago/Turabian Style

Zhou, Xiao, Jian Peng, Bowei Wen, and Mingzhan Su. 2023. "Tour Route Recommendation Model by the Improved Symmetry-Based Naive Bayes Mining and Spatial Decision Forest Search" Symmetry 15, no. 12: 2168. https://doi.org/10.3390/sym15122168

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop