Exploring New Vista of Intelligent Recommendation Framework for Tourism Industries: An Itinerary through Big Data Paradigm

: Big Data is changing how organizations conduct operations. Data are assembled from multiple points of view through online quests, investigation of purchaser purchasing conduct, and then some, and industries utilize it to improve their net revenue and give an overall better experience to clients. Each of these organizations must ﬁgure out how to improve the general client experience and meet every client’s novel necessities, and big data helps with this cycle. Through the utilization and reviews of Big Data, travel industry organizations can study the inclinations of more modest portions of their intended interest group or even about people in some cases. In this paper, a Crow Search Optimization-based Hybrid Recommendation Model is proposed to get accurate suggestions based on clients’ preferences. The hybrid recommendation is performed by combining collaborative ﬁltering and content-based ﬁltering. As a result, the advantages of collaborative ﬁltering and content-based ﬁltering are utilized. Moreover, the intelligent behavior of Crows’ assists the proper selection of neighbors, rating prediction, and in-depth analysis of the contents. Accordingly, an optimized recommendation is always provided to the target users. Finally, performance of the proposed model is tested using the TripAdvisor dataset. The experimental results reveal that the model provides 58%, 58.5%, 27%, 24.5%, and 25.5% better Mean Absolute Error, Root Mean Square Error, Precision, Recall, and F-Measure, respectively, compared to similar algorithms.


Introduction
Recommender Systems (RSs) function as a specialist for people who cannot decide on the conceivably overwhelming number of choices accessible on the World Wide Web. Neighborhood-based calculations are conventional methodologies for synergistic proposals and are mainstream because of their straightforwardness and proficiency.
Nowadays, people of different age groups prefer online purchasing. The instant availability of various alternatives, feedback of common people on items, and easy accessibility from home, is the major reason for such popularity. As a result, the inconsistencies during offline purchasing have been resolved substantially. However, host sites often face "information overload" problems to keep up the enriched services. Neighborhoodbased recommender frameworks utilize various sorts of likeness measures between clients or things to accomplish assorted objectives for planning an RS, for example, exactness, curiosity, and variety. recommenders used by consumers in everyday life. Collaborative filtering considers user ratings to perform filtering and as such, the recommendations are based on users' ratings who have a similar preference level as that of the target user. On the other hand, Content-based Filtering is made with the item information (features) and user's preference (ratings) to those items. The recommended items are similar to the ones the user has already purchased. Demographic filtering is quite similar to collaborative filtering.
All the recommendation techniques can be deployed based on the previous data. Today, all tourism industries deploy recommendation techniques based on the concept of the Big Data model. In this paper, the concept of Big Data in the domain of tourism industries is considered. The tourism data set is analyzed first, and a structured form is made for further prediction. Crow search-based, hybrid recommendation techniques are applied to implement the proposed model. The proposed model combines collaborative and content-based filtering techniques to achieve more effective recommendations for clients in a real scenario. Finally, a comparison is performed with two different existing related works. The performance of the proposed model is also evaluated. The remaining parts of the article are organized as follows. Related Works are described in Section 2. Section 3 illustrates the relation of Big Data with Tourism Industries. Section 4 explores the proposed model and algorithm followed by data preparation in Section 5. Section 6 explores results and simulations. Furthermore, a complete discussion is given in Section 7. Finally, Section 8 depicts the conclusions of the research.

Related Works
Recommendation Systems (RSs) area software application that minimizes the information overload problem, filters the required information, and automatically suggests ideal items to the user. Moreover, the recommendations are displayed according to the priority. This is also known as the "Top-N" recommendation. In other words, RS deals with the prediction of users' preferences. Isinkaye et al. [7], in their research, described the basic concepts of the recommendation system. As their paper indicates, the Internet is known as the greatest source of information for all users. However, with the regular increase in the amount of information, the number of options is also increasing. Thus, to handle this much volume it becomes necessary to filter out the required information for every user. Katarya et al. [8] described the recommendation systems work based on rating predictions on several items. It becomes equally important to focus on sequential information. To test this novel approach, an MNSBC data set was used, which contained 5000 entries per user. On the same line, to enhance the existing systems, another research was implemented on movie recommendations using K-Means and the Cukoo Search Optimization algorithm by Katarya et al. [9]. The final result was compared with the existing approaches and was found to be more precise. However, to enhance its quality, bio-inspired algae algorithms have also been used. To find the similarities between any two users' multilevel, the Pearson Correlation Coefficient (PCC) was used [10]. Meanwhile, most music recommendation systems focus more on contextual information, such asmusic ratings and user-user interaction dynamics. A new recommendation system was proposed based on Depth First Search (DFS) and Bellmon Ford is used to recommend new suggestions based on all the factors mentioned above [11]. Another proposed movie recommendation model worked on the based similarities of two users on the same online platform. K-Means clustering was applied with bio-inspired artificial bee colony (ABC) techniques, known as the ABC-KM recommendation model [12]. The current recommendation system suffers from sparsity, coldstart, and scalability issues. Post understanding all these issues, a new model was proposed based on matrix factorization and topical-based collaborative filtering. This model was observed to be more accurate, and due to the hybrid use of techniques, it was termed the HYBRTyco model [13]. Martı'nez et al. [14] introduced a matrix factorization-based hybrid recommendation algorithm consisting of collaborative filtering and content-based filtering. Shamri et al. [15] proposed a hybrid filtering technique using the the Fuzzy-Genetic algorithm. The algorithm provides less Mean Absolute Error and more Coverage. Lee et al. [16] proposed a neural network-based optimized hybrid recommendation algorithm on the MovieLens dataset to improvise Mean Absolute Error and Scalability. Campos et al. [17] introduced a hybrid recommendation algorithm with the help of the Bayesian Network using MovieLens and IMDB datasets, providing few Mean Absolute Error. Shinde et al. [18] proposed a hybrid recommendation framework, applying the Centering-Bunching-based Clustering algorithm using the Jester dataset, resulting in less Mean Absolute Error and higher Precision, Recall, and F-Measure. Jung et al. [19] performed user behavior analysis to accomplish a hybrid recommendation. The framework was successfully validated with the EachMovie dataset's help, providing top-n recommendations and less Mean Absolute Error. Christakou et al. [20] introduced a neural network-based two-way hybrid filtering technique. The proposed algorithm was validated using the MovieLens dataset, resulting in higher Precision and Recall. Geetha et al. [21] developed a K-Means clustering-based hybrid filtering technique. The experimental results show an accurate and personalized recommendation. Yang et al. [22] proposed a job recommendation system based on hybrid filtering. The system is validated using User-Skill, User-City, Most Recent Company, and Most Recent Job Title features. Iaquinta et al. [23] proposed a word net-based hybrid recommendation technique to optimize the electronic support system's performance by using Each Movie dataset. The results reveal the proposed algorithm results in low Mean Absolute Error compared to the popular Pearson coefficient. Recent research by Ashami et al. [24] introduced a clustering-based approach to finding the best neighbor of the target user. The clustering is accomplished using the hybridization of the Crow Search and Uniform Crossover Algorithm and the Jester dataset is used as a test-bed for providing significantly less Mean Absolute Error. Recently, Tesfaye et al. [25] proposed a hybrid algorithm to accomplish intelligent Collaborative Filtering. The hybridization is achieved using the Crow Search Algorithm and K-Means Clustering. The MovieLens dataset is used as a test-bed and the results are promising (small Mean Absolute Error) compared to similar techniques. A new proposed model was read that used Newton's third law of gravitation and was applied to the data set taken from the user-item matrix [26].

Tourism Industries: Perspectives of Big Data
The movement business is tremendous; the World Tourism Organization evaluates that it makes up about 10% of the world's GDP and makes 1in every 11 positions. Moreover, it is an industry on the rise, with enormous future potential for overall wealth and business. Metropolitan people groups have a staggering plenitude of information accessible to them that can change the movement business' universe. Nonetheless, having information is not adequate to have the alternative to use it enough. Metropolitan territories and the movement business players need induction to gadgets that will engage them to make significant deductions from the data. Countless finishes will help them by benefitting visitors, and, additionally, close by tenants as well. The impact of Big Data on tourism industries and the visualization of the proposed model is shown in Figure 1.
The travel industry sheets and associations in the movement business zone make a profit through information from various viewpoints. This unites analytical advancing endeavors, offering packs to re-establish for visitors' apparent likely benefits and picking which countries to focus in on winning customers. These experiences can assist with the powerful cycle to improve business capacities. Huge parts in the development of businesses would now have the choice to settle on trained choices concerning assessment and number-driven data. Organizers are focused on expected customer gatherings at every stage in the trip organizing measure. Organizers can also broaden the ability and possibility of associations. Large amounts of information can even be utilized to predict which new things may work well in their market. They can achieve enhanced encounters that depend on the clients and their requirements. Big Data can be used as a farsighted device to figure out future models and to envision and react to voyagers' necessities right away.

Future of Big Data in the Tourism Industry: Challenges and Scope
The movement business is presently part of the mission of different ventures that utilize information investigation in their daily tasks. Online exercises of individuals create a ton of information, such as occasion objections or purchasing rucksacks on the web.

1.
While individuals produce a ton of information on the web intentionally or inadvertently, there are a few other elective information sources that the movement business can use to settle on better choices. This information helps the movement business in investigation-driven choices and could be very beneficial.

2.
BigData investigation is being utilized by movement and transportation organizations all around the globe. For example, carrier administrators use data analysisto comprehend traveler's buying and travel designs about specific socioeconomics.

3.
Organizations in the travel industry area can profit from Big Data from numerous points of view that incorporates showcasing efforts, offering bundles arranged to the guest's advantages, and to zero in on getting clients to travel.
Big Data is effectively utilized in a variety of areas in tourism. Today, travel bots are intelligent chat bots that give either computerized client administrations on sites of movement organizations or work through informing stages, such as Facebook Messenger, through associating with voyagers and help them in their appointments. These experiences can be of extraordinary assistance in the emotional cycle and change how the travel industry works. They can distinguish their expected clients and likewise expand productivity and administrations' nature at each stage in the arranging cycle. Big Data can likewise be used as a proactive apparatus in figuring out how a new product could function well in the market.

Proposed Model
Today, big data is useful in the domain of tourism industries to better serve the clients in a more realistic way. In this research paper, an Intelligent Tourist Recommender System (ITRS) is proposed to guide tourists based on their preferences. The tourism industries can deploy the model to recommend the tourists regarding their entire plans. This proposed ITRS is beneficial for both tourists as well as tourism industries. To implement the ITRS model, a Big Data tourism database is used. The tourism dataset supports the '3V' properties of Big Data [27,28], i.e., volume, variety, and veracity. A tourism dataset is statistically analyzed [29,30] and a balanced, structured matrix is formed to perform the prediction and recommendation. Sequential analysis is performed as a pilot survey to generate sub-matrices where all the sub-matrices are orthogonal.

Statistical Analysis
To validate the proposed model, data analysis is required. Real-time data is used as a test-bed for this model. The data is analyzed by statistical approach. Using the tourism data set as a test-bed, it is analyzed based on the user's choice. The dataset is extremely large, with unstructured data patterns and integrity issues to deal with.
Assuming that all the records of the dataset have n number of features with their different parameters' value as {u 1 , u 2 , . . . . . . . ., u n }, the dataset is continuous then determines the density function ϕ(u 1 , u 2 , . . . . ., u n ) for selecting the data sample randomly. A small sample of data, ∆v, belongs around the data points {u 1 , u 2 , . . . . . . . ., u n }. In the dataset, the values of users' choices are categorized and near each other.
To get the whole dataset to equilibrium, the users with a high frequency of traveling are distributed with possible equidistribution over the whole dataset with kinematic restriction. The users with a high frequency of traveling will influence their co-centric neighbors. The first target is to distinguish the users who travel very frequently through any third-party travel agencies and determine the users with similar choices. The proposed model is validated using the tourism dataset as a test-bed. The tourism dataset is unstructured and contains noisy data. Thus, the first thing is to filter the dataset and structure through an analysis. The data sample of frequent users of position {u 1 , u 2 , . . . . . . . ., u n } have E(u 1 , u 2 , . . . . u m ). The mean of frequent users per sample is determined as where, dv indicates the unit volume of the sample dataset and d denotes the entire dataset. ϕ is determined and evaluates the degree of similarities for each sample distribution using a uniform distribution. Since the pattern of a dataset of tourism industries is Big Data, a high degree of similarities is required to find more similar users. To determine the entropy of the sample, Equation (2) is used.
Then, the solution of Equation (2) is maximized using Equation (1). The optimum selection of data sample U, for which − U log u dv is maximized and subjected to EU dv = d, is determined using Equation (3).
where β is selected such that U dv = 1.
Using the inequality the sample data, ∆v, are selected randomly from the whole dataset for every iteration. For any two alternative densities, the inequality in Equation (4) will be defined as where U denotes the data samples.
The variable U will have unity if ∀ incident of Equation (5) is applied for Equation (1). The data samples of frequent users are considered for the tourism industries. The degree of travel frequency is different for the different data samples. The sampling distribution is deployed to transform the degree of travel frequency for different samples, as per If B is considered a non-singular matrix, then DY DU = |B| is a positive sign (6) As per the relational equation, the relation between the users is defined as Equation (7): As per Equation (6), The dissimilarity between frequent users and rear users is firmly determined and a threshold value is evaluated by using a partitioning matrix.
The partitioning matrix in Equation (10) is partitioned into kth sub-matrices where all the sub-matrices are orthogonal to each other but are not orthogonal by themselves. This is a pilot statistical analysis for the data. After statistical analysis, the dataset is visualized in Figure 2. The visualization of the dataset in Figure 2 represents the distributions of the data points. Data analysis is performed by using Algorithm 1.

Algorithm 1: Data analysis for grouping similar data Begin
Step 1: Initialize n data sample with density function ϕ, and initialize a small sample of data, ∆v, which belongs around the n number of features Step 2: Set n data samples of frequent users of position {u 1 , u 2 , . . . . . . . ., u n } haveE(u 1 , u 2 , . . . . u m ) Step 3: Determine the mean of frequent users per sample as EU dv = d Step 4: Determine the entropy of the sample data as − U log u dv Step 5: At the maximum value of − U log u dv, determine the optimum selection of data samples U as U = e φE+µ = βe φE Step 6: Randomly select the data sample to determine any two alternative densities by applying Step 7: Sampling distribution is deployed to transform the degree of travel frequency for different samples Step 8: The interrelated differential elements are defined as dy 1 dy 2 . . . . . . . . . dy n = |B| du 1 du 2 . . . . . . du n Step 9: Determine dissimilarity between frequent users and rear users as dY = |B|dU ∀ B is an orthogonal matrix and |B = 1|,

Computational Analysis
As per the concept of Big Data, the tourism dataset has a large number of dimension values as a matrix form with some known coefficients. If Y and α denote the column vectors of the variables, y i and the parameter α j can be represented as a matrix notation.
where D denotes the dispersion, and I stands for the unit matrix of order n. The objective of this analysis is to estimate the unknown parameter α j based on the observations of y i . In this research, the user's choice is estimated based on their other profile data. Now, established correlations among all observations of the different sample can be deduced by Equation (12): where, T is supposed to be a known matrix and both α and σ 2 are unknown.

Proposed Collaborative Filtering Approach
Once the statistical analysis is completed, a new Collaborating Filtering [31][32][33][34][35][36][37] approach is proposed, which is used for the purpose of recommendation. In fact, a memory-based collaborative filtering approach is applied to determine the users who are similar to the target user. The proposed collaborative filtering helps to effectively predict user feedback based on the feedback of like-minded users. Therefore, it performs in-depth analysis of user profiles. As a result, sparsity and scalability problems are substantially minimized with respect to the dataset chosen in this research. Technically, the proposed approach consists of finding a new similarity measure, a threshold to determine the similar users, and the rating prediction technique. The proposed approach is as follows: Assuming the total number of users is defined as n and the total number of hotels is m where n, m > 0, the dimension of the data set is n × m.
Let us consider that the user gives a rating for t number of hotels among m. The rating of each user is determined by R in Equation (13): where r and t indicate the rating of each hotel and number of hotels, respectively. Similarly, it computes the rating for n number of users as R 1 , R 2 , . . . . . ., R n . Rating R is used to determine each user's similarity index with respect to the remaining users in the dataset. The similarity index R sim is evaluated using Equation (14).
A hotel dataset is used with n number of users giving a rating. If the user does not visit any hotel, they do not provide any rating for that hotel.
Suppose the user rates for t number of hotels among m number of hotels. Thus, t ≤ m is always true. Among the t number of ratings, the minimum and maximum rating values are provided by the particular user. The users rate the hotel within a certain range of rating values. If the rating values are out of the range, then these values will be discarded. The threshold value R Th is evaluated by Equation (15): where m ≥ t. L R and M R denote the lowest rating value and highest rating value respectively. Now, compare the similarity index with the threshold value to predict the future's rating if the user does not provide it priorly. The predicted rating is determined by Equation (16).

Proposed Content-Based Filtering Approach
The filtering process becomes more effective when the preferences of users are considered. The recommendation technique, which considers the preferences of a user, is known as content-based filtering [38][39][40][41]. In this paper, a new content-based filtering approach is proposed. The proposed approach helps to discern user preferences in a novel way. As a result, more personalized, content-based filtering is performed. The proposed approach considers the items recommended by the collaborative filtering algorithm. Successively, the modified Jaccard similarity coefficient is employed for the content-based filtering. The proposed approach is as follows: Suppose there are n numbers of users, and the similarity index of k number of users are nearest to a threshold value. Using Equation (16), the value of the predicted Rating R Predict is evaluated.
Assuming k ≤ n and R Predict ≥ R Th , it is not confirmed that the corresponding hotel will be recommended to that user who did not rate the particular hotel previously. The user, who did not visit a particular hotel H i previously, did not rate the hotel H i . The proposed model will decide that the hotel H i will be recommended to that user for future use. The recommendation will be made not only based on the values of R sim , R Th , and R Predict but also the features of the hotel. Therefore, the recommendation pattern is a hybrid model of collaborative and content-based filtering.
Let there be z numbers of features.
In Equation (17), the Jaccard similarity coefficient is applied to find out the recommen- The unrated hotel will be recommended using Equation (18).
The proposed model will make the expert decision that either the unrated hotel will be recommended or not based on Equation (18).

Crow Search Algorithm-based Hybrid Recommendation Model
The Crow Search Algorithm (CSA) is a population-based, bio-inspired algorithm proposed by Askarzadeh [42]. It describes the movement of crows in search for food.
In this paper, a CSA-based recommendation model is proposed to perform optimized hybrid recommendation. As a result, the problems related to traditional collaborativecontent hybrids are minimized substantially. On the other hand, the proposed model is the basis for ITRS. Herein, the intelligent food searching (exploration and exploitation) of crows assists to identify the relevant parameters of recommendation. Moreover, the movements of crows ensure optimized recommendation. To provide the recommendations, an analogy between CSA and parameters of the model is considered. Successively, the position update rule of CSA is applied to find the recommendation threshold. The analogy between CSA and the parameters of the model is shown in Table 1.

Proposed Hybrid Recommendation Algorithm
The CSA-based hybrid recommendation model is defined in Algorithm 2: Algorithm 2: CSA based Hybrid Recommendation

Begin
Step 1: Normalize the rating of target user using Equation (13) Step 2: Find the similar users with respect to the target user using Equation (14) Step 3: Designate a user as neighbor whose similarity is greater than the threshold of similarity Step 4: Compute the first threshold for recommendation using Equation (15) Step 5: Find the second threshold for recommendation using the position update rule of CSA Step 6: Predict the rating of target user using Equation (16) Step 7: If the predicted rating is ≥ the first threshold and second threshold, then go to step 8 Step 8: Predict the rating of target user (content-based filtering) using Equation (17) Step 9: Recommend to the target user using Equation (18) End

Data Preparation
The TripAdvisor [43] dataset has been used to accomplish experimentation for the proposed model. The dataset is comprised of 29,799 reviews of 21,851 unique customers. The reviews were collected from September 2007 to September 2009, including every hotel from all regions of Ireland. In addition to the survey text, each audit accompanies a lodging identifier, a general rating, and discretionary viewpoint explicit appraisals for the accompanying seven angles: Rooms, Cleanliness, Value, Service, Location, Check-in, and Business. All review-level appraisals are on a discrete ordinal scale from 1 to 5 (with 1 demonstrating that a viewpoint's explicit rating was not given by the commentator).
The proposed model is also validated by two more datasets as well. The Datafiniti's Business dataset [44] has been used for validation purposes. It consists of 10,000 reviews of users to the 2000 hotels in the United States. The dataset is made up of several attributes, such as hotel id, name, address rating, reviews, and username. The user reviews basically contain textual descriptions describing the user experience. On the other hand, user ratings are numeric values in the range between 1 and 5. Moreover, TripAdvisor hotel review datasets contributed by Barkha Bansal has also been considered for validation purposes [45]. It consists of 20,491 reviews as well as ratings of users. The user reviews are textual information depicting user experience, whereas, user ratings are numeric values between 1 and 5. Table 2 contains a sample of 15 customer ratings from the TripAdvisor dataset. The value of customer ratings lies between 1 and 5. Moreover, the customer preferences towards other amenities, e.g., rooms and cleanliness, are also mentioned in the dataset. The collection of ratings helps to accurately learn individual profiles. In other words, the proposed model could be trained well. As a result, the overfitting and the underfitting problems are avoided.

Results and Simulations
The tourism dataset is used as a test-bed to validate the proposed model. The data set is very large and has an imbalanced structure. As such, statistical analysis is performed to fit the dataset of the proposed model, and the simulation results are shown in Figure 2.
The original data set, before the statistical analysis shown in Figure 2a, has data patterns that are unstructured and have some missing data fields. After the statistical analysis, the dataset is filtered and freed from noise data. As per the requirement, the data is filtered by the analysis. The filtered and structured dataset is shown in Figure 2b. The required data values are all in a single group other than noise data. In Figure 2b, the required data points are shown in the high-density section. After data analysis, a collaborative filtering technique is applied to find out the users' similarities based on their preferences. Figure 3 shows the grouping of the users as per their preferences. In this research, the applied data set is divided into three different subgroups by the collaborative filtering technique. Every user has a similarity index concerning other remaining users in a group. Figure 3a shows three different groups derived from the applied structured test-bed data set. A threshold value is derived from selecting the group whose similarity index is very near to the threshold value. The threshold value is shown in Figure 3b. Every group has its similarity index value, which is the mean of the group members' all similarity index. Based on the similarity index, the item will be recommended to the neighbors of the same group. Figure 4 shows that the similarity index value is gradually decreased outward from the center value. The users who belong near the center have similar preferences. After each iteration, the error is determined. The Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) are determined from the dataset after deploying the proposed methodology. Tables 2 and 3 show the MAE and RMSE respectively, based on the rating of the users.   Table 3 depicts MAE's related to the proposed model. To begin with, 500, 1000, 2000, 3000, 4000, and 5000 user ratings are considered for validation purposes. Consecutively, the target user's rating on unseen items is predicted. The experimental result demonstrates that the proposed model provides small MAE on large data samples. Moreover, after a few iterations, MAE becomes unchanged. Table 4 exhibits the RMSE's obtained from the proposed model. At first, 500, 1000, 2000, 3000, 4000, and 5000 user ratings were considered for experimentation. Successively, target user preference is predicted on unrated items. The experimental results reveal that the proposed model provides less RMSE for the large data samples. On the other hand, RMSE rapidly decreased as the number of user ratings increased. Therefore, the proposed model is practically deployable whenever numerous ratings are involved. After the minimization of errors, a similarity index of all users in a group is evaluated and shown in Figure 4. The simulation in Figure 4 describes the distribution of the similarity index for a particular group. Based on this similarity index, a new item will be recommended in the future for users in the same group. But, in a real scenario, collaborative filtering is not always sufficient for recommendations to users. The proposed model is designed based on hybrid recommendation systems. Collaborative filtering is visualized in Figure 4. To achieve a higher level of accuracy, content-based filtering is also applied. After deploying content-based filtering, an effective result is achieved for a recommendation. Figure 5 describes the effective result of a recommendation. A new item will not be recommended for all the same groups of users with the same similarity index. The new item is only recommended for the users who have the same similarity index and have prior similar content choices. After determining the effective optimized recommendation, performance metrics of the model are measured with various sampling data. The result of the performance metrics [46] is shown in Figure 5. .5%, 62%, 63.5%, and 69%, respectively. Therefore, the experimental results reveal that the proposed model is better than other models in terms of precision.    Table 6 shows a comparison of recall values between the proposed model and popular recommendation models. The size of recommendation sets is varied up to 50 recommendations in order to make a correct decision. Subsequently, the true positive, and false negative recommendations in those models have been identified. The experimental results presented in the table shows that the proposed model provides an average recall of 94.5%. On the other hand, recent recommendation models, such as HCSUC and Tesfaye et al. provides an average recall of 77.3% and 62.3%. Traditional recommendation models, such as collaborative filtering, content-based filtering, collaborative-content hybrid, and PSO-based hybrid filtering provides an average recall of 62.3%, 63.8%, 65.6%, and 63.6%. Therefore, it is evident from the experimental results that the proposed model is very accurate in providing personalized recommendation.       Table 8 shows a comparison of precision between the proposed model and other related models. The experimental results demonstrate the proposed model provides a precision of 93.5%. Meanwhile, recent research works on HCSUC and Tesfaye et al. achieve a precision of 60.3% and 57.8%. On the other hand, popular recommendation models, such as collaborative filtering, content-based filtering, collaborative-content hybrid, and PSObased hybrid filtering, provide precision levels of 60.8%, 61.3%, 66%, and 71% respectively. Therefore, results show that the proposed model is more accurate in identifying true positive and false positive recommendations compared to the other models. Table 9 represents a comparison of recall between the proposed model, recent recommendation models, and traditional recommendation models. The experimental results reveal the proposed model provides a recall of 96.6%. Meanwhile, HCSUC and Tesfaye et al., provide an average recall of 60% and 71% respectively. On the other hand, collaborative filtering, content-based filtering, collaborative-content hybrid, and PSO-based hybrid filtering provides an average recall of 61.6%, 67.6%, 74.1%, and 76.6%, respectively. Therefore, it is evident from the results that the proposed model is more correct in identifying true positive and false negative recommendations.  Table 10 represents a comparison of F-Measure values between the proposed model, and different recommendation models considering the testing sets. The experimental results reveal that the proposed model provides an F-Measure of 94.3%. Meanwhile, recommendation models, namely HCSUC and Tesfaye et al., provide an average F-Measure of 59.5% and 61.1%, respectively. Traditional recommendation models, such as collaborative filtering, content-based filtering, collaborative-content hybrid, and PSO-based hybrid filtering provide F-Measures of 60%, 63.6%, 69.3%, and 73.1%, respectively. Therefore, the experimental results demonstrate that the proposed model provides better recommendations compared to other models.  Table 11 compares the precision of the proposed models, HCSUC and Tesfaye et al., collaborative filtering, content-based filtering, collaborative-content hybrid filtering, and particle swarm optimization (PSO)-based collaborative-content hybrid filtering when the recommendation set has various sizes. The experiment is carried out with a recommendation set size of 5, 10, 20, 30, 40, and 50, respectively. The testing sets are used to verify seven recommendation models at first. Following that, suggestions for genuine positives and false positives are calculated. Finally, the models' accuracy is determined by counting the number of true positive and false positive suggestions. The suggested model has an average accuracy of 96.3%, according to the experimental findings. HCSUC, on the other hand, achieves an average accuracy of 76.1%. The Tesfaye et al. recommendation model, collaborative filtering, content-based filtering, collaborative-content hybrid, and PSO-based hybrid filtering, on the other hand, has an average precision of 72.3%, 61.1%, 67.3%, 80.3%, and 64.1%. As a consequence of the experimental data, it is obvious that the suggested model outperforms similar models in terms of precision.  Table 12 compares recall values for the proposed models, HCSUC, Tesfaye et al., collaborative filtering, content-based filtering, collaborative-content hybrid filtering, and particle swarm optimization (PSO)-based collaborative-content hybrid filtering with different recommendation set sizes. The experiment is performed with the recommended size set of 5, 10, 20, 30, 40, and 50, correspondingly. To begin, the testing sets are used to validate seven recommendation models. The real positive and false negative suggestions are reviewed in order. Finally, the recall of models is assessed by considering true positive and false negative suggestions. According to the testing results, the suggested model has an average recall of 94.5%. HCSUC, on the other hand, has an average recall of 69.6%. Similarly, on the other hand, the recommendation model of Tesfaye et al., collaborative filtering, contentbased filtering, collaborative-content hybrid, and PSO-based hybrid filtering results an average recall of 73.5%, 66.3%, 69.5%, 83.1%, and 71%, respectively. Therefore, it is evident from the experiment that the proposed model outperforms other models in terms of recall.  Table 13 compares F-Measure values for collaborative filtering, content-based filtering, collaborative-content hybrid filtering, and particle swarm optimization (PSO)-based collaborative-content hybrid filtering with respect to different sizes of the recommendation set between the proposed model, HCSUC, and Tesfaye et al., collaborative filtering, content-based filtering, collaborative-content hybrid filtering, and particle swarm optimization (PSO)-based collaborative-content hybrid filter. The experiment is performed with the size of the suggestion set to 5, 10, 20, 30, 40, and 50. To begin, the testing sets are used to verify seven recommendation models. Then the models' accuracy and recall are determined. Finally, the models' F-Measure is calculated by combining accuracy and recall. The suggested model has an average F-Measure of 94.5 percent, according to the experimental findings. The average F-Measure for the HCSUC recommendation model is 72.1%. The Tesfaye et al. recommendation model, collaborative filtering, content-based filtering, collaborative-content hybrid, and PSO-based hybrid filtering, on the other hand, achieves an average F-Measure of 72.5%, 63.0%, 67.8%, 81.1%, and 65%, respectively. As a consequence of the experimental findings, it is clear that the suggested model outperforms current methods in terms of F-Measure.

Performance Analysis
To evaluate the proposed model's efficiency, performance analysis is derived based on the different parameters of the confusion matrix. In this research paper, a four-fold cross-validation technique is applied to the given data sets. The ratio of the training and testing dataset are selected as various ratios for four iterations. The precision and recall values for dataset 1, dataset 2, and dataset 3 are shown in Tables 14-16 respectively. after every iteration, the error is corrected to enhance the model more accurately. The proposed model is applied for a good recommendation for users based of their preferences.  The data sets are validated by the proposed model through four-fold cross-validation techniques. The values of the performance metrics are different for four different strategies. The mean of the performance index for four different strategies is graphically shown in Figure 9 for three different datasets.

Analytical Comparison
To measure the proposed model's efficiency, a comparison is performed with two recent research studies that deal with the crow search-based hybrid recommendation algorithm. The comparisons are shown in Tables 17 and 18. The evaluation of efficiency is carried out using standard performance metrics, that is, Mean Absolute Error (MAE) and Root Mean Square Error (RMSE).  Table 17 shows a comparison between the proposed model, HCSUC, Tesfaye et al., collaborative filtering, content-based filtering, collaborative-content hybrid filtering, and particle swarm optimization (PSO)-based collaborative-content hybrid filtering with respect to MAE. To begin, the dataset's 500,1000,2000,3000, 4000, and 5000 user ratings are considered for rating prediction. Then, on unseen objects, the users' ratings are anticipated. Finally, MAE is calculated by subtracting the difference between the expected and actual ratings. The suggested model yields an average MAE of 0.23, according to the experimental results. HCSUC and TESFAYE et al., on the other hand, has MAEs of 0.75 and 0.86, respectively. Collaborative filtering, content-based filtering, collaborative-content hybrid filtering, and PSO-based hybrid filtering, on the other hand, had average MAEs of 0.63, 0.54, 0.53, and 0.58, respectively. As a consequence, the experimental findings show that the suggested model outperforms the existing alternatives. Table 18 compares collaborative filtering, content-based filtering, collaborative-content hybrid filtering, and particle swarm optimization (PSO)-based collaborative-content hybrid filtering using the proposed model, HCSUC, and Tesfaye et al. RMSE is more accurate than MAE for detecting prediction mistakes. To begin, user ratings of 500, 1000, 2000, 3000, 4000, and 5000 are picked from the dataset. The users' ratings on unrated things are consistently anticipated. Finally, the RMSE is calculated based on the normalized difference between the expected and actual ratings. The suggested model has an average RMSE of 0.33, according to the experimental results. HCSUC and TESFAYE et al., on the other hand, report average RMSE as0.86 and 0.97, respectively. On the other hand, collaborative filtering, contentbased filtering, collaborative-content hybrid, and PSO-based hybrid filtering results in an average RMSE of 0.64, 0.57, 0.61, and 0.71, respectively. As a consequence, the preceding data show that the suggested model is more efficient than similar techniques.

Discussion
In this paper, a personalized recommendation system is introduced for tourists. Tourism recommendation is one of the areas in which little research has been done so far. However, the number of online tourism recommenders and users' interest towards such recommenders has increased exponentially over the years. To overcome this, the proposed model provides more accurate recommendations in less time and helps increase the popularity of recommender sites. Moreover, the objective of this research is not only to deliver recommendations according to user profiles, but to also suggest high quality recommendations at any given point in time. On the other hand, the model effectively deals with new users providing accurate recommendations based on minimal information from users. In order to keep user interest intact, the model ensures diversified recommendations within a short span of time. The model suggested in this paper is easily deployable in mobile devices, group recommenders, distributed systems, contextual recommenders, and Internet of Things devices. The proposed idea can be solicited in social recommenders as well. Social recommenders help to get the features from social networking sites and recommendation systems. The features ensure proper execution of filtering process and to overcome the problems in recommendation systems. The benefit of having a social recommender is bi-focal. Firstly, proposed collaborative filtering helps to identify the most similar users of the target user using the data available in social sites. Thereafter, proposed content-based filtering envisages relevant content. Moreover, the model is capable of identifying communities from social recommenders and matching communal interests with the user profile. The recommenders consisting of such intelligent capabilities is very useful for industrial applications and researchers. The rating prediction technique proposed in this paper also minimizes the data sparsity and cold start problem. Technically, the proposed system uses a Big Data tourism dataset to analyze the customers' choices. A sequential statistical analysis is performed, and the Big Data tourism data set is structured to fit the requirement of the model. The proposed model is an intelligent recommender system that consists of collaborative and content-based filtering to achieve an accurate result. Due to this, ITRS encompasses the advantages of constituent algorithms. As a result, drawbacks of the constituent algorithm are substantially minimized. Therefore, the system could be deployed, providing suggestions of various products. From the above discussion, it is evident that the proposed model is not only made up of a flawless framework but also offers various services. However, the research work has some drawbacks that are detrimental to the performance. Some of the major drawbacks of the proposed model are proper aggregation function, filtering schemes containing user contexts, dealing with change of user preference, handling sentiment analysis of users, trivial analysis of user contents, recommendations based on numeric preferences, systematic security protocols, plenty of historical ratings, and numerous redundant features for content-based filtering.

Conclusions
The rapid development of the web and its applications has been significant for recommender systems. Applied in different spaces, recommender frameworks were intended to create proposals, for example, for things or administrations depending on client interests. A big data tourism dataset is used for filtering the users to whom tourist industries recommend in the future. Statistical analysis is performed by a sequential pilot survey that divides the data into three different groups. The groups are formed based on users' preferences and the content of their desired items. Crow search-based hybrid recommendation techniques, combined with collaborative and content-based filtering, is deployed to achieve the desired result. Hybrid filtering techniques are implemented within the proposed model to achieve recommendation accuracy. Three different real datasets are used to validate the proposed model. The proposed model is also compared with two different existing studies. The performance of the proposed model is determined by applying a four-fold cross-validation system. The overall Precision and Recall of the proposed model are 95.23% and 94.38%, respectively. To enhance the proposed model's performance and achieve a higher level of recommendation accuracy, a context-based recommendation will be considered in future research.

Conflicts of Interest:
The authors declare no conflict of interest.