Customer Clustering and Marketing Optimization in Hospitality: A Hybrid Data Mining and Decision-Making Approach from an Emerging Economy
Abstract
:1. Introduction
2. Literature Review
2.1. Data Mining in the Hospitality Industry
2.2. Customer Segmentation Techniques
2.3. RFM vs. RMD: The Need for an Enhanced Segmentation Model
- R (Recency): Defined as the number of days between the latest presence of a hotel’s customer and the date of analysis. It determines the most recent customers’ presence. The customers’ data are bounded between the dates 18 August 2017 and 18 August 2018 for RMD analysis; beginning with data from 18 August 2017. Each of the dates is numbered, up to 366 for 18 August 2018.
- M (Monetary): The amount of money each customer spends.
- D (Duration): The number of days each individual stays at the hotel.
3. Basic Concepts
3.1. K-Means
3.2. Association Rules and Customer Behavior Analysis
3.3. Multi-Criteria Decision-Making (MCDM) Approaches for Customer Prioritization
3.3.1. Shannon Entropy
3.3.2. TOPSIS
3.3.3. BWM
- 1.
- A set of decision criteria is chosen.
- 2.
- Focus groups or experts determine the best and worst criteria. Moreover, no comparison is applied to them.
- 3.
- Focus groups or experts select their preference for the best criteria over other criteria based on numbers between 1 and 9 ( (,,…)).
- 4.
- Focus groups or experts select the worst criteria compared to the other criteria based on numbers between 1 and 9 ( (,,…)).
- 5.
- The proper weights are found by solving the nonlinear (NLP) model using Formula (22):
- 6.
- In this section, the compatibility rate (CR) of the comparisons is computed using Equation (23). In this paper, CRs less than 0.2 are reasonable.
3.3.4. Customer Lifetime Value
4. Research Methodology
- 1.
- K determination, after data preparation.
- 2.
- Customer classification and rule extraction.
- 3.
- Clusters evaluations based on CLV and decision-making methods (TOPSIS and BWM).
- 1.
- The clusters are evaluated based on cluster quality indices including silhouette analysis (Equations (2)–(4)), Calinski–Harabasz (Equation (5)), and Calinski–Harabasz (Equations (6)–(10)).
- 2.
- These indices are considered as decision criteria, and their weights are extracted based on Shannon entropy (Equations (11)–(14)).
- 3.
- Different values of K are considered as decision alternatives, which are prioritized by TOPSIS (Equations (15)–(21)).
- 1.
- Customers are clustered using the K-means algorithm based on RMD attributes. The optimal obtained number of phase I cases is considered as the number of clusters. To measure the distance, Euclidean distance is used (Equation (1)).
- 2.
- A priori as a method of an association rule is employed to extract the rules.
- 3.
- Tailored strategies are developed for each cluster based on its characteristics.
- 1.
- RMD attributes are considered as the criteria to assess clusters, and the weights are determined by BWM (Equations (21) and (22)).
- 2.
- Clusters are prioritized based on TOPSIS (Equations (15)–(21)).
- 3.
- CLV is computed for each cluster (Equation (23)).
5. Case Study and Results
5.1. Clustering Model
5.2. Clustering Analysis
5.3. Association Rule Results
5.4. Comparison and Evaluation of Clusters
6. Discussion and Implications
6.1. Theoretical Contributions
6.2. Practical Implications
6.3. Contextual Insights
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Abirami, M., & Pattabiraman, V. (2016). Data mining approach for intelligent customer behavior analysis for a retail store. In The 3rd international symposium on big data and cloud computing challenges (pp. 283–291). Springer. [Google Scholar]
- Ahmadi, H., Emdadi Mahdimahalleh, S., Farahat, A., & Saffari, B. (2025). Unsupervised time-series signal analysis with autoencoders and vision transformers: A review of architectures and applications. Engineering Applications of Artificial Intelligence, 127, 107629. [Google Scholar]
- Akter, J., Roy, A., Rahman, S., Mohona, S., & Ara, J. (2025). Artificial Intelligence-Driven Customer Lifetime Value (CLV) forecasting: Integrating RFM analysis with machine learning for strategic customer retention. Journal of Computer Science and Technology Studies, 7(1), 249–257. [Google Scholar] [CrossRef]
- Alao, M. A., Ayodele, T. R., Ogunjuyigbe, A. S. O., & Popoola, O. M. (2020). Multi-criteria decision-based waste to energy technology selection using entropy-weighted TOPSIS technique: The case study of Lagos, Nigeria. Energy, 201, 117675. [Google Scholar] [CrossRef]
- Ansari, A., & Riasi, A. (2016). Customer clustering using a combination of fuzzy c-means and genetic algorithms. International Journal of Business and Management, 11(7), 59. [Google Scholar] [CrossRef]
- Baecke, P., & Poel, D. (2011). Data augmentation by predicting spending pleasure using commercially available external data. Journal of Intelligent Information Systems, 36(3), 367–383. [Google Scholar] [CrossRef]
- Calinski, T., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics—Theory and Methods, 3(1), 1–27. [Google Scholar] [CrossRef]
- Chang, H. H., Wang, Y. H., & Yang, W. Y. (2009). The impact of e-service quality, customer satisfaction and loyalty on e-marketing: Moderating effect of perceived value. Total Quality Management & Business Excellence, 20(4), 423. [Google Scholar]
- Chen, D., Sain, S. L., & Guo, K. (2012). Data mining for the online retail industry: A case study of RFM model-based customer segmentation using data mining. Journal of Database Marketing & Customer Strategy Management, 19(3), 197–208. [Google Scholar] [CrossRef]
- Cheng, C. H., & Chen, Y. S. (2009). Classifying the segmentation of customer value by the RFM model and RS theory. Expert Systems with Applications, 36(3), 4176–4184. [Google Scholar] [CrossRef]
- Çalık, A., Sain, S. L., & Guo, K. (2020). Evaluation of social media platforms using Best Worst Method and Fuzzy VIKOR Methods: A case study of travel agency. Iranian Journal of Management Studies, 19(3), 197–208. [Google Scholar] [CrossRef]
- Davies, D., & Bouldin, D. W. (1979). A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1(2), 224–227. [Google Scholar] [CrossRef] [PubMed]
- Deldadehasl, M., Jafari, M., & Sayeh, M. R. (2025). Dynamic classification using the adaptive competitive algorithm for breast cancer detection. Journal of Data Analysis and Information Processing, 13(2), 101–115. [Google Scholar] [CrossRef]
- Dimitrovski, D., & Todorovic, A. (2015). Clustering wellness tourists in spa environment. Tourism Management Perspectives, 16, 259–265. Available online: https://www.sciencedirect.com/science/article/pii/S2211973615300040 (accessed on 2 March 2025). [CrossRef]
- Doğan, O., Ayçin, E., & Bulut, Z. A. (2018). Customer segmentation by using RFM model and clustering methods: A case study in retail industry. International Journal of Contemporary Economics and Administrative Sciences, 8(1), 1–19. [Google Scholar]
- Dursun, A., & Caber, M. (2016). Using data mining techniques for profiling profitable hotel customers: An application of RFM analysis. Tourism Management Perspectives, 18, 153–160. [Google Scholar] [CrossRef]
- Erevelles, S., Fukawa, N., & Swayne, L. (2016). Big data consumer analytics and the transformation of marketing. Journal of Business Research, 69(2), 897–904. [Google Scholar] [CrossRef]
- Ganjali, M., & Teimourpour, B. (2016). Identify valuable customers of Taavon Insurance in field of life insurance with data mining approach. UCT Journal of Research in Science, Engineering and Technology, 4(1), 1–10. [Google Scholar] [CrossRef]
- George, G., Haas, M. R., & Pentland, A. (2014). Big data and management. Academy of Management Journal, 57(2), 321–326. [Google Scholar] [CrossRef]
- Guo, S., & Zhao, H. (2017). Fuzzy best-worst multi-criteria decision-making method and its applications. Knowledge-Based Systems, 121, 23–31. [Google Scholar] [CrossRef]
- Gurau, C., & Ranchhod, A. (2002). How to calculate the value of a customer—Measuring customer satisfaction: A platform for calculating, predicting and increasing customer profitability. Journal of Targeting, Measurement & Analysis for Marketing, 10(3), 203. [Google Scholar]
- Hanafizadeh, P., & Mirzazadeh, M. (2011). Visualizing market segmentation using self-organizing maps and the Fuzzy Delphi method–ADSL market of a telecommunication company. Expert Systems with Applications, 38(1), 198–205. [Google Scholar] [CrossRef]
- Ho, C.-I., & Lee, Y.-L. (2007). The development of an e-travel service quality scale. Tourism Management, 28(6), 1434–1449. [Google Scholar] [CrossRef]
- Hosseini, S. M., Maleki, A., & Gholamian, M. R. (2010). Cluster analysis using a data mining approach to develop CRM methodology to assess customer loyalty. Expert Systems with Applications, 37, 5259–5264. [Google Scholar] [CrossRef]
- Hu, Y. H., & Yeh, T. W. (2014). Discovering valuable frequent patterns based on RFM analysis without customer identification information. Knowledge-Based Systems, 61, 76–88. [Google Scholar] [CrossRef]
- Irani, H., & Metsis, V. (2024). Enhancing time-series prediction with temporal context modeling: A Bayesian and deep learning synergy. The International FLAIRS Conference Proceedings, 37(1). [Google Scholar] [CrossRef]
- Jain, A. K. (2010). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31(8), 651–666. Available online: https://www.sciencedirect.com/science/article/pii/S0167865509002323 (accessed on 24 February 2025). [CrossRef]
- Kahan, R. (1998). Using database marketing techniques to enhance your one-to-one marketing initiatives. Journal of Consumer Marketing, 15(5), 491–493. [Google Scholar] [CrossRef]
- Kahreh, Z., Shirmohammadi, A., & Kahreh, M. (2017). Explanatory study towards analysis of the relationship between total quality management and knowledge management. Procedia—Social and Behavioral Sciences, 109, 600–604. [Google Scholar] [CrossRef]
- Kasprova, A. (2020). Customer lifetime value for retail based on transactional and loyalty card data. Ukrainian Catholic Institution. [Google Scholar]
- Khajvand, M., & Tarokh, M. J. (2011). Estimating customer future value of different customer segments based on adapted RFM model in retail banking context. Procedia Computer Science, 3, 1327–1332. [Google Scholar] [CrossRef]
- Khalilzadeh, M., Katoueizadeh, L., & Zavadskas, E. K. (2020). Risk identification and prioritization in banking projects of payment service provider companies: An empirical study. Frontiers of Business Research in China, 14(1), 1–27. [Google Scholar] [CrossRef]
- Kotler, P. (1973). Atmospherics as a marketing tool. Journal of Retailing, 49(4), 48–64. [Google Scholar]
- Kumar, V., Bhagwat, Y., & Zhang, X. (2015). Regaining ‘lost’ customers: The predictive power of first-lifetime behavior, the reason for defection, and the nature of the win-back offer. Journal of Marketing, 79(4), 34–55. [Google Scholar] [CrossRef]
- Laursen, G. H. (2011). Business analytics for sales and marketing managers: How to compete in the information age. John Wiley & Sons. [Google Scholar]
- Law, R., Qi, S., & Buhalis, D. (2010). Progress in tourism management: A review of website evaluation in tourism research. Tourism Management, 31(3), 297–313. [Google Scholar] [CrossRef]
- Liao, S. H., Chen, Y. J., & Deng, M. Y. (2010). Mining customer knowledge for tourism new product development and customer relationship management. Expert Systems with Applications, 37(6), 4212–4223. [Google Scholar] [CrossRef]
- Loh, W. Y., & Shih, Y. S. (1997). Split selection methods for classification trees. Statistica Sinica, 7(4), 815–840. [Google Scholar]
- Mahdiraji, H. A., Zavadskas, E. K., Kazeminia, A., & Kamardi, A. A. (2019). Marketing strategies evaluation based on big data analysis: A CLUSTERING-MCDM approach. Economic Research-Ekonomska Istraživanja, 32(1), 2882–2898. [Google Scholar] [CrossRef]
- Matz, A., & Hermawan, A. T. (2020). Customer loyalty clustering model using K-Means algorithm with LRIFMQ parameters. Inform, 5(2), 54–61. [Google Scholar]
- McCarty, J. A., & Hastak, M. (2007). Segmentation approaches in data-mining: A comparison of RFM, CHAID, and logistic regression. Journal of Business Research, 60(6), 656–662. [Google Scholar] [CrossRef]
- Mesforoush, A., & Tarokh, M. J. (2013). Customer profitability segmentation for SMEs case study: Network equipment company. International Journal of Research in Industrial Engineering, 2(1), 30–44. [Google Scholar]
- Miglautsch, J. (2000). Thoughts on RFM scoring. Journal of Database Marketing & Customer Strategy Management, 8(1), 67–72. [Google Scholar]
- Mohammadian, M., & Makhani, I. (2016). RFM-Based customer segmentation as an elaborative analytical tool for enriching the creation of sales and trade marketing strategies. International Academic Journal of Accounting and Financial Management, 3(6), 21–35. [Google Scholar] [CrossRef]
- Mohammadrezapour, O., Kisi, O., & Pourahmad, F. (2020). Fuzzy c-means and K-means clustering with genetic algorithm for identification of homogeneous regions of groundwater quality. Neural Computing and Applications, 32(8), 3763–3775. [Google Scholar] [CrossRef]
- Mosavi, A. B., & Afsar, A. (2018). Customer value analysis in banks using data mining and fuzzy analytic hierarchy processes. International Journal of Information Technology & Decision Making, 17(3), 819–840. [Google Scholar]
- Ostovare, M., & Shahraki, M. R. (2019). Evaluation of hotel websites using the multicriteria analysis of PROMETHEE and GAIA: Evidence from the five-star hotels of Mashhad. Tourism Management Perspectives, 30, 107–116. [Google Scholar] [CrossRef]
- Peker, S., Kocyigit, A., & Eren, P. E. (2017). LRFMP model for customer segmentation in the grocery retail industry: A case study. Marketing Intelligence & Planning, 35(4), 544–559. [Google Scholar] [CrossRef]
- Rajput, L., & Singh, S. N. (2023, January 19–20). Customer segmentation of e-commerce data using K-Means clustering algorithm. 2023 13th international conference on cloud computing, data science & engineering (confluence) (pp. 659–664), Noida, India. [Google Scholar] [CrossRef]
- Ramasamy, S., & Nirmala, K. (2020). Disease prediction in data mining using association rule mining and keyword-based clustering algorithms. International Journal of Computers and Applications, 42(1), 1–8. [Google Scholar] [CrossRef]
- Reichheld, F. F., & Sasser, J. (1996). Zero defections: Quality comes to services. Harvard Business Review, 68(5), 105–111. [Google Scholar]
- Rezaei, J. (2015). Best-worst multi-criteria decision-making method. Omega (Westport), 53, 49–57. [Google Scholar] [CrossRef]
- Ristoski, P., & Paulheim, H. (2016). Semantic Web in data mining and knowledge discovery: A comprehensive survey. Journal of Web Semantics, 36, 1–22. Available online: https://www.sciencedirect.com/science/article/pii/S1570826816000020 (accessed on 27 February 2025). [CrossRef]
- Rousseeuw, P. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20, 53–65. Available online: https://www.sciencedirect.com/science/article/pii/0377042787901257 (accessed on 27 February 2025). [CrossRef]
- Samarasinghe, G. D., & Samarasinghe, D. S. R. (2013). Green decisions: Consumers’ environmental beliefs and green purchasing behaviour in Sri Lankan context. International Journal of Innovation and Sustainable Development, 7(2), 172–184. [Google Scholar] [CrossRef]
- Sarvari, P. A., Ustundag, A., & Takci, H. (2016). Performance evaluation of different customer segmentation approaches based on RFM and demographics analysis. Kybernetes, 45(7), 1129–1157. [Google Scholar] [CrossRef]
- Sharifi, S. (2025). Enhancing kidney transplantation through multi-agent kidney exchange programs: A comprehensive review and optimization models. arXiv, arXiv:2502.07819. [Google Scholar] [CrossRef]
- Srihadi, T. F., Sukandar, D., & Soehadi, A. W. (2016). Segmentation of the tourism market for Jakarta: Classification of foreign visitors’ lifestyle typologies. Tourism Management Perspectives, 19, 32–39. [Google Scholar] [CrossRef]
- Syakur, M. A., Khotimah, B. K., Rochman, E. M. S., & Satoto, B. D. (2018). Integration K-means clustering method and elbow method for identification of the best customer profile cluster. IOP Conference Series: Materials Science and Engineering, 336(1), 012017. [Google Scholar] [CrossRef]
- Tabianan, K., Velu, S., & Ravi, V. (2022). K-Means clustering approach for intelligent customer segmentation using customer purchase behavior data. Sustainability, 14(12), 7243. [Google Scholar] [CrossRef]
- Tu, Y., Chen, K., Wang, H., & Li, Z. (2020). Regional water resources security evaluation based on a hybrid fuzzy BWM-TOPSIS method. International Journal of Environmental Research and Public Health, 17(14), 4987. [Google Scholar] [CrossRef]
- Wang, S., Sun, L., & Yu, Y. (2024). A dynamic customer segmentation approach by combining LRFMS and multivariate time series clustering. Scientific Reports, 14, 17491. [Google Scholar] [CrossRef]
- Wang, Y., Wen, Z., & Li, H. (2020). Symbiotic technology assessment in iron and steel industry based on entropy TOPSIS method. Journal of Cleaner Production, 260, 120900. [Google Scholar] [CrossRef]
- Wei, J. T., Lee, M. C., Chen, H. K., & Wu, H. H. (2013). Customer relationship management in the hairdressing industry: An application of data mining techniques. Expert Systems with Applications, 40(18), 7513–7518. [Google Scholar] [CrossRef]
- Wei, J. T., Lin, S. Y., & Wu, H. H. (2010). A review of the application of the RFM model. African Journal of Business Management, 4(19), 4199. [Google Scholar]
- Yoon, P., & Hwang, C.-L. (1995). Multiple attributes decision making: An introduction. Sage Publications. [Google Scholar]
- You, Z., Si, Y.-W., Zhang, D., Zeng, X., Leung, S., & Li, T. (2015). A decision-making framework for precision marketing. Expert Systems with Applications, 42(7), 3357–3367. [Google Scholar] [CrossRef]
- Zhang, X., Gong, W., & Kawamura, Y. (2004). Customer behavior pattern discovering with web mining. In Asia-Pacific web conference (pp. 844–853). Springer. [Google Scholar]
- Zhao, H. H., Luo, X. C., Ma, R., & Lu, X. (2021). An extended regularized K-means clustering approach for high-dimensional customer segmentation with correlated variables. IEEE Access, 9, 48405–48412. [Google Scholar] [CrossRef]
Researcher(s)/Year | Target(s) | Tool(s) | Result(s) |
---|---|---|---|
(Akter et al., 2025) | CLV forecasting and segmentation for retention strategy | RFM, ML (regression, decision trees, neural nets), K-means++ | Improved CLV prediction accuracy; enabled personalized retention with AI-driven clustering |
(Wang et al., 2024) | Dynamic customer segmentation in auto parts industry | LRFMS model, DTW-D, SBD, CID, AP, SP, k-medoids | Outperformed traditional RFM; identified meaningful segments using multivariate time series clustering |
(Rajput & Singh, 2023) | Customer segmentation to guide platform focus (website vs. app) | K-means, elbow | Identified key clusters; recommended boosting app use and memberships |
(Tabianan et al., 2022) | Customer segmentation in e-commerce | K-means | Behavior-based clustering for profitable customer segmentation |
(Zhao et al., 2021) | Combining K-means and L1-norm | K-means, L-1 norm, RFM | Outperforms K-means with fewer errors |
(Mohammadrezapour et al., 2020) | Comparing two clustering methods | K-means, C-means | C-means yielded higher accuracy than K-means |
(Matz & Hermawan, 2020) | Proposing a model for a cluster of a loyal customer | LRIFMQ, CLV, AHP, K-means | Customers were grouped into six clusters |
(Mahdiraji et al., 2019) | Clustering and ranking bank customers using RFM | RFM modeling, BWM, COPRAS | Classified customers into six clusters and selected two groups as influential ones |
(Syakur et al., 2018) | Determining the best number of clusters | K-means, elbow method | Defining an appropriate number of clusters using the elbow method |
(Doğan et al., 2018) | Clustering retail customers | RFM modeling, K-means, two-step | Comparing two types of clustering results |
(Mosavi & Afsar, 2018) | Analyzing bank customers’ value | FAHP, K-means, random forest classification | Presenting the model according to the applied attributes |
(Peker et al., 2017) | Developing services and increasing profits | LRFMP, K-means, Calinski–Harabasz, Davies–Bouldin, silhouette | Clustering customers into five groups |
(Dursun & Caber, 2016) | Clustering hotel customers | RFM modeling, K-means | Offering proper strategies to each group |
(Ansari & Riasi, 2016) | Combining data mining methods to cluster steel industries’ customers | LRFM modeling, two-step, genetic algorithm, C-means | Classifying customers into two groups, rendering tailored strategies |
(Ganjali & Teimourpour, 2016) | Clustering insurance customers | K-means, CLV, association rule, decision tree, Davies–Bouldin | Classifying customers into five clusters |
(Sarvari et al., 2016) | Clustering fast-food customers | Associated rules, RFML modeling, K-means | Having proper groups is critical to forming strong associations |
(Abirami & Pattabiraman, 2016) | Clustering customers | RFM modeling, K-means, association rules | Predicting customers’ behavior, improving customer satisfaction |
(Srihadi et al., 2016) | Clustering foreign customers | K-means | Identifying groups, proposing proper strategies |
(Chang et al., 2009) | Finding important variables influenced by customer loyalty | Decision tree analysis | Exploring customer behavior |
(Mohammadian & Makhani, 2016) | Analyzing data to identify customer intentions | RFM modeling, CLV | Grouping customers into eight clusters to understand customers |
(You et al., 2015) | Clustering customers | RFM modeling, K-means, CHAID decision trees, Pareto values | Offering precision marketing strategies |
(Dimitrovski & Todorovic, 2015) | Understanding customer behavior | K-means, chi-square test, hierarchical method | Understanding visitor intentions, presenting appropriate promotions |
(Wei et al., 2013) | Clustering hairdressing industry customers | K-means, RFM modeling | Identifying customers, offering proper strategies |
(Chen et al., 2012) | Understanding retail customers | K-means, RFM modeling, decision tree | Classifying customers into five clusters |
(Liao et al., 2010) | Finding hidden patterns in data | K-means, a priori algorithm | Exploring group-buying customer behavior |
(Hosseini et al., 2010) | Clustering SAPCO customers | K-means, WRFM, CLV | Assessing customers, proposing an effective model for understanding customers |
Category | Group | Percentage (%) |
---|---|---|
Gender | Female | 60.4 |
Male | 34.9 | |
Age group | 31–40 | 26.7 |
41–60 | 19.4 | |
Nationality | Iraqi | 22.89 |
Chinese | 7.33 | |
Travel type | Alone | 61.12 |
With others | 31.55 |
Number of K | Silhouette | Davies–Bouldin | Calinski–Harabasz |
---|---|---|---|
Weight of validity indices | 0.34 | 0.36 | 0.28 |
2 | 0.74 | 0.63 | 831.2 |
3 | 0.82 | 0.62 | 1096.65 |
4 | 0.78 | 0.66 | 1287.77 |
5 | 0.70 | 0.63 | 1347.64 |
6 | 0.73 | 0.61 | 1602.60 |
7 | 0.70 | 0.67 | 1484.12 |
8 | 0.74 | 0.70 | 1654.36 |
9 | 0.73 | 0.68 | 1613.19 |
10 | 0.73 | 0.71 | 1652.56 |
RMD Indices | Minimum | Maximum | St. dev. | |
---|---|---|---|---|
R (Recency) | 10 | 365 | 126.8 | 97.0 |
M (Monetary) | 667 | 4724 | 1252.3 | 501.6 |
D (Duration) | 1 | 12 | 3.4 | 1.6 |
Clusters | N | RMD Value | |||
---|---|---|---|---|---|
1 | 579 | 1.00 | 414.10 | 211.60 | R↑M↓D↓ |
2 | 24 | 8.45 | 3221.50 | 202.58 | R↑M↑D↑ |
3 | 81 | 3.66 | 1214.61 | 185.40 | R↑M↓D↑ |
4 | 26 | 2.76 | 1314.88 | 14.00 | R↓M↑D↓ |
5 | 315 | 3.57 | 806.17 | 114.01 | R↓M↓D↑ |
6 | 81 | 1.32 | 542.76 | 33.75 | R↓M↓D↓ |
Total | 1107 | 3.46 | 1252.34 | 126.84 |
Attributes | NC | LC | CBC | PC | BC | LoC |
---|---|---|---|---|---|---|
RMD scores | R↑M↓D↓ | R↑M↑D↑ | R↑M↓D↑ | R↓M↑D↓ | R↓M↓D↑ | R↓M↓D↓ |
N | 579 (52.3%) | 24 (2.16%) | 81 (7.31%) | 26 (2.34%) | 315 (28.45%) | 81 (7.31%) |
Gender | Male (68%) | Male & Female (50–50%) | Male (69%) | Male & Female (50–50%) | Male (65%) | Male (66%) |
Age group | 21–30 (26%) | 41–50 (41%) | 31–40 (28%) | 31–40 (38%) | 31–40 (28%) | 21–30 (27%) |
Nationality | Iraqi (24%) | Iraqi (45%) | Chinese (17.3%) | Iraqi & Chinese (15.3–15.3%) | Iraqi (23%) | Iraqi (18.5%) |
Travel companion | Alone (68.22%) | Two people (20%) | Two (38.2%) | Alone (38%) | Alone (65.7%) | 1 (51.1%) |
Job | Freelance (64.7%) | Freelance (41.6%) | Freelance (39.5%) | Employee (38%) | Freelance (62.2%) | Tourist (72.8%) |
Travel intentions | Tourism (58.5%) | Tourism (43%) | Tourism (49.38%) | Office work (34.6%) | Office work (34.9%) | Tourism (50.6%) |
Duration (days) | 1 (100%) | 7 (33.33%) | 4 (44%) | 1 (50%) | 2 (74.3%) | 1 (76.5%) |
Clusters | Rule | Confidence | Support |
---|---|---|---|
New customers | [male → Iraqi, freelance] | 94.5% | 16.5% |
[tourism → tourist] | 93.5% | 11.3% | |
Loyal customers | [freelance → Iraqi, men] | 100% | 12.5% |
[tourism → tourist] | 100% | 12.5% | |
Collective buying customers | [men → freelance, 41–50] | 100% | 11.11% |
[tourism → Chinese, 31–40] | 100% | 11.11% | |
Potential customers | [men → Chinese] | 100% | 15.3% |
[employee → women, 31–40, office work] | 83.87% | 23.4% | |
Business customers | [men → Iraqi, freelance] | 94.11% | 10.7% |
[men → office work, freelance] | 100% | 12.5% | |
Lost customers | [tourism → 61–90, tourist] | 88% | 12.3% |
Clusters | Cluster Ranking By TOPSIS | N | D | M | R | CLV | CLV Ranking |
---|---|---|---|---|---|---|---|
C1 | 0 | 52.3 | 0.009 | 0.012 | 0.33 | 0.09 | CLV4 |
C2 | 0.86 | 2.16 | 0.66 | 0.7 | 0.3 | 0.59 | CLV1 |
C3 | 0.13 | 7.3 | 0.12 | 0.1 | 0.25 | 0.13 | CLV2 |
C4 | 0.21 | 2.34 | 0.07 | 0.11 | 0.001 | 0.07 | CLV3 |
C5 | 0.12 | 28.45 | 0.11 | 0.04 | 0.097 | 0.05 | CLV5 |
C6 | 0.14 | 7.31 | 0.01 | 0.02 | 0.008 | 0.01 | CLV6 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Deldadehasl, M.; Karahroodi, H.H.; Haddadian Nekah, P. Customer Clustering and Marketing Optimization in Hospitality: A Hybrid Data Mining and Decision-Making Approach from an Emerging Economy. Tour. Hosp. 2025, 6, 80. https://doi.org/10.3390/tourhosp6020080
Deldadehasl M, Karahroodi HH, Haddadian Nekah P. Customer Clustering and Marketing Optimization in Hospitality: A Hybrid Data Mining and Decision-Making Approach from an Emerging Economy. Tourism and Hospitality. 2025; 6(2):80. https://doi.org/10.3390/tourhosp6020080
Chicago/Turabian StyleDeldadehasl, Maryam, Houra Hajian Karahroodi, and Pouya Haddadian Nekah. 2025. "Customer Clustering and Marketing Optimization in Hospitality: A Hybrid Data Mining and Decision-Making Approach from an Emerging Economy" Tourism and Hospitality 6, no. 2: 80. https://doi.org/10.3390/tourhosp6020080
APA StyleDeldadehasl, M., Karahroodi, H. H., & Haddadian Nekah, P. (2025). Customer Clustering and Marketing Optimization in Hospitality: A Hybrid Data Mining and Decision-Making Approach from an Emerging Economy. Tourism and Hospitality, 6(2), 80. https://doi.org/10.3390/tourhosp6020080