Uncovering Insights for New Car Recommendations with Sequence Pattern Mining on Mobile Applications
Abstract
:1. Introduction
- How can sequential pattern mining effectively recommend new products to users of a car info app, especially for small app vendors with limited financial resources?
- What are the browsing behaviors of app users interested in car information, and how can these behaviors be analyzed to provide more intelligent and precise marketing recommendations?
- How can automobile companies use insights from analyzing user behavior and preferences to make sound decisions about creating new products?
2. Literature Review
2.1. Sequential Pattern Mining
2.2. Recent Extensions of Sequential Pattern Mining
2.3. Algorithm Selection for Sequential Pattern Mining
3. Method
3.1. Data
3.2. Method
3.3. Advantages of Method in Car Information App Analysis and Recommendations
3.4. Measures
4. Result
5. Discussion
5.1. Insights for New Product Recommendation
5.2. Theoretical Implications
5.3. Managerial Implications
5.4. Limitations and Future Research
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Nie, L.; Said, K.S.; Ma, L.; Zheng, Y.; Zhao, Y. A systematic mapping study for graphical user interface testing on mobile apps. IET Softw. 2023, 1–19. [Google Scholar] [CrossRef]
- Park, D.H.; Kim, H.K.; Choi, I.Y.; Kim, J.K. A literature review and classification of recommender systems research. Expert Syst. Appl. 2012, 39, 10059–10072. [Google Scholar] [CrossRef]
- Demiriz, A. WebSPADE: A parallel sequence mining algorithm to analyze web log data. In Proceedings of the 2002 IEEE International Conference on Data Mining, Maebashi City, Japan, 9–12 December 2002; IEEE: Maebashi City, Japan, 2002; pp. 755–758. [Google Scholar] [CrossRef]
- Fournier-Viger, P.; Lin, J.C.W.; Kiran, R.U.; Koh, Y.S.; Thomas, R. A survey of sequential pattern mining. Data Sci. Pattern Recognit. 2017, 1, 54–77. [Google Scholar]
- Chen, G.; Li, Z. A new method combining pattern prediction and preference prediction for next basket recommendation. Entropy 2021, 23, 1430. [Google Scholar] [CrossRef] [PubMed]
- Gouda, K.; Hassaan, M.; Zaki, M.J. Prism: An effective approach for frequent sequence mining via prime-block encoding. J. Comput. Syst. Sci. 2010, 76, 88–102. [Google Scholar] [CrossRef]
- Jalilvand, A.; Christino, L.; Paulovich, F.V. Serviz: A Visual Analytics System for the Analysis of Sequential Rules and Its Application to Airport Ground Handling Operations. 2023. Available online: https://doi.org/10.2139/ssrn.4341080 (accessed on 19 May 2023).
- Mabroukeh, N.R.; Ezeife, C.I. A taxonomy of sequential pattern mining algorithms. ACM Comput. Surv. 2010, 43, 1–41. [Google Scholar] [CrossRef]
- Pei, J.; Han, J.; Pinto, H.; Chen, Q.; Dayal, U.; Hsu, M.C. PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth. In Proceedings of the 17th International Conference on Data Engineering, Heidelberg, Germany, 2–6 April 2001; pp. 215–224. [Google Scholar]
- Pei, J.; Han, J.; Mortazavi-Asl, B.; Wang, J.; Pinto, H.; Chen, Q.; Dayal, U.; Hsu, M.-C. Mining sequential patterns by pattern-growth: The PrefixSpan approach. IEEE Trans. Knowl. Data Eng. 2004, 16, 1424–1440. [Google Scholar]
- Srikant, R.; Agrawal, R. Mining sequential patterns: Generalizations and performance improvements. In Proceedings of the Advances in Database Technology—EDBT’96: 5th International Conference on Extending Database Technology, Avignon, France, 25–29 March 1996; Springer: Berlin/Heidelberg, Germany, 1996; Volume 5, pp. 1–17. [Google Scholar]
- Zaki, M.J. SPADE: An efficient algorithm for mining frequent sequences. Mach. Learn. 2001, 42, 31–60. [Google Scholar] [CrossRef]
- Agrawal, R.; Srikant, R. Fast algorithms for mining association rules. In Proceedings of the 20th International Conference on Very Large Data Bases, Santiago, Chile, 12–15 September 1994; VLDB 94; Volume 1215, pp. 487–499. Available online: https://dl.acm.org/doi/10.5555/645920.672836 (accessed on 19 May 2023).
- Agrawal, R.; Srikant, R. Mining sequential patterns. In Proceedings of the Eleventh International Conference on Data Engineering, Washington, DC, USA,, 9–10 March 1995; IEEE: Taipei, Taiwan, 1995; pp. 3–14. Available online: https://10.1109/ICDE.1995.380415 (accessed on 19 May 2023).
- Fournier-Viger, P.; Gomariz, A.; Campos, M.; Thomas, R. Fast vertical mining of sequential patterns using co-occurrence information. In Proceedings of the Advances in Knowledge Discovery and Data Mining: 18th Pacific-Asia Conference, Tainan, Taiwan, 13–16 May 2014; PAKDD 2014. Springer: Tainan, Taiwan, 2014; pp. 40–52. [Google Scholar] [CrossRef]
- Ayres, J.; Flannick, J.; Gehrke, J.; Yiu, T. Sequential pattern mining using a bitmap representation. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’02, Edmonton, AB, Canada, 23–26 July 2002; Association for Computing Machinery: New York, NY, USA, 2002; pp. 429–435. [Google Scholar] [CrossRef]
- Ghani, R.; Probst, K.; Liu, Y.; Krema, M.; Fano, A. Text mining for product attribute extraction. ACM SIGKDD Explor. Newsl. 2006, 8, 41–48. [Google Scholar] [CrossRef]
- Kumar, S.; Kar, A.K.; Ilavarasan, P.V. Applications of text mining in services management: A systematic literature review. Int. J. Inf. Manag. Data Insights 2021, 1, 100008. [Google Scholar] [CrossRef]
- Srinivas, S.; Ramachandiran, S. Passenger intelligence as a competitive opportunity: Unsupervised text analytics for discovering airline-specific insights from online reviews. Ann. Oper. Res. 2023, 1–34. [Google Scholar] [CrossRef]
- Dargan, S.; Kumar, M.; Ayyagari, M.R.; Kumar, G. A survey of deep learning and its applications: A new paradigm to machine learning. Arch. Comput. Methods Eng. 2020, 27, 1071–1092. [Google Scholar] [CrossRef]
- Dong, S.; Wang, P.; Abbas, K. A survey on deep learning and its applications. Comput. Sci. Rev. 2021, 40, 100379. [Google Scholar] [CrossRef]
- Wei, J.; He, J.; Chen, K.; Zhou, Y.; Tang, Z. Collaborative filtering and deep learning based recommendation system for cold start items. Expert Syst. Appl. 2017, 69, 29–39. [Google Scholar] [CrossRef]
- Senturk, I.; Gursoy, N.K.; Oner, T.; Gursoy, A. A novel algorithmic construction for deductions of categorical polysyllogisms by Carroll’s diagrams. Inf. Sci. 2021, 578, 236–256. [Google Scholar] [CrossRef]
Algorithm | Description of Key Features |
---|---|
AprioriAll [14] | AprioriAll is one of the most well-known strategies for sequence pattern mining, and it uses a hash tree as the primary storage structure to generate candidate sequences. |
GSP [11] | GSP carries out multiple passes over the sequence database and uses a depth-first search strategy to generate candidate sequences. |
SPADE/cSPADE [12] | SPADE employs a vertical data representation to illustrate the sequence database and allows for efficient support counting based on equivalence classes. In addition, the SPADE algorithm is enhanced by cSPADE through the incorporation of novel constraints. |
PrefixSpan [10] | PrefixSpan utilizes a depth-first search strategy and prefix-projection technique to reduce the search cost of the database. |
SPAM [11] | SPAM utilizes a vertical data format for transaction databases, which leads to decreased memory usage and computational time. This approach is both efficient and effective. |
CM-Spam [15] | The CM-Spam uses a pattern-growth approach to generate candidate patterns and mine frequent sequential patterns by recursively appending new items to existing patterns. |
CM-Spade [15] | The CM-Spade uses a vertical representation of the sequence database to mine frequent sequential patterns. |
SPAP [5] | SPAP combines association rules, sequential analysis, and periodic patterns to capture complex customer purchase behavior. |
SeRViz [7] | The SeRViz was developed to visualize sequence rules and aid in the exploratory analysis of airport operations. |
Settings | Threshold of Min Support % | Min Support Count | Sequential Rules |
---|---|---|---|
1 | Support ≥ 4.00 | 2640 | 61 Rules |
2 | Support ≥ 3.00 | 1980 | 173 Rules |
3 | Support ≥ 2.00 | 1320 | 691 Rules |
4 | Support ≥ 1.00 | 660 | 4449 Rules |
5 | Support ≥ 0.90 | 594 | 5696 Rules |
6 | Support ≥ 0.80 | 528 | 7460 Rules |
7 | Support ≥ 0.70 | 462 | 10,240 Rules |
8 | Support ≥ 0.60 | 396 | 14,578 Rules |
9 | Support ≥ 0.50 | 330 | 22,077 Rules |
10 | Support ≥ 0.40 | 264 | 37,345 Rules |
11 | Support ≥ 0.30 | 198 | 72,951 Rules |
ID | Sequential Rule | Support | Count | Confidence | Lift |
---|---|---|---|---|---|
1 | {3},{9},{4},{9} | 0.90% | 591 | 51.30% | 2.46 |
2 | {492},{582} | 0.73% | 482 | 21.06% | 2.74 |
3 | {53},{432},{434} | 0.85% | 563 | 25.30% | 2.11 |
4 | {645},{18},{629} | 0.65% | 429 | 23.46% | 2.46 |
5 | {634},{594} | 0.65% | 432 | 21.82% | 1.23 |
6 | {645},{120},{82} | 0.61% | 403 | 26.20% | 2.51 |
7 | {65},{455},{432} | 0.69% | 457 | 30.09% | 2.44 |
8 | {432},{455},{455} | 1.30% | 855 | 41.20% | 3.72 |
9 | {108},{486},{106} | 0.74% | 486 | 46.78% | 6.94 |
10 | {645},{4},{9} | 1.00% | 658 | 43.29% | 2.08 |
11 | {645},{340},{82} | 0.70% | 462 | 39.66% | 3.79 |
12 | {106},{66} | 1.45% | 959 | 21.56% | 1.51 |
13 | {8},{22},{8} | 0.79% | 521 | 30.38% | 2.72 |
14 | {98},{53},{29} | 0.84% | 556 | 38.69% | 3.09 |
15 | {432},{432},{648},{432} | 1.00% | 659 | 64.73% | 5.25 |
16 | {98},{594},{401} | 0.61% | 403 | 21.32% | 1.65 |
17 | {120},{645} | 1.83% | 1206 | 29.07% | 2.49 |
18 | {342},{82} | 4.03% | 2658 | 29.25% | 2.80 |
19 | {4},{645},{323} | 0.80% | 529 | 28.53% | 1.48 |
20 | {496},{494},{491},{493} | 0.98% | 646 | 55.45% | 17.15 |
21 | {98},{3},{9} | 1.96% | 1293 | 47.50% | 2.28 |
22 | {8},{9},{3},{4} | 0.73% | 479 | 31.72% | 2.86 |
23 | {644},{401},{3} | 0.53% | 352 | 25.21% | 1.15 |
24 | {434},{448} | 2.84% | 1876 | 23.65% | 1.94 |
25 | {17},{18} | 2.56% | 1691 | 40.46% | 6.39 |
26 | {431},{53} | 1.79% | 1179 | 24.14% | 1.60 |
27 | {401},{323},{445} | 0.52% | 342 | 22.02% | 1.64 |
28 | {323},{629},{82} | 1.21% | 801 | 40.01% | 3.83 |
29 | {674},{432},{431} | 0.52% | 344 | 20.81% | 2.81 |
30 | {494},{9} | 1.23% | 813 | 27.89% | 1.34 |
ID | Sequential Rule | Support | Count | Confidence | Lift |
---|---|---|---|---|---|
1 | {111},{111} | 2.62% | 1732 | 37.97% | 5.50 |
2 | {458},{111} | 2.61% | 1725 | 35.77% | 5.18 |
3 | {665},{111} | 0.56% | 372 | 24.88% | 3.60 |
4 | {66},{458},{111} | 0.69% | 453 | 39.49% | 5.72 |
5 | {66},{111},{111} | 0.68% | 451 | 41.30% | 5.98 |
6 | {458},{458},{111} | 1.03% | 678 | 37.17% | 5.38 |
7 | {448},{458},{111} | 0.74% | 490 | 39.64% | 5.74 |
8 | {434},{458},{111} | 0.83% | 549 | 36.53% | 5.29 |
9 | {36},{458},{111} | 0.68% | 447 | 39.35% | 5.69 |
10 | {458},{111},{111} | 1.03% | 681 | 39.48% | 5.71 |
11 | {111},{458},{111} | 1.05% | 695 | 44.61% | 6.46 |
12 | {458},{458},{458},{111} | 0.52% | 344 | 39.31% | 5.69 |
13 | {458},{111},{458},{111} | 0.50% | 332 | 46.30% | 6.70 |
14 | {111},{111},{458},{111} | 0.53% | 348 | 56.86% | 8.23 |
15 | {448},{434},{111} | 0.57% | 378 | 20.36% | 2.95 |
16 | {36},{448},{111} | 0.51% | 339 | 20.50% | 2.97 |
17 | {448},{111},{111} | 0.68% | 447 | 39.59% | 5.73 |
18 | {434},{111},{111} | 0.69% | 458 | 39.38% | 5.70 |
19 | {111},{111},{111} | 1.30% | 855 | 49.36% | 7.14 |
20 | {458},{111},{111},{111} | 0.52% | 344 | 50.51% | 7.31 |
21 | {111},{458},{111},{111} | 0.55% | 360 | 51.80% | 7.50 |
22 | {111},{111},{111},{111} | 0.73% | 485 | 56.73% | 8.21 |
ID | Sequential Rule | Support | Count | Confidence | Lift |
---|---|---|---|---|---|
1 | {466},{466} | 1.77% | 1168 | 24.79% | 3.47 |
2 | {66},{466} | 4.26% | 2810 | 29.87% | 4.18 |
3 | {98},{66},{466} | 0.60% | 399 | 31.39% | 4.40 |
4 | {90},{66},{466} | 0.83% | 547 | 32.27% | 4.52 |
5 | {65},{66},{466} | 1.11% | 730 | 29.89% | 4.19 |
6 | {644},{66},{466} | 0.69% | 454 | 33.48% | 4.69 |
7 | {553},{66},{466} | 0.52% | 344 | 29.91% | 4.19 |
8 | {521},{66},{466} | 0.60% | 396 | 37.75% | 5.29 |
9 | {466},{66},{466} | 1.04% | 688 | 30.78% | 4.31 |
10 | {463},{66},{466} | 0.72% | 476 | 32.47% | 4.55 |
11 | {458},{66},{466} | 0.66% | 434 | 32.15% | 4.50 |
12 | {455},{66},{466} | 0.63% | 419 | 34.43% | 4.82 |
13 | {434},{66},{466} | 0.99% | 652 | 31.35% | 4.39 |
14 | {432},{66},{466} | 0.65% | 426 | 31.25% | 4.38 |
15 | {431},{66},{466} | 0.51% | 336 | 35.71% | 5.00 |
16 | {36},{66},{466} | 0.51% | 336 | 31.01% | 4.34 |
17 | {342},{66},{466} | 0.51% | 336 | 31.88% | 4.47 |
18 | {339},{66},{466} | 0.51% | 336 | 30.26% | 4.24 |
19 | {29},{66},{466} | 0.51% | 336 | 30.39% | 4.26 |
20 | {126},{66},{466} | 0.51% | 336 | 30.31% | 4.25 |
21 | {111},{66},{466} | 0.59% | 392 | 32.50% | 4.55 |
22 | {66},{466},{66},{466} | 0.63% | 416 | 31.16% | 4.36 |
23 | {466},{66},{66},{466} | 0.53% | 352 | 34.17% | 4.79 |
24 | {466},{466},{466} | 0.55% | 360 | 30.82% | 4.32 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Liu, H.-W.; Wu, J.-Z.; Wang, Y.-H. Uncovering Insights for New Car Recommendations with Sequence Pattern Mining on Mobile Applications. Appl. Sci. 2023, 13, 6386. https://doi.org/10.3390/app13116386
Liu H-W, Wu J-Z, Wang Y-H. Uncovering Insights for New Car Recommendations with Sequence Pattern Mining on Mobile Applications. Applied Sciences. 2023; 13(11):6386. https://doi.org/10.3390/app13116386
Chicago/Turabian StyleLiu, Hsiu-Wen, Jei-Zheng Wu, and Ying-Hsuan Wang. 2023. "Uncovering Insights for New Car Recommendations with Sequence Pattern Mining on Mobile Applications" Applied Sciences 13, no. 11: 6386. https://doi.org/10.3390/app13116386
APA StyleLiu, H.-W., Wu, J.-Z., & Wang, Y.-H. (2023). Uncovering Insights for New Car Recommendations with Sequence Pattern Mining on Mobile Applications. Applied Sciences, 13(11), 6386. https://doi.org/10.3390/app13116386