End-to-End Personalization via Unifying LLM Agents and Graph Attention Networks for Entertainment Recommendation †
Abstract
1. Introduction
Contributions
- A Generalizable End-to-End Personalization Architecture: We propose a unified recommendation paradigm that extends hybrid graph–semantic modeling from a single-domain setting to a multi-domain entertainment environment. This generalization enables the system to capture richer behavioral signals while improving robustness across heterogeneous item spaces and enabling cross-domain preference modeling.
- Agent-Oriented Semantic Infrastructure for Representation Learning: We introduce a modular agentic design that restructures the preprocessing stage into an intelligent semantic infrastructure capable of transforming raw, heterogeneous metadata into structured representations. This design enhances scalability, promotes architectural flexibility, and supports reproducible feature construction for graph-based learning.
- Refined Feature Initialization for Attention-Based Collaborative Filtering: We develop an improved feature construction methodology that strengthens the alignment between semantic representations and relational graph structure, thereby improving the capacity of the GAT to learn meaningful preference patterns under sparsity and cold-start conditions.
- Post-Processing Intelligence through Confidence-Aware Agentic Reranking: We design a structured post-processing framework in which LLM-driven agents perform confidence-aware reranking over candidate recommendations. This mechanism introduces an additional reasoning layer that consistently improves recommendation quality while mitigating early-stage ranking noise.
- Explainable Recommendation through Natural-Language Justification Agents: To address the growing demand for transparent AI systems, we integrate an explanation agent that generates context-sensitive natural-language rationales, thereby enhancing interpretability without sacrificing predictive performance.
- Comprehensive Empirical Validation Across Domains and Datasets: We conduct extensive experiments spanning multiple datasets and entertainment modalities, in general cases, cold start scenarios, and warm cases, demonstrating consistent gains in ranking metrics and particularly strong improvements in cold-start scenarios. These findings highlight the framework’s capacity for reliable personalization in data-constrained environments.
2. Literature Review
2.1. LLMs for Feature Engineering
2.2. LLMs for Ranking and Interpretability
2.3. Graph Attention Networks in Recommendation Systems
3. Methodology
3.1. Item Profile Preprocessing
3.1.1. Item Profile Generation
3.1.2. Item Profile: Across Content Domains
3.2. User Profile Preprocessing
3.2.1. Input and Output Data Representation
3.2.2. User Profile Generation Strategy
3.2.3. Preference and Dislike Modeling
3.2.4. Implementation and Validation
3.3. Model Training
3.3.1. Semantic Embedding Initialization
3.3.2. Interaction Graph Construction
3.3.3. Graph Attention Architecture
3.3.4. Training Objective and Hybrid Optimization
3.3.5. Hyperparameter Configuration
3.4. Post-Processing Procedure
3.4.1. Reranker Agent
3.4.2. Rank Fusion and Explanation Agent
3.5. Computational Considerations and Scalability
4. Experimental Results
4.1. Data Preprocessing
4.1.1. Interaction Normalization and Aggregation
4.1.2. Sampling Strategy
4.1.3. Train–Test Splitting Strategy
4.2. Baseline Recommendation Algorithms
4.3. Results and Discussion
4.3.1. Results of the Main Proposed Methodology
4.3.2. Cold-Start Users: Robustness Under Sparse Signals
4.3.3. Warm-Start Users: Where Collaborative Filtering Recovers
4.3.4. Results of the Proposed Post-Processing Methodology
4.4. Evaluation of Recommendation Explanations
4.4.1. Evaluation Setup for Explanations
4.4.2. Results of Explanations Analysis
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Zhao, Z.; Fan, W.; Li, J.; Liu, Y.; Mei, X.; Wang, Y.; Wen, Z.; Wang, F.; Zhao, X.; Tang, J.; et al. Recommender Systems in the Era of Large Language Models (LLMs). IEEE Trans. Knowl. Data Eng. 2024, 36, 6889–6907. [Google Scholar] [CrossRef]
- Wu, L.; Zheng, Z.; Qiu, Z.; Wang, H.; Gu, H.; Shen, T.; Qin, C.; Zhu, C.; Zhu, H.; Liu, Q.; et al. A survey on large language models for recommendation. World Wide Web 2024, 27, 60. [Google Scholar] [CrossRef]
- Lin, J.; Dai, X.; Xi, Y.; Liu, W.; Chen, B.; Zhang, H.; Liu, Y.; Wu, C.; Li, X.; Zhu, C.; et al. How Can Recommender Systems Benefit from Large Language Models: A Survey. ACM Trans. Inf. Syst. 2023, 43, 28. [Google Scholar] [CrossRef]
- Rentfrow, P.J.; Gosling, S.D. The do re mi’s of everyday life: The structure and personality correlates of music preferences. J. Pers. Soc. Psychol. 2003, 84, 1236–1256. [Google Scholar] [CrossRef]
- Ebrat, D.; Aminian, T.; Ahmadian, S.; Rueda, L. End-to-End Personalization: Unifying Recommender Systems with Large Language Models. arXiv 2025, arXiv:2508.01514. [Google Scholar]
- Ebrat, D.; Ahmadian, S.; Rueda, L. Vectorized Context-Aware Embeddings for GAT-Based Collaborative Filtering. arXiv 2025, arXiv:2510.264612025. [Google Scholar] [CrossRef]
- Xi, Y.; Liu, W.; Lin, J.; Cai, X.; Zhu, H.; Zhu, J.; Chen, B.; Tang, R.; Zhang, W.; Yu, Y. Towards Open-World Recommendation with Knowledge Augmentation from Large Language Models. In Proceedings of the 18th ACM Conference on Recommender Systems; ACM: New York, NY, USA, 2024; pp. 12–22. [Google Scholar] [CrossRef]
- Liu, F.; Liu, Y.; Chen, H.; Cheng, Z.; Nie, L.; Kankanhalli, M. Understanding Before Recommendation: Semantic Aspect-Aware Review Exploitation via Large Language Models. ACM Trans. Inf. Syst. 2023, 43, 44. [Google Scholar] [CrossRef]
- Torbati, G.H.; Tigunova, A.; Yates, A.; Weikum, G. Recommendations by Concise User Profiles from Review Text. arXiv 2023, arXiv:2311.01314. [Google Scholar] [CrossRef]
- Moon, J.; Park, S.; Lee, J. LLM-Enhanced Linear Autoencoders for Recommendation; ACM: New York, NY, USA, 2025. [Google Scholar] [CrossRef]
- Suzumura, T.; Ikari, H.; Kanezashi, H.; Rahman, M.M.; Hirate, Y. SymCERE: Symmetric Contrastive Learning for Robust Review-Enhanced Recommendation. arXiv 2025, arXiv:2504.02195. [Google Scholar]
- Spillo, G.; Musto, C.; Mannavola, M.; de Gemmis, M.; Lops, P.; Semeraro, G. GAL-KARS: Exploiting LLMs for Graph Augmentation in Knowledge-Aware Recommender Systems. In Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization; ACM: New York, NY, USA, 2025; pp. 73–82. [Google Scholar] [CrossRef]
- Shi, K.; Sun, X.; Wang, D.; Fu, Y.; Xu, G.; Li, Q. LLaMA-E: Empowering E-Commerce Authoring with Object-Interleaved Instruction Following; Association for Computational Linguistics: Stroudsburg, PA, USA, 2023. [Google Scholar]
- Li, Y.; Ma, S.; Wang, X.; Huang, S.; Jiang, C.; Zheng, H.-T.; Xie, P.; Huang, F.; Jiang, Y. EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce. Proc. AAAI Conf. Artif. Intell. 2024, 38, 18582–18590. [Google Scholar] [CrossRef]
- Yang, T.; Ren, B.; Gu, C.; Xu, F.; Ma, B.; Konomi, S. Enhancing Course Recommendation with LLM-Generated Concepts: A Unified Framework for Side Information Integration. Big Data Cogn. Comput. 2025, 9, 311. [Google Scholar] [CrossRef]
- Chen, J.; Ma, L.; Li, X.; Thakurdesai, N.; Xu, J.; Cho, J.H.; Nag, K.; Korpeoglu, E.; Kumar, S.; Achan, K. Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs. arXiv 2023, arXiv:2305.09858. [Google Scholar]
- Chu, Z.; Wang, Y.; Cui, Q.; Li, L.; Chen, W.; Qin, Z.; Ren, K. LLM-Guided Multi-View Hypergraph Learning for Human-Centric Explainable Recommendation. arXiv 2024, arXiv:2401.08217. [Google Scholar]
- Du, Y.; Luo, D.; Yan, R.; Wang, X.; Liu, H.; Zhu, H.; Song, Y.; Zhang, J. Enhancing Job Recommendation through LLM-Based Generative Adversarial Networks. Proc. AAAI Conf. Artif. Intell. 2024, 38, 8363–8371. [Google Scholar] [CrossRef]
- Christakopoulou, K.; Lalama, A.; Adams, C.; Qu, I.; Amir, Y.; Chucri, S.; Vollucci, P.; Soldo, F.; Bseiso, D.; Scodel, S.; et al. Large Language Models for User Interest Journeys. arXiv 2023, arXiv:2305.15498. [Google Scholar] [CrossRef]
- Zheng, Z.; Zhu, Y.; Liu, H.; Ju, M.; Zhao, T.; Shah, N.; Li, J. MI4Rec: Pretrained Language Model based Cold-Start Recommendation with Meta-Item Embeddings. In Proceedings of the 34th ACM International Conference on Information and Knowledge Management; ACM: New York, NY, USA, 2025; pp. 4455–4465. [Google Scholar] [CrossRef]
- He, X.; Liao, L.; Zhang, H.; Nie, L.; Hu, X.; Chua, T.-S. Neural Collaborative Filtering. In Proceedings of the 26th International Conference on World Wide Web (WWW 2017); Barrett, R., Cummings, R., Agichtein, E., Gabrilovich, E., Eds.; International World Wide Web Conferences Steering Committee: Perth, Australia, 2017; pp. 173–182. [Google Scholar] [CrossRef]
- Cheng, M.; Liu, Q.; Zhang, W.; Liu, Z.; Zhao, H.; Chen, E. A general tail item representation enhancement framework for sequential recommendation. Front. Comput. Sci. 2024, 18, 186333. [Google Scholar] [CrossRef]
- Wang, H.; Zhang, F.; Wang, J.; Zhao, M.; Li, W.; Xie, X.; Guo, M. RippleNet. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management; ACM: New York, NY, USA, 2018; pp. 417–426. [Google Scholar] [CrossRef]
- Su, X.; Khoshgoftaar, T.M. A Survey of Collaborative Filtering Techniques. Adv. Artif. Intell. 2009, 2009, 421425. [Google Scholar] [CrossRef]
- Lin, J.; Chen, B.; Wang, H.; Xi, Y.; Qu, Y.; Dai, X.; Zhang, K.; Tang, R.; Yu, Y.; Zhang, W. ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction. In Proceedings of the ACM Web Conference 2024; ACM: New York, NY, USA, 2024; pp. 3319–3330. [Google Scholar] [CrossRef]
- Yang, S.; Wang, C.; Liu, Y.; Xu, K.; Ma, W.; Liu, Y.; Zhang, M.; Zeng, H.; Feng, J.; Deng, C. Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation. In Proceedings of the 2023 IEEE International Conference on Data Mining (ICDM); IEEE: New York, NY, USA, 2023; pp. 728–737. [Google Scholar] [CrossRef]
- Bao, K.; Zhang, J.; Zhang, Y.; Wang, W.; Feng, F.; He, X. TALLRec: An Effective and Efficient Tuning Framework to Align Large Language Model with Recommendation. In Proceedings of the 17th ACM Conference on Recommender Systems; ACM: New York, NY, USA, 2023; pp. 1007–1014. [Google Scholar] [CrossRef]
- Di Palma, D.; Biancofiore, G.M.; Anelli, V.W.; Narducci, F.; Di Noia, T.; Di Sciascio, E. Evaluating ChatGPT as a Recommender System: A Rigorous Approach. arXiv 2024, arXiv:2309.03613. [Google Scholar]
- Yue, Z.; Rabhi, S.; de Wang, D.S.G.; Oldridge, E. LlamaRec: Two-Stage Recommendation using Large Language Models for Ranking. arXiv 2023, arXiv:2311.02089. [Google Scholar]
- Wang, Y.; Liu, Z.; Zhang, J.; Yao, W.; Heinecke, S.; Yu, P.S. DRDT: Dynamic Reflection with Divergent Thinking for LLM-based Sequential Recommendation. arXiv 2023, arXiv:2312.11336. [Google Scholar] [CrossRef]
- Hou, Y.; Zhang, J.; Lin, Z.; Lu, H.; Xie, R.; McAuley, J.; Zhao, W.X. Large Language Models are Zero-Shot Rankers for Recommender Systems. In Proceedings of the European Conference on Information Retrieval; Springer Nature: Cham, Switzerland, 2024. [Google Scholar]
- Golub, G.H. Least squares, singular values and matrix approximations. Appl. Math. 1968, 13, 44–51. [Google Scholar] [CrossRef]
- Gori, M.; Pucci, A. ItemRank: A random-walk based scoring algorithm for recommender engines. In Proceedings of the 20th International Joint Conference on Artifical Intelligence; In IJCAI’07; Morgan Kaufmann Publishers Inc.: San Francisco, CA, USA, 2007; pp. 2766–2771. [Google Scholar]
- He, X.; Gao, M.; Kan, M.-Y.; Wang, D. BiRank: Towards Ranking on Bipartite Graphs. IEEE Trans. Knowl. Data Eng. 2017, 29, 57–71. [Google Scholar] [CrossRef]
- Yang, Q.; Zhang, S.; Dong, S.; Xu, L.; Dong, W.; Li, X.; Sun, P.; Jiang, F.; Zhang, X.; Luo, G. Graph Convolutional Network with Neural Inductive Matrix Completion for Predicting Disease-Related LncRNA Genes. In Proceedings of the 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM); IEEE: New York, NY, USA, 2023; pp. 3595–3601. [Google Scholar] [CrossRef]
- Ying, R.; He, R.; Chen, K.; Eksombatchai, P.; Hamilton, W.L.; Leskovec, J. Graph Convolutional Neural Networks for Web-Scale Recommender Systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining; ACM: New York, NY, USA, 2018; pp. 974–983. [Google Scholar] [CrossRef]
- He, X.; Deng, K.; Wang, X.; Li, Y.; Zhang, Y.; Wang, M. LightGCN. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval; ACM: New York, NY, USA, 2020; pp. 639–648. [Google Scholar] [CrossRef]
- Chernov, A.; Wahab, H.; Novitskij, O. Leveraging Language Semantics for Collaborative Filtering with TextGCN and TextGCN-MLP: Zero-Shot vs In-Domain Performance. arXiv 2025, arXiv:2510.12461. [Google Scholar]
- Elahi, E.; Anwar, S.; Al-Kfairy, M.; Rodrigues, J.J.; Ngueilbaye, A.; Halim, Z.; Waqas, M. Graph attention-based neural collaborative filtering for item-specific recommendation system using knowledge graph. Expert Syst. Appl. 2025, 266, 126133. [Google Scholar] [CrossRef]
- Zhang, S.; Li, Z.; Wang, X.; Chen, Z.; Guo, W. TKGAT: Temporal Knowledge Graph Representation Learning Using Attention Network; Springer Nature: Cham, Switzerland, 2023; pp. 46–61. [Google Scholar] [CrossRef]
- Ren, X.; Wei, W.; Xia, L.; Su, L.; Cheng, S.; Wang, J.; Yin, D.; Huang, C. Representation Learning with Large Language Models for Recommendation. In Proceedings of the ACM Web Conference 2024; ACM: New York, NY, USA, 2024; pp. 3464–3475. [Google Scholar] [CrossRef]
- Harper, F.M.; Konstan, J.A. The MovieLens Datasets: History and Context. 2015. Available online: http://grouplens.org/datasets/movielens (accessed on 1 January 2026).
- Zajac, Z. Goodbooks-10k: A New Dataset for Book Recommendations; FastML: London, UK, 2017. [Google Scholar]
- Santana, I.A.P.; Pinhelli, F.; Donini, J.; Catharin, L.; Mangolin, R.B.; Feltrim, V.D.; Domingues, M.A. Music4All: A New Music Database and Its Applications. In Proceedings of the 2020 International Conference on Systems, Signals and Image Processing (IWSSIP), Niterói, Brazil, 1–3 July 2020; pp. 399–404. [Google Scholar]
- Cormack, G.V.; Clarke, C.L.A.; Buettcher, S. Reciprocal rank fusion outperforms condorcet and individual rank learning methods. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval; In SIGIR ’09; Association for Computing Machinery: New York, NY, USA, 2009; pp. 758–759. [Google Scholar] [CrossRef]








| Dataset | Segment | N Users | Min | Max | Mean | Median | Overall Mean |
|---|---|---|---|---|---|---|---|
| Goodbook | Cold (5%) | 50 | 20 | 56 | 44.24 | 48.0 | 89.44 |
| Warm (5%) | 50 | 128 | 146 | 134.68 | 133.5 | 89.44 | |
| Music4all | Cold (5%) | 50 | 6 | 48 | 31.24 | 32.0 | 147.27 |
| Warm (5%) | 50 | 250 | 338 | 278.30 | 270.5 | 147.27 | |
| ML-100k | Cold (5%) | 47 | 4 | 13 | 10.72 | 11.0 | 84.84 |
| Warm (5%) | 47 | 260 | 685 | 335.53 | 315.0 | 84.84 | |
| ML-1M | Cold (5%) | 302 | 11 | 18 | 16.48 | 17.0 | 132.48 |
| Warm (5%) | 302 | 449 | 1867 | 635.05 | 590.0 | 132.48 |
| Dataset | Goodbook | Music4All | ML-100K | ML-1M | |||||
|---|---|---|---|---|---|---|---|---|---|
| Metric | MAP | NDCG | MAP | NDCG | MAP | NDCG | MAP | NDCG | |
| Method | |||||||||
| SVD | 0.0848 ± 0.0088 | 0.1575 ± 0.0116 | 0.0447 ± 0.0065 | 0.0544 ± 0.0074 | 0.2240 ± 0.0209 | 0.3727 ± 0.0239 | 0.0812 ± 0.0035 | 0.1629 ± 0.0045 | |
| NGCF | 0.0754 ± 0.0077 | 0.1453 ± 0.0110 | 0.0438 ± 0.0074 | 0.0494 ± 0.0075 | 0.2669 ± 0.0249 | 0.4110 ± 0.0245 | 0.2076 ± 0.0060 | 0.3208 ± 0.0069 | |
| LightGCN | 0.0870 ± 0.0087 | 0.1619 ± 0.0122 | 0.1019 ± 0.0135 | 0.1037 ± 0.0123 | 0.1850 ± 0.0196 | 0.3134 ± 0.0219 | 0.1751 ± 0.0058 | 0.2683 ± 0.0066 | |
| NCF | 0.0349 ± 0.0050 | 0.0736 ± 0.0079 | 0.0100 ± 0.0028 | 0.0146 ± 0.0034 | 0.1972 ± 0.0219 | 0.3319 ± 0.0226 | 0.1063 ± 0.0044 | 0.1962 ± 0.0053 | |
| GAT | 0.0435 ± 0.0059 | 0.0878 ± 0.0086 | 0.0188 ± 0.0042 | 0.0248 ± 0.0052 | 0.2165 ± 0.0244 | 0.3242 ± 0.0259 | 0.1766 ± 0.0062 | 0.2605 ± 0.0070 | |
| RLMRec | 0.0491 ± 0.0068 | 0.0946 ± 0.0095 | 0.0179 ± 0.0046 | 0.0198 ± 0.0043 | 0.1661 ± 0.0229 | 0.2585 ± 0.0251 | 0.1569 ± 0.0059 | 0.2357 ± 0.0067 | |
| Our Method | 0.1145 ± 0.0100 | 0.1972 ± 0.0138 | 0.1261 ± 0.0171 | 0.1287 ± 0.0151 | 0.3448 ± 0.0286 | 0.4713 ± 0.0261 | 0.2429 ± 0.0070 | 0.3410 ± 0.0073 | |
| +31.6% | +21.8% | +23.7% | +24.1% | +29.1% | +14.6% | +17% | +6.2% | ||
| Scenario | Dataset | Goodbook | Music4All | ML-100K | ML-1M | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Metric | MAP | NDCG | MAP | NDCG | MAP | NDCG | MAP | NDCG | ||
| Method | ||||||||||
| Cold | SVD | 0.1400 ± 0.0900 | 0.1640 ± 0.0900 | 0.1200 ± 0.0900 | 0.0932 ± 0.0729 | 0.1915 ± 0.1170 | 0.2404 ± 0.1032 | 0.0861 ± 0.0315 | 0.0925 ± 0.0298 | |
| NGCF | 0.0800 ± 0.0700 | 0.0800 ± 0.0661 | 0.1200 ± 0.0900 | 0.0603 ± 0.0557 | 0.2553 ± 0.1170 | 0.3032 ± 0.1208 | 0.1093 ± 0.0348 | 0.1183 ± 0.0343 | ||
| LightGCN | 0.1400 ± 0.1000 | 0.1600 ± 0.0940 | 0.0800 ± 0.0700 | 0.0534 ± 0.0567 | 0.1702 ± 0.1064 | 0.2660 ± 0.1032 | 0.1358 ± 0.0381 | 0.1442 ± 0.0391 | ||
| NCF | 0.1200 ± 0.0900 | 0.1240 ± 0.0800 | 0.0400 ± 0.0500 | 0.0151 ± 0.0216 | 0.1277 ± 0.0957 | 0.1617 ± 0.0957 | 0.1060 ± 0.0348 | 0.1123 ± 0.0349 | ||
| GAT | 0.1000 ± 0.0800 | 0.1000 ± 0.0900 | 0.0000 ± 0.0000 | 0.0003 ± 0.0004 | 0.1064 ± 0.0957 | 0.1223 ± 0.0809 | 0.0662 ± 0.0281 | 0.0603 ± 0.0255 | ||
| RLMRec | 0.0200 ± 0.0300 | 0.0200 ± 0.0300 | 0.0200 ± 0.0300 | 0.0003 ± 0.0004 | 0.0000 ± 0.0000 | 0.0170 ± 0.0213 | 0.0298 ± 0.0182 | 0.0278 ± 0.0172 | ||
| Our Method | 0.2600 ± 0.1300 | 0.2760 ± 0.1140 | 0.1800 ± 0.1100 | 0.1145 ± 0.0783 | 0.2766 ± 0.1277 | 0.3553 ± 0.1197 | 0.1457 ± 0.0414 | 0.1498 ± 0.0382 | ||
| Warm | SVD | 0.2600 ± 0.1200 | 0.2800 ± 0.1100 | 0.0200 ± 0.0300 | 0.0250 ± 0.0325 | 0.4286 ± 0.2500 | 0.5286 ± 0.1857 | 0.2583 ± 0.0480 | 0.3205 ± 0.0427 | |
| NGCF | 0.1800 ± 0.1000 | 0.2000 ± 0.1000 | 0.0800 ± 0.0700 | 0.0565 ± 0.0513 | 0.6429 ± 0.2500 | 0.6571 ± 0.1714 | 0.6093 ± 0.0547 | 0.6589 ± 0.0384 | ||
| LightGCN | 0.2200 ± 0.1200 | 0.2240 ± 0.1060 | 0.0400 ± 0.0500 | 0.0500 ± 0.0465 | 0.7143 ± 0.2500 | 0.6429 ± 0.2071 | 0.6788 ± 0.0513 | 0.7139 ± 0.0371 | ||
| NCF | 0.0400 ± 0.0500 | 0.0400 ± 0.0500 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.1429 ± 0.1786 | 0.4000 ± 0.1643 | 0.3510 ± 0.0530 | 0.4338 ± 0.0440 | ||
| GAT | 0.1200 ± 0.0900 | 0.1480 ± 0.0860 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.7857 ± 0.2143 | 0.7429 ± 0.1571 | 0.7483 ± 0.0480 | 0.7583 ± 0.0374 | ||
| RLMRec | 0.1600 ± 0.1000 | 0.1600 ± 0.0920 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.5000 ± 0.2857 | 0.5000 ± 0.2286 | 0.7185 ± 0.0514 | 0.7364 ± 0.0404 | ||
| Our Method | 0.2200 ± 0.1100 | 0.2240 ± 0.1061 | 0.0400 ± 0.0500 | 0.0371 ± 0.0450 | 0.9286 ± 0.1071 | 0.8571 ± 0.1286 | 0.8344 ± 0.0414 | 0.8285 ± 0.0318 | ||
| Scenario | Dataset | Goodbook | Music4All | ML-100K | ML-1M | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Metric | MAP | NDCG | MAP | NDCG | MAP | NDCG | MAP | NDCG | ||
| Variant | ||||||||||
| Cold | Base CF | 0.2200 ± 0.1100 | 0.2280 ± 0.1060 | 0.4000 ± 0.4000 | 0.2186 ± 0.3093 | 0.3684 ± 0.1579 | 0.4079 ± 0.1375 | 0.1800 ± 0.1100 | 0.1920 ± 0.1020 | |
| LLM Reranked | 0.1600 ± 0.1100 | 0.1840 ± 0.0981 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.1316 ± 0.1184 | 0.1671 ± 0.0993 | 0.0800 ± 0.0700 | 0.1040 ± 0.0781 | ||
| RRF Balanced | 0.2200 ± 0.1000 | 0.2520 ± 0.1080 | 0.4000 ± 0.4000 | 0.0699 ± 0.0862 | 0.3158 ± 0.1319 | 0.3461 ± 0.1316 | 0.2200 ± 0.1100 | 0.2000 ± 0.1041 | ||
| Warm | Base CF | 0.2600 ± 0.1200 | 0.2560 ± 0.1180 | 0.2000 ± 0.3000 | 0.0400 ± 0.0600 | 0.8182 ± 0.2273 | 0.8182 ± 0.1727 | 0.6200 ± 0.1300 | 0.6700 ± 0.1080 | |
| LLM Reranked | 0.2400 ± 0.1100 | 0.2360 ± 0.1180 | 0.0000 ± 0.0000 | 0.0000 ± 0.0000 | 0.8182 ± 0.2273 | 0.8727 ± 0.0909 | 0.4800 ± 0.1400 | 0.4640 ± 0.1240 | ||
| RRF Balanced | 0.2800 ± 0.1300 | 0.2800 ± 0.1160 | 0.2000 ± 0.3000 | 0.0400 ± 0.0600 | 0.9091 ± 0.1364 | 0.9091 ± 0.0727 | 0.5600 ± 0.1400 | 0.6140 ± 0.1200 | ||
| Dataset | Relevance | Faithfulness | Personalization | Coherence | Average |
|---|---|---|---|---|---|
| Goodbook | 4.30 ± 0.52 | 4.17 ± 0.53 | 3.91 ± 0.62 | 4.37 ± 0.51 | 4.19 ± 0.55 |
| Music4all | 4.17 ± 0.54 | 4.00 ± 0.60 | 3.86 ± 0.68 | 4.42 ± 0.50 | 4.11 ± 0.58 |
| ML-100k | 4.31 ± 0.47 | 4.08 ± 0.58 | 3.92 ± 0.60 | 4.46 ± 0.49 | 4.19 ± 0.54 |
| ML-1M | 4.33 ± 0.50 | 4.15 ± 0.58 | 4.06 ± 0.65 | 4.53 ± 0.42 | 4.27 ± 0.55 |
| Overall | 4.28 ± 0.51 | 4.10 ± 0.58 | 3.94 ± 0.64 | 4.44 ± 0.48 | 4.19 ± 0.56 |
| Dataset | SAS_User | SAS_Item | SAS_Combined |
|---|---|---|---|
| Goodbook | 0.538 ± 0.068 | 0.705 ± 0.067 | 0.621 ± 0.043 |
| Music4all | 0.625 ± 0.068 | 0.749 ± 0.081 | 0.687 ± 0.047 |
| ML-100k | 0.563 ± 0.066 | 0.707 ± 0.085 | 0.635 ± 0.045 |
| ML-1M | 0.592 ± 0.071 | 0.720 ± 0.071 | 0.656 ± 0.040 |
| Overall | 0.579 ± 0.075 | 0.720 ± 0.078 | 0.650 ± 0.051 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Ebrat, D.; Ahmadian, S.; Rueda, L. End-to-End Personalization via Unifying LLM Agents and Graph Attention Networks for Entertainment Recommendation. Information 2026, 17, 344. https://doi.org/10.3390/info17040344
Ebrat D, Ahmadian S, Rueda L. End-to-End Personalization via Unifying LLM Agents and Graph Attention Networks for Entertainment Recommendation. Information. 2026; 17(4):344. https://doi.org/10.3390/info17040344
Chicago/Turabian StyleEbrat, Danial, Sepideh Ahmadian, and Luis Rueda. 2026. "End-to-End Personalization via Unifying LLM Agents and Graph Attention Networks for Entertainment Recommendation" Information 17, no. 4: 344. https://doi.org/10.3390/info17040344
APA StyleEbrat, D., Ahmadian, S., & Rueda, L. (2026). End-to-End Personalization via Unifying LLM Agents and Graph Attention Networks for Entertainment Recommendation. Information, 17(4), 344. https://doi.org/10.3390/info17040344

