1. Introduction
The explosion of online information and digital content today has greatly exacerbated the information overload problem faced by users. This trend highlights the importance of recommender systems in filtering and pushing relevant content, which becomes a key tool for guiding users’ attention, increasing interaction frequency, and prolonging platform dwell time. Nowadays, recommender systems optimize the user experience through personalized content recommendations [
1]. However, despite the remarkable achievements in the industry, we still lack a deep and systematic theoretical understanding of how recommended content diffuses in the user community and how users interact with each other to facilitate clicking or consuming behaviors. Previous studies have focused on improving recommendation accuracy [
2], optimizing sorting strategies [
3], or introducing contextual information to enhance user satisfaction, but these approaches are often based on static evaluation metrics, ignoring the temporal characteristics of recommended content spreading in dynamic user networks and the changing process of user engagement. In fact, a recommendation algorithm is not only a content matching process [
4], but also a complex information diffusion process with obvious propagation and evolution [
5]. Therefore, it is of great theoretical and practical significance to understand how recommendation strategies diffuse content across user networks from a macro perspective, as this sheds light on the collective dynamics of user engagement, platform influence, and viral propagation.
The motivation for this study stems from bridging the gap between recommender systems and information diffusion modeling by proposing an interdisciplinary analytical framework. We consider the recommendation process as analogous to the spread of an infectious disease in a social network [
6]. This perspective not only captures the dynamic process of user reception and dissemination of content, but also quantifies the performance of different recommendation algorithms in terms of propagation speed, reach, and user coverage. Specifically, we explore the path of integrating three classical recommendation strategies: popularity-based [
7], collaborative filtering [
8], and content-based recommendation [
9] with an epidemic network model [
10]. With the help of real user interaction data, we construct a scale-free network reflecting the real social structure [
11], and introduce a time-varying propagation rate function to simulate the phenomenon of user engagement decay over time, which makes the model closer to the user behavioral patterns in online platforms. This design can more realistically reflect the evolutionary trend of the recommendation effect under a long running time, and also provides an interpretable theoretical basis for the study of user churn and cold start problems [
12]. Based on this methodological framework, we not only systematically compare the differences in content dissemination efficiency between different recommendation strategies, but also explore the interaction mechanism between algorithm design and user network structure. The experimental results reveal the rapid diffusion ability of collaborative filtering in the initial propagation, but also point out its weak propagation sustainability; in contrast, the popularity-based approach maintains high propagation stability over a long period of time despite its limited initial influence. These findings provide empirical references for the dynamic scheduling of recommendation algorithms and the design of hybrid strategies.
The main contributions of this paper are as follows. First, this paper introduces the SI model in epidemiology into the study of recommender systems, and constructs an analytical framework that can dynamically simulate the process of content diffusion, so as to be closer to the real-life changes in user behavior. Second, in a scale-free network environment constructed based on real clickstream data, the system compares the performance of three types of classical recommendation strategies: popularity-based, collaborative filtering, and content-based approaches in terms of propagation efficiency. Third, this paper introduces time-varying propagation rate to characterize the phenomenon of user engagement decay over time, providing a more nuanced perspective for understanding the evolution of recommendation effectiveness. Finally, this interdisciplinary study not only enriches the theoretical connection between recommender systems and information diffusion mechanisms, but also provides insights with practical value for optimizing recommendation algorithms to maintain user engagement.
The remainder of this paper is organized as follows.
Section 2 briefly reviews the related literature. In
Section 3, we clarify our research motivation and present research questions.
Section 4 defines the problem of modeling recommendation diffusion and outlines key assumptions.
Section 5 introduces our proposed approach, which integrates classical recommendation algorithms with a network-based SI model incorporating time-varying infection rates. In
Section 6, we verify the model’s effectiveness and feasibility by testing its performance under various scenarios.
Section 7 discusses the limitations of the current model and suggests that more sophisticated propagation mechanisms could be introduced in the future to enhance the realistic applicability of the model. Finally,
Section 8 summarizes the key contributions of the article and outlines potential avenues for future research.
6. Experiments
This paper designs and implements a series of simulation experiments to test the proposed recommendation propagation framework based on epidemic modeling in real-world environments. Real-world user click data serves as the foundation for our experiments while we systematically assess propagation efficiency and dynamic evolution of three classical recommendation algorithms using a scale-free user network that reflects real-world network structure characteristics. The next section details the dataset and network construction method while explaining the recommendation strategy settings and propagation simulation process used in experiments before presenting and analyzing key experimental results.
6.6. Comparative Analysis of Propagation Performance Across Recommendation Algorithms
In
Figure 7 we observe the result of the performance propagation of three recommendation algorithms, namely Content-Based Filtering algorithm, Collaborative Filtering algorithm and a popularity-based algorithm (Popularity-Based) on the three dimensions of user attributes (language, city and type). In each simulation, we run all three algorithms in the same simulation settings, with the nine subfigures according to different algorithms and user attributes.
In all of the subgraphs, we can see the content adoption rate of users increases rapidly in the first two time steps, and then the rate of growth slows down and levels off, showing a trend of obvious decreasing marginal effect. This is the typical information diffusion in the scale-free network: fast propagation at the beginning, but as it approaches saturation, the propagation capacity will be significantly reduced.
In all of the three algorithms, the collaborative filtering algorithm (subfigures d, e and f) has the best performance in propagation speed and coverage. The algorithm fully plays the role of the similarity of interests between users; it could deliver the content to users who are more interested faster by taking advantage of the network proximity structure, thus effectively triggering the network effect.
The overall spreading performance of the content-based recommendation algorithm (subfigures a, b and c) is in the middle. Its spreading rate is relatively slow, the main reason is that the algorithm depends on the feature information of items rather than the behavioral relationship between users, and thus has limited propagation ability in the network structure. We could also find that the algorithm’s performance is slightly different in different user attribute dimensions, the spreading effect based on the “content type” dimension is relatively stable.
The popularity-based recommendation model (subgraphs g, h, i) introduces a time-decaying mechanism for the propagation probability, and exhibits more diverse diffusion characteristics. Although the method shows a strong spreading ability in the initial stage, its curve stabilizes earlier and fluctuates more among different user groups. This phenomenon is particularly pronounced in language or genre subgroups with smaller user bases, suggesting that the decay mechanism has a greater impact on long-term diffusion in smaller populations.
It is interesting to note that the spread gap between different user groups (e.g., language, city, type) is most significant in the popularity-based model. This suggests that a single strategy relying on popularity may inadvertently exacerbate the imbalance in content exposure, allowing groups that are already dominant to receive more referral resources.
Combining collaborative filtering algorithms with epidemic propagation modeling methods provides new perspectives for understanding user behavior and network interaction mechanisms. On the one hand, the method highlights the important influence of initial recommendation strength on the overall propagation path; on the other hand, by introducing the time-decaying propagation rate, it more realistically simulates the behavioral characteristics of users’ decreasing interest over time, such as information fatigue or interest transfer.
In addition, this study also emphasizes the influence of network structure and user attribute differences on recommendation propagation efficiency. In a network structure with high connectivity and “hub” users, it is easier for recommendation content to achieve rapid diffusion, while in a network with sparse structure or more edge users, recommendation diffusion may be limited, and more targeted strategies need to be designed to improve the effect.
7. Discussion
In fact, although the modeling method of recommendation propagation dynamics in this paper based on epidemiology provides some new ideas, there are also many shortcomings. Here we summarize several obvious deficiencies to be improved in this work, hoping to provide some direction for follow-up work.
Firstly, this paper employs the classical SI (susceptible-infected) model, where a user who has seen and clicked on the recommended content is considered “infected” and will not recover. It is assumed that the infected individuals do not lose their interest in the item and will be eternally infected. However, user behavior in real life is generally more complicated and also time-aware. In the real propagation process, there may be two kinds of effects of time on users. On the one hand, in the short term, users may quickly forget the recommended contents and lose interest in it; on the other hand, due to the intervention of external forces (such as information overload or deletion of platform authority), users may temporarily or permanently interrupt the propagation. But these processes are impossible to reflect in an SI model.
In order to depict the change of user behavior more realistically, it is also possible to introduce a more complex model of user communication in future research. For instance, in the SIR model, users can transition to the “recovered” state after a certain period of “infection” [
31], which is suitable for the situation where the content attraction is declining or users are gradually cooling down. In the SEIR model, users have to go through the “exposed” state before becoming infected [
32], and it is used to model the propagation behavior of users in the latent period after receiving the recommendation. The extended models mentioned above are expected to further enrich the description of the diffusion of recommended content and the time characteristics of user response.
In addition, the scale-free network used for propagation simulation in this paper was generated by the Barabasi–Albert model, which can better reflect the “central node” and “long-tailed distribution” structural characteristics of the real social network, but it is still an idealized modeling. Real networks have many complex attributes, such as community structure, homogeneous connection [
33], asymmetric influence, etc. Therefore, future research can construct a more representative network structure based on real social platforms or content dissemination data, so as to improve the external validity and predictive power of the model.
From the perspective of propagation performance of recommendation algorithms, it is observed that collaborative filtering algorithms have more balanced and efficient propagation ability across different user groups, which fully leverages the potential of user preference similarity and network structure. In comparison, the performance of popularity-based recommendation methods decreases more rapidly over time after an impressive initial propagation stage, while their performance also varies across different user groups more. This gap indicates the possibility that a popularity-driven recommendation mechanism might further amplify the imbalance of content exposure, leading to higher exposure in mainstream groups and further marginalization of marginalized groups.
These results provide crucial guidance for developing practical recommender systems. Recently, as more attention has been paid to platform user stickiness, fairness, content distribution, and other issues, how to improve recommendation efficiency and distribute content more scientifically has become an urgent problem to be solved. Our experiments have found that the recommendation algorithm can not only improve the individual recommendation experience, but also affect the overall content diffusion mode at the network level. Therefore, when designing more socially responsible recommendation strategies, platforms need to consider not only the dissemination mechanism itself, but also the characteristics of the user structure. Methodologically, by connecting collaborative filtering algorithms with epidemic propagation models, we have found a new approach for studying user behavior mechanisms and network interaction. On the one hand, the method emphasizes the impact of the initial recommendation strength on the entire propagation path; on the other hand, it introduces the time-varying propagation rate to more realistically reflect the behavioral characteristics of users’ gradually waning interest, such as fatigue or change of interest.
Author Contributions
Conceptualization, P.H., X.Q.; methodology, P.H. and X.Q.; software, P.H.; validation, P.H.; formal analysis, P.H.; investigation, P.H., L.S., X.G., Y.Z. and X.Q.; resources, P.H., Y.Z. and X.Q.; data curation, P.H. and X.Q.; writing—original draft, P.H.; writing—review and editing, L.S., X.G., Y.Z. and X.Q.; visualization, P.H., L.S., X.G., Y.Z. and X.Q.; supervision, X.Q.; project administration, P.H., Y.Z. and X.Q. All authors have read and agreed to the published version of the manuscript.
Funding
Xiao Qin’s work is supported by the U.S. NSF (Grant DUE-2424934), NASA (Grant 80NSSC20M0044), NHTSA (Grant 451861-19158), Alabama Research and Development Enhancement Fund (Grant 1ARDEF25 02), and Wright Media, LLC (Grants 240250 and 240311).
Data Availability Statement
The original contributions presented in this study are included in the article material. Further inquiries can be directed to the corresponding authors.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Liang, T.P.; Lai, H.J.; Ku, Y.C. Personalized content recommendation and user satisfaction: Theoretical synthesis and empirical findings. J. Manag. Inf. Syst. 2006, 23, 45–70. [Google Scholar] [CrossRef]
- Gedikli, F.; Jannach, D. Improving recommendation accuracy based on item-specific tag preferences. ACM Trans. Intell. Syst. Technol. 2013, 4, 1–19. [Google Scholar] [CrossRef]
- Vasto-Terrientes, L.D.; Valls, A.; Zielniewicz, P.; Borras, J. A hierarchical multi-criteria sorting approach for recommender systems. J. Intell. Inf. Syst. 2016, 46, 313–346. [Google Scholar] [CrossRef]
- Mao, M.; Lu, J.; Zhang, G.; Zhang, J. A fuzzy content matching-based e-commerce recommendation approach. In Proceedings of the 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), Istanbul, Turkey, 2–5 August 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1–8. [Google Scholar]
- Wan, S.; Niu, Z. A hybrid e-learning recommendation approach based on learners’ influence propagation. IEEE Trans. Knowl. Data Eng. 2019, 32, 827–840. [Google Scholar] [CrossRef]
- Sooknanan, J.; Comissiong, D. Trending on social media: Integrating social media into infectious disease dynamics. Bull. Math. Biol. 2020, 82, 86. [Google Scholar] [CrossRef] [PubMed]
- Cañamares, R.; Castells, P. Should I follow the crowd? A probabilistic analysis of the effectiveness of popularity in recommender systems. In Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, Ann Arbor, MI, USA, 8–12 July 2018; pp. 415–424. [Google Scholar]
- Goldberg, D.; Nichols, D.; Oki, B.M.; Terry, D. Using collaborative filtering to weave an information tapestry. Commun. ACM 1992, 35, 61–70. [Google Scholar] [CrossRef]
- Di Noia, T.; Mirizzi, R.; Ostuni, V.C.; Romito, D.; Zanker, M. Linked open data to support content-based recommender systems. In Proceedings of the 8th International Conference on Semantic Systems, Graz, Austria, 5–7 September 2012; pp. 1–8. [Google Scholar]
- Van Mieghem, P. The N-intertwined SIS epidemic network model. Computing 2011, 93, 147–169. [Google Scholar] [CrossRef]
- Barabási, A.L. Scale-free networks: A decade and beyond. Science 2009, 325, 412–413. [Google Scholar] [CrossRef] [PubMed]
- Wei, Y.; Wang, X.; Li, Q.; Nie, L.; Li, Y.; Li, X.; Chua, T.S. Contrastive learning for cold-start recommendation. In Proceedings of the 29th ACM International Conference on Multimedia, Virtual, 20–24 October 2021; pp. 5382–5390. [Google Scholar]
- Ko, H.; Lee, S.; Park, Y.; Choi, A. A survey of recommendation systems: Recommendation models, techniques, and application fields. Electronics 2022, 11, 141. [Google Scholar] [CrossRef]
- Mnih, A.; Salakhutdinov, R.R. Probabilistic matrix factorization. Adv. Neural Inf. Process. Syst. 2007, 20, 1257–1264. [Google Scholar]
- Basilico, J.; Hofmann, T. Unifying collaborative and content-based filtering. In Proceedings of the Twenty-First International Conference on Machine Learning, Banff, AB, Canada, 4–8 July 2004; p. 9. [Google Scholar]
- Keeling, M.J.; Eames, K.T. Networks and epidemic models. J. R. Soc. Interface 2005, 2, 295–307. [Google Scholar] [CrossRef]
- Jacquez, J.A.; Simon, C.P. The stochastic SI model with recruitment and deaths I. Comparison with the closed SIS model. Math. Biosci. 1993, 117, 77–125. [Google Scholar] [CrossRef] [PubMed]
- Volz, E.; Meyers, L.A. Susceptible–infected–recovered epidemics in dynamic contact networks. Proc. R. Soc. B Biol. Sci. 2007, 274, 2925–2934. [Google Scholar] [CrossRef]
- Viguerie, A.; Lorenzo, G.; Auricchio, F.; Baroli, D.; Hughes, T.J.; Patton, A.; Reali, A.; Yankeelov, T.E.; Veneziani, A. Simulating the spread of COVID-19 via a spatially-resolved susceptible–exposed–infected–recovered–deceased (SEIRD) model with heterogeneous diffusion. Appl. Math. Lett. 2021, 111, 106617. [Google Scholar] [CrossRef]
- Luarn, P.; Yang, J.C.; Chiu, Y.P. The network effect on information dissemination on social network sites. Comput. Hum. Behav. 2014, 37, 1–8. [Google Scholar] [CrossRef]
- Wang, C.; Chen, W.; Wang, Y. Scalable influence maximization for independent cascade model in large-scale social networks. Data Min. Knowl. Discov. 2012, 25, 545–576. [Google Scholar] [CrossRef]
- Chen, W.; Yuan, Y.; Zhang, L. Scalable influence maximization in social networks under the linear threshold model. In Proceedings of the 2010 IEEE International Conference on Data Mining, Sydney, NSW, Australia, 13–17 December 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 88–97. [Google Scholar]
- Li, M.; Wang, X.; Gao, K.; Zhang, S. A survey on information diffusion in online social networks: Models and methods. Information 2017, 8, 118. [Google Scholar] [CrossRef]
- Park, J.; Barabási, A.L. Distribution of node characteristics in complex networks. Proc. Natl. Acad. Sci. USA 2007, 104, 17916–17920. [Google Scholar] [CrossRef] [PubMed]
- Bujlow, T.; Carela-Español, V.; Sole-Pareta, J.; Barlet-Ros, P. A survey on web tracking: Mechanisms, implications, and defenses. Proc. IEEE 2017, 105, 1476–1510. [Google Scholar] [CrossRef]
- Strogatz, S.H. Exploring complex networks. Nature 2001, 410, 268–276. [Google Scholar] [CrossRef] [PubMed]
- Zhou, J.; Cui, G.; Hu, S.; Zhang, Z.; Yang, C.; Liu, Z.; Wang, L.; Li, C.; Sun, M. Graph neural networks: A review of methods and applications. AI Open 2020, 1, 57–81. [Google Scholar] [CrossRef]
- Albert, R.; Barabási, A.L. Statistical mechanics of complex networks. Rev. Mod. Phys. 2002, 74, 47. [Google Scholar] [CrossRef]
- Koren, Y.; Rendle, S.; Bell, R. Advances in collaborative filtering. In Recommender Systems Handbook; Springer: Boston, MA, USA, 2021; pp. 91–142. [Google Scholar]
- Pazzani, M.J.; Billsus, D. Content-based recommendation systems. In The Adaptive Web: Methods and Strategies of Web Personalization; Springer: Berlin/Heidelberg, Germany, 2007; pp. 325–341. [Google Scholar]
- Cooper, I.; Mondal, A.; Antonopoulos, C.G. A SIR model assumption for the spread of COVID-19 in different communities. Chaos Solitons Fractals 2020, 139, 110057. [Google Scholar] [CrossRef] [PubMed]
- He, S.; Peng, Y.; Sun, K. SEIR modeling of the COVID-19 and its dynamics. Nonlinear Dyn. 2020, 101, 1667–1680. [Google Scholar] [CrossRef] [PubMed]
- Macpherson, D. A survey of homogeneous structures. Discret. Math. 2011, 311, 1599–1634. [Google Scholar] [CrossRef]
- Ye, Z.; Zhang, L.; Xiao, K.; Zhou, W.; Ge, Y.; Deng, Y. Multi-user mobile sequential recommendation: An efficient parallel computing paradigm. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 2624–2633. [Google Scholar]
- Zhang, M.; Wu, S.; Yu, X.; Liu, Q.; Wang, L. Dynamic graph neural networks for sequential recommendation. IEEE Trans. Knowl. Data Eng. 2022, 35, 4741–4753. [Google Scholar] [CrossRef]
- Farseev, A.; Samborskii, I.; Filchenkov, A.; Chua, T.S. Cross-domain recommendation via clustering on multi-layer graphs. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Tokyo Japan, 7–11 August 2017; pp. 195–204. [Google Scholar]
- Yin, J.; Jia, H.; Zhou, B.; Tang, T.; Ying, L.; Ye, S.; Peng, T.Q.; Wu, Y. Blowing Seeds across Gardens: Visualizing Implicit Propagation of Cross-Platform Social Media Posts. IEEE Trans. Vis. Comput. Graph. 2024, 31, 185–195. [Google Scholar] [CrossRef]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).