CSRLoan: Cold Start Loan Recommendation with Semantic-Enhanced Neural Matrix Factorization
Abstract
:1. Introduction
- To the best of our knowledge, this is one of the pioneer works that model the semantics of statements with pre-training techniques and utilize them for loan recommendations. The intrinsic characters of statements are suitable for solving the cold start recommendation problem.
- We propose CSRLoan, a dual neural matrix factorization model for cold start loan recommendation. It first learns the representations of statements. Then, the loan projects and applicants are embedded in a semantic space for better recommendation.
- We conduct extensive experiments on a real-world dataset. The results show the superiority of CSRLoan compared with all baselines.
2. Related Work
2.1. Loan Recommendation
2.2. Cold Start Recommendation
2.3. Pre-Training for Semantic Modeling
3. Preliminaries
3.1. Problem Formulation
3.2. Overview of CSRLoan
4. Methodology
4.1. Statement Encoding
4.2. Dual Neural Matrix Factorization
4.3. Mixture Learning Target
5. Experiment
- RQ1: Does CSRLoan outperform other general recommendation methods and loan recommendation methods?
- RQ2: What is the capability of the proposed pre-training techniques and dual NMF?
- RQ3: What are the influences of different hyper-parameter settings?
5.1. Experimental Settings
5.1.1. Datasets
5.1.2. Evaluation Metrics
5.1.3. Compared Baselines
- MF-BPR [37]: A Bayesian personalized ranking optimized MF model with a pairwise ranking loss. It is tailored toward recommendations with implicit feedback data.
- CML [2]: A recently proposed algorithm that minimizes the distance between each user–loan interactions in Euclidean space.
- MBR [14]: A motivation-based recommendation method to utilize the unstructured data. This is the state-of-the-art method for loan recommendations.
- FAR [3]: A fairness-aware recommendation method based on one-class collaborative-filtering techniques.
- NMF: Traditional NMF that removes the pre-training and default risk estimation module.
- CSRLoan : Directly train CSRLoan with randomly initialized statement encoder.
- CSRLoan : A variant of CSRLoan which removes the default risk estimation module.
5.1.4. Parameter Settings
5.2. Experimental Results
5.3. Ablation Studies
5.4. Hyper-Parameter Studies
6. Conclusions
Author Contributions
Funding
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Rendle, S.; Freudenthaler, C.; Gantner, Z.; Schmidt-Thieme, L. BPR: Bayesian personalized ranking from implicit feedback. arXiv 2012, arXiv:1205.2618. [Google Scholar]
- Hsieh, C.; Yang, L.; Cui, Y.; Lin, T.; Belongie, S.J.; Estrin, D. Collaborative Metric Learning. In Proceedings of the WWW ’17: 26th International World Wide Web Conference, Perth, Australia, 3–7 April 2017; pp. 193–201. [Google Scholar]
- Lee, E.L.; Lou, J.K.; Chen, W.M.; Chen, Y.C.; Lin, S.D.; Chiang, Y.S.; Chen, K.T. Fairness-aware loan recommendation for microfinance services. In Proceedings of the 2014 International Conference on Social Computing, Beijing, China, 4–7 August 2014; pp. 1–4. [Google Scholar]
- Zhao, K.; Zhang, Y.; Yin, H.; Wang, J.; Zheng, K.; Zhou, X.; Xing, C. Discovering Subsequence Patterns for Next POI Recommendation. In Proceedings of the IJCAI, Yokohama, Japan, 7–15 January 2020; pp. 3216–3222. [Google Scholar]
- Liu, Y.; Ao, X.; Dong, L.; Zhang, C.; Wang, J.; He, Q. Spatiotemporal Activity Modeling via Hierarchical Cross-Modal Embedding. IEEE Trans. Knowl. Data Eng. 2022, 34, 462–474. [Google Scholar] [CrossRef]
- Tan, H.; Yao, D.; Bi, J. Deep Transfer Learning for Successive POI Recommendation. In Proceedings of the International Conference on Spatial Data and Intelligence, Hangzhou, China, 22–24 April 2021; Springer: Berlin/Heidelberg, Germany, 2021; pp. 134–140. [Google Scholar]
- Yao, D.; Zhang, C.; Huang, J.; Bi, J. SERM: A Recurrent Model for Next Location Prediction in Semantic Trajectories. In Proceedings of the CIKM ’17: ACM Conference on Information and Knowledge Management, Singapore, 6–10 November 2017; pp. 2411–2414. [Google Scholar]
- Yin, H.; Cui, B.; Zhou, X.; Wang, W.; Huang, Z.; Sadiq, S.W. Joint Modeling of User Check-in Behaviors for Real-time Point-of-Interest Recommendation. ACM Trans. Inf. Syst. 2016, 35, 11:1–11:44. [Google Scholar] [CrossRef]
- Li, D.; Gong, Z.; Zhang, D. A Common Topic Transfer Learning Model for Crossing City POI Recommendations. IEEE Trans. Cybern. 2019, 49, 4282–4295. [Google Scholar] [CrossRef]
- Wang, C.; Xiao, Z. A Deep Learning Approach for Credit Scoring Using Feature Embedded Transformer. Appl. Sci. 2022, 12, 10995. [Google Scholar] [CrossRef]
- Munkhdalai, L.; Ryu, K.H.; Namsrai, O.E.; Theera-Umpon, N. A partially interpretable adaptive Softmax regression for credit scoring. Appl. Sci. 2021, 11, 3227. [Google Scholar] [CrossRef]
- Petrides, G.; Moldovan, D.; Coenen, L.; Guns, T.; Verbeke, W. Cost-sensitive learning for profit-driven credit scoring. J. Oper. Res. Soc. 2022, 73, 338–350. [Google Scholar] [CrossRef]
- Liu, W.; Fan, H.; Xia, M. Credit scoring based on tree-enhanced gradient boosting decision trees. Expert Syst. Appl. 2022, 189, 116034. [Google Scholar] [CrossRef]
- Yan, J.; Wang, K.; Liu, Y.; Xu, K.; Kang, L.; Chen, X.; Zhu, H. Mining social lending motivations for loan project recommendations. Expert Syst. Appl. 2018, 111, 100–106. [Google Scholar] [CrossRef]
- Wang, X.; Zhang, D.; Zeng, X.; Wu, X. A Bayesian investment model for online P2P lending. In Frontiers in Internet Technologies; Springer: Berlin/Heidelberg, Germany, 2013; pp. 21–30. [Google Scholar]
- Liu, Y.; Ma, H.; Jiang, Y.; Li, Z. Modelling risk and return awareness for p2p lending recommendation with graph convolutional networks. Appl. Intell. 2022, 52, 4999–5014. [Google Scholar] [CrossRef]
- Zhang, H.; Zhao, H.; Liu, Q.; Xu, T.; Chen, E.; Huang, X. Finding potential lenders in P2P lending: A hybrid random walk approach. Inf. Sci. 2018, 432, 376–391. [Google Scholar] [CrossRef]
- Zhao, H.; Liu, Q.; Wang, G.; Ge, Y.; Chen, E. Portfolio selections in P2P lending: A multi-objective perspective. In Proceedings of the 22nd ACM SIGKDD International Conference On Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 2075–2084. [Google Scholar]
- Gope, J.; Jain, S.K. A survey on solving cold start problem in recommender systems. In Proceedings of the 2017 International Conference on Computing, Communication and Automation (ICCCA), Greater Noida, India, 5–6 May 2017; pp. 133–138. [Google Scholar]
- Braunhofer, M.; Elahi, M.; Ge, M.; Ricci, F. Context dependent preference acquisition with personality-based active learning in mobile recommender systems. In Proceedings of the International Conference on Learning and Collaboration Technologies, Heraklion, Greece, 22–27 June 2014; Springer: Berlin/Heidelberg, Germany, 2014; pp. 105–116. [Google Scholar]
- Sun, M.; Li, F.; Lee, J.; Zhou, K.; Lebanon, G.; Zha, H. Learning multiple-question decision trees for cold-start recommendation. In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, Rome, Italy, 4–8 February 2013; pp. 445–454. [Google Scholar]
- Rubens, N.; Elahi, M.; Sugiyama, M.; Kaplan, D. Active learning in recommender systems. In Recommender Systems Handbook; Springer: Berlin/Heidelberg, Germany, 2015; pp. 809–846. [Google Scholar]
- Elahi, M.; Ricci, F.; Rubens, N. Active learning in collaborative filtering recommender systems. In Proceedings of the International Conference on Electronic Commerce and Web Technologies, Munich, Germany, 1–4 September 2014; Springer: Berlin/Heidelberg, Germany, 2014; pp. 113–124. [Google Scholar]
- Bobadilla, J.; Ortega, F.; Hernando, A.; Bernal, J. A collaborative filtering approach to mitigate the new user cold start problem. Knowl. Based Syst. 2012, 26, 225–238. [Google Scholar] [CrossRef] [Green Version]
- Shapira, B.; Rokach, L.; Freilikhman, S. Facebook single and cross domain data for recommendation systems. User Model. User Adapt. Interact. 2013, 23, 211–247. [Google Scholar] [CrossRef]
- Lika, B.; Kolomvatsos, K.; Hadjiefthymiades, S. Facing the cold start problem in recommender systems. Expert Syst. Appl. 2014, 41, 2065–2073. [Google Scholar] [CrossRef]
- Zhang, M.; Tang, J.; Zhang, X.; Xue, X. Addressing cold start in recommender systems: A semi-supervised co-training algorithm. In Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, Gold Coast, QLD, Australia, 6–11 July 2014; pp. 73–82. [Google Scholar]
- Tan, H.; Yao, D.; Huang, T.; Wang, B.; Jing, Q.; Bi, J. Meta-Learning Enhanced Neural ODE for Citywide Next POI Recommendation. In Proceedings of the 2021 22nd IEEE International Conference on Mobile Data Management (MDM), Toronto, ON, Canada, 15–18 June 2021; pp. 89–98. [Google Scholar] [CrossRef]
- Liu, S.; Ounis, I.; Macdonald, C.; Meng, Z. A heterogeneous graph neural model for cold-start recommendation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual, 25–30 July 2020; pp. 2029–2032. [Google Scholar]
- Han, P.; Wang, J.; Yao, D.; Shang, S.; Zhang, X. A Graph-based Approach for Trajectory Similarity Computation in Spatial Networks. In Proceedings of the KDD, Virtual, 14–18 August 2021; pp. 556–564. [Google Scholar]
- Pennington, J.; Socher, R.; Manning, C.D. Glove: Global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 25–29 October 2014; pp. 1532–1543. [Google Scholar]
- He, X.; Liao, L.; Zhang, H.; Nie, L.; Hu, X.; Chua, T.S. Neural collaborative filtering. In Proceedings of the 26th International Conference on World Wide Web, Perth, Australia, 3–7 April 2017; pp. 173–182. [Google Scholar]
- Guthrie, D.; Allison, B.; Liu, W.; Guthrie, L.; Wilks, Y. A closer look at skip-gram modelling. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy, 22–28 May 2006; Volume 6, pp. 1222–1225. [Google Scholar]
- Floridi, L.; Chiriatti, M. GPT-3: Its nature, scope, limits, and consequences. Minds Mach. 2020, 30, 681–694. [Google Scholar] [CrossRef]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv 2018, arXiv:1810.04805. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; Volume 30. [Google Scholar]
- Rendle, S.; Freudenthaler, C.; Gantner, Z.; Schmidt-Thieme, L. BPR: Bayesian Personalized Ranking from Implicit Feedback. In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009), Montreal, QC, Canada, 18–21 June 2009; AUAI Press: Vancouver, BC, Canada, 2009; pp. 452–461. [Google Scholar]
Notations | Descriptions |
---|---|
Applicant user, loan project, and the statement | |
User set and loan project set | |
A historical record indicates u apply v with s | |
Historical loan records of user u | |
Historical data of all loan records | |
Word embedding matrix | |
d | Embedding dimension |
All parameters in CSRLoan |
Dataset | # of Samples | # of Users | # of Default Samples | Average Length of s |
---|---|---|---|---|
Train | 634,144 | 501,813 | 50,900 | 13.7 |
Validation | 53,703 | 48,083 | 3833 | 11.6 |
Test | 388,405 | 302,401 | 6210 | 12.8 |
ID | Amount | Categories | Statement | Country | Region | Ages | Income | Gender | Repayment | Activity Label |
---|---|---|---|---|---|---|---|---|---|---|
1 | 575 | Transportation | to repair and maintain the auto rickshaw used in their business | Pakistan | Lahore | 30 | 14,000 | female | irregular | default |
2 | 150 | Transportation | to repair their old cycle-van and buy another one to rent out | India | Maynaguri | 22 | 6000 | female | bullet | completed |
3 | 200 | Arts | to purchase an embroidery machine and new materials | Pakistan | Lahore | 25 | 8000 | female | irregular | completed |
4 | 250 | Services | purchase leather for my business using ksh 20000 | Kenya | 23 | 6000 | female | irregular | completed | |
5 | 200 | Agriculture | to purchase a dairy cow and start a milk products business | India | Maynaguri | 25 | 8000 | male | bullet | overdue |
6 | 400 | Services | to buy more hair and skin care products | Pakistan | Ellahabad | 30 | 8000 | female | monthly | completed |
7 | 475 | Manufacturing | to purchase leather, plastic soles and heels in different sizes | Pakistan | Lahore | 46 | 19,000 | female | monthly | completed |
8 | 625 | Food | to buy a stall, gram flour, ketchup, and coal for selling ladoo | Pakistan | Lahore | 35 | 24,000 | male | irregular | default |
ID | Loan Theme | Require Partner | Duration | Amount |
---|---|---|---|---|
638631 | General | Yes | 2 years | 8000 |
640322 | General | Yes | 0.5 years | 12,000 |
641006 | Higher Education | Yes | 3 years | 40,000 |
641019 | Higher Education | No | 2 years | 2000 |
641594 | Subsistence Agriculture | Yes | 2 years | 10,000 |
642256 | Extreme Poverty | Yes | 1 years | 20,000 |
Type | Method | HR@5 | NDCG@5 | HR@10 | NDCG@10 |
---|---|---|---|---|---|
General | MF-BPR | 0.02134 | 0.01841 | 0.05263 | 0.03991 |
CML | 0.04630 | 0.03877 | 0.1518 | 0.1247 | |
Loan Rec | FAR | 0.04261 | 0.02974 | 0.0993 | 0.04971 |
MBR | 0.05216 | 0.03234 | 0.1377 | 0.0860 | |
Ablations | NMF | 0.0841 | 0.0710 | 0.2432 | 0.1221 |
CSRLoan | 0.0922 | 0.07082 | 0.2310 | 0.1778 | |
CSRLoan | 0.1021 | 0.08113 | 0.2530 | 0.1809 | |
Our | CSRLoan | 0.1150 | 0.0970 | 0.2233 | 0.1928 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhuang, K.; Wu, S.; Liu, S. CSRLoan: Cold Start Loan Recommendation with Semantic-Enhanced Neural Matrix Factorization. Appl. Sci. 2022, 12, 13001. https://doi.org/10.3390/app122413001
Zhuang K, Wu S, Liu S. CSRLoan: Cold Start Loan Recommendation with Semantic-Enhanced Neural Matrix Factorization. Applied Sciences. 2022; 12(24):13001. https://doi.org/10.3390/app122413001
Chicago/Turabian StyleZhuang, Kai, Sen Wu, and Shuaiqi Liu. 2022. "CSRLoan: Cold Start Loan Recommendation with Semantic-Enhanced Neural Matrix Factorization" Applied Sciences 12, no. 24: 13001. https://doi.org/10.3390/app122413001
APA StyleZhuang, K., Wu, S., & Liu, S. (2022). CSRLoan: Cold Start Loan Recommendation with Semantic-Enhanced Neural Matrix Factorization. Applied Sciences, 12(24), 13001. https://doi.org/10.3390/app122413001