Credit Scoring Using Machine Learning by Combing Social Network Information: Evidence from Peer-to-Peer Lending

: Financial institutions use credit scoring to evaluate potential loan default risks. However, insu ﬃ cient credit information limits the peer-to-peer (P2P) lending platform’s capacity to build e ﬀ ective credit scoring. In recent years, many types of data are used for credit scoring to compensate for the lack of credit history data. Whether social network information can be used to strengthen ﬁnancial institutions’ predictive power has received much attention in the industry and academia. The aim of this study is to test the reliability of social network information in predicting loan default. We extract borrowers’ social network information from mobile phones and then use logistic regression to test the relationship between social network information and loan default. Three machine learning algorithms—random forest, AdaBoost, and LightGBM—were constructed to demonstrate the predictive performance of social network information. The logistic regression results show that there is a statistically signiﬁcant correlation between social network information and loan default. The machine learning algorithm results show that social network information can improve loan default prediction performance signiﬁcantly. The experiment results suggest that social network information is valuable for credit scoring. information in predicting credit default risk. This study demonstrates the value of social network information and proves the connection between raw social network data and credit scoring.


Introduction
When a consumer attempts to obtain a credit card or auto loan, lenders usually use credit scores to determine whether to approve it. Credit scoring is a statistical analysis performed by financial institutions or credit bureaus to evaluate a borrower's creditworthiness and is based on credit history, demographic data, and credit behavior. In general, borrowers with high credit scores are more likely to obtain the loan and be charged a lower interest rate. Credit scoring is important to financial institutions not only because it can measure default risk but also because any small improvement would produce great profits [1,2]. However, many customers do not have a credit history and many developing countries have an imperfect credit reporting system [3]. Insufficient credit information limits financial institutions in building an effective credit scoring system to distinguish high-risk borrowers from the target population, especially in peer-to-peer (P2P) lending [4]. In addition, customers will be excluded from credit because they lack credit data. With the rapid development of P2P lending, payday loans, and online microlending markets in developing countries, financial institutions should seek more reliable methods to better evaluate a borrower's default risk.
One way to toughen predictive power is to search for an alternative source of credit information. In the past few years, a considerable amount of literature has studied variables related to personality and socioeconomic status that can be used to predict the loan default probability. Jiang et al. [5] use the Latent Dirichlet Allocation topic model to subdivide the loan statement texts and quantify the text information. The empirical result shows that loan statement text can be used to improve classification accuracy. On some P2P lending platforms, loan applications that include a picture and loan period affect the default probability [6]. Meanwhile, as the role of social networks has received much attention, some researchers have explored the correlation between online social network information and loan default [7,8]. Extending this stream of research, our study is to test the reliability of social network information in predicting loan default.
Mobile phones have become an indispensable part of daily life in modern society. People rely heavily on mobile phones to store a large amount of personal information. Mobile phone data and data collected through mobile phones have been widely used in personal behavior research, poverty identification, and other studies [9,10]. Mobile phones also store personal social network information, so each person's social network information can be obtained from his/her mobile phone for personal credit risk assessment.
In this paper, we extract social network data from mobile phones to test whether social network information can be used for credit scoring. A total of 21,036 P2P loan samples were collected from mobile network operators and a Chinese P2P lending platform. Since almost every person in modern society owns a mobile phone, the social network information extracted from mobile phones ensures the method's versatility. The suggested method is thus expected to reduce the problem of insufficient credit records of borrowers. Additionally, it can enable more people to obtain financial services, thus reducing financial exclusion. At the same time, the results could be beneficial to P2P lending platforms' attempt to design more effective default prediction algorithms. Financial institutions can more accurately evaluate institutional risk. In this way, the P2P lending industry can develop well.
The remainder of the paper is organized as follows. Section 2 presents the literature review. Section 3 provides the hypothesis development. Section 4 is the empirical study. Section 5 discusses the results, offering limitations and future directions.

Literature Review
Credit scoring has played a key role in the significant growth of consumer finance over the past 60 years. Many statistical models are used to credit scoring, such as linear or logistic regression, linear discriminant analysis, probit analysis, and naïve Bayes [11,12]. However, these methods often perform poorly when dealing with nonlinear relationships. It is difficult for them to meet the given statistical assumptions in practical application. Therefore, many machine learning and artificial intelligence methods have been applied to credit scoring, and these algorithms have worked better than statistical analysis. These methods include support vector machine (SVM) [13], artificial neural networks (ANN) [14], and random forest [15]. In recent years, ensemble methods such as random forest, AdaBoost (Adaptive Boosting), and GBDT (gradient boosted decision trees) have received much attention in credit scoring [16]. Compared with different algorithms for credit scoring, ensemble methods are typically more advantageous [5], and these algorithms are now regarded as mainstream in credit scoring.
The financial institutions rely on three kinds of data to build credit scoring: demographic information, customer's transactional history data, and credit history data [17]. In recent years, the boom in P2P lending has provided a large number of research materials for credit scoring. With the dramatic changes in credit scoring data sources, a lot of research sought to test the new data sources. Empirical research on Lending Club shows debt information and FICO scores play an important role in loan default prediction [18]. Gao and Lin [19] demonstrated that borrowing reasons text features can impact loan default. For borrowers with strong readability and more objective description, the default probability is lower. Ma et al. [4] extracted information from meta-level phone usage data to build a default prediction method. The results show that phone usage data can improve credit scoring model accuracy.
The social media boom has generated a huge amount of social network data. The data gathered from social media may be considered an important source of information. Hill et al. [20] applied social network data to empirical research. They observed social network data from calling behavior to predict product/service adoption. In order to reduce information asymmetry, many P2P lending platforms encourage borrowers to build online groups. The empirical results of research by Everett [21] reveal that the closer the location of the borrowers in the group, the lower the default probability. In addition, if they have a certain connection in real life, joining the group will significantly reduce the borrower default probability. According to social media exposure by the borrower, borrowers with more friends are more likely to get loans, and the default probability is lower [22]. Based on the assumption that people are more likely to form social relationships with people like themselves, Wei et al. [8] proposed a social network theoretical framework. Their framework showed that social networks can improve the prediction accuracy of borrowers' defaulting. De Cnudde et al. [23] used online social network information extracted from Facebook accounts to build a credit scoring model. Their results show that social network information extracted from Facebook have predictive value. The developing country market has also explored the relationships between individuals' social networks and loan performance. For example, Li et al.'s [24] empirical results reveal that the higher the credit grade of friends, the less likely the borrower is to default. Zhang et al. [25] found that online social network data could predict loan default. Ge et al. [7] tested the predictive effect of self-disclosed online social network information. Guo et al. [26] extracted social network data from Weibo, a Chinese microblogging website. The empirical results show that credit scoring with online social network data outperforms traditional credit scoring methods.
In sum, there has been considerable interest in loan default prediction, and many algorithms and data have been tested. Ensemble algorithms have better performance. In order to improve the performance of the classification model, related research verified "hard information" such as income, age, FICO score, and debt information, and they have also focused on the "soft information" of borrowers. In this aspect, social network information has received much attention. Some scholars have explored the role of social network information, but most only explored the correlation between online social network information and loan default. Although De Cnudde et al. [23] extracted social network data from Facebook to increase the predictive ability, this method cannot be applied in many countries. For example, Facebook is blocked in some countries. In addition, many people are reluctant to provide online social network information or do not have social media accounts, and some people provide invalid data and their social network information cannot be obtained, which leads to the failure of the method and limits the improvement of its risk prediction ability.
Although some people do not have online social network data, nearly every person contacts others by mobile phone, which saves the borrower's social network information. This paper extracts three types of social network information-social network quality, social network stability, and social network exposure-from mobile phones to build credit scoring.
Theoretically, this research enriches the research on social networks and provides a theoretical basis for social network application in credit scoring. In addition, using social network data provided by mobile phones to predict the default probability of the borrower has a huge advantage in practical applications. First, this method improves financial institutions' risk control capabilities. There is a lack of usable credit history for a large number of consumers; therefore, the commonly used variables of loan repayment history and debt ratio in credit risk assessment cannot be used, which greatly reduces the risk control capability of the financial institution. As a tool used by everyone in modern society, social network data extracted from mobile phones ensures the validity and breadth of credit risk assessment. Second, this method has greatly expanded the scope of financial services, enabling more people to access financial services without restrictions.

Hypothesis Development
We define the social network data collected from the mobile phone as the following three variables: social network quality, social network stability, and social network exposure. Social network quality counts the number of default borrowers in a borrower's mobile address book. When the borrower applies for a loan, the platform will ask the borrower for permission to access the address book. After the platform reads the address book, the platform will obtain the borrower's real social network information. The platform compares social network information with the platform's database and can get the total number of borrowing defaults within the borrower's social network. The social network exposure information comes from the number of contacts that the borrower fills in on the app or website when applying for a loan, including home phone, work phone, and emergency contact phone number. The borrower must fill in his or her own phone number when registering on the platform. In addition, the borrower can also fill in his home phone number, work phone number, and emergency contact phone number, so that the platform receives the social network exposure information. The social network stability means the length of time the borrower's mobile phone number has been used. The telecom operator knows the customer's mobile phone number usage duration, and the platform can obtain the social network stability information of the borrower through cooperation with the telecom operator. Due to the large volume of social network information, we used MapReduce to carry out the extraction process.
In the process of forming a social network, individuals prefer to build social relationships with people like them [27]. This makes closely connected individuals in a social network not only highly correlated but also highly similar. This similarity is not only reflected in the fact that the group accept the same information, form similar beliefs, and have similar behaviors [28]. Wei et al.'s [8] theoretical framework shows that the credit scoring of individuals in the same social network is relatively close. Compared with the online social network data that the previous research focused on, social relationships in real life can reflect the characteristics of individuals more realistically and accurately [24]. Based on these works of literature, we propose the following hypothesis:

Hypothesis 1 (H1). Social network quality contains predictability of loan default.
In China, borrowers know that if they default, the telephone collection will have a great impact on their life. Unless the loan is paid off, the impact will continue. The consequences of defaulting will make the borrower want to pay the loan as on time as possible. If the borrower wants to change their phone number so that the collector cannot contact them, it will cost a lot. The longer the mobile phone number is used, the more people the borrower can contact and the closer relationship there will be in the network. Once he/she leaves the social network, it takes a lot of effort to rebuild. The replacement of the mobile number means that the reconstruction of the social network will cause great inconvenience to the borrower, which is also a shock to the borrower's default. If the borrower's mobile phone number is only used for a short period of time, the borrower's social conversion cost will be lower, and the default will have less impact on the borrower's life.
In addition, the length of time that the mobile phone number is used can reflect his/her economic situation, and this is positively related to the economic conditions of the individual [29]. Economic status is an important indicator of P2P loan default prediction [30]. Therefore, we propose the following hypothesis:

Hypothesis 2 (H2). Social network stability contains predictability of loan default.
When borrowers apply for loans, they can choose to disclose many kinds of contact information to the platform. In addition to their mobile phone number, they can also provide home phone number, work phone number, and emergency phone number. The more contact information the borrower provides, the more social information the platform can control. In the event of default, the platform will have more channels to contact collection, and the information that the borrower cannot pay the loan on time will be communicated to his social network. Whether a person is honest and trustworthy is of great significance to the maintenance of his social relationship [31]. If the loan is defaulted on, the borrower's family and colleagues can perceive the borrower as a dishonest and untrustworthy person, which impacts his social relationships. Compared to the online social network, the borrower's relatives and friends can be directly contacted by telephone, and the social relationship is more effective and binding. Therefore, the borrowers think the number of social relationships they disclose can demonstrate the borrower's willingness to repay his/her debts.
In China's P2P lending market, due to the imperfect credit system, only a small number of people have credit information, and the borrowers lack objective credit data evaluation, which may lead to adverse selection problems in the market. Borrowers with better credit have more ways to raise loans, so they are not willing to over-disclose their social network information. A large number of P2P lending borrowers are high-risk groups identified by other channels and have difficulty obtaining loans, but they also have certain credit needs. Because of the lack of financing channels, borrowers with poor credit are more inclined to adopt a strategy of actively disclosing social network information, revealing more social network verifiable information to make themselves look more credible, and thus attract more lender investment and achieve the purpose of successful borrowing. Herzenstein et al.'s [32] empirical research found that customers with a higher risk of default will disclose more information. Therefore, we propose the following hypothesis: Hypothesis 3 (H3). Social network exposure contains predictability of loan default.

Empirical Study
In this section, we show the variables and use a logistic regression model to identify the relationship between borrowers' social network information and loan default. Previous studies have shown that the ensemble methods have better performance for credit scoring [5]. We employed the random forest, AdaBoost, and LightGBM algorithms to build default prediction models. In order to test the discrimination performance of social network data, we applied these algorithms with social network information and without. Then, we showed the feature importance of three algorithms with social network variables.
The experiment was coded using Python 3 and stata 14. The empirical evaluation was carried out using one core of an Intel Core i3-4170 based Windows 10 operating system PC with 8GB RAM.

Data and Variables
We collected the dataset from a Chinese P2P lending platform. All data were encrypted to protect privacy. A total of 21,036 loan samples were collected, of which 9025 are default loans and 12,011 are good (not default) loans. Every loan contains 21 variables. These variables include the borrower's personal information variables, registration information variables, loan information variables, and social network information variables. All variables are listed in Table 1. The personal information is the borrower's demographic information and financial information, including the borrower's age (Age), gender (Gender), marital status (Marriage), number of children (Children), family member counts (Family), education status (Education), income level (Income), car ownership (Car), income category (Income_type), job title (Jobtitle), house ownership (House), and days of work (Work_days). Registration information contains minutes of registration (Registration_time) and minutes before the borrower changed the document with which he applied for the loan (Document_change). In addition, loan information including the loan amount (Amount), interest rate (Rate), when the borrower applies for a loan during the day (Time) and repayment period (Period). The social network variables are social stability (Social_stability), social exposure (Social_exposure), and social quality (Social_quality). The exploratory analysis of the social network variables is shown in Table 2. The mean, the median, and the standard deviation of defaulting and non-defaulting borrowers are shown in Table 2. From the table, we can see that the average value of the social network quality variable of the defaulting borrower is 0.190, whereas for the non-defaulting borrowers it is 0.141. The results show that the defaulting borrower's social network quality is even worse. In their social network, they have more defaulting friends than non-defaulting borrowers. The defaulting borrower's social network is less stable than the non-defaulting borrowers. The social stable variable's mean value for a defaulting borrower is 824.67, which is less than the non-defaulting borrowers' mean value of 998.47. The defaulting borrower will show more contact information than non-defaulting borrowers. On average, defaulting borrowers expose 2.366 contact information, which is bigger than that of non-defaulting borrowers. The t-test was used to test whether there is a statistically significant difference between the two groups. The last columns display the significant results of the t-test. As we can see, defaulting borrowers and non-defaulting borrowers' differences are significant in all the social network information variables, which indicates that social network information may differ between non-defaulting and defaulting borrowers. Based on the exploratory analysis, the social network information hypotheses can be partially supported. Social network information variables contain predictability of loan default.

Statistical Significance of Social Network Information
We used a logistic regression model to test the influence of the social network information on loan default probability. The default variable is the dependent variable. If the borrower defaulted the default variable is 1 and 0 if the borrower did not default. In accordance with P2P lending industrial inertia, we define a borrower default as when the borrower fails to pay for more than 30 days. The independent variables are all shown in Table 1, which contains social network variables and control variables. Control variables are borrowers' demographic information, registration information, and loan information. At the same time, in order to test the impact of social network information on the default of the borrower, two models were constructed. Model 1 only contains control variables. Model 2 not only has control variables but also has social network information: Model 1: Model 2: The term α is the coefficient of control variables, β is the coefficient of social network variables, and ε is disturbance.

Default Prediction Models
Credit scoring is regarded as a classification problem [30]. After the borrower obtains a loan from the P2P lending platform, two final statuses will arise: default or not. Defaulting borrowers are classified as bad and assigned 1, while non-defaulting ones are classified as good and assigned 0. Credit scoring predicts the borrower's final repayment status through the relevant characteristics of the borrower. For a borrower set D = {(x 1 ,y 1 ),(x 2 ,y 2 ), . . . (x n ,y n )}, x i is the relevant characteristics of the borrower, and y i is the final state variable. Ensemble algorithms perform better in credit scoring compared to other algorithms [5]. They are prevalent in credit scoring. Random forest, AdaBoost and GBDT are the most popular ensemble methods. In order to verify the effect of social network information in loan prediction, we selected random forest, AdaBoost, and LightGBM models to examine the changes in predictive ability before and after fusing social network information to credit scoring. The LightGBM algorithm is a framework proposed to implement the GBDT algorithm. In order to show the importance of social network variables in the classification model, we show the feature importance results of random forest, AdaBoost, and LightGBM.

Random Forest
Random forest is an ensemble learning algorithm based on the decision tree. It can be used for both regression and classification tasks and is easy to implement. Random forest uses bootstrap to obtain samples from the original data. Every tree gives a classification, and the forest chooses the classification with the most votes in all the trees. The parameter m is the number of decision trees and determines the degree of randomness. In the random forest, the borrower is assumed to have d attributes. In general, we set m = log 2 d. The CART classification tree uses the Gini index to create split points. If we have n classes, p i is the probability that an object belongs to class i. The Gini index would be: The Gini index is used to compute the impurity of data sets. The smaller the Gini index, the higher the purity of the data sets. The Gini index of classification problem:

AdaBoost
AdaBoost is a powerful classification algorithm that has high accuracy and low generalization error. We can use CART, C4.5, and SVM for AdaBoost as weak learner. AdaBoost aims to convert a set of weak classifiers into a strong one. AdaBoost reduces the error rate of classification by multi-step iteration. After each iteration, the weak classifier weight with small classification error rate is reduced, and the weak classifier weight with a large classification error rate is increased until the predetermined error rate or iteration round is reached. We choose the decision tree as weak learners. The number of weak learners h t (x) is T. The weight of every learner is where t represents the classification error rate in every weak learner. The weight for each data point is updated as where ω t is the weight of the data point, and Z t is a normalization factor that ensures the sum of instance weights is equal to 1. The weak learner is ensembled to a strong classifier. The linear combination of the basic classifiers is defined as The final prediction sums up the weighted prediction of each classifier:

LightGBM
The LightGBM algorithm is a framework proposed by Microsoft in 2017 to implement the GBDT algorithm. It can be used for classification, regression, and many other machine learning tasks. LightGBM is one of the most popular methods for data scientists and the machine learning online community Kaggle. Compared with the xgboost algorithm, which has shined within data science competition in previous years, LightGBM guarantees accuracy while having faster speed and less memory consumption and supports distributed computing, which can process massive data quickly.
The GBDT algorithm procedure is performed as Algorithm 1. f (x) is the decision tree, R J J 1 are the parameters of the decision tree, b j J 1 is the decision tree's output function value, J is the number of leaf nodes, and F is a collection of decision trees. D is training data, L is target function, and K is the number of iterations.
GBDT continuously fits the residuals and adds them to F. In this process, the residuals become smaller and smaller. GBDT algorithms use a gradient descent method to optimize the loss function minf (x) and minL(F). Different from many other boosting methods that use pre-sort-based algorithms, LightGBM uses histogram-based algorithms, which bucket continuous variables into discrete bins. This speeds up training and reduces memory usage.
GBDT algorithms are based on decision tree algorithms. Most of the GBDT algorithms split the tree by depth or level rather than by leaf. This can control the complexity of the model and reduce overfitting. However, the information gain of some leaf nodes is lower, and the computation is increased. LightGBM splits the tree with leaf-wise algorithms. The leaf-wise algorithm can reduce loss more than other algorithms. Level-wise splits the leaf nodes with the largest information gain. Level-wise can reduce the error and obtain higher precision than leaf-wise.

Parameter Selection
Hyperparameters are parameters that cannot directly be estimated from training data. Hyperparameters are important because they can control the performance of algorithms. In order to have a good performance for every model, we used the gird search to find the optimal hyperparameters. We use the area under the ROC curve (AUC) to measure how effective a model is. In order to prevent overfitting and identify the validity of the three models, these methods are optimized using a 5-fold cross-validated grid search over a hyperparameter selection. The data set is randomly partitioned into five roughly equal subsets. The algorithm is training on four folds, and the other fold is the test set.
The n_estimator is the number of trees in the random forest algorithms. A parameter grid on [10,150] is applied to select the parameters. The best n_estimator is 100. A parameter grid on [10, 1000] × [0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1] is applied to select the best parameters of forest size and learning rate in AdaBoost algorithms. The result shows that when forest size is 400, and learning_rate is 0.2, and the performance reaches its best. In LightGBM algorithms, max_depth is the maximum depth of tree, min_data_in_leaf is the minimum number of the records a leaf may have, feature_fraction determines how many parameters LightGBM will select in each iteration for building trees, and bagging_fraction specifies the fraction of data to be used for each iteration. We get the optimal parameter from the grid search: max_depth = 6, min_data_in_leaf = 35, feature_fraction = 0.8, bagging_fraction = 0.5.

Results
The logistic regression results are shown in Table 3. Model 1 only contains control variables; Model 2 not only has control variables but also has social network information. As can be seen, the variables that are statistically significantly correlated in Model 1 are also significant in Model 2, but some variables differ significantly in their levels.
The coefficients of all the social network information variables are statistically significant at the p < 0.01 level. Among the social network information features, social network quality and social network exposure are positively correlated with loan default; the social stable feature is negatively correlated. This indicates that when the borrower has more loan defaults in the social network or exposes more contact information, the more likely the borrower is to default. In contrast, the longer the mobile phone number has been used, the less likely the borrower is to default.
The results show that the social network information variables extracted from the mobile phone have certain relationships with P2P lending loan default. The social network variables can be used to estimate the loan's probability of default.

Default Prediction Models and Discussion
In order to verify the improvement of social network information on credit risk assessment, we chose three classification models: random forest, AdaBoost, and LightGBM to see the change of prediction ability of each model before and after adding social network information.
We conducted 50 experiments and every experiment adopted 5-fold cross-validation to ensure the validity of the results. We used AUC, F1 score and the prediction accuracy to evaluate the prediction results. Table 4 summarizes the mean of accuracy, F1 score and AUC results on 50 repeated 5-fold cross-validation experiments for the three models. Every algorithm has two results: contain and not contain social network information. From Table 4, we can see that the LightGBM algorithm has the best classification effect, regardless of the accuracy of the model or the F1 score and AUC. In the absence of social network information, the AUC value of the original model is 0.692, and the prediction accuracy of the model is 65.50%. After combing social network information, its AUC increased to 0.711, and the prediction accuracy also increased to 66.22%. The F1 score of the benchmark model is 0.649, and the F1 score of the model with social network information is higher at 0.659. The rest of the models are inferior in accuracy, F1 score, and AUC compared to LightGBM, but the final results show that after combing the social network variables, both the prediction accuracy, F1 score, and AUC have a certain degree of improvement. After combing social network information, the AUC of the three algorithms increased by 0.015, 0.016 and 0.019. In the credit scoring industry, it is a significant upgrade [33]. In addition, hyperparameters are searched in the dataset that are not combined with social network information. If we do the grid search on the dataset with social network information, the model's predictive power will be higher.
In order to test whether the AUC of the model with social network variables is not only larger than that of the model not containing social network information but also statistically away from it, we used the nonparametric approach developed by DeLong et al. [34] and implemented the stata routine "roccomp". Table 5 shows the results. The statistical test of random forest and AdaBoost are statistically significant at the p < 0.05 level, LightGBM is statistically significant at the p < 0.01 level. The results proved that the AUC value of the model with social network information is not only larger than when not combined but also statistically away from it. Regardless of which algorithm, all the AUC values with social network information are better than the results without social network information. Through the comparison of the model AUC, F1 score and accuracy, we find that combing social network information can significantly improve the accuracy of the prediction and has a good application prospect. Further tests carried out with the nonparametric approach [34] corroborated our initial findings.

Feature Importance
There are 21 features in the classification model. Figures 1-3 show the ordering of the 14 most important variables of the random forest, AdaBoost, and LightGBM. These variables are plotted in decreasing order of the importance value.
As can be seen, although the importance order of the three models is not the same, the period variable is ranked in the top one in all three models. The social network stability ranks 5th, 9th, and 4th, respectively, in the three models; the social network quality variables rank 14th, 13th, and 13th, respectively, in the three models. The social network exposure variable ranked 13th, 12th, and 14th, respectively, in the three models. The results show that social network information plays an important role in the prediction of loan default.  Through the comparison of the model AUC, F1 score and accuracy, we find that combing social network information can significantly improve the accuracy of the prediction and has a good application prospect. Further tests carried out with the nonparametric approach [34] corroborated our initial findings.

Feature Importance
There are 21 features in the classification model. Figures 1-3 show the ordering of the 14 most important variables of the random forest, AdaBoost, and LightGBM. These variables are plotted in decreasing order of the importance value.
As can be seen, although the importance order of the three models is not the same, the period variable is ranked in the top one in all three models. The social network stability ranks 5th, 9th, and 4th, respectively, in the three models; the social network quality variables rank 14th, 13th, and 13th, respectively, in the three models. The social network exposure variable ranked 13th, 12th, and 14th, respectively, in the three models. The results show that social network information plays an important role in the prediction of loan default.
The above results show that social network information can be used in credit scoring, and the predictive ability of the model has been improved. The social network information hypotheses can be supported: the three social network information variables contain predictability of loan default.
Many research and financial projects use different kinds of information and machine learning algorithms to predict loan default [3]. Our research results show the reliability of social network information in predicting credit default risk. This study demonstrates the value of social network information and proves the connection between raw social network data and credit scoring.

Conclusions
Loan default prediction is important to financial institutions, which can use credit scoring to distinguish customer default or not. In developing countries, many P2P lending borrowers lack credit history data, making it difficult to predict whether the borrower will default or not. Financial institutions need a more robust method for credit scoring. Modern society is highly interconnected, and social network data can show the credit status of borrowers. Mobile phones are accessible to almost everyone and hold a lot of social network information. This paper extracted social network data from mobile phones to predict borrowers' behavior. The empirical results show the social network variables extracted from mobile phones could be used to improve loan prediction accuracy. The research enhances our understanding of the social network. The platform can also obtain a better judgment of the credit risk of the borrower, improve the industry's ability to resist risks, and facilitate the smooth development of the P2P lending industry.

Conclusions
Loan default prediction is important to financial institutions, which can use credit scoring to distinguish customer default or not. In developing countries, many P2P lending borrowers lack credit history data, making it difficult to predict whether the borrower will default or not. Financial institutions need a more robust method for credit scoring. Modern society is highly interconnected, and social network data can show the credit status of borrowers. Mobile phones are accessible to almost everyone and hold a lot of social network information. This paper extracted social network data from mobile phones to predict borrowers' behavior. The empirical results show the social network variables extracted from mobile phones could be used to improve loan prediction accuracy. The research enhances our understanding of the social network. The platform can also obtain a better judgment of the credit risk of the borrower, improve the industry's ability to resist risks, and facilitate the smooth development of the P2P lending industry. The above results show that social network information can be used in credit scoring, and the predictive ability of the model has been improved. The social network information hypotheses can be supported: the three social network information variables contain predictability of loan default.
Many research and financial projects use different kinds of information and machine learning algorithms to predict loan default [3]. Our research results show the reliability of social network information in predicting credit default risk. This study demonstrates the value of social network information and proves the connection between raw social network data and credit scoring.

Conclusions
Loan default prediction is important to financial institutions, which can use credit scoring to distinguish customer default or not. In developing countries, many P2P lending borrowers lack credit history data, making it difficult to predict whether the borrower will default or not. Financial institutions need a more robust method for credit scoring. Modern society is highly interconnected, and social network data can show the credit status of borrowers. Mobile phones are accessible to almost everyone and hold a lot of social network information. This paper extracted social network data from mobile phones to predict borrowers' behavior. The empirical results show the social network variables extracted from mobile phones could be used to improve loan prediction accuracy. The research enhances our understanding of the social network. The platform can also obtain a better judgment of the credit risk of the borrower, improve the industry's ability to resist risks, and facilitate the smooth development of the P2P lending industry.
However, some limitations are worth noting. We did not collect other social network data, such as the frequency of calls, whether they are incoming or outgoing and the strength of social network ties. This limits the research regarding the role of social networks in personal credit risk assessment. In the future, other offline social network data may be collected and used for credit scoring.