You are currently viewing a new version of our website. To view the old version click .
Risks
  • Article
  • Open Access

5 December 2025

Gender as a Risk Factor: A Test of Gender-Neutral Pricing in Lithuania’s P2P Market

and
Faculty of Business, Kaunas Kolegija Higher Education Institution, Pramonės av. 22, 50387 Kaunas, Lithuania
*
Author to whom correspondence should be addressed.

Abstract

European Union legislation, particularly Council Directive 2004/113/EC, mandates gender neutrality in credit scoring to prevent discrimination. However, this creates a regulatory paradox if gender is a statistically relevant predictor of default risk. This study investigates this “fairness-through-unawareness” approach by empirically testing for systematic mispricing. We employ a twofold econometric analysis on a dataset of consumer loans from a Lithuanian peer-to-peer platform. After data preparation for the regression, the sample consists of 9707 loans. First, logistic regression is used to model actual default risk, controlling for credit rating, age, loan amount, and education. Second, Ordinary Least Squares (OLS) regression is used to model the interest rate set by the platform. The Logit model finds that gender is a highly significant predictor of default (p < 0.001), with male borrowers associated with a higher probability of default. Conversely, the OLS model finds that gender is not a statistically significant factor in loan pricing (p = 0.263), confirming the platform’s compliance with EU law. The findings empirically demonstrate the regulatory paradox: the legally compliant, gender-blind pricing model fails to account for a significant risk differential. This leads to systematic risk mispricing and an implicit cross-subsidy from lower-risk female borrowers to higher-risk male counterparts, highlighting a critical tension between regulatory intent and outcome fairness. The analysis is limited to observed loan-level characteristics; it does not incorporate household composition or the internal structure of the platform’s proprietary scoring model.

1. Introduction

The financial landscape across the European Union (EU) is undergoing significant transformation driven by sophisticated algorithmic credit scoring systems. Employed across traditional banking and P2P lending, these systems promise to broaden financial inclusion Shi et al. (2022). By leveraging vast datasets, lenders aim to predict borrower default risk with greater accuracy. However, this technological advancement is not without peril; these algorithms can inadvertently embed or amplify existing societal biases, leading to discriminatory outcomes Bartlett et al. (2022).
At the heart of this challenge lies a significant conflict within the EU context. On one hand, a robust legal framework, anchored in Article 21 of the Charter of Fundamental Rights, explicitly prohibits discrimination based on sex European Union (2000). This principle is operationalized through Council Directive 2004/113/EC The Council of the European Union (2004). The interpretation of this directive by the Court of Justice of the European Union (CJEU) in the pivotal Test-Achats case cemented a mandate for gender neutrality, forbidding the use of gender as a determining factor in risk assessment and pricing for insurance and related financial services Skouris et al. (2010).
On the other hand, a growing body of empirical evidence challenges the assumption that gender is irrelevant to credit risk. Studies analyzing P2P lending platforms consistently indicate that gender can be a statistically significant predictor of creditworthiness. Specifically, multiple analyses suggest female borrowers, on average, exhibit lower default rates compared to male counterparts Aliano et al. (2023); Chen et al. (2019); Pope and Sydnor (2011).
This juxtaposition creates a regulatory paradox. The legal imperative for “fairness-through-unawareness”—that is, achieving fairness by simply omitting a variable—may inadvertently generate unintended negative consequences Hardt et al. (2016). If gender is a statistically relevant risk factor, prohibiting its use compels lenders to pool inherently different risk groups. This can lead to a systematic mispricing of risk, where the lower-risk group (e.g., women) is effectively overcharged to subsidize the higher-risk group (e.g., men) Caire and Fernandez Vidal (2024). This phenomenon raises profound questions about whether the current regulatory focus on input variables adequately addresses the complexity of achieving equitable outcomes Bartlett et al. (2022); Corbett-Davies et al. (2018).
While previous studies have descriptively highlighted these gender-based differences, many have lacked the robust econometric analysis required to separate the effect of gender from other confounding variables (like age, income, or existing credit score) and to empirically test the “paradox” itself. This paper aims to fill this gap by moving beyond descriptive statistics to econometrically model and quantify this mispricing.
Using a dataset from a Lithuanian P2P platform, we test two formal hypotheses:
H1: 
Gender is a statistically significant predictor of actual default risk, with male borrowers exhibiting a higher probability of default even when controlling for standardized risk factors (e.g., credit rating, age, and loan amount).
H2: 
The legally mandated exclusion of gender from the platform’s credit scoring model leads to a systematic mispricing of risk, where the platform’s final interest rates do not reflect the actual, higher risk associated with male borrowers.
By testing H1, we establish the statistical reality of gendered risk. By testing H2, we confirm the platform’s compliance with the law. Demonstrating both simultaneously provides a clear, empirical validation of the regulatory paradox and its economic consequences.

2. Literature Review

2.1. The EU Legal Framework Prohibiting Gender Discrimination

The European Union’s commitment to equality forms a cornerstone of its legal identity. This is firmly rooted in EU primary law, particularly Article 21 of the Charter of Fundamental Rights of the European Union, which explicitly states that “Any discrimination based on any ground such as sex… shall be prohibited” European Union (2000). This article establishes broad protection and is binding on EU institutions and Member States.
Recognizing that gender equality needed to extend beyond the workplace, the EU adopted Council Directive 2004/113/EC (the “Goods and Services Directive”) The Council of the European Union (2004). Article 5(1) of this Directive established the core “unisex rule,” stating that “the use of sex as a factor in the calculation of premiums and benefits for the purposes of insurance and related financial services shall not result in differences in individuals’ premiums and benefits”. While the Directive initially allowed derogations, this provision was declared invalid by the CJEU in the landmark Test-Achats ruling (Case C-236/09) Skouris et al. (2010). The Court reasoned that allowing an indefinite exemption undermined the Directive’s objective. This ruling effectively eliminated the legal basis for using gender directly in risk calculation for new contracts across the EU. Lithuania has transposed these principles into national law, notably through the Law on Equal Opportunities for Women and Men Law of the Republic of Lithuania (2017).

2.2. Empirical Evidence on Gender and Creditworthiness

Despite the clear legal prohibition, a significant body of empirical research indicates that gender often emerges as a statistically significant variable when predicting credit default risk Aliano et al. (2023); Liu and Mona (2025). This evidence is particularly prominent in the P2P lending sector. Studies utilizing data from major European P2P platforms like Bondora consistently find that female borrowers tend to exhibit a lower probability of defaulting compared to male borrowers, holding other factors constant Aliano et al. (2023); Liu and Mona (2025).
Similar findings have been observed in other P2P markets, such as China Chen et al. (2019). However, these findings are not universal; research on the German P2P platform Smava found no significant gender effect on funding success Barasinska and Schäfer (2014); Polena and Regner (2018), and a study on a Chinese platform found no significant gender effect on default probability Lingnan (2019), underscoring potential context dependency Barasinska and Schäfer (2010).
Research specifically examining Lithuanian P2P platforms found gender was not a statistically significant factor in determining interest rates charged, though this relates to pricing, not default risk itself Gaigalienė and Česnys (2018); Golovkina (2021). Beyond P2P lending, academic studies also point towards gender’s statistical relevance. A key study analyzing European car loans explicitly found gender to be statistically significant in predicting default within a standard credit scoring model Andreeva and Matuszyk (2019). Other research indicates female loan officers may achieve lower loan delinquency rates Moltalvo and Reynal-Querol (2020) and suggests potential links between female leadership/boardroom diversity and decreased firm-level credit risk Aguir et al. (2023); Kinateder et al. (2021), although evidence on individual risk aversion is mixed and context-dependent Aliano et al. (2023); Lobão (2024).
Recent scholarship on machine learning and fairness in financial models supports this concern. In Nwafor et al. (2024), it was argued that proxy variables often replicate hidden gender effects, while it was shown in He and Tao (2025) that, even when protected attributes are excluded, bias can persist unless explicitly constrained.

2.3. Limitations of Age and Income as Explanatory Variables

Credit risk modeling has long relied on demographic and financial variables such as age and income as proxies for borrower stability Crook et al. (2007). It was shown in Jammalamadaka and Itapu (2023) that machine learning credit models often mask structural biases under traditional risk factors like income. Similarly, in De Andrés et al. (2021), by analyzing Spanish bank loan data, it was found that income-related risk assessments systematically undervalue female borrowers.
Furthermore, in a systematic review Dastile and Celik (2024), it was concluded that older borrowers are generally less risky. The authors in He and Tao (2025) likewise observed that credit scoring algorithms often learn that older borrowers are less likely to default. These findings suggest that age and income do not fully explain gender-based variation in credit risk.

2.4. Challenges: Proxy Discrimination and Alternative Data

The legal mandate to exclude gender encounters significant practical challenges due to “proxy discrimination” Schwarcz and Prince (2020). Even when gender is explicitly removed, its influence can persist through other legally permissible variables correlated with it (e.g., occupation, income type, spending patterns) Morais Maceira (2017). Algorithms trained on large datasets can identify these correlations, potentially perpetuating historical biases and circumventing the intended goal of equal treatment Weerts et al. (2024).
The financial industry is increasingly exploring alternative data sources (e.g., utility payments, digital footprints) Alliance for Financial Inclusion (2025); Wang (2024). Proponents argue that this can promote financial inclusion. However, the European Banking Authority (EBA) notes challenges regarding data reliability, explainability, and governance. Furthermore, alternative data introduces ethical concerns, including privacy risks and the potential amplification of societal biases.

2.5. Algorithmic Fairness Concepts

The term “fairness” in the context of algorithms lacks a single, universally accepted definition Bono et al. (2021); Das et al. (2022); Liu (2024). Instead, a variety of mathematical formulations exist, often reflecting different ethical priorities and sometimes proving mutually incompatible Wu (2023). Common group fairness metrics include Demographic Parity, Equal Opportunity, and Predictive Parity.
Recent systematic reviews underscore the limitations of “Fairness Through Unawareness”—the assumption that excluding sensitive attributes like gender inherently ensures impartiality. A comprehensive synthesis by de Castro Vieira et al. (2025) observes that this approach is rarely used in academic fairness research, reinforcing the consensus that structural bias persists even when protected variables are omitted. This study contributes to filling a research gap by employing a real-world, proprietary dataset from a Lithuanian P2P platform to test this “unawareness” approach.

2.6. Ethical Considerations and Societal Impact

The debate surrounding gender inclusion extends into ethics and societal impact. The primary ethical argument against using gender aligns with non-discrimination law, emphasizing assessment based on individual merit, not group characteristics Andreeva and Matuszyk (2019). Incorporating gender risks reinforcing harmful stereotypes and perpetuating historical discrimination Morais Maceira (2017).
Conversely, arguments for controlled inclusion stem from a focus on actual outcomes and statistical fairness. Excluding gender when it is a statistically significant differentiator can lead to an unfair cross-subsidy, where the lower-risk group (females) effectively subsidizes the higher-risk group (males) Caire and Fernandez Vidal (2024). From an outcome-oriented perspective, this result can be framed as unfair. The decision impacts financial inclusion European Institute for Gender Equality (2023) and public trust in credit systems Bono et al. (2021).

3. Results

3.1. Descriptive Statistics

Table 1 presents the summary statistics for all variables used in the final regression analyses, based on the cleaned sample of 9707 observations. The mean default rate in this sample is 7.9%. Male borrowers (Gender_Numerical = 1) represent 55.1% of the sample. The average borrower is 37 years old with an average credit rating score of 10.7.
Table 1. Summary statistics for model variables (N = 9707).

3.2. Exploratory Data Analysis (EDA)

Before proceeding to the formal econometric analysis, an exploratory data analysis (EDA) of the full dataset (N = 10,893) provides preliminary, non-parametric evidence for our hypotheses.

3.2.1. EDA for H1: Gender and Default Risk

To find preliminary support for H1 (that gender is a predictor of risk), we first examine the overall default rates, which are based on 4872 loans to females and 6018 loans to males. As shown in Table 2, a clear disparity exists.
Table 2. Default rate by gender (full dataset, N = 10,893).
The data indicates that male borrowers default approximately 1.7 times more frequently than female borrowers. However, this could be confounded by other variables, such as age. The average age for female borrowers was 40.2 years, while for male borrowers it was 34.7 years. Given that older clients tend to be less risky Dastile and Celik (2024); He and Tao (2025), this age difference could explain the lower default rate for women.
To test this, we stratified the data by age group, as shown in Table 3.
Table 3. Default rates by age and gender (%) (full dataset, N = 10,893).
Contrary to the confounding hypothesis, the female default rate was significantly lower in almost all age groups, and the gap widened in older cohorts. This supports H1, suggesting gender is a factor independent of age.
Next, we examine if the disparity persists within the platform’s own standardized credit rating categories. If the rating system were perfectly capturing all risk, default rates should be similar for men and women within the same rating band.
Table 4 reveals a consistent pattern: female borrowers default less frequently than male borrowers in nearly every rating category. The gap widens substantially in the higher-risk bands (e.g., C3–D3). This provides strong preliminary evidence for H1, indicating that the gender-neutral rating system is not fully capturing systematic, gender-related differences in risk Bartlett et al. (2022); Garcia et al. (2024).
Table 4. Default rates by credit rating and gender (%) (full dataset, N = 10,893).
Finally, we assess the explanatory power of income. To account for inflationary effects over the 12-year period, net salaries were adjusted to their 2025 equivalent values. Table 5 compares default rates within these normalized income brackets.
Table 5. Default rates by adjusted net salary and gender (%) (full dataset, N = 10,893).
Despite having lower average net salaries (801 EUR for females vs. 994 EUR for males), female borrowers exhibited lower default rates in every income segment. This challenges findings that income-related risk assessments systematically undervalue female borrowers De Andrés et al. (2021). This analysis confirms that neither age nor income fully explains the observed gender difference in default rates, providing strong preliminary support for H1.

3.2.2. EDA for H2: Gender and Loan Pricing

Having established preliminary evidence for H1 (gender as a risk factor), we now conduct a preliminary test for H2 (gender-neutral pricing). The most direct way to observe the paradox is to look at the risk–price relationship on the platform. Table 6 shows the actual default rates for men and women who were grouped into the same interest rate bracket.
Table 6. Default rates by interest rate and gender (%).
Table 6 provides the clearest preliminary evidence for H2. In almost every single interest rate bracket offered by the platform, the default rate for female borrowers is lower—often substantially—than the default rate for male borrowers. For example, in the 11% bracket, men default at over 3.6 times the rate of women. In the 23% bracket, women default at 0% while men default at 14%. This demonstrates that men and women are being pooled together and offered the same price, despite exhibiting different levels of risk.
This directly suggests a systematic mispricing. To confirm that this pricing is indeed neutral, we also examine the average interest rates charged, stratified by age, as shown in Table 7.
Table 7. Interest rates by age and gender (full dataset, N = 10,893).
As Table 7 shows, the interest rates are nearly identical across all age groups for both genders. This confirms the platform is applying a gender-neutral pricing policy. When combined with Table 6, this is the crux of the regulatory paradox: the platform correctly follows the law by offering gender-neutral prices, but in doing so, it ignores a significant, observable difference in risk, leading to a cross-subsidy from lower-risk female borrowers to higher-risk male borrowers.
This EDA, while insightful, relies on stratified tables. To formally test these hypotheses while controlling for all variables simultaneously, we now turn to multivariate regression.

3.3. Econometric Analysis

The following analysis is performed on the cleaned, regression-ready sample of N = 9707 observations.

3.3.1. Econometric H1 Results: Gender as a Predictor of Actual Default

Table 8 presents the results of the Logit regression model predicting loan default (Loan_status). The model is statistically significant as a whole (LLR p-value: 4.321  ×   10 109 ), indicating it has strong predictive power.
Table 8. Logit regression results for loan default (H1) (N = 9707).
The results strongly support H1. The variable Gender_Numerical has a positive coefficient (0.3145) and is highly statistically significant (p = 0.000).
  • Interpretation: Since Gender_Numerical is coded as 1 for Male and 0 for Female, this positive coefficient indicates that male borrowers have a significantly higher likelihood of defaulting than female borrowers. This is true even when controlling forall other variables in the model simultaneously.
  • Odds Ratio: The odds ratio for gender is e 0.3145 = 1.37 . This means that the odds of a male borrower defaulting are approximately 37% higher than for a female borrower with an identical profile.
Connecting H1 to H2
The confirmation of H1, both in the EDA and the formal regression, is the first half of the regulatory paradox. We have established that gender is a statistically significant and economically meaningful predictor of default risk.
This finding necessitates the test of H2. If the platform’s pricing model were purely risk-based and unconstrained by law, we would expect it to account for this 37% higher default risk associated with male borrowers, likely by charging them a higher interest rate. The EDA (Table 6 and Table 7) already suggested this is not happening. H2 now formally tests this within the multivariate regression framework.

3.3.2. Econometric H2 Results: Gender as a Factor in Loan Pricing

Table 9 presents the results of the OLS regression model predicting the Loan_Interest rate. This model explains a large portion of the variance in interest rates, with an R-squared of 0.735.
Table 9. OLS regression results for loan interest rate (H2) (N = 9707).
The results strongly support H2. The variable Gender_Numerical has a p-value of 0.263, which is well above the 0.05 threshold for statistical significance.
  • Interpretation: This finding confirms what the EDA suggested: gender has no statistically significant effect on the interest rate charged by the platform. The platform’s pricing model is, in effect, gender-blind.
  • Model Compliance: This result is the expected outcome for a lender that is compliantwith EU non-discrimination law (Directive 2004/113/EC).

4. Discussion

This study’s findings, supported first by exploratory data analysis and confirmed with a robust econometric framework, empirically validate the regulatory paradox. Our Logit model (H1) confirms a statistical reality: gender is a significant predictor of default risk in this dataset, with male borrowers being approximately 1.37 times more likely to default than their female counterparts, even after controlling for the platform’s own credit rating (Table 8).
Simultaneously, our OLS model (H2) confirms a legal reality: the platform’s pricing model is compliant with EU law, as gender is not a statistically significant factor in determining interest rates (Table 9).
The combination of these two findings is the critical contribution of this paper. The analysis of the platform’s historical loan data shows that gender is a statistically significant risk factor, whereas the platform’s pricing remains gender-neutral in line with legal requirements. Under such a regulatory constraint, any pricing model that excludes gender will, by construction, pool borrowers with different risk profiles, which in turn produces a systematic mispricing of risk. Because the platform’s legally constrained, gender-neutral pricing effectively pools higher-risk (male) and lower-risk (female) borrowers, the observed outcome is an average price applied to groups with different default behaviour. In this setting, lower-risk female borrowers appear to pay an interest rate that is high relative to their observed default probability, while higher-risk male borrowers pay a rate that is low relative to theirs. This pattern is consistent with an implicit cross-subsidy between the two groups, although we do not quantify the exact monetary magnitude of this effect in the present study. This outcome, while legally compliant, is arguably “unfair” from an outcome-oriented perspective.

Limitations and Future Research

The conclusions of this study must be viewed within the context of its limitations:
  • Scope: The analysis is based on data from a single P2P platform in a single EU member state (Lithuania). The findings, while robust for this dataset, may not be generalizable to all P2P platforms or traditional banking sectors across the EU.
  • Variable Omission: The model does not control for all possible risk factors.
  • Household structure and shared liabilities: The dataset does not contain information on household composition (e.g., dependants, shared income, partner income, or household-level liabilities). As a result, part of the observed gender difference in default risk may be attributable to unobserved differences in household structure rather than gender alone.
  • Proxies: This study did not test for proxy discrimination. While the platform does not use gender directly, its models may be using other variables (e.g., occupation, income source) that are highly correlated with gender, potentially perpetuating bias Morais Maceira (2017); Weerts et al. (2024).
This study’s findings establish a clear, static mispricing of risk. Future research will build directly on this by pursuing two advanced lines of inquiry. First, to quantify the precise economic cost of this regulatory paradox, we will develop a counterfactual risk-pricing model. This involves building separate, gender-aware risk models to estimate the “fair” interest rates that would have been assigned to female borrowers based on their actual, lower default risk. By comparing these counterfactual rates to the actual rates paid, we will calculate the aggregate overpayment in Euros, thereby quantifying the total monetary value of the implicit cross-subsidy.
Second, we will move beyond the static, binary (default/no default) analysis of the Logit model to a dynamic analysis of default timing. We will employ survival analysis, such as a Cox proportional hazards model, to investigate whether gender is a significant covariate in determining the time-to-default. This will reveal if gender not only affects if a loan defaults, but also how quickly it defaults, adding a critical temporal dimension to the risk profile.

5. Data and Methodology

5.1. Data and Variables

This study utilizes a proprietary, anonymized dataset provided by a major Lithuanian peer-to-peer (P2P) lending platform. The platform granted access to this dataset for research purposes only. The initial data comprised detailed records of 10,893 consumer loans issued between 2014 and 2025. After listwise deletion of observations with missing data for any of the model variables, the final sample used for regression analysis consists of 9707 loans.
The platform’s credit ratings (A1 to E3) follow a standardized scoring framework, as described in Taujanskaitė and Milčius (2022). In line with reviewer recommendations, Table 10 provides the mapping of these categories to their numerical and linguistic values.
Table 10. Frequency distribution and description of credit rating.
All variables used in the regression models are described in Table 11. These variables were included in the original dataset provided by the lending platform.
Table 11. Description of variables used in regression models (N = 9707).

5.2. Software and Data Transformations

The data analysis and econometric modeling for this study were conducted using the Python programming language (version 3.12). The following open-source libraries were essential to the analysis:
  • pandas: Used for data loading from an Excel file, data manipulation, filtering, and analysis.
  • numpy: Employed for numerical operations, particularly for checking for infinite or missing (NaN) values during data cleaning.
  • statsmodels.api: The primary library used for fitting the logistic (Logit) and linear (OLS) regression models and generating statistical summaries.
To prepare the raw data for the regression models (N = 9707), several transformations were performed:
  • Data Loading and Filtering: Rows with zero or NaN values in “Individual _monthly_ income” or “Duration_of_Income_Numerical” were removed to ensure the integrity of the regression sample.
  • Binary Encoding (Dependent Variable): The “Loan_status” column was converted into a binary numerical variable (0 for FULLY_PAID, 1 for DEFAULT).
  • Numerical Encoding (Features): The “Gender” column was mapped to a single numerical variable (0 for Female, 1 for Male). The categorical “Duration_of_Income” column was similarly mapped to its numerical equivalent (“Duration_of_Income_Numerical”).
  • One-Hot Encoding: To include education level in the model, the categorical “Education” column was converted into dummy variables. The resulting boolean columns were then converted to integers (0 and 1s) to be used as regressors.
Note that the “Equalized_Individual_monthly_income” variable, calculated using a linear regression formula based on the “Metai” (Year) column, was used only for the stratified tables in the Exploratory Data Analysis (Section 3.2) and not in the final regression models.

5.3. Econometric Framework

To empirically test the regulatory paradox, we employ a two-part econometric framework. This formal modeling approach provides the mathematical formulation for our hypotheses, addressing key reviewer feedback.

5.3.1. Testing H1: Gender and Actual Default Risk

To test H1—that gender is a statistically significant predictor of actual default risk—we use a logistic (Logit) regression model. This is a standard approach for binary dependent variables like loan default Crook et al. (2007). The model estimates the probability of default for a borrower i. This approach aligns with the methodology used in Cozarenco and Szafarz (2022) which detects structural biases.
P ( Loan _ status i = 1 | X i ) = Λ ( β 0 + β 1 · G e n d e r i + β k · C o n t r o l s i + ϵ i )
where
  • P ( Loan _ status i = 1 ) is the probability of default;
  • Λ is the logistic function;
  • G e n d e r i is our key explanatory variable (Gender_Numerical);
  • C o n t r o l s i is a vector of control variables (Credit Rating, Age, etc.).
Our hypothesis H1 is supported if the coefficient β 1 is statistically significant (p < 0.05) and its sign indicates a higher default probability for males.

5.3.2. Testing H2: Gender and Loan Pricing

To test H2—that the platform’s pricing model is gender-blind—we use Ordinary Least Squares (OLS) regression to model the loan’s final interest rate.
Loan _ Interest i = β 0 + β 1 · G e n d e r i + β k · C o n t r o l s i + ϵ i
where
  • Loan _ Interest i is the interest rate charged;
  • G e n d e r i is our key explanatory variable;
  • C o n t r o l s i is the same vector of control variables.
Our hypothesis H2 is supported if the coefficient β 1 is not statistically significant (p > 0.05), which would demonstrate that gender is not a factor in the platform’s pricing, even if H1 is true.

6. Conclusions

This study moves beyond a single-method analysis to provide robust, multi-stage evidence of a regulatory paradox in credit risk modeling. First, an exploratory data analysis on 10,893 loans revealed consistent, non-parametric evidence that female borrowers have lower default rates than males across nearly all age, income, and credit rating groups, yet are charged identical interest rates.
Second, a formal econometric analysis on a cleaned sample of 9,707 loans confirmed these findings. By applying a Logit model to actual defaults (H1) and an OLS model to loan pricing (H2), we demonstrated a clear and statistically significant disconnect: gender predicts risk (p < 0.001), but it does not influence price (p = 0.263). These conclusions are based solely on observed loan-level variables and do not account for unobserved factors such as household structure or the full internal specification and calibration of the platform’s proprietary scoring model.
This legally mandated, gender-blind approach, while well-intentioned, creates a quantifiable economic distortion. It leads to the systematic mispricing of risk and an implicit subsidy from lower-risk borrowers to their higher-risk counterparts. The findings highlight a critical need for policymakers and regulators to re-evaluate whether the current “fairness-through-unawareness” doctrine in EU law achieves the goal of gender equality in financial access, or whether, in some contexts, it inadvertently disadvantages the very groups it aims to protect.

Author Contributions

Conceptualization, M.J. and A.L.; methodology, M.J. and A.L.; software, M.J. and A.L.; validation, M.J. and A.L.; formal analysis, M.J. and A.L.; investigation, M.J. and A.L.; resources, M.J.; data curation, M.J.; writing—original draft preparation, M.J. and A.L.; writing—review and editing, M.J. and A.L.; visualization, M.J.; supervision, M.J. and A.L.; project administration, M.J. and A.L.; funding acquisition, M.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data can be provided on request by email to the corresponding author.

Acknowledgments

During the preparation of this manuscript/study, the author(s) used pandas, statsmodels.api, and other libraries of Python (version 3.12) for the purposes of data analysis and econometric modeling. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Aguir, Iness, Narjess Boubakri, Miriam Marra, and Lu Zhu. 2023. Gender Diversity in Leadership: Empirical Evidence on Firm Credit Risk. Journal of Financial Stability 69: 101185. [Google Scholar] [CrossRef]
  2. Aliano, Mauro, Khalil Alnabulsi, Greta Cestari, and Stefania Ragni. 2023. Peer-to-Peer (P2P) Lending in Europe: Evaluating the Default Risk of Borrowers in the Context of Gender and Education. European Scientific Journal 19: 60–78. [Google Scholar] [CrossRef]
  3. Alliance for Financial Inclusion. 2025. Alternative Data for Credit Scoring: Special Report. Available online: https://www.afi-global.org/wp-content/uploads/2025/02/Alternative-Data-for-Credit-Scoring.pdf (accessed on 8 November 2025).
  4. Andreeva, Galina, and Anna Matuszyk. 2019. The Law of Equal Opportunities or Unintended Consequences?: The Effect of Unisex Risk Assessment in Consumer Credit. Journal of the Royal Statistical Society Series A 182: 1287–311. [Google Scholar] [CrossRef]
  5. Barasinska, Nataliya, and Dorothea Schäfer. 2010. Does Gender Affect Funding Success at the Peer-to-Peer Credit Markets? Evidence from the Largest German Lending Platform. DIW Discussion Papers. Available online: https://www.econstor.eu/handle/10419/52541 (accessed on 8 November 2025).
  6. Barasinska, Nataliya, and Dorothea Schäfer. 2014. Is Crowdfunding Different? Evidence on the Relation between Gender and Funding Success from a German Peer-to-Peer Lending Platform. German Economic Review 15: 436–52. [Google Scholar] [CrossRef]
  7. Bartlett, Robert, Adair Morse, Richard Stanton, and Nancy Wallace. 2022. Consumer-Lending Discrimination in the FinTech Era. Journal of Financial Economics 143: 30–56. [Google Scholar] [CrossRef]
  8. Bono, Teresa, Karen Croxson, and Adam Giles. 2021. Algorithmic Fairness in Credit Scoring. Oxford Review of Economic Policy 37: 585–617. [Google Scholar] [CrossRef]
  9. Caire, Dean, and Maria Fernandez Vidal. 2024. Gender-Intentional Credit Scoring Acknowledgments. Available online: https://www.cgap.org/research/publication/gender-intentional-credit-scoring (accessed on 8 November 2025).
  10. Chen, Xiao, Bihong Huang, and Dezhu Ye. 2019. Gender Gap in Peer-to-Peer Lending: Evidence from China. Journal of Banking & Finance 108: 105633. [Google Scholar]
  11. Corbett-Davies, Sam, Johann D. Gaebler, Hamed Nilforoshan, Ravi Shroff, and Sharad Goel. 2018. The Measure and Mismeasure of Fairness. arXiv arXiv:1808.00023. [Google Scholar]
  12. Cozarenco, Anastasia, and Ariane Szafarz. 2022. Intersectional Bias in Prosocial Lending: Methods and Application to Microcredit. Available online: https://www.researchgate.net/profile/Ariane-Szafarz/publication/358690498_Intersectional_Bias_in_Prosocial_Lending_Methods_and_Application_to_Microcredit/links/621b69bb2542ea3cacb5e728/Intersectional-Bias-in-Prosocial-Lending-Methods-and-Application-to-Microcredit.pdf (accessed on 8 November 2025).
  13. Crook, Jonathan N., David B. Edelman, and Lyn C. Thomas. 2007. Recent developments in consumer credit risk assessment. European Journal of Operational Research 183: 1447–65. [Google Scholar] [CrossRef]
  14. Das, Sanjiv, Richard Stanton, and Nancy Wallace. 2022. Algorithmic Fairness. Annual Review of Financial Economics 14: 19–39. [Google Scholar] [CrossRef]
  15. Dastile, Xolani, and Turgay Celik. 2024. Counterfactual explanations with multiple properties in credit scoring. IEEE Access 12: 55215–28. [Google Scholar] [CrossRef]
  16. De Andrés, Pablo, Ricardo Gimeno, and Ruth Mateos de Cabo. 2021. The gender gap in bank credit access. Journal of Corporate Finance 71: 101782. [Google Scholar] [CrossRef]
  17. de Castro Vieira, José Rômulo, Flavio Barboza, Daniel Cajueiro, and Herbert Kimura. 2025. Towards Fair AI: Mitigating Bias in Credit Decisions—A Systematic Literature Review. Journal of Risk and Financial Management 18: 228. [Google Scholar] [CrossRef]
  18. European Institute for Gender Equality. 2023. Financial Independence and Gender Equality: Joining the Dots Between Income, Wealth and Power. Available online: https://eige.europa.eu/sites/default/files/documents/financial-independence-and-gender-equality.pdf (accessed on 8 November 2025).
  19. European Union. 2000. Charter of Fundamental Rights of the European Union. Official Journal of the European Communities C 364: 1–22. [Google Scholar]
  20. Gaigalienė, Asta, and Dovydas Česnys. 2018. Determinants of Default in Lithuanian Peer-To-Peer Platforms. Management Organizational: Systems Research 80: 19–36. [Google Scholar] [CrossRef]
  21. Garcia, Ana Cristina Bicharra, Marcio Gomes Pinto Garcia, and Roberto Rigobon. 2024. Algorithmic discrimination in the credit domain: What do we know about it? AI & Society 39: 2059–98. [Google Scholar]
  22. Golovkina, Ana. 2021. Factors Affecting Consumer Loans Interest Rate in P2P Market. Bachelor’s thesis, ISM University of Management and Economics, Vilnius, Lithuania. [Google Scholar]
  23. Hardt, Moritz, Eric Price, and Nati Srebro. 2016. Equality of Opportunity in Supervised Learning. In Advances in Neural Information Processing Systems. vol. 29, pp. 3323–31. [Google Scholar]
  24. He, Fengxiang, and Dacheng Tao. 2025. Algorithmic Fairness. In Foundations of Deep Learning. Singapore: Springer, pp. 263–68. [Google Scholar]
  25. Jammalamadaka, Krishna Ravali, and Srikanth Itapu. 2023. Responsible AI in automated credit scoring systems. AI and Ethics 3: 485–95. [Google Scholar] [CrossRef]
  26. Kinateder, Harald, Tonmoy Choudhury, Rashid Zaman, Simone D. Scagnelli, and Nurul Sohel. 2021. Does Boardroom Gender Diversity Decrease Credit Risk in the Financial Sector? Worldwide Evidence. Journal of International Financial Markets, Institutions and Money 73: 101347. [Google Scholar] [CrossRef]
  27. Law of the Republic of Lithuania on Equal Opportunities for Women and Men. 2017. Available online: https://e-seimas.lrs.lt/portal/legalAct/lt/TAD/3486ee123cfa11eabd71c05e81f09716 (accessed on 8 November 2025).
  28. Lin, Lingnan. 2019. Gender Effect on the Default Risk in Peer-to-Peer Lending Markets: The Case of the Largest Chinese Platform. Risk Governance & Control: Financial Markets & Institutions 9: 8–22. [Google Scholar]
  29. Liu, Xuemeng. 2024. Fairness in Credit Risk Prediction Using Machine Learning. Master’s thesis, Tilburg University, Tilburg, The Netherlands. [Google Scholar]
  30. Liu, Yajing, and Kyoko Mona. 2025. Characteristics of High-Risk Defaulters: An Empirical Study on the Bondora P2P Lending Platform. Available online: https://hdl.handle.net/10125/109670 (accessed on 8 November 2025).
  31. Lobão, Júlio. 2024. The Influence of Gender on Individuals’ Ability to Predict Their Own Risk Tolerance: Evidence from a European Country. Administrative Sciences 14: 56. [Google Scholar] [CrossRef]
  32. Moltalvo, Jose G., and Marta Reynal-Querol. 2020. Gender and Credit Risk: A View From the Loan Officer’s Desk. Available online: https://bw.bse.eu/wp-content/uploads/2019/03/1076-file.pdf (accessed on 8 November 2025).
  33. Morais Maceira, Helena. 2017. Economic Benefits of Gender Equality in the EU. Intereconomics 52: 178–83. [Google Scholar] [CrossRef]
  34. Nwafor, Chioma Ngozi, Obumneme Nwafor, and Sanjukta Brahma. 2024. Enhancing transparency and fairness in automated credit decisions: An explainable novel hybrid machine learning approach. Scientific Reports 14: 25174. [Google Scholar] [CrossRef]
  35. Polena, Michal, and Tobias Regner. 2018. Determinants of Borrowers’ Default in P2P Lending under Consideration of the Loan Risk Class. Games 9: 82. [Google Scholar] [CrossRef]
  36. Pope, Devin G., and Justin R. Sydnor. 2011. What’s in a Picture? Evidence of Discrimination from Prosper.Com. Journal of Human Resources 46: 53–92. [Google Scholar]
  37. Schwarcz, Daniel, and Anya E. R. Prince. 2020. Proxy Discrimination in the Age of Artificial Intelligence and Big Data. Iowa Law Review 105: 1257–318. [Google Scholar]
  38. Shi, Si, Rita Tse, Wuman Luo, Stefano D’Addona, and Giovanni Pau. 2022. Machine Learning-Driven Credit Risk: A Systemic Review. Neural Computing and Applications 34: 14327–39. [Google Scholar] [CrossRef]
  39. Skouris, Vassilios, Antonio Tizzano, José Narciso Cunha Rodrigues, Koen Lenaerts, Jean-Claude Bonichot, Alexander Arabadjiev, E. Juhász, G. Arestis, A. Borg Barthet, and M. Ilešič. 2010. Judgment of the Court (Grand Chamber) of 1 March 2011. Association Belge Des Consommateurs Test-Achats ASBL and Others v Conseil Des Ministres. Case C-236/09. Available online: https://curia.europa.eu/juris/document/document.jsf?docid=80619&doclang=FR (accessed on 15 February 2025).
  40. Taujanskaitė, Kamilė, and Eugenijus Milčius. 2022. Accelerated growth of Peer-to-Peer lending and its impact on the consumer credit market: Evidence from Lithuania. Economies 10: 210. [Google Scholar] [CrossRef]
  41. The Council of the European Union. 2004. Council Directive 2004/113/EC of 13 December 2004 the Implementing the Principle of Equal Treatment Between Men and Women in the Access to and Supply of Goods and Services. Official Journal of the European Union, L 373/37–43. Available online: https://eur-lex.europa.eu/eli/dir/2004/113/oj/eng (accessed on 8 November 2025).
  42. Wang, Zhiqin. 2024. Artificial Intelligence and Machine Learning in Credit Risk Assessment: Enhancing Accuracy and Ensuring Fairness. Open Journal of Social Sciences 12: 19–34. [Google Scholar] [CrossRef]
  43. Weerts, Hilde, Aislinn Kelly-Lyth, Reuben Binns, and Jeremias Adams-Prassl. 2024. Unlawful Proxy Discrimination: A Framework for Challenging Inherently Discriminatory Algorithms. Paper presented at 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT 2024), Rio de Janeiro, Brazil, June 3–6; pp. 1850–60. [Google Scholar]
  44. Wu, Siyuan. 2023. Towards Fair Credit Risk Assessment: Developing Mathematical Models with Equity and Accuracy. Advances in Economics Management and Political Sciences 44: 151–65. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Article metric data becomes available approximately 24 hours after publication online.