Predicting Default Situations in the P2P Lending
Based on Machine Learning
- DOI
- 10.2991/978-94-6463-246-0_73How to use a DOI?
- Keywords
- P2P; Logistic regression model; default; Machine learning; Gradient Boosting Decision Tree Model; T-test; Chi-square test
- Abstract
As a flexible and efficient new financial format, P2P lending suffers from breach of contract and lack of trust due to the uneven credit, income, and region of borrowers. Therefore, we plan to use machine learning algorithms to predict the default situation in the P2P market in the future, and compare the prediction accuracy of various models to find the optimal default prediction model. The research data in this article includes P2P lending data from 33,105 users in 50 states in the United States. It includes variables such as investment income loss percentage, borrower income, and loan term. To simplify subsequent analysis, missing values were cleaned and data on borrower state and loan date were classified and simplified. T-test and chi-square test were used to preliminarily analyze data-type data and categorical-type data, and the results showed that all relevant variables are statistically significant and need to be considered in subsequent research. To further determine the significance of each variable in the default situation, a logistic regression model was introduced, which has practical significance for lending platforms in user selection. Finally, four types of models were used for constructing default prediction models, which are logistic regression, decision trees, random forests, and GBDT. The ACC and AUC values of different models on the training and testing sets were compared. The conclusion is that the GBDT model has the highest prediction accuracy and a high AUC value, which can serve as a prediction model for future lending platforms to predict user default situations.
- Copyright
- © 2024 The Author(s)
- Open Access
- Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
Cite this article
TY - CONF AU - Chenzhou Mo PY - 2023 DA - 2023/09/26 TI - Predicting Default Situations in the P2P Lending BT - Proceedings of the 3rd International Conference on Economic Development and Business Culture (ICEDBC 2023) PB - Atlantis Press SP - 607 EP - 613 SN - 2352-5428 UR - https://doi.org/10.2991/978-94-6463-246-0_73 DO - 10.2991/978-94-6463-246-0_73 ID - Mo2023 ER -