Proceedings of the 3rd International Conference on Economic Development and Business Culture (ICEDBC 2023)

Predicting Default Situations in the P2P Lending

Based on Machine Learning

Authors
Chenzhou Mo1, *
1Beijing Normal University, Zhuhai, 510630, Guangdong, China
*Corresponding author. Email: 1349696871@qq.com
Corresponding Author
Chenzhou Mo
Available Online 26 September 2023.
DOI
10.2991/978-94-6463-246-0_73How to use a DOI?
Keywords
P2P; Logistic regression model; default; Machine learning; Gradient Boosting Decision Tree Model; T-test; Chi-square test
Abstract

As a flexible and efficient new financial format, P2P lending suffers from breach of contract and lack of trust due to the uneven credit, income, and region of borrowers. Therefore, we plan to use machine learning algorithms to predict the default situation in the P2P market in the future, and compare the prediction accuracy of various models to find the optimal default prediction model. The research data in this article includes P2P lending data from 33,105 users in 50 states in the United States. It includes variables such as investment income loss percentage, borrower income, and loan term. To simplify subsequent analysis, missing values were cleaned and data on borrower state and loan date were classified and simplified. T-test and chi-square test were used to preliminarily analyze data-type data and categorical-type data, and the results showed that all relevant variables are statistically significant and need to be considered in subsequent research. To further determine the significance of each variable in the default situation, a logistic regression model was introduced, which has practical significance for lending platforms in user selection. Finally, four types of models were used for constructing default prediction models, which are logistic regression, decision trees, random forests, and GBDT. The ACC and AUC values of different models on the training and testing sets were compared. The conclusion is that the GBDT model has the highest prediction accuracy and a high AUC value, which can serve as a prediction model for future lending platforms to predict user default situations.

Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the 3rd International Conference on Economic Development and Business Culture (ICEDBC 2023)
Series
Advances in Economics, Business and Management Research
Publication Date
26 September 2023
ISBN
978-94-6463-246-0
ISSN
2352-5428
DOI
10.2991/978-94-6463-246-0_73How to use a DOI?
Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Chenzhou Mo
PY  - 2023
DA  - 2023/09/26
TI  - Predicting Default Situations in the P2P Lending
BT  - Proceedings of the 3rd International Conference on Economic Development and Business Culture (ICEDBC 2023)
PB  - Atlantis Press
SP  - 607
EP  - 613
SN  - 2352-5428
UR  - https://doi.org/10.2991/978-94-6463-246-0_73
DO  - 10.2991/978-94-6463-246-0_73
ID  - Mo2023
ER  -