Skip to main content

Feature Selection on Credit Risk Prediction for Peer-to-Peer Lending

  • Conference paper
  • First Online:
New Frontiers in Artificial Intelligence (JSAI-isAI 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11717))

Included in the following conference series:

Abstract

Lending plays a key role in economy from early civilization. One of the most important issue in lending business is to measure the risk that the borrower will default or delay in loan payment. This is called credit risk. After Lehman shock in 2008–2009, big banks increased verification for lending operation to reduce risk. As borrowing from established financial institutions is getting harder, social lending also called Peer-to-Peer (P2P) lending, is becoming the popular trend. Because the client information at P2P lending is not sufficient as in traditional financial system, big data and machine learning become the default methods for analyzing credit risk. However, cost of computation and the problem of training the classifier with imbalance data affect the quality of result. This paper proposes a machine learning model with feature selection to measure credit risk of individual borrower on P2P lending. Based on our experimental results, we showed that the credit risk prediction for P2P lending can be improved using Logistic Regression in addition to proper feature selection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. McWaters, J., et al.: The Future of Financial Services: How Disruptive Innovations are Reshaping the Way Financial Services are Structured. Provisioned and Consumed. World Economic Forum (2015)

    Google Scholar 

  2. Thomas, L.C.: A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers. Int. J. Forecast. 16(2), 149–172 (2000)

    Article  Google Scholar 

  3. Sandberg, M.: Credit Risk Evaluation using Machine Learning (2017)

    Google Scholar 

  4. Birla, S., Kohli, K., Dutta, A.: Machine learning on imbalanced data in credit risk. In: 2016 IEEE 7th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), pp. 1–6. IEEE, Vancouver (2016). https://doi.org/10.1109/iemcon.2016.7746326

  5. Brown, I., Mues, C.: An experimental comparison of classification algorithms for imbalanced credit scoring data sets. Expert Syst. Appl. 39(3), 3446–3453 (2012)

    Article  Google Scholar 

  6. Ashcraft, A.B., Schuermann, T.: Understanding the securitization of subprime mortgage credit. Found. Trends Finan. 2(3), 191–309 (2008)

    Article  Google Scholar 

  7. Board, Financial Stability, FinTech Credit: Market Structure, Business Models and Financial Stability Implications. Financial Stability Board, Basel (2017)

    Google Scholar 

  8. John, C.W.: A note on the comparison of logit and discriminant models of consumer credit behavior. J. Finan. Quant. Anal. 15(3), 757–770 (1980)

    Article  Google Scholar 

  9. Dong, G., Lai, K.K., Yen, J.: Credit scorecard based on logistic regression with random coefficients. Procedia Comput. Sci. 1(1), 2463–2468 (2010)

    Article  Google Scholar 

  10. Malekipirbazari, M., Aksakalli, V.: Risk assessment in social lending via random forests. Expert Syst. Appl. 42(10), 4621–4631 (2015)

    Article  Google Scholar 

  11. LendingClub, June 2018. https://www.lendingclub.com/info/download-data.action

  12. Peng, H., Fuhui, L., Chris, D.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1226–1238 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shin-Fu Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, SF., Chakraborty, G., Li, LH. (2019). Feature Selection on Credit Risk Prediction for Peer-to-Peer Lending. In: Kojima, K., Sakamoto, M., Mineshima, K., Satoh, K. (eds) New Frontiers in Artificial Intelligence. JSAI-isAI 2018. Lecture Notes in Computer Science(), vol 11717. Springer, Cham. https://doi.org/10.1007/978-3-030-31605-1_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-31605-1_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-31604-4

  • Online ISBN: 978-3-030-31605-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics