Identification of the bank’s default clients by machine learning methods on the basis of binning

Yurii Kleban, Nataliia Horoshko


Introduction. In the current global crisis, the problem of the quality of banks’ loan portfolios is a topical issue. Among the methods of effective credit risk management is the assessment of the borrower’s creditworthiness. Improving the quality of analysis of the strengths and weaknesses of the counterparty will reduce the occurrence of unforeseen risks in the process of conducting credit operations. Given the importance of the role of creditworthiness assessment for decision-making, there is a need to improve and choose a methodology that will ensure the most accurate classification of the bank’s clients.

Purpose. The aim of the work is to choose the best method for predicting the probability of default of commercial bank customers based on the analysis of approaches and testing of the built models.

Method (methodology). The paper considers methodological approaches to modeling the insolvency of bank customers and determining the probability of repayment of loans based on binning indicators. Also, the credit risk assessment models based on the use of logit and probit regressions, the algorithm of extreme gradient boosting and artificial neural networks are constructed. The comparative analysis of the efficiency of the application of the used approaches is carried out.

Results. The obtained results demonstrated the high accuracy of the models and their ability to identify non-creditworthy customers. The findings of the study and evaluation of mathematical approaches can be implemented in the work of banking structures and other credit institutions to spread the amount of problem fees in their loan portfolios.


model; binning; logit regression; neural network; gradient boosting; creditworthiness; individual borrower

Full Text:



The National Bank of Ukraine. (2004). Metodychni vkazivky z inspektuvannia bankiv «Systema otsinky ryzykiv»: Postanova Pravlinnia Natsionalnoho banku Ukrainy vid 15.03.2004 № 104. Retrieved from [in Ukrainian].

Velykoivanenko, G. I., Trokoz, L. O. (2013). Modelyuvannia kredytospromozhnosti pozychalnykiv komertsiinoho banku [Modeling of the creditworthiness of commercial bank borrowers]. Naukovyi zhurnal «Naukovi zapysky Natsionalnoho universytetu«Ostrozka akademiia seriia Ekonomika [Scientific journal Scientific Notes of the National University of Ostroh Academy, series Economics], 22, 137-141. [in Ukrainian].

Matviichuk, A. V. (2011). Shtuchnyi intelekt v ekonomitsi: neironni merezhi, nechitka lohika. [Artificial intelligence in economics: neural networks, fuzzy logic: a monograph]. Кyiv, KNEU. [in Ukrainian].

Smoleva, Т. (2014). Suchasni metody otsinky kredytospromozhnosti pozychalnykiv bankamy Ukrayiny [Modern methods of assessing the creditworthiness of borrowers by Ukrainian banks.]. Finansy, oblik, banky [Finance, accounting, banks], 1(20), 241-245. [in Ukrainian].

Thomas, L.C. (2009). Modelling the credit risk for portfolios of consumer loans: analogies with corporate loan models. Math. Comput. Simulat, 79 (8), 2525–2534.

Zenzerović, R. (2011). Credit scoring models in estimating the creditworthiness of small and medium and big enterprises. Croatian Operational Research Review, 2(1).

Abdou, H., Pointon, J. (2011). Credit scoring, statistical techniques and evaluation criteria: a review of the literature. Intell. Syst. Account., Finance Manage, 18 (2-3), 59-88.

Akkoc, S. (2012). An empirical comparison of conventional techniques, neural networks and the three stage hybrid Adaptive Neuro Fuzzy Inference System (AN- FIS) model for credit scoring analysis: the case of Turkish credit card data, Eur. J. Oper. Res., 222 (1), 168-178.

Lee, T., Chen, I. (2005). A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression spines, Expert Syst. Appl., 28 (4), 743–752.

Crook, J., Banasik, J. (2012). Forecasting and explaining aggregate consumer credit delinquency behaviour, Int. J. Forecasting, 28 (1), 145–160.

Novoseletskyy, O.M., Yakubets O.V. (2014) Modeliuvannia kredytospromozhnosti yurydychnykh osib na osovi dyskryminantnoho analisu ta neironnykh merezh [Legal entities creditworthiness modeling using discriminant analysis and neural networks]. Neyronechitki tekhnolohii modelyuvannia v ekonomitsi. Naukovo-analitychnyi zhurnal [Neuroreflective modeling technologies in economics. Scientific and analytical journal], 3, 120-151. [in Ukrainian].

Kleban, Y. V. (2019). Doslidzhennia sposobiv transformatsii danykh v konteksti pidvyshchennia efektyvnosti modeley kredytnoho skorynhu [Research of ways of data transformation in the context of increasing the efficiency of credit scoring models]. Neyronechitki tekhnolohii modelyuvannia v ekonomitsi. Naukovo-analitychnyi zhurnal [Neuroreflective modeling technologies in economics. Scientific and analytical journal], 8, 94-123. [in Ukrainian].

Hryhorovych, O.V. (2019). Zastosuvannia bagatosharovysh perseptroniv dlya klasyfikacii posychalnykiv-yurydychnykh osib [Application of multilayer perceptrons to legal entities borrowers classification]. Neyronechitki tekhnolohii modelyuvannia v ekonomitsi. Naukovo-analitychnyi zhurnal [Neuroreflective modeling technologies in economics. Scientific and analytical journal], 8, 48-64. [in Ukrainian].

Nielsen, D. (2016). Tree Boosting with XGBoost. Norwegian University of Science and Technology. Retrieved from . [in English].

Troyanskaya, O., Cantor, M., Sherlock and otherth (2001). Missing value estimation methods for DNA microarrays. Bioinfomatics, 17 (6), 520-525.

Trueck, S., Rachev, S.T. (2009). Rating Based Modeling of Credit Risk. Academic Press, 11-30.

Matviichuk, A. V., Ben, V.P. (2015). The use of logit and probit regressions in borrower’s creditworthiness assessment. The Visnyk of the National Bank of Ukraine, 5, 37-41.

Vitlinskii, V.V. (2012) Shtuchnyi intellect u systemi pryiniattia upravlinskykh rishen [Artificial intelligence in the management decision-making system]. Neyronechitki tekhnolohii modelyuvannia v ekonomitsi. Naukovo-analitychnyi zhurnal [Neuroreflective modeling technologies in economics. Scientific and analytical journal],1, 97-118.

Haykin, S. (2006). Neural Networks: A Comprehensive Foundation (2nd ed.). Moscow: Williams.

Fawcett, T. (2011). An Introduction to ROC Analysis. Pattern Recognition Letters, 27 (8), 861–874.

Berrar, D. (2018). Cross-Validation. Reference Module in Life Sciences. Retrieved from


Article Metrics

Metrics Loading ...

Metrics powered by PLOS ALM


  • There are currently no refbacks.

Copyright (c) 2021

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.


Articles are distributed under Creative Commons Attribution  International 4.0 (CC-BY-NC 4.0) 

Science Works Journal "Ekonomichnyy analiz"

ISSN 1993-0259 (Print)  ISSN 2219-4649 (Online) DOI: 10.35774/econa

© West Ukrainian National University

© «Ekonomichnyy analiz», 2007-2023