2019
Conference article  Closed Access

An empirical comparison of classification algorithms for imbalanced credit scoring datasets

Soares De Melo Junior L., Nardini F. M., Renso C., Fernandes De Macedo J. A.

Immbalanced datasets  Classification  Benchmarking  Credit scoring 

The profitability of banks is highly dependent on credit scoring models, which support decision making to approve a loan to a customer. State-of-the-art credit scoring models are based on learning methods. These methods need to cope with the problem of imbalanced classes since credit scoring datasets usually contain mainly paid loans and few defaults (unpaid ones). Recently, new imbalanced learning techniques have been proposed in the literature, and they can improve the credit scoring results. Motivated by this scenario, we evaluate several classification approaches to credit scoring. Besides, we also assess some preprocessing methods to overcome skewed datasets. To achieve it, we use three public real-world credit scoring datasets. In our experiments, we progressively increase the class imbalance in each of these datasets by randomly undersampling the minority class of defaulters to identify how the predictive power is affected. The results indicate that random forest, extreme gradient boosting perform very well in all imbalance levels. We also find that a complete grid search step can increase the prediction power of classification approaches in high imbalanced datasets.

Source: ICMLA 2019 - 18th IEEE International Conference on Machine Learning and Applications, pp. 747–754, Boca Raton; United States, 16-19 December, 2019


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:424010,
	title = {An empirical comparison of classification algorithms for imbalanced credit scoring datasets},
	author = {Soares De Melo Junior L. and Nardini F. M. and Renso C. and Fernandes De Macedo J. A.},
	doi = {10.1109/icmla.2019.00133},
	booktitle = {ICMLA 2019 - 18th IEEE International Conference on Machine Learning and Applications, pp. 747–754, Boca Raton; United States, 16-19 December, 2019},
	year = {2019}
}

BigDataGrapes
Big Data to Enable Global Disruption of the Grapevine-powered Industries


OpenAIRE