2019
Conference article  Open Access

On combining dynamic selection, sampling, and pool generators for credit scoring

Melo Junior L., Nardini F. M., Renso C., Fernandes De Macedo J. A.

credit scoring  imbalanced datasets  dynamic classification  ensemble pool generators 

The profitability of the banks highly depends on the models used to decide on the customer's loans. State of the art credit scoring models are based on machine learning methods. These methods need to cope with the problem of imbalanced classes since credit scoring datasets usually contain many paid loans and few not paid ones (defaults). Recently, dynamic selection approaches combined with pre-processing techniques have been evaluated for imbalanced datasets. However, previous works only evaluate oversampling techniques combined with bagging pool generator ensembles. For this reason, we propose to combine different dynamic selection, preprocessing and pool generation techniques. We assess the prediction performance by using four public real-world credit scoring datasets with different levels of imbalanced ratio and four evaluation measures. Experimental results show that KNORA-Union dynamic selection technique combined with Balanced Random Forest improves the classification performance concerning the static ensemble for all levels of imbalance ratio.

Source: Machine Learning and Data Mining in Pattern Recognition, 15th International Conference on Machine Learning and Data Mining, MLDM, pp. 443–457, New York, USA, 18/07/2019, 23/07/2019



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:415726,
	title = {On combining dynamic selection, sampling, and pool generators for credit scoring},
	author = {Melo Junior L. and Nardini F. M. and Renso C. and Fernandes De Macedo J. A.},
	booktitle = {Machine Learning and Data Mining in Pattern Recognition, 15th International Conference on Machine Learning and Data Mining, MLDM, pp. 443–457, New York, USA, 18/07/2019, 23/07/2019},
	year = {2019}
}