2020
Journal article  Open Access

A novel approach to define the local region of dynamic selection techniques in imbalanced credit scoring problems

Melo Junior L., Nardini F. M. Renso C., Trani R., Macedo J. A.

Imbalanced learning  Computer Science Applications  Artificial Intelligence  Dynamic Selection Classification  Credit scoring  General Engineering 

Lenders, such as banks and credit card companies, use credit scoring models to evaluate the potential risk posed by lending money to customers, and therefore to mitigate losses due to bad credit. The profitability of the banks thus highly depends on the models used to decide on the customer's loans. State-of-the-art credit scoring models are based on machine learning and statistical methods. One of the major problems of this field is that lenders often deal with imbalanced datasets that usually contain many paid loans but very few not paid ones (called defaults). Recently, dynamic selection methods combined with ensemble methods and preprocessing techniques have been evaluated to improve classification models in imbalanced datasets presenting advantages over the static machine learning methods. In a dynamic selection technique, samples in the neighborhood of each query sample are used to compute the local competence of each base classifier. Then, the technique selects only competent classifiers to predict the query sample. In this paper, we evaluate the suitability of dynamic selection techniques for credit scoring problem, and we present Reduced Minority k-Nearest Neighbors (RMkNN), an approach that enhances state of the art in defining the local region of dynamic selection techniques for imbalanced credit scoring datasets. This proposed technique has a superior prediction performance in imbalanced credit scoring datasets compared to state of the art. Furthermore, RMkNN does not need any preprocessing or sampling method to generate the dynamic selection dataset (called DSEL). Additionally, we observe an equivalence between dynamic selection and static selection classification. We conduct a comprehensive evaluation of the proposed technique against state-of-the-art competitors on six real-world public datasets and one private one. Experiments show that RMkNN improves the classification performance of the evaluated datasets regarding AUC, balanced accuracy, H-measure, G-mean, F-measure, and Recall.

Source: Expert systems with applications 152 (2020). doi:10.1016/j.eswa.2020.113351

Publisher: Pergamon,, Oxford , Regno Unito


Metrics



Back to previous page
BibTeX entry
@article{oai:it.cnr:prodotti:440239,
	title = {A novel approach to define the local region of dynamic selection techniques in imbalanced credit scoring problems},
	author = {Melo Junior L. and Nardini F. M.  Renso C. and Trani R. and Macedo J. A.},
	publisher = {Pergamon,, Oxford , Regno Unito},
	doi = {10.1016/j.eswa.2020.113351},
	journal = {Expert systems with applications},
	volume = {152},
	year = {2020}
}

BigDataGrapes
Big Data to Enable Global Disruption of the Grapevine-powered Industries

MASTER
Multiple ASpects TrajEctoRy management and analysis


OpenAIRE