A stacking ensemble model with SMOTE for improved imbalanced classification on credit data

Nur Alamsyah, Budiman Budiman, Titan Parama Yoga, R. Yadi Rakhman Alamsyah

Abstract


This research is based on a significant problem in credit risk analysis in the banking sector caused by class imbalance. We face the problem of the model’s inability to accurately identify risks in the ‘‘Charged Off’’ class. As a solution, we propose a stacked ensemble approach that utilizes synthetic minority over-sampling technique (SMOTE) to balance the class distribution. Experiments were conducted by applying SMOTE to the training data before training the credit model using gradient boosting (XGBoost) and random forest (RF) algorithms in a single ensemble. The results show significant improvements in precision, recall, and F1-score after applying SMOTE on the unbalanced classes. The updated model achieved a striking accuracy rate of 0,97 on resampled training data. This re-search clearly identifies the problem of class imbalance as a major challenge in credit risk analysis. The application of SMOTE in a stacked ensemble was found to be effective in improving model performance, making a valuable contribution to the development of more reliable credit models for better risk management and revenue generation in financial institutions.


Keywords


banking data; classification; credit risk; stacking ensemble; synthetic minority over-sampling technique;

Full Text:

PDF


DOI: http://doi.org/10.12928/telkomnika.v22i3.25921

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

TELKOMNIKA Telecommunication, Computing, Electronics and Control
ISSN: 1693-6930, e-ISSN: 2302-9293
Universitas Ahmad Dahlan, 4th Campus
Jl. Ringroad Selatan, Kragilan, Tamanan, Banguntapan, Bantul, Yogyakarta, Indonesia 55191
Phone: +62 (274) 563515, 511830, 379418, 371120
Fax: +62 274 564604

View TELKOMNIKA Stats