Loan classification using random forest algorithm and comparative analysis with other classifiers
https://doi.org/10.29235/1561-8323-2025-69-2-101-108
Abstract
The study aims to analyze the application of the random forest algorithm in addressing the loan classification issue. Furthermore, it intends to perform a comparative analysis by juxtaposing the outcomes with those derived from logistic regression, feedforward neural network, and deep feedforward neural network models. The research determined the ideal maximum number of input indicators and the ideal number of trees in the ensemble when utilizing the random forest algorithm. Additionally, it explored the impact of alternative data partitioning into training and test sets on the accuracy of model forecasting with the random forest algorithm. In conclusion, a strategy for addressing the loan classification issue using the classifiers studied has been proposed.
About the Authors
U. I. BehunkouBelarus
Behunkou Uladzimir I. – Master of Sciences (Engineering)
6, Surganov Str., 220012, Minsk
M. Y. Kovalyov
Belarus
Kovalyov Mikhail Y. – Corresponding Member, D. Sc. (Physics and Mathematics), Professor
6, Surganov Str., 220012, Minsk
References
1. Behunkou U. I., Kovalyov M. Y. Loan classification using logistic regression. Informatics, 2023, vol. 20, no. 1, pp. 55–74 (in Russian). https://doi.org/10.37661/1816-0301-2023-20-1-55-74
2. Behunkou U. I. Loan classification using a feed-forward neural network. Informatics, 2024, vol. 21, no. 1, pp. 83–104 (in Russian). https://doi.org/10.37661/1816-0301-2024-21-1-83-104
3. Lessmann S., Baesens B., Seow H.-V., Thomas L. C. Benchmarking state-of-the-art classification algorithms for credit scoring: an update of research. European Journal of Operational Research, 2015, vol. 247, no. 1, pp. 124–136. https://doi.org/10.1016/j.ejor.2015.05.030
4. Quinlan J. R. Induction of Decision Trees. Machine Learning, 1986, vol. 1, pp. 81–106. https://doi.org/10.1007/bf00116251
5. Breiman L., Friedman J. H., Olshen R. A., Stone C. J. Classification and Regression Trees. New York, 1984. 368 p. https://doi.org/10.1201/9781315139470
6. Hastie T., Tibshirani R., Friedman J. The elements of statistical learning: Data mining, inference, and prediction. 3d ed. New York, 2009, pp. 308–310. https://doi.org/10.1007/978-0-387-84858-7
7. Breiman L. Random Forests. Machine Learning, 2001, vol. 45, pp. 5–32. https://doi.org/10.1023/a:1010933404324