Dyakonova, Ludmila and Konstantinov, Alexey (2024): Approaches to risk analysis in the financial sector based on machine learning and artificial intelligence methods.
PDF
MPRA_paper_122941.pdf Download (263kB) |
Abstract
The article studies approaches to improving the forecasting quality of machine learning models in finance. An overview of studies devoted to the application of machine learning models and artificial intelligence in the banking sector is given, both from the point of view of risk management and considering in more detail the applied methods of credit scoring and fraud detection. Aspects of applying explainable artificial intelligence (XAI) methods in financial organizations are considered. To identify the most effective machine learning models, the authors conducted experiments to compare 8 classification models used in the financial sector. The gradient boosting model CatboostClassifier was chosen as the base model. A comparison was carried out for the results obtained on the CatboostClassifier model with the characteristics of the other models: IsolationForest, feature ranking model using Recursive Feature Elimination (RFE), XAI Shapley values method, positive class weight increase models wrapper model. All models were applied to 5 open financial data sets. 1 dataset contains transaction data of credit card transactions, 3 datasets contain data on retail lending, and 1 dataset contains data on consumer lending. Our calculations revealed slight improvement for the models IsolationForest and wrapper model in comparison with the base CatboostClassifier model in terms of ROC_AUC for loan defaults data.
Item Type: | MPRA Paper |
---|---|
Original Title: | Approaches to risk analysis in the financial sector based on machine learning and artificial intelligence methods |
English Title: | Approaches to risk analysis in the financial sector based on machine learning and artificial intelligence methods |
Language: | English |
Keywords: | financial risks, credit scoring, fraud detection, machine learning, explainable artificial intelligence methods, Catboost, SHAP |
Subjects: | C - Mathematical and Quantitative Methods > C6 - Mathematical Methods ; Programming Models ; Mathematical and Simulation Modeling > C63 - Computational Techniques ; Simulation Modeling |
Item ID: | 122941 |
Depositing User: | Ms Ludmila Dyakonova |
Date Deposited: | 16 Dec 2024 14:22 |
Last Modified: | 16 Dec 2024 14:22 |
References: | Abbasi, A., Albrecht, C., Vance, A., Hansen, J. MetaFraud: A metalearning framework for detecting financial fraud. // Mis Quart., vol. 36, no. 4, pp. 12931327, 2012. Adadi A. and Berrada, M. Peeking inside the black-box: A survey on explainable artificial intelligence (XAI). // IEEE Access, vol. 6, pp. 52138-52160, 2018. Aziz, S. and Dowling, M. Machine Learning and AI for Risk Management, in T. Lynn, G. Mooney, P. Rosati, and M. Cummins (eds.) // Disrupting Finance: FinTech and Strategy in the 21st Century, Palgrave, pp 33-50. 2019. BaFin. Machine learning in risk models – Characteristics and supervisory priorities. Consultation paper. // Bundesanatalt für Finanzdienstleistungsaufsicht. 2021. https://www.skadden.com/-/media/files/publications/2023/12/regulation-of-ai-in-financial-services-an-international-update/the-german-regulator-bafin-has-stated.pdf Bahnsen, A.C., Aouada, D., Stojanovic, A., Ottersten, B. Feature engineering strategies for credit card fraud detection // Expert Syst. Appl., vol. 51, pp. 134-142, Jun. 2016. Bello, O.A. Machine Learning Algorithms for Credit Risk Assessment: An Economic and Financial Analysis, International Journal of Management Technology, Vol.10, No 1, pp.109-133. 2023. https://eajournals.org/ijmt/wp-content/uploads/sites/69/2024/06/Machine-Learning-Algorithms.pdf Blázquez-García, A., Conde, A., Mori, U., Lozano, J. A. A review on outlier/anomaly detection in time series data. 2020. https://arxiv.org/abs/2002.04236 Bracke, P., Datta, A., Jung, C., Sen, S. Machine learning explainability in Fnance: An application to default risk analysis. // Bank England, London, U.K., Working Paper 816, 2019. Breiman, L. Bagging predictors. // Machine Learning, 24(2), 123-140, 1996. https://sci2s.ugr.es/keel/pdf/algorithm/articulo/1996-ML-Breiman-Bagging%20Predictors.pdf Di Biasi, P., Gnutti, R., Resti, A., Vergari, D. Machine Learning for Credit risk: three successful Case Histories. 25 Aug 2022. Risk management magazine. Vol. 17, Iss: 2, pp 5-18. https://www.aifirm.it/wp-content/uploads/2022/08/RMM-2022-02-Excerpt-2.pdf Goldstein, A., Kapelner, A., Bleich, J., Pitkin, E. Peeking Inside the Black Box: Visualizing Statistical Learning with Plots of Individual Conditional Expectation . 2014. https://arxiv.org/abs/1309.6392 Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., Pedreschi, D. A survey of methods for explaining black box models. // ACM Comput. Surv., vol. 51, no. 5, pp. 1-42, Jan. 2019. Ileberi, E., Sun, Y., Wang, Z. A machine learning based credit card frauddetection using the GA algorithm for feature. Journal of Big Data (2022) 9:24. https://doi.org/10.1186/s40537-022-00573-8 Liu, F. T., Ting, K. M., Zhou, Z.-H. IsolatioForest. 2009. https://www.researchgate.net/publication/224384174_Isolation_Forest Lundberg, S. M. and Lee, S.-I. A Unified Approach to Interpreting Model Predictions 2017. https://arxiv.org/abs/1705.07874 Mashrur, A., Luo, W., Zaidi, N. A., Robles-Kelly, A. Machine Learning for Financial Risk Management: A Survey. 2020. // IEEE Access, 8, 203203–203223 doi:10.1109/access.2020.3036322 10.1109/access.2020.3036322 Molnar, Ch. Interpretable Machine Learning. 2024. https://christophm.github.io/interpretable-ml-book Ribeiro, M. T., Singh, S., Guestrin C. Why Should I Trust You?: Explaining the Predictions of Any Classifier //2016. https://arxiv.org/abs/1602.04938. https://doi.org/10.48550/arXiv.1602.04938 Thennakoon, A., et al. Real-time credit card fraud detection using machine learning. // 2019 9th international conference on cloud computing, data science & engineering (Confluence). IEEE; 2019. Verikas, A., Kalsyte, Z., Bacauskiene M., Gelzinis A. Hybrid and ensemble-based soft computing techniques in bankruptcy prediction: A survey. // Soft Comput., vol. 14, no. 9, pp. 995-1010, Jul. 2010. https://www.researchgate.net/publication/220176584_Hybrid_and_ensemble-based_soft_computing_techniques_in_bankruptcy_prediction_A_survey Wang, G., Hao, J, Ma, J., Huang, L., Xu, K. Two credit scoring models based on dual strategy ensemble trees. // Knowl.-Based Syst., vol. 26, pp. 61-68, Feb. 2012 Wang, G., Hao, J, Ma, J., Jiang, H. A comparative assessment of ensemble learning for credit scoring. // Expert Syst. Appl., vol. 38, no. 1, pp. 223-230, Jan. 2011. doi:10.1016/j.eswa.2010.06.048. Wang, H., Liang, Q., Hancock, J.T. et al. Feature selection strategies: a comparative analysis of SHAP-value and importance-based methods. // J Big Data 11, 44 (2024). https://doi.org/10.1186/s40537-024-00905-w |
URI: | https://mpra.ub.uni-muenchen.de/id/eprint/122941 |