Accounting and Auditing Research

Accounting and Auditing Research

Application of XGBoost to Predict Financial Distress of the Listed Companies on Tehran Stock Exchange (TSE) and Iran Fara Burse (IFB)

Document Type : Original Article

Authors
1 PhD Student in Accounting, Department of Accounting, Mazandaran University, Babolsar, Iran
2 Distinguished Professor, Department of Accounting, Mazandaran University, Babolsar, Iran
10.22034/iaar.2024.206034
Abstract
The purpose of this article is to predict the potential financial distress of the listed companies on Tehran Stock Exchange (TSE) and Iran Fara Bourse (IFB). To do so, a wide range of features including accrual accounting variables, cash-based accounting variables, market-based variables, corporate governance mechanisms, and macroeconomic indicators have been identified to prospectively predict the financial distress in the companies.
The final sample includes 421 firms leading to 3,670 firm-year observations. The prepared data, was then split into a train and test data set using a 70/30 ratio.
In this research, various data pre-processing machine learning techniques i.e., Z-score standardization, one-hot encoding, stratified K-fold validation combined with feature engineering are applied to improve classifier performance. Stratified K-fold cross validation method, (with k = 5) was used for estimation of model prediction performance during training phase. During the training phase, hyper-parameter tuning of a model was carried out using a grid-search. Furthermore, SMOTE technique in conjunction with the proposed imbalance-oriented metric i.e., F1 score were used to overcome the extreme class imbalance issue.
Based on the experimental results, the tuned XGBoost model achieved a f1-score, MCC, recall and precision of respectively, 90%, 90%, 100% and 82% on the training set. Finally, the proposed model was tested on the hold-out test set which resulted in a f1-score, MCC, recall and precision of 52%, 52%, 73% and 41%, respectively. This information provides a powerful tool for predicting the financial distress of companies.
Keywords

  1. احمدپور، احمد و حبیبه میرزایی اسرمی (1392). مقایسه مدل تحلیل تمایزی چندگانه با مدل شبکه‌های عصبی در پیش‌بینی ورشکستگی شرکت‌های پذیرفته شده در بازار بورس اوراق بهادار تهران. فصلنامه تحقیقات حسابداری و حسابرسی، سال پنجم، شماره 18، 22-4.
  2. کاتبی، حسینقلی (1380). حقوق تجارت. چاپ هفتم، تهران: انتشارات گنج دانش
  3. . Aktan, S. (2011). Early warning system for bankruptcy: Bankruptcy prediction (Doctoral dissertation, Karlsruhe Institute of Technology, KIT). Retrieved from https://d-nb.info/1019790032/34.
  4. Altman, E. I. (1984). “The success of business failure prediction models: An international survey”, Journal of Banking and Finance, 8(2), 171–198.
  5. Altman, E. I., & Narayanan, P. (1997). “An international survey of business failure classification models”, Financial Markets, Institutions & Instruments, 6(2), 1–57.
  6. Barboza, F., Kimura, H., & Altman, E. (2017). “Machine learning models and bankruptcy prediction”, Expert Systems with Applications, 83, 405–417.
  7. Beaver, W. H., Correia, M., & McNichols M. F. (2010). “Financial statement analysis and the prediction of financial distress”, Foundations and Trends in Accounting, 5(2), 99–173.
  8. Beaver, W. H., McNichols, M. F., & Rhie, J. W. (2005). “Have financial statements become less informative? Evidence from the ability of financial ratios to predict bankruptcy”, Review of Accounting Studies, 10(1), 93–122.
  9. Belli, G. (2009). Nonexperimental quantitative research. In S. D. Lapan & M. T. Quartaroli (Eds.), Research essentials: An introduction to designs and practices. (pp. 59-77). Jossey-Bass Publications.
  10. Bonnes, K. (2017). Predicting mortgage demand using machine learning techniques (Master Thesis, University of Twente). Retrieved from https://essay.utwente.nl/ 73640/7/Bonnes_MA_EEMCS.pdf.
  11. Chancharat, N. (2008). An empirical analysis of financially distressed Australian companies: The application of survival analysis (Doctoral dissertation, University of Wollongong). Retrieved from https://ro.uow. au/theses/401/.
  12. Charalambakis, E. C., & Garrett, (2018). “On corporate financial distress prediction: What can we learn from private firms in a developing economy? Evidence from Greece”, Review of Quantitative Finance and Accounting, 52(2), 467–491.
  13. Cho, S., Kim, J., & Bae, J. K. (2009). “An integrative model with subject weight based on neural network learning for bankruptcy prediction”, Expert Systems with Applications, 36(1), 403–410.
  14. Etemadi, H., Anvary Rostamy, A. A., & Farajzadeh Dehkordi, H. (2009). “A genetic programming model for bankruptcy prediction: Empirical evidence from Iran”, Expert Systems with Applications, 36(2), 3199–3207.
  15. Fan, X. (2016). An adaptive and diversity-based ensemble method for binary classification (Master Thesis, Carleton University). Retrieved from https://curve.carleton.ca/system/files/etd/e2e72ad2-2da5-47b8-ab6a-28810d 5eb197/etd_pdf/81f038dc409086dafcc1cced7b46be03/fan-anadaptiveanddiv ersitybasedensemblemethod.pdf.
  16. Jaikengkit, A. (2004). Corporate governance and financial distress: an empirical analysis - the case of Thai financial institutions (Doctoral dissertation, Case Western Reserve University). Retrieved from ProQuest Dissertations and Theses A&I. (AAT 3118137).
  17. Jantadej, P. (2006). Using the combinations of cash flow components to predict financial distress (Doctoral dissertation, University of Nebraska). Retrieved from ProQuest Dissertations and Theses A&I. (AAT 3216429).
  18. Johnson, R. B., & Christensen, L. (2014). Educational research: quantitative, qualitative, and mixed approaches (5th ed.). London: SAGE Publications, Inc.
  19. Kothari, C. R. (2004). Research methodology, methods and techniques. New Delhi: New Age International (P) Ltd. Publishers.
  20. Leano, H. J. (2004). Discriminant analysis, factor analysis and regression analysis to classify financially distressed firms and predict bankruptcy using financial ratios and macroeconomic predictors: Model application to selected M&A (Master Thesis, Lamar University). Retrieved from ProQuest Dissertations and Theses A&I (AAT 1426146).
  21. Lee, K., Booth, D., & Alam, P. (2005). “A comparison of supervised and unsupervised neural networks in predicting bankruptcy of Korean firms”, Expert Systems with Applications, 29(1), 1–16.
  22. Lee, T. S., Yeh, Y. H., & Liu, R. T. (2003). Can corporate governance variables enhance the prediction power of accounting-based financial distress prediction models? (Center for Economic Institutions Working Paper Series, No. 2003-14). Retrieved from https://core.ac.uk/download/pdf/7063883.pdf.
  23. Lensberg, T., Eilifsen, A., & McKee, T. E. (2006). “Bankruptcy theory development and classification via genetic programming”, European Journal of Operational Research, 169(2), 677–697.
  24. Li, M. Y. L., & Miu, P. (2010). “A hybrid bankruptcy prediction model with dynamic loadings on accounting-ratio-based and market-based information: A binary quantile regression approach”, Journal of Empirical Finance, 17(4), 818–833.
  25. Lin, R. H., Wang, Y. T., Wu, C. H., & Chuang, C. L. (2009). “Developing a business failure prediction model via RST, GRA and CBR”, Expert Systems with Applications, 36(2), 1593–1600.
  26. McKee, T. E., & Lensberg, T. (2002). “Genetic programming and rough sets: A hybrid approach to bankruptcy classification”, European Journal of Operational Research, 138(2), 436–451.
  27. Nobre, J., & Neves R. (2019). “Combining Principal Component Analysis, Discrete Wavelet Transform and XGBoost to trade in the financial markets”, Expert Systems with Applications, 125, 181–194.
  28. Ohlson, J. A. (1980). “Financial ratios and the probabilistic prediction of bankruptcy”, Journal of Accounting Research, 18(1), 109 –131.
  29. Oleksy, T. A. (2017). Machine learning methods for mood disorder decision support (Master Thesis, University of Bergen). Retrieved from http://bora.uib.no/bitstream/handle/1956/16259/actigraphdataformoods.pdf?sequence=4&isAllowed=y.
  30. Pajkossy, K. (2013). Studying feature selection methods applied to classification tasks in natural language processing (Master Thesis, Eotvos Lorand University of Sciences). Retrieved from https://web.cs.elte.hu/ blobs/diplomamunkak/msc_alkmat/2013/pajkossy_katalin.pdf
  31. Pendharkar, P. C. (2005). “A threshold-varying artificial neural network approach for classification and its application to bankruptcy prediction problem”, Computers & Operations Research, 32(10), 2561–2582.
  32. Raschka, S. (2015). Python machine learning. Birmingham: Packt Publishing Ltd..
  33. Schellenger, M., & Cross, J. N. (1994). “FASB 95, Cash flow and bankruptcy”, Journal of Economics and Finance, 18(3), 261–274.
  34. Suntraruk, Phassawan (2009). Predicting Financial Distress: Evidence from Thailand. http://www. efmaefm. org/0EFMAMEETINGS/EFMA%20ANNU AL%20MEETINGS/2009-milan/phd/phassawan. pdf
  35. Uguroglu, S. (2013). Robust Learning with Highly Skewed Category Distributions (Doctoral dissertation, Carnegie Mellon University). Retrieved from https://pdfs.semanticscholar.org/c792/d83d78ff10b7a944d2c4e534c2c 55bdf59b8.pdf
  36. Wu, W. W. (2010). “Beyond business failure prediction”, Expert Systems with Applications, 37(3), 2371–2376.
  37. Wu, Y., Gaunt, C., & Gray, S. (2010). “A comparison of alternative bankruptcy prediction models”, Journal of Contemporary Accounting & Economics, 6(1), 34–45.
  38. Zhou, L., Lai, K. K., & Yen, J. (2012). “Empirical models based on features ranking techniques for corporate financial distress prediction”, Computers and Mathematics with Applications, 64(8), 2484–2496.