Comparing performances and effectiveness of machine learning classifiers in detecting financial accounting fraud for Turkish SMEs
- DOI
- 10.2991/ijcis.d.210203.007How to use a DOI?
- Keywords
- Financial accounting fraud; SMEs; Machine learning classifiers; Sampling methods; SMOTE; Feature selection
- Abstract
Turkish small- and medium-sized enterprises (SMEs) are exposed to fraud risks and creditor banks are facing big challenges to deal with financial accounting fraud. This study explores effectiveness of machine learning classifiers in detecting financial accounting fraud assessing financial statements of 341 Turkish SMEs from 2013 to 2017. The data are obtained from one of the leading creditor banks of Turkey. Highly imbalanced classes of 1384 nonfraudulent cases and 321 fraudulent cases (by 122 firms) are detected thus sampling techniques are used to mitigate class imbalance problem. Research methodology consists of two stages. First stage is data preprocessing wherein financial ratio calculation, feature selection methods for defining financial ratios with the greatest impact on fraudulent financial statements and two sampling methods of Synthetic Minority Oversampling Technique (SMOTE) as oversampling and undersampling are performed, respectively. Second stage is performance evaluation and comparison of classifiers wherein seven different classifiers (support vector machine, Naive Bayes, artificial neural network, K-nearest neighbor, random forest, logistic regression, and bagging) are executed and compared by using performance metrics. Classifiers are also compared without using any feature selection and/or sampling techniques. Results reveal that random forest-without feature selection-oversampling model outperforms all other models.
- Copyright
- © 2021 The Authors. Published by Atlantis Press B.V.
- Open Access
- This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).
Download article (PDF)
View full text (HTML)
Cite this article
TY - JOUR AU - Serhan Hamal AU - Ozlem Senvar PY - 2021 DA - 2021/02/10 TI - Comparing performances and effectiveness of machine learning classifiers in detecting financial accounting fraud for Turkish SMEs JO - International Journal of Computational Intelligence Systems SP - 769 EP - 782 VL - 14 IS - 1 SN - 1875-6883 UR - https://doi.org/10.2991/ijcis.d.210203.007 DO - 10.2991/ijcis.d.210203.007 ID - Hamal2021 ER -