Detection of fraudulent credit card transactions: A comparative analysis of data sampling and classification techniques
Document Type
Conference Proceeding
Publication Title
Journal of Physics: Conference Series
Abstract
Every year there is an increasing loss of a huge amount of money due to fraudulent credit card transactions. Recently there is a focus on using machine learning algorithms to identify fraud transactions. The number of fraud cases to non-fraud transactions is very low. This creates a skewed or unbalanced data, which poses a challenge to training the machine learning models. The availability of a public dataset for this research problem is scarce. The dataset used for this work is obtained from Kaggle. In this paper, we explore different sampling techniques such as under-sampling, Synthetic Minority Oversampling Technique (SMOTE) and SMOTE-Tomek, to work on the unbalanced data. Classification models, such as k-Nearest Neighbour (KNN), logistic regression, random forest and Support Vector Machine (SVM), are trained on the sampled data to detect fraudulent credit card transactions. The performance of the various machine learning approaches are evaluated for its precision, recall and F1-score. The classification results obtained is promising and can be used for credit card fraud detection.
DOI
10.1088/1742-6596/2161/1/012072
Publication Date
1-11-2022
Recommended Citation
Mahesh, Konduri Praveen; Afrouz, Shaik Ashar; and Areeckal, Anu Shaju, "Detection of fraudulent credit card transactions: A comparative analysis of data sampling and classification techniques" (2022). Open Access archive. 4678.
https://impressions.manipal.edu/open-access-archive/4678