Cost-Sensitive Fraud Detection with Reliability Calibration: A Practical Pipeline with XGBoost and Focal-Proxy Reweighting
DOI:
https://doi.org/10.24246/ijiteb.812025.13-24Keywords:
Class Imbalance, Cost-Sensitive Learning, Fraud Detection, Probability Calibration, XgboostAbstract
Fraud detection on payment transactions is an extremely imbalanced, high-stakes classification task in which deployment decisions depend not only on ranking quality but also on reliable probability estimates. We study credit card fraud detection on a standard real-transaction benchmark (284,807 transactions; 492 frauds) and target two deployment requirements: cost-sensitive thresholding under asymmetric error costs and reliability calibration so model outputs can be interpreted as stable risk scores. We benchmark logistic regression and XGBoost and propose a focal-proxy reweighting scheme for boosted trees via iterative weight updates inspired by focal loss. Probabilities are calibrated on validation using Platt scaling, temperature scaling, and isotonic-style monotone calibration; the best calibrator is selected by minimum validation Brier score. For decision-making, we choose the operating threshold that minimizes expected cost, Cost(t) = 10 · FN(t) + 1 · FP(t), on validation, then evaluate on a held-out test set. On the benchmark split (train 199,364; validation 42,721; test 42,722), the calibrated XGBoost baseline achieves AUROC 0.973, AUPRC 0.812, fraud-class F1 0.767, and expected cost 154 with very low calibration error (ECE = 1.1 × 10⁻⁴). Overall, calibration reduces ECE and improves or maintains the Brier score, while cost-aware thresholding makes the FN/FP trade-off explicit via decision curves.
Downloads
References
J. Davis and M. Goadrich, “The Relationship Between Precision-Recall and ROC Curves,” in Proceedings of the 23rd International Conference on Machine Learning (ICML), 2006.
M. Carvalho, A. J. Pinho, and S. Brás, “Resampling Approaches to Handle Class Imbalance: A Review from a Data Perspective,” J. Big Data, vol. 12, no. 1, p. 71, 2025.
G. Haixiang, L. Yijing, S. Jin, G. Mingyun, H. Yuanyue, and G. Bing, “Learning from Class-Imbalanced Data: Review of Methods and Applications,” Expert Syst. Appl., vol. 73, pp. 220–239, 2017.
J. M. Johnson and T. M. Khoshgoftaar, “Survey on Deep Learning with Class Imbalance,” J. Big Data, vol. 6, no. 1, p. 27, 2019.
A. D. Pozzolo, O. Caelen, R. A. Johnson, and G. Bontempi, “Calibrating Probability with Undersampling for Unbalanced Classification,” in 2015 IEEE Symposium Series on Computational Intelligence, 2015.
F. Carcillo, A. D. Pozzolo, Y.-A. Le Borgne, O. Caelen, Y. Mazzer, and G. Bontempi, “Stream Based Active Learning for Real-Life Credit Card Fraud Detection: A Comparative Study,” Data Min. Knowl. Discov., vol. 35, pp. 1–29, 2021.
T. Chen and C. Guestrin, “XGBoost: A Scalable Tree Boosting System,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2016, pp. 785–794.
G. Ke et al., “LightGBM: A Highly Efficient Gradient Boosting Decision Tree,” in Advances in Neural Information Processing Systems (NeurIPS), 2017.
L. Prokhorenkova, G. Gusev, A. Vorobev, A. V. Dorogush, and A. Gulin, “CatBoost: Unbiased Boosting with Categorical Features,” in Advances in Neural Information Processing Systems (NeurIPS), 2018.
S. O. Arik and T. Pfister, “TabNet: Attentive Interpretable Tabular Learning,” Proc. AAAI Conf. Artif. Intell., 2021.
X. Huang, A. Khetan, M. Cvitkovic, and Z. Karnin, “TabTransformer: Tabular Data Modeling Using Contextual Embeddings,” arXiv Prepr., 2020.
G. Somepalli, M. Goldblum, A. Schwarzschild, C. Bruss, and T. Goldstein, “SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training,” arXiv Prepr., 2021.
Y. Gorishniy, I. Rubachev, V. Khrulkov, and A. Babenko, “Revisiting Deep Learning Models for Tabular Data,” in Advances in Neural Information Processing Systems (NeurIPS), 2021.
C. Guo, G. Pleiss, Y. Sun, and K. Q. Weinberger, “On Calibration of Modern Neural Networks,” in Proceedings of the 34th International Conference on Machine Learning (ICML), 2017.
D. Widmann, F. Lindsten, J. Roll, and T. B. Schön, “Evaluating Classifier Calibration,” in Advances in Neural Information Processing Systems (NeurIPS), 2019.
A. Kumar, P. Liang, and T. Ma, “Verified Uncertainty Calibration,” in Advances in Neural Information Processing Systems (NeurIPS), 2019.
Y. Ovadia et al., “Can You Trust Your Model’s Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift,” in Advances in Neural Information Processing Systems (NeurIPS), 2019.
C. Elkan, “The Foundations of Cost-Sensitive Learning,” in Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI), 2001, pp. 973–978.
T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, “Focal Loss for Dense Object Detection,” in Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 2980–2988.
G. W. Brier, “Verification of Forecasts Expressed in Terms of Probability,” Mon. Weather Rev., vol. 78, no. 1, pp. 1–3, 1950.
J. Platt, “Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods,” Adv. Large Margin Classif., pp. 61–74, 1999.
M. Kull, T. M. S. Filho, and P. Flach, “Beta Calibration: A Well-Founded and Easily Implemented Improvement on Logistic Calibration for Binary Classifiers,” in Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), PMLR, 2017, pp. 623–631.
M. Kull, T. S. Filho, and P. Flach, “Beyond Temperature Scaling: Obtaining Well-Calibrated Multi-Class Probabilities with Dirichlet Calibration,” in Advances in Neural Information Processing Systems (NeurIPS), 2019.
W. Cherif, Y. Madani, and L. Khoukhi, “Credit Card Fraud Detection: Methods, Datasets, and Research Challenges,” IEEE Access, 2023.
V. N. Dornadula and S. Geetha, “Credit Card Fraud Detection Using Machine Learning Algorithms,” Procedia Comput. Sci., vol. 165, pp. 631–641, 2019.
D. Danang, T. Wahyono, I. Sembiring, T. Wellem, and N. H. Dzulkefly, “An Adaptive Framework Integrating ML Blockchain and TEE for Cloud Security,” in 2025 4th International Conference on Creative Communication and Innovative Technology (ICCIT), IEEE, 2025, pp. 1–7.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 International Journal of Information Technology and Business

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.






