An Intelligent Hybrid Data Mining Framework for Healthcare Fraud Detection Using Machine Learning and Association Rule Mining

المؤلفون

  • Osama Mohammed Qasim المؤلف

DOI:

https://doi.org/10.59992/IJCI.2026.v5n6p4

الكلمات المفتاحية:

Data Mining، Healthcare Fraud Detection، Machine Learning، Association Rule Mining، Random Forest، Isolation Forest، Artificial Intelligence

الملخص

Healthcare insurance fraud is one of the biggest healthcare challenges today and all over the world. The conventional fraud detection approaches are based on manual auditing and rule-based systems and tend to be ineffective in detecting fraudulent behaviors as they change over time. In this paper, Association Rule Mining (ARM), Random Forest (RF) and Isolation Forest (IF) algorithms are combined to provide a novel hybrid data mining for intelligent healthcare fraud detection. The novelty of the proposed approach is the fusion of supervised and unsupervised learning techniques as well as of the adaptive risk scoring, which will enhance the detection accuracy and reduce false positives. The framework uses an insurance claim data set from healthcare to identify any unusual patterns and transactions which could be suspicious. The experimental results show that the proposed hybrid model outperforms the conventional machine learning models in terms of accuracy, precision, recall, and F1-score. Under highly imbalanced datasets, the proposed framework achieved an accuracy of 97.2% and significantly enhanced the fraud detection performance. Research provides a scalable and explainable data mining solution appropriate for real-world health care systems.

السيرة الشخصية للمؤلف

  • Osama Mohammed Qasim

    Assistant Lecturer, Computer Science, College of Engineering, Al-Karkh University of Science, Iraq

المراجع

[1] J. Li, K. Huang, J. Jin, and J. Shi, (2008), “A Survey on Statistical Methods for Health Care Fraud Detection,” Health Care Management Science, vol. 11, no. 3, pp. 275–287.

[2] Z. Hamid, F. Khalique, S. Mahmood, A. Daud, A. Bukhari and B. Alshemaimri, (2024), “Healthcare insurance fraud detection using data mining,” BMC Medical Informatics and Decision Making, vol. 24, no. 112, pp. 1–15.

[3] A. du Preez, S. Bhattacharya, P. Beling and E. Bowen, (2025), “Fraud detection in healthcare claims using machine learning: A systematic review,” Artificial Intelligence in Medicine, vol. 160, pp. 102–118.

[4] E. D. Curtis, P. Billion-Polak, T. M. Khoshgoftaar and B. Furht, (2025), “A review of distinct machine learning classifiers for healthcare fraud detection,” Journal of Big Data, vol. 12, no. 238, pp. 1–27.

[5] E. Nabrawi and A. Alanazi, (2023), “Fraud Detection in Healthcare Insurance Claims Using Machine Learning,” Risks, vol. 11, no. 9, pp. 160–176.

[6] D. Dash, M. Kumar, S. Patra, A. Kumar and A. Ganguly, (2025), “Healthcare Fraud Detection Using an Integrated ML Approach with SMOTE,” Procedia Computer Science, vol. 258, pp. 800–810.

[7] I. Matloob, S. A. Khan, R. Rukaiya, M. A. K. Khattak and A. Munir, (2022), “A Sequence Mining-Based Novel Architecture for Detecting Fraudulent Transactions in Healthcare Systems,” IEEE Access, vol. 10, pp. 48447–48463.

[8] S. Shekhar, J. Leder-Luis, L. Akoglu, (2022), “Unsupervised Machine Learning for Explainable Health Care Fraud Detection,” arXiv preprint arXiv:2211.02927, pp. 1–14.

[9] C. Phua, V. Lee, K. Smith, and R. Gayler, (2010), “A Comprehensive Survey of Data Mining-based Fraud Detection Research,” arXiv preprint arXiv:1009.6119, pp. 1–30.

[10] K. P. Fourkiotis and A. Tsadiras, (2025), “Future Internet Applications in Healthcare: Big Data-Driven Fraud Detection with Machine Learning,” Future Internet, vol. 17, no. 10, pp. 460–472.

[11] N. Innan, M. A. Khan, M. Bennai, (2023), “Financial Fraud Detection: A Comparative Study of Quantum Machine Learning Models,” arXiv preprint arXiv:2308.05237, pp. 1–18.

[12] J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, (2011), 3rd ed. Burlington, MA, USA: Morgan Kaufmann.

[13] R. Agrawal, T. Imieliński, and A. Swami, (1993), “Mining Association Rules Between Sets of Items in Large Databases,” in Proc. ACM SIGMOD Int. Conf. Management of Data, Washington, DC, USA, pp. 207–216.

[14] L. Breiman, “Random Forests, (2001), ”Machine Learning, vol. 45, no. 1, pp. 5–32.

[15] F. T. Liu, K. M. Ting, and Z.-H. Zhou, “Isolation Forest,” in Proc. IEEE Int. Conf. Data Mining (ICDM), Pisa, Italy, 2008, pp. 413–422.

[16] T. Fawcett, (2006), “An Introduction to ROC Analysis,” Pattern Recognition Letters, vol. 27, no. 8, pp. 861–874.

التنزيلات

منشور

2026-06-15

إصدار

القسم

المقالات

كيفية الاقتباس

Osama Mohammed Qasim. (2026). An Intelligent Hybrid Data Mining Framework for Healthcare Fraud Detection Using Machine Learning and Association Rule Mining. المجلة الدولية للحاسبات والمعلوماتية, 5(6). https://doi.org/10.59992/IJCI.2026.v5n6p4