Application of LLMS to Fraud Detection

Malingu, Curthbert Jeremiah and Kabwama, Collin Arnold and Businge, Pius and Agaba, Ivan Asiimwe and Ankunda, Ian Asiimwe and Mugalu, Brian and Ariho, Joram Gumption and Musinguzi, Denis (2025) Application of LLMS to Fraud Detection. World Journal of Advanced Research and Reviews, 26 (2). pp. 178-183. ISSN 2581-9615

[thumbnail of WJARR-2025-1586.pdf] Article PDF
WJARR-2025-1586.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download ( 557kB)

Abstract

Fraud detection in financial systems remains a critical challenge due to highly imbalanced data, evolving fraudulent tactics, and strict privacy constraints that limit the availability of data. Traditionally, tree based models such as random forests, XGBoost, and LightGBM have been the backbone of fraud detection, offering robust performance through extensive feature engineering. However, recent advances in large language models (LLMS), pretrained on massive corpora and endowed with powerful in-context learning capabilities suggest that these models can be leveraged to enhance fraud detection even in low-data regimes. In this study, we explore the applications of LLMs to fraud detection on tabular data by converting structured inputs into natural language through various serialization techniques, including list templates, text templates, and a markdown-based t-table format. This conversion enables LLMs to exploit their pre-trained knowledge for zero-shot and few-shot learning scenarios. We evaluate the impact of different serialization methods on model performance and examine the sample efficiency of LLMs relative to conventional tree-based models. Our experimental results demonstrate that LLMs achieve competitive performance on fraud detection tasks, particularly when data is scarce, and offer a promising alternative to traditional approaches. This work provides valuable insights and guidelines for deploying LLMs in real-world financial applications, paving the way for more efficient, data driven fraud detection systems.

Item Type: Article
Official URL: https://doi.org/10.30574/wjarr.2025.26.2.1586
Uncontrolled Keywords: Large Language Models; Fraud detection; Natural Language Processing; Financial applications
Depositing User: Editor WJARR
Date Deposited: 25 Jul 2025 16:09
Related URLs:
URI: https://eprint.scholarsrepository.com/id/eprint/2477