A Novel Framework for Text-Based Fraud Detection in Banking Using Spark NLP and Tableau

Ram Ghadiyaram; Laxmi Vanam; Durga Krishnamoorthy; Jaya Eripilla

doi:10.63412/jed09w96

Authors

Ram Ghadiyaram Author https://orcid.org/0009-0006-3730-0914
Laxmi Vanam Author https://orcid.org/0009-0006-5535-1387
Durga Krishnamoorthy Author https://orcid.org/0009-0004-6235-6077
Jaya Eripilla Author https://orcid.org/0009-0005-4422-2523

DOI:

https://doi.org/10.63412/jed09w96

Keywords:

Fraud Detection, Pyspark, NLP, Tableau, Great Expectations, AWS, Text Analytics, Banking, Contextual Fraud Scoring

Abstract

Fraud detection in banking is evolving beyond numerical analysis to include unstructured text data, such as transaction notes and customer communications. This paper proposes a novel framework integrating Spark NLP for natural language processing, Great Expectations for data quality, AWS for scalable infrastructure, and Tableau for interactive visualizations to detect fraudulent patterns in text. Emphasizing a contextual fraud scoring approach, the framework combines entity recognition and sentiment analysis to identify suspicious activities with high precision. Designed for compliance with GDPR and PCI-DSS, it offers a scalable, modular solution adaptable to various banking applications. Using example datasets, we illustrate its potential to transform fraud detection. Mermaid diagrams and accessible language make this framework approachable for researchers, practitioners, and non-experts alike.

A Novel Framework for Text-Based Fraud Detection in Banking Using Spark NLP and Tableau

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

How to Cite

Similar Articles

Make a Submission