Tavva, Gayatri (2025) Scalable data quality alerting powered by AI Models: Architecture and tooling for self-healing data pipelines. World Journal of Advanced Engineering Technology and Sciences, 16 (1). pp. 594-602. ISSN 2582-8266
![WJAETS-2025-1235.pdf [thumbnail of WJAETS-2025-1235.pdf]](https://eprint.scholarsrepository.com/style/images/fileicons/text.png)
WJAETS-2025-1235.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.
Abstract
The growing complexity and volume of contemporary data pipelines have boosted the significance of smart data quality monitoring infrastructures. The traditional rule-based techniques tend to fail or provide unreliable analytics in dynamic and high-throughput environments, causing silent failures. This review explores the possibility of artificial intelligence (AI) and machine learning (ML) leveraging the use of adaptive data quality alerting systems that can be implemented in scale. It gives importance to architecture concepts, model approaches, and tooling environments that help in anomaly detection and automated remediation through self-healing pipelines in real-time. The argument is furthered along the artifacts of anomaly detection models, streaming data platforms, orchestration frameworks, and feedback-based model retraining. Some important contributions are a proposal of a modular architecture that can perform real-time alerting and classification of tooling options depending on each stage of the pipeline, and an overview of governance considerations. The research areas are defined as gaps that need to be addressed in the field of model interpretability, real-time integration, and operational benchmarking of autonomous, intelligent data quality management systems in a distributed environment. The review ends with the suggested route of study development.
Item Type: | Article |
---|---|
Official URL: | https://doi.org/10.30574/wjaets.2025.16.1.1235 |
Uncontrolled Keywords: | Artificial Intelligence; Data Quality; Anomaly Detection; Self-Healing Pipelines; Data Engineering |
Depositing User: | Editor Engineering Section |
Date Deposited: | 22 Aug 2025 08:56 |
Related URLs: | |
URI: | https://eprint.scholarsrepository.com/id/eprint/5259 |