Copilot in the Cloud: Evaluating the Accuracy and Speed of LLMs in Data Engineering Tasks

Kesireddy, Sunny (2025) Copilot in the Cloud: Evaluating the Accuracy and Speed of LLMs in Data Engineering Tasks. World Journal of Advanced Engineering Technology and Sciences, 15 (3). pp. 1434-1441. ISSN 2582-8266

Abstract

The integration of large language models (LLMs) into enterprise workflows has opened new frontiers in cloud data engineering. This article presents a comprehensive evaluation of AI copilots in the development of scalable data pipelines across regulated environments. The article benchmarks LLMs on key engineering tasks including pipeline scaffolding, SQL optimization, IAM policy generation, and compliance rule encoding, providing insights into their capabilities and limitations in specialized technical contexts. It measures improvements in developer velocity, reduction in syntax errors, and overall impact on quality assurance cycles. Beyond automation, the article assesses how LLMs learn and generalize patterns from metadata-driven frameworks—making intelligent suggestions aligned with domain rules and architectural best practices. Special attention is given to the risks of hallucination, governance gaps, and security considerations that organizations must actively manage. It contributes to a deeper understanding of human-AI pair programming in high-stakes data systems, offering a framework for safely scaling AI-augmented development across data teams while preserving auditability, trust, and compliance.

Item Type:	Article
Official URL:	https://doi.org/10.30574/wjaets.2025.15.3.1049
Uncontrolled Keywords:	AI Copilots; Data Engineering; Code Hallucination; Governance Frameworks; Developer Productivity
Date Deposited:	16 Aug 2025 13:11
Related URLs:	https://journalwjaets.com/node/1100 https://journalwjaets.com/sites/default/... https://doi.org/10.30574/wjaets.2025.15....
URI:	https://eprint.scholarsrepository.com/id/eprint/4724

View Item