Ashok, Sangeetha (2025) Efficient ETL workflows for big data: Handling massive datasets at scale. International Journal of Science and Research Archive, 14 (2). pp. 1567-1574. ISSN 2582-8185
![IJSRA-2025-0531.pdf [thumbnail of IJSRA-2025-0531.pdf]](https://eprint.scholarsrepository.com/style/images/fileicons/text.png)
IJSRA-2025-0531.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.
Abstract
Extract, Transform, Load (ETL) processes are the backbone of data integration, enabling organizations to manage and analyze vast amounts of information. However, traditional ETL pipelines often struggle with scalability, performance, and efficiency when dealing with massive datasets in the era of big data. This article explores best practices, architectural considerations, and modern optimizations for designing efficient ETL workflows that can handle big data at scale. We discuss distributed processing, cloud-based ETL, automation, and real-time data ingestion to improve performance and reliability.
Item Type: | Article |
---|---|
Official URL: | https://doi.org/10.30574/ijsra.2025.14.2.0531 |
Uncontrolled Keywords: | Etl Workflows; Real-Time Big Data Processing; Cloud-Based Etl Solutions; Distributed Computing; Serverless Etl; Streaming Data Ingestion |
Depositing User: | Editor IJSRA |
Date Deposited: | 15 Jul 2025 16:46 |
Related URLs: | |
URI: | https://eprint.scholarsrepository.com/id/eprint/893 |