Reddy, Avinash Reddy Thimma (2025) Demystifying data lakes and data warehouses: A technical perspective. World Journal of Advanced Engineering Technology and Sciences, 15 (3). pp. 2056-2069. ISSN 2582-8266
![WJAETS-2025-1121.pdf [thumbnail of WJAETS-2025-1121.pdf]](https://eprint.scholarsrepository.com/style/images/fileicons/text.png)
WJAETS-2025-1121.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.
Abstract
This article examines the fundamental concepts, architectural distinctions, and strategic implications of data warehouses and data lakes in contemporary enterprise data management. As organizations face exponential growth in data volume and diversity, traditional siloed approaches prove increasingly insufficient to address the full spectrum of analytical requirements. The article provides a comprehensive technical analysis of data warehouse structures—characterized by subject-orientation, integration, time-variance, and non-volatility—alongside the defining features of data lakes, including schema-on-read flexibility, support for heterogeneous data types, and horizontal scalability. Through comparative assessment, the article explores how these paradigms differ in structure, query performance, governance requirements, and optimal use cases. Further examination reveals emerging convergence trends, particularly the lake house architecture that combines warehouse performance with lake flexibility, multi-tier processing workflows, and event-driven systems enabling real-time analytics. The article extends beyond technical implementation to address strategic considerations in enterprise data architecture design, governance implementation, and organizational structure, offering guidance on selecting appropriate technologies based on data characteristics, analytical maturity, technical capabilities, and resource constraints.
Item Type: | Article |
---|---|
Official URL: | https://doi.org/10.30574/wjaets.2025.15.3.1121 |
Uncontrolled Keywords: | Data Architecture; Enterprise Data Management; Data Governance; Lake House Paradigm; Analytical Workloads |
Depositing User: | Editor Engineering Section |
Date Deposited: | 22 Aug 2025 07:09 |
Related URLs: | |
URI: | https://eprint.scholarsrepository.com/id/eprint/4887 |