Tools and techniques for real-time data processing: A review

Rani, Sangeeta (2025) Tools and techniques for real-time data processing: A review. International Journal of Science and Research Archive, 14 (1). pp. 1872-1881. ISSN 2582 8185

[thumbnail of IJSRA-2025-0252.pdf] Text
IJSRA-2025-0252.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (514kB)

Abstract

Real-time data processing is an essential component in the modern data landscape, where vast amounts of data are generated continuously from various sources such as Internet of Things devices, social media, financial transactions, and manufacturing systems. Unlike traditional batch processing methods that analyse data in intervals, real-time data processing enables the continuous intake, manipulation, and analysis of data within milliseconds of generation. This capability is critical for applications requiring instant insights and rapid decision-making, including fraud detection, predictive maintenance, real-time analytics, and autonomous operations. This paper reviews the tools and techniques that have revolutionized real-time data processing, with a focus on cutting-edge platforms such as Apache Kafka and Apache Flink, as well as cloud-native solutions. These technologies offer scalable and fault-tolerant systems capable of managing high-volume data streams while ensuring low latency and data consistency. Apache Kafka provides a highly scalable distributed messaging system, while Apache Flink combines stateful and stateless processing to support complex event-driven applications. This review highlights the. This paper reviews key techniques and tools used in real-time data processing, including stream processing, complex event processing, in-memory computing, micro-batching, and real-time dashboards. In addition, it highlights advancements in real-time data processing frameworks, their capabilities, and their impact on modern business applications. Additionally, the paper explores various tools used in real-time data processing, including Apache Kafka for data ingestion, Apache Flink and Spark Streaming for stream processing, Redis and Apache Druid for real-time storage, and Grafana and Kibana for data visualization. By examining these techniques and tools, this paper highlights the importance of real-time data processing in enabling businesses to make data-driven decisions with minimal latency, ultimately gaining a competitive edge in the rapidly evolving digital world.

Item Type: Article
Uncontrolled Keywords: Real-time data processing; Stream processing; Complex event processing; In-memory computing; Micro-batching; Windowing
Subjects: Q Science > Q Science (General)
Depositing User: Editor IJSRA
Date Deposited: 09 Jul 2025 16:51
Last Modified: 09 Jul 2025 16:51
URI: https://eprint.scholarsrepository.com/id/eprint/241

Actions (login required)

View Item
View Item