Anomaly detection in network traffic using azure machine learning and log analytics

Manugula, Sai Yathin and Kalidindi, Dheeraj Varma and Gogikari, Sindhu Sri and Billakanti, Srinivas Rao (2025) Anomaly detection in network traffic using azure machine learning and log analytics. World Journal of Advanced Research and Reviews, 26 (3). pp. 864-883. ISSN 2581-9615

[thumbnail of WJARR-2025-2197.pdf] Article PDF
WJARR-2025-2197.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download ( 882kB)

Abstract

This study presents a scalable and efficient solution for advanced anomaly detection in network traffic using Azure Databricks and machine learning techniques. Modern networks generate massive volumes of traffic data, making manual detection of anomalies or cyber threats challenging. Traditional tools, such as RDBMS and Hadoop, are slow and not designed for real-time security monitoring. To address these challenges, the proposed system utilizes Azure Databricks, a unified cloud platform for big data processing and machine learning. Network traffic logs were cleaned and transformed using PySpark to extract features, such as IP addresses, session duration, data transfer, and packet counts. K-means clustering was then applied to group similar traffic patterns and identify anomalies without the need for labeled data. Model performance was evaluated using the Silhouette Score to ensure meaningful and well-separated clusters. The objective of this study is to provide a comprehensive overview of recent advancements in abnormality detection, focusing on emerging technologies and potential future opportunities. All stages, from data ingestion to anomaly detection, were executed within a single databricks notebook, thus requiring a minimal setup. The system performs efficiently even on low-cost Azure plans, making it accessible to small teams, students, and researchers. This solution enables real-time threat detection, automatic scaling, and quick incident response, offering a faster, smarter, and more cost-effective alternative to traditional network security methods.

Item Type: Article
Official URL: https://doi.org/10.30574/wjarr.2025.26.3.2197
Uncontrolled Keywords: Network Traffic; Anomaly Detection; Azure Databricks; K-Means Clustering; Silhouette Score
Depositing User: Editor WJARR
Date Deposited: 20 Aug 2025 12:06
Related URLs:
URI: https://eprint.scholarsrepository.com/id/eprint/4009