Leveraging Kubernetes for AI/ML Workloads: Case studies in autonomous driving and large language model infrastructure

Madabushini, Praneel (2025) Leveraging Kubernetes for AI/ML Workloads: Case studies in autonomous driving and large language model infrastructure. World Journal of Advanced Engineering Technology and Sciences, 15 (1). pp. 1044-1052. ISSN 2582-8266

[thumbnail of WJAETS-2025-0320.pdf] Article PDF
WJAETS-2025-0320.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download ( 555kB)

Abstract

This article explores how Kubernetes has become a critical solution for addressing the complex infrastructure challenges inherent in artificial intelligence and machine learning workloads. As AI models grow in size and complexity, organizations face significant hurdles in resource management, scaling, reliability, and operational efficiency. The article examines how Kubernetes provides dynamic resource allocation, intelligent scaling, self-healing capabilities, enhanced monitoring, and workload portability that directly address these challenges. Through industry-specific case studies, the article demonstrates how industry leaders leverage Kubernetes to manage massive computational demands, orchestrate distributed training, and deploy models efficiently. The analysis also covers the evolving Kubernetes AI ecosystem, including specialized tools like Kubeflow, TensorFlow operators, enhanced security technologies, and lightweight orchestration mechanisms that further extend its capabilities for AI workloads. The inquiry highlights how Kubernetes has enabled organizations to accelerate AI initiatives while maintaining operational efficiency in a rapidly growing market.

Item Type: Article
Official URL: https://doi.org/10.30574/wjaets.2025.15.1.0320
Uncontrolled Keywords: Kubernetes; Artificial Intelligence; Machine Learning Infrastructure; Container Orchestration; Distributed Training
Depositing User: Editor Engineering Section
Date Deposited: 04 Aug 2025 16:09
Related URLs:
URI: https://eprint.scholarsrepository.com/id/eprint/2880