Alla, Satya Sai Ram (2025) Demystifying AI-driven cloud resiliency: How machine learning enhances fault tolerance in hybrid cloud infrastructure. World Journal of Advanced Engineering Technology and Sciences, 15 (2). pp. 1203-1215. ISSN 2582-8266
![WJAETS-2025-0591.pdf [thumbnail of WJAETS-2025-0591.pdf]](https://eprint.scholarsrepository.com/style/images/fileicons/text.png)
WJAETS-2025-0591.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.
Abstract
The evolution of cloud infrastructure resilience has transitioned from traditional redundancy-based approaches to sophisticated AI-driven frameworks that enhance fault tolerance in hybrid and multi-cloud environments. This article examines how machine learning models improve cloud-native resiliency through predictive analytics, automated remediation, and intelligent resource allocation. Through systematic literature review and case studies across streaming media, container orchestration, and retail platforms, the effectiveness of various AI techniques is evaluated against traditional methods. The research demonstrates significant improvements in downtime reduction, false positive rates, and recovery metrics when employing AI-enhanced resilience mechanisms. Despite these benefits, implementation challenges persist in data quality, model drift, integration complexity, security implications, resource overhead, and organizational adaptation. The investigation reveals that successful implementations share common characteristics: comprehensive observability infrastructure, phased automation deployment, and cross-functional expertise. The integration of machine learning with established resilience patterns creates hybrid approaches that combine the predictive power of AI with proven fault tolerance strategies, fundamentally transforming cloud infrastructure management from reactive to proactive paradigms.
Item Type: | Article |
---|---|
Official URL: | https://doi.org/10.30574/wjaets.2025.15.2.0591 |
Uncontrolled Keywords: | Machine Learning Resilience; Hybrid Cloud Fault Tolerance; Predictive Maintenance; AI-Driven Self-Healing; Multi-Cloud Disaster Recovery |
Depositing User: | Editor Engineering Section |
Date Deposited: | 04 Aug 2025 16:33 |
Related URLs: | |
URI: | https://eprint.scholarsrepository.com/id/eprint/3722 |