Scaling AI Applications on the Cloud toward Optimized Cloud-Native Architectures, Model Efficiency, and Workload Distribution
Aravind Nuthalapati
Additional contact information
Aravind Nuthalapati: Microsoft, USA
International Journal of Latest Technology in Engineering, Management & Applied Science, 2025, vol. 14, issue 2, 200-206
Abstract:
The rapid growth of Artificial Intelligence (AI) has increasefd the demand for scalable, efficient, and cost-effective computational infrastructure. Traditional on-premise systems face limitations in scalability, resource allocation, and cost efficiency, making cloud computing a preferred solution. This paper examines cloud-native architectures, including containerization, Kubernetes orchestration, serverless computing, and microservices, as key enablers of AI scalability. Modern approaches for optimizing AI models involve using quantization and pruning and knowledge distillation approaches to make them more efficient without sacrificing their accuracy levels. The paper investigates workload distribution methods like federated learning together with distributed training plus adaptive AI scaling for improving resource efficiency and lowering response times. The implementation continues to face difficulties concerning expense control and latency reduction and scheduling resources efficiently while ensuring security standards. The research presents three possible solutions namely automated AI scaling, edge-cloud integration and provisioning with cost intelligent management systems to overcome current limitations. This examination features a study of present-day trends which consist of AI-native cloud orchestration along with AutoML-based optimization and quantum computing applications for the enhancement of AI scaling capabilities. This research provides comprehensive insights about cloud-based AI scalability which helps researchers as well as practitioners improve their deployment and optimization capabilities of high-performance AI systems.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.ijltemas.in/DigitalLibrary/Vol.14Issue2/200-206.pdf (application/pdf)
https://www.ijltemas.in/papers/volume-14-issue-2/200-206.html (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bjb:journl:v:14:y:2025:i:2:p:200-206
Access Statistics for this article
International Journal of Latest Technology in Engineering, Management & Applied Science is currently edited by Dr. Pawan Verma
More articles in International Journal of Latest Technology in Engineering, Management & Applied Science from International Journal of Latest Technology in Engineering, Management & Applied Science (IJLTEMAS)
Bibliographic data for series maintained by Dr. Pawan Verma ().