Dynamic Scaling Strategies for AI Workloads in Cloud Environments

Authors

  • Huan Yue Assistant Professor, Department of Computer Science, Nanjing Academy of Information Technology, Nanjing, China Author
  • Liwei Chen Professor and Chair, Department of Information Systems, Nanjing Academy of Information Technology, Nanjing, China Author

Keywords:

Dynamic scaling, AI workloads, Cloud computing, Auto-scaling, Machine learning, Resource management, Cost optimization.

Abstract

The integration of Artificial Intelligence (AI) applications with cloud computing has brought about unparalleled opportunities for scalability and flexibility. However, the dynamic nature of AI workloads poses significant challenges in terms of resource allocation and utilization within cloud environments. This research paper explores various dynamic scaling strategies tailored specifically for AI workloads in cloud environments. Through a comprehensive review of existing literature and empirical analysis, this paper evaluates the effectiveness of different scaling approaches in optimizing resource utilization, reducing costs, and enhancing performance for AI workloads.

Downloads

Published

19-04-2024

How to Cite

Dynamic Scaling Strategies for AI Workloads in Cloud Environments. (2024). Asian American Research Letters Journal, 1(2). https://aarlj.com/index.php/AARLJ/article/view/25

Similar Articles

1-10 of 53

You may also start an advanced similarity search for this article.