ABSTRACT
This paper develops a fully decentralized control architecture to address the workload consolidation problem in large-scale server clusters wherein the cluster's processing capacity is dynamically tuned to satisfy the service level agreements (SLAs) associated with the incoming workload while consolidating the workload onto the fewest number of servers. In a decentralized setting, this problem is decomposed into simpler subproblems, each of which is mapped to a server and solved by a controller assigned to that server. Though control loops on different servers run independently of each other, they are implicitly coupled via the shared high-level performance goal and interactions between controllers may result in undesired system behavior such as SLA violations and frequent switching of cores on and off. Using the proposed architecture as the reference, we analyze how the organization of individual controllers within the control structure affects its overall performance for large clusters of up to thousand servers. Our studies indicate that the control structure, when organized as a causal system in which a precedence relation exists among the individual controllers, achieves a high degree of SLA satisfaction (> 98%) while significantly reducing the corresponding switching cost.
- T. Atwood. Right architecture for the right workload: The application tier. Technical report, Sun Microsystems Report, %Enterprise Systems Products, Jul. 2004.Google Scholar
- Y. Chen, D. Gmach, C. Hyser, Z. Wang, C. Bash, C. Hoover, and S. Singhal. Integrated management of application performance, power and cooling in data centers. In Network Operations and Mgmt. Symposium, 2010.Google ScholarCross Ref
- R. Das, J. O. Kephart, C. Lefurgy, G. Tesauro, D. W. Levine, and H. Chan. Autonomic multi-agent management of power and performance in data centers. In Conf. Autonomous agents and multiagent systems, 2008. Google ScholarDigital Library
- W. B. Dunbar and R. M. Murray. Distributed receding horizon control for multi-vehicle formation stabilization. Automatica, 42(4):549--558, 2006. Google ScholarDigital Library
- A. Guez, I. Rusnak, and I. B. Kana. Multiple objectives optimization approach to adaptive and learning control. Intl. Journal of Control, 56(2):469--482, September 1992.Google ScholarCross Ref
- J. Hellerstein, S. Singhal, and Q. Wang. Research challenges in control engineering of computing systems. IEEE Trans. Network & Service Mgmt., 6(4):206--211, Dec. 2009. Google ScholarDigital Library
- J. L. Hellerstein, Y. Diao, S. Parekh, and D. M. Tilbury. Feedback Control of Computing Systems. Wiley-IEEE Press, 2004. Google ScholarDigital Library
- Y.-C. Ho and K.-C. Chu. Team decision theory and information structures in optimal control problems--Part I. Automatic Control, IEEE Transactions on, 17(1):15 -- 22, Feb. 1972.Google Scholar
- D. Kusic, J. Kephart, J. Hanson, N. Kandasamy, and G. Jiang. Power and performance management of virtualized computing environments via lookahead control. Cluster Computing, 12:1--15, 2009. Google ScholarDigital Library
- A. Leon-Garcia. Probability, statistics, and random processes for electrical engineering. Prentice Hall, 2008.Google Scholar
- J. M. Maciejowski. Predictive Control with Constraints. Prentice Hall, London, 2002.Google Scholar
- S. G. Makridakis, S. C. Wheelwright, and R. J. Hyndman. Forecasting: methods and applications. Wiley, 1998.Google Scholar
- X. Meng, C. Isci, J. Kephart, L. Zhang, E. Bouillet, and D. Pendarakis. Efficient resource provisioning in compute clouds via VM multiplexing. In Intl Conf. on Autonomic computing, New York, NY, USA, 2010. Google ScholarDigital Library
- G. Tesauro. Reinforcement learning in autonomic computing: A manifesto and case studies. IEEE Internet Computing, 11:22--30, 2007. Google ScholarDigital Library
- R. Wang and N. Kandasamy. Workload consolidation in virtualized computing systems via hierarchical control. Intel Technology Journal, 16, June 2012.Google Scholar
- R. Wang, D. M. Kusic, and N. Kandasamy. A distributed control framework for performance management of virtualized computing environments. In Intl. Conf. on Autonomic computing, 2010. Google ScholarDigital Library
- X. Wang, M. Chen, C. Lefurgy, and T. Keller. Ship: Scalable hierarchical power control for large-scale data centers. In Intl. Conf. on Parallel Architectures and Compilation Techniques, sept. 2009. Google ScholarDigital Library
- Y. Wang, X. Wang, M. Chen, and X. Zhu. Power-efficient response time guarantees for virtualized enterprise servers. In Real-Time Systems Symp., 2008. Google ScholarDigital Library
Index Terms
- On the design of decentralized control architectures for workload consolidation in large-scale server clusters
Recommendations
Server consolidation with migration control for virtualized data centers
Virtualization has become a key technology for simplifying service management and reducing energy costs in data centers. One of the challenges faced by data centers is to decide when, how, and which virtual machines (VMs) have to be consolidated into a ...
Energy proportionality and workload consolidation for latency-critical applications
SoCC '15: Proceedings of the Sixth ACM Symposium on Cloud ComputingEnergy proportionality and workload consolidation are important objectives towards increasing efficiency in large-scale datacenters. Our work focuses on achieving these goals in the presence of applications with μs-scale tail latency requirements. Such ...
An Approach for Detection of Overloaded Host to Consolidate Workload in Cloud Datacenter
This article describes the process of workload consolidation through detection of overloaded hosts in Cloud datacenter which leads to saving in energy consumption. Cloud computing is a novice paradigm where virtual resources are provisioned on pay-as-...
Comments