Abstract
Computational sprinting is a class of mechanisms that boost performance but dissipate additional power. We describe a sprinting architecture in which many, independent chip multiprocessors share a power supply and sprints are constrained by the chips' thermal limits and the rack's power limits. Moreover, we present the computational sprinting game, a multi-agent perspective on managing sprints. Strategic agents decide whether to sprint based on application phases and system conditions. The game produces an equilibrium that improves task throughput for data analytics workloads by 4--6x over prior greedy heuristics and performs within 90% of an upper bound on throughput from a globally optimized policy.
- Adlakha, S., Johari, R. Mean field equilibrium in dynamic games with strategic complementarities. Oper. Res. 61, 4 (2013), 971--989.Google ScholarCross Ref
- Brooks, D. Martonosi, M. Dynamic thermal management for high-performance microprocessors. In Proceedings of the 7th IEEE International Symposium on High Performance Computer Architecture (HPCA) (Monterrey, Nuevo Leon, Mexico, 2001), 171--182. Google ScholarDigital Library
- Chase, J.S., Anderson, D.C., Thakar, P.N., Vahdat, A.M., Doyle, R.P. Managing energy and server resources in hosting centers. In Proceedings of the 18th Symposium on Operating Systems Principles (SOSP) (Banff, Alberta, Canada, 2001), 103--116. Google ScholarDigital Library
- Fan, X., Weber, W.-D., Barroso, L.A. Power provisioning for a warehouse-sized computer. In Proceedings of the 34th Annual International Symposium on Computer Architecture (ISCA) (San Diego, CA, USA, 2007), 13--23. Google ScholarDigital Library
- Fu, X., Wang, X., Lefurgy, C. How much power oversubscription is safe and allowed in data centers. In Proceedings of the 8th ACM International Conference on Autonomic Computing (ICAC) (Karlsruhe, Germany, 2011), 21--30. Google ScholarDigital Library
- Ghodsi, A., Zaharia, M., Hindman, B., Konwinski, A., Shenker, S., Stoica, I. Dominant resource fairness: Fair allocation of multiple resource types. In Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation (NSDI) (Boston, MA, USA, 2011), 323--336. Google ScholarDigital Library
- Govindan, S., Sivasubramaniam, A., Urgaonkar, B. Benefits and limitations of tapping into stored energy for datacenters. In Proceeding of the 38th Annual International Symposium on Computer Architecture (ISCA) (San Jose, CA, USA, 2011), 341--351. Google ScholarDigital Library
- Guevara, M., Lubin, B., Lee, B.C.. Navigating heterogeneous processors with market mechanisms. In Proceeding of the 19th IEEE International Symposium on High Performance Computer Architecture (HPCA) (Shenzhen, China, 2013), 95--106. Google ScholarDigital Library
- Guevara, M., Lubin, B., Lee, B.C. Strategies for anticipating risk in heterogeneous system design. In Proceeding of the 20th IEEE International Symposium on High Performance Computer Architecture (HPCA) (Orlando, FL, USA, 2014), 154--164.Google ScholarCross Ref
- Hindman, B., Konwinski, A., Zaharia, M., Ghodsi, A., Joseph, A.D., Katz, R., Shenker, S., Stoica, I. Mesos: A platform for fine-grained resource sharing in the data center. In Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation (NSDI) (Boston, MA, USA, 2011), 295--308. Google ScholarDigital Library
- Liu, Z., Wierman, A., Chen, Y., Razon, B., Chen, N. Data center demand response: Avoiding the coincident peak via workload shifting and local generation. Perform. Eval. 70, 10 (2013), 770--791. Google ScholarDigital Library
- Llull, Q., Fan, S., Zahedi, S.M., Lee, B.C. Cooper: Task colocation with cooperative games. In Proceedings of the 23rd IEEE International Symposium on High-Performance Computer Architecture (HPCA) (Austin, TX, USA, 2017), 421--432.Google Scholar
- Raghavan, A., Emurian, L., Shao, L., Papaefthymiou, M., Pipe, K.P., Wenisch, T.F., Martin, M.M. Computational sprinting on a hardware/software testbed. In Proceedings of the 18th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) (Houston, TX, USA, 2013), 155--166. Google ScholarDigital Library
- Raghavan, A., Luo, Y., Chandawalla, A., Papaefthymiou, M., Pipe, K.P., Wenisch, T.F., Martin, M.M.K. Computational sprinting. In Proceedings of the 18th IEEE International Symposium on High Performance Computer Architecture (HPCA) (New Orleans, LA, USA, 2012), 1--12. Google ScholarDigital Library
- Shao, L., Raghavan, A., Emurian, L., Papaefthymiou, M.C., Wenisch, T.F., Martin, M.M., Pipe, K.P. On-chip phase change heat sinks designed for computational sprinting. In Proceedings of the 30th Annual Semiconductor Thermal Measurement and Management Symposium (San Jose, CA, USA, 2014),29--34.Google ScholarCross Ref
- Skach, M., Arora, M., Hsu, C.-H., Li, Q., Tullsen, D., Tang, L., Mars, J. Thermal time shifting: Leveraging phase change materials to reduce cooling costs in warehouse-scale computers. In Proceedings of the 42nd Annual International Symposium on Computer Architecture (ISCA) (Portland, OR, USA, 2015), 439--449. Google ScholarDigital Library
- Somu Muthukaruppan, T., Pathania, A., Mitra, T. Price theory based power management for heterogeneous multi-cores. In Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) (Salt Lake City, UT, USA, 2014), 161--176. Google ScholarDigital Library
- Wang, X., Chen, M., Lefurgy, C., Keller, T.W. Ship: A scalable hierarchical power control architecture for large-scale data centers. IEEE Trans. Parallel Distrib. Syst. 23, 1 (2012), 168--176. Google ScholarDigital Library
- Zahedi, S.M., Lee, B.C. Sharing incentives and fair division for multiprocessors. IEEE Micro 35, 3 (2015), 92--100.Google ScholarDigital Library
- Zahedi, S.M., Llull, Q., Lee, B.C. Amdahl's Law in the datacenter era: A market for fair processor allocation. In Proceedings of the 24rd IEEE International Symposium on High-Performance Computer Architecture (HPCA) (Vienna, Austria, 2018).Google ScholarCross Ref
- Zheng, W., Wang, X. Data center sprinting: Enabling computational sprinting at the data center level. In Proceedings of the 35th International Conference on Distributed Computing Systems (ICDCS) (Columbus, OH, USA, 2015), 175--184.Google ScholarCross Ref
Index Terms
Distributed strategies for computational sprints
Recommendations
The Computational Sprinting Game
ASPLOS'16Computational sprinting is a class of mechanisms that boost performance but dissipate additional power. We describe a sprinting architecture in which many, independent chip multiprocessors share a power supply and sprints are constrained by the chips' ...
ABC-Sprints: adapting Scrum to academic game development courses
FDG '10: Proceedings of the Fifth International Conference on the Foundations of Digital GamesWe propose a course design that fits a practical game development project into a regular game design lecture course. As this approach requires a consistent structure, our concept proposes an adaption of Scrum that is based on the game development life-...
Creative sprints: an unplanned broad agile evaluation and redesign process
NordiCHI '14: Proceedings of the 8th Nordic Conference on Human-Computer Interaction: Fun, Fast, FoundationalWe report how a request for routine usability work rapidly evolved into a novel agile process for evaluation and redesign. This process is described and then analysed to identify reasons for success. This analysis supports realistic knowledge transfer ...
Comments