skip to main content
10.1145/3185768.3186313acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
research-article

Evaluation of Energy Consumption of Replicated Tasks in a Volunteer Computing Environment

Published:02 April 2018Publication History

ABSTRACT

High Throughput Computing allows workloads of many thousands of tasks to be performed efficiently over many distributed resources and frees the user from the laborious process of managing task deployment, execution and result collection. However, in many cases the High Throughput Computing system is comprised from volunteer computational resources where tasks may be evicted by the owner of the resource. This has two main disadvantages. First, tasks may take longer to run as they may require multiple deployments before finally obtaining enough time on a resource to complete. Second, the wasted computation time will lead to wasted energy. We may be able to reduce the effect of the first disadvantage here by submitting multiple replicas of the task and take the results from the first one to complete. This, though, could lead to a significant increase in energy consumption. Thus we desire to only ever submit the minimum number of replicas required to run the task in the allocated time whilst simultaneously minimising energy. In this work we evaluate the use of fixed replica counts and Reinforcement Learning on the proportion of task which fail to finish in a given time-frame and the energy consumed by the system.

References

  1. David P Anderson. 2004. Boinc: A system for public-resource computing and storage Grid Computing, 2004. IEEE, 4--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. David P. Anderson, Jeff Cobb, Eric Korpela, Matt Lebofsky, and Dan Werthimer. 2002. {email protected}: An Experiment in Public-resource Computing. Commun. ACM Vol. 45, 11 (Nov. 2002), 56--61. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Peter Bodík, Rean Griffith, Charles Sutton, Armando Fox, Michael Jordan, and David Patterson. 2009. Statistical Machine Learning Makes Automatic Control Practical for Internet Datacenters. In USENIX HotCloud. Article 12. http://dl.acm.org/citation.cfm?id=1855533.1855545Google ScholarGoogle Scholar
  4. M. Forshaw, A.S. McGough, and N. Thomas. 2016. HTC-Sim: a trace-driven simulation framework for energy consumption in high-throughput computing systems. Concurrency and Computation: Practice and Experience 28, 12 (2016), 3260-3290. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. E. M. Heien, N. Fujimoto, and K. Hagihara. 2008. Computing low latency batches with unreliable workers in volunteer computing environments. In 2008 IEEE International Symposium on Parallel and Distributed Processing. 1-8.Google ScholarGoogle Scholar
  6. Derrick Kondo, Andrew A Chien, and Henri Casanova. 2004. Resource management for rapid application turnaround on enterprise desktop grids. In ACM/IEEE Supercomputing. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Kondo, B. Javadi, P. Malecot, F. Cappello, and D. P. Anderson. 2009. Cost-benefit analysis of Cloud Computing versus desktop grids. In 2009 IEEE International Symposium on Parallel Distributed Processing. 1-12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Eric J. Korpela. 2012. SETI@home, BOINC, and Volunteer Distributed Computing. Annual Review of Earth and Planetary Sciences 40, 1 (2012), 69-87.Google ScholarGoogle ScholarCross RefCross Ref
  9. Antonios Litke, Dimitrios Skoutas, Konstantinos Tserpes, and Theodora Varvarigou. 2007. Efficient task replication and management for adaptive fault tolerance in Mobile Grid environments. Future Generation Computer Systems 23, 2 (2007), 163 - 178. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Litzkow, M. Livney, and M. W. Mutka. 1988. Condor-a hunter of idle workstations. In ICDCS.Google ScholarGoogle Scholar
  11. A.S. McGough, C. Gerrard, J. Noble, P. Robinson, and S. Wheater. 2011. Analysis of Power-Saving Techniques over a Large Multi-use Cluster. In IEEE DASC. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Stephen McGough and Matthew Forshaw. 2014. Reduction of wasted energy in a volunteer computing system through Reinforcement Learning. Sustainable Computing: Informatics and Systems 4, 4 (2014), 262 - 275.Google ScholarGoogle ScholarCross RefCross Ref
  13. Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Mingliang Liu, Yan Zhai, Wenguang Chen, and Weimin Zheng. 2013. Employing checkpoint to improve job scheduling in large-scale systems. In Job Scheduling Strategies for Parallel Processing. Springer, 36-55.Google ScholarGoogle Scholar
  14. R.S. Sutton and A.G. Barto. 1998. Reinforcement Learning: An Introduction. Bradford Book. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. The Condor Team. 2010. Condor Manual. http://www.cs.wisc.edu/condor/manual/. (Oct. 2010). University of Wisconsin.Google ScholarGoogle Scholar

Index Terms

  1. Evaluation of Energy Consumption of Replicated Tasks in a Volunteer Computing Environment

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ICPE '18: Companion of the 2018 ACM/SPEC International Conference on Performance Engineering
          April 2018
          212 pages
          ISBN:9781450356299
          DOI:10.1145/3185768

          Copyright © 2018 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 2 April 2018

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate252of851submissions,30%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader