skip to main content
10.1145/2612669.2612703acmconferencesArticle/Chapter ViewAbstractPublication PagesspaaConference Proceedingsconference-collections
abstract

Persistent unfairness arising from cache residency imbalance

Published:21 June 2014Publication History

ABSTRACT

We describe a counter-intuitive performance phenomena relevant to concurrency research. On a modern multicore system with a shared last-level cache, a set of concurrently running identical threads that loop -- each accessing the same quantity of distinct thread-private data -- can suffer significant relative progress imbalance. If one thread, or a small subset of the threads, manages to transiently enjoy higher cache residency than the other threads, that thread will tend to iterate faster and keep more of its data resident, thus increasing the odds that it will continue to run faster. This emergent behavior tends to be stable over surprisingly long periods.

References

  1. Y. Afek, D. Dice, and A. Morrison. Cache Index-aware Memory Allocation. In Proceedings of the International Symposium on Memory Management, ISMM '11, pages 55--64, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. B. Brett, P. Kumar, M. Kim, and H. Kim. CHiP: A Profiler to Measure the Effect of Cache Contention on Scalability. In Proceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum, IPDPSW '13, pages 1565--1574, Washington, DC, USA, 2013. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. K. Katti and V. Ramachandran. Competitive Cache Replacement Strategies for Shared Cache Environments. In Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS '12, pages 215--226, Washington, DC, USA, 2012. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. G. Marsaglia. Xorshift RNGs. Journal of Statistical Software, 8(14):1--6, 7 2003.Google ScholarGoogle ScholarCross RefCross Ref
  5. J. M. Mellor-Crummey and M. L. Scott. Algorithms for Scalable Synchronization on Shared-memory Multiprocessors. ACM Trans. Comput. Syst., 9:21--65, February 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Oracle Corporation. Oracle's SPARC T4--1, SPARC T4--2, SPARC T4--4, and SPARC T4--1B Server Architecture, 2012.Google ScholarGoogle Scholar

Index Terms

  1. Persistent unfairness arising from cache residency imbalance

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SPAA '14: Proceedings of the 26th ACM symposium on Parallelism in algorithms and architectures
        June 2014
        356 pages
        ISBN:9781450328210
        DOI:10.1145/2612669
        • General Chair:
        • Guy Blelloch,
        • Program Chair:
        • Peter Sanders

        Copyright © 2014 Owner/Author

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 21 June 2014

        Check for updates

        Qualifiers

        • abstract

        Acceptance Rates

        SPAA '14 Paper Acceptance Rate30of122submissions,25%Overall Acceptance Rate447of1,461submissions,31%

        Upcoming Conference

        SPAA '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader