skip to main content
Skip header Section
Intel threading building blocksJuly 2007
Publisher:
  • O'Reilly & Associates, Inc.
  • 103A Morris St. Sebastopol, CA
  • United States
ISBN:978-0-596-51480-8
Published:12 July 2007
Pages:
332
Skip Bibliometrics Section
Bibliometrics
Abstract

No abstract available.

Cited By

  1. ACM
    Cai Q, Xiao G, Lin S, Yang W, Li K and Li K (2024). ABSS: An Adaptive Batch-Stream Scheduling Module for Dynamic Task Parallelism on Chiplet-based Multi-Chip Systems, ACM Transactions on Parallel Computing, 11:1, (1-24), Online publication date: 31-Mar-2024.
  2. ACM
    Shiina S and Taura K Itoyori: Reconciling Global Address Space and Global Fork-Join Task Parallelism Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-15)
  3. Garcia A, Griebler D, Schepke C and Fernandes L (2023). Micro-batch and data frequency for stream processing on multi-cores, The Journal of Supercomputing, 79:8, (9206-9244), Online publication date: 1-May-2023.
  4. Mangiras D, Chinnery D and Dimitrakopoulos G (2023). Task-Based Parallel Programming for Gate Sizing, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 42:4, (1309-1322), Online publication date: 1-Apr-2023.
  5. Sotoudeh M, Tao Z and Thakur A (2023). SyReNN: A tool for analyzing deep neural networks, International Journal on Software Tools for Technology Transfer (STTT), 25:2, (145-165), Online publication date: 1-Apr-2023.
  6. Andrade G, Griebler D, Santos R and Fernandes L (2023). A parallel programming assessment for stream processing applications on multi-core systems, Computer Standards & Interfaces, 84:C, Online publication date: 1-Mar-2023.
  7. ACM
    Schardl T and Lee I OpenCilk Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, (189-203)
  8. Löff J, Hoffmann R, Pieper R, Griebler D and Fernandes L (2022). DSParLib: A C++ Template Library for Distributed Stream Parallelism, International Journal of Parallel Programming, 50:5-6, (454-485), Online publication date: 1-Dec-2022.
  9. Chang S, Bi R, Sun J, Liu W, Yu Q, Deng Q and Gu Z (2022). Toward Minimum WCRT Bound for DAG Tasks Under Prioritized List Scheduling Algorithms, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 41:11, (3874-3885), Online publication date: 1-Nov-2022.
  10. ACM
    Wang H, Luo Q and Du Z A Hybrid Data-flow Visual Programing Language* Workshop Proceedings of the 51st International Conference on Parallel Processing, (1-6)
  11. ACM
    Aguilar Mena J, Shaaban O, Lopez V, Garcia M, Carpenter P, Ayguade E and Labarta J Transparent load balancing of MPI programs using OmpSs-2@Cluster and DLB Proceedings of the 51st International Conference on Parallel Processing, (1-11)
  12. Martínez M, Fraguela B and Cabaleiro J (2022). A highly optimized skeleton for unbalanced and deep divide-and-conquer algorithms on multi-core clusters, The Journal of Supercomputing, 78:8, (10434-10454), Online publication date: 1-May-2022.
  13. Hoffmann R, Löff J, Griebler D and Fernandes L (2022). OpenMP as runtime for providing high-level stream parallelism on multi-cores, The Journal of Supercomputing, 78:6, (7655-7676), Online publication date: 1-Apr-2022.
  14. ACM
    Mitchell R, Stokes D, Frank E and Holmes G (2022). Bandwidth-Optimal Random Shuffling for GPUs, ACM Transactions on Parallel Computing, 9:1, (1-20), Online publication date: 31-Mar-2022.
  15. Chang S, Sun J, Hao Z, Deng Q and Guan N (2022). Computing exact WCRT for typed DAG tasks on heterogeneous multi-core processors, Journal of Systems Architecture: the EUROMICRO Journal, 124:C, Online publication date: 1-Mar-2022.
  16. Martínez M, Fraguela B and Cabaleiro J (2021). A Parallel Skeleton for Divide-and-conquer Unbalanced and Deep Problems, International Journal of Parallel Programming, 49:6, (820-845), Online publication date: 1-Dec-2021.
  17. Bednárek D, Kruliš M and Yaghob J (2021). Letting future programmers experience performance-related tasks, Journal of Parallel and Distributed Computing, 155:C, (74-86), Online publication date: 1-Sep-2021.
  18. Moreno P, Areias M and Rocha R (2021). On the implementation of memory reclamation methods in a lock-free hash trie design, Journal of Parallel and Distributed Computing, 155:C, (1-13), Online publication date: 1-Sep-2021.
  19. ACM
    Qian Z, Kavvos G and Birkedal L (2021). Client-server sessions in linear logic, Proceedings of the ACM on Programming Languages, 5:ICFP, (1-31), Online publication date: 22-Aug-2021.
  20. ACM
    Moreno M, Papa S and Ofria C Conduit Proceedings of the Genetic and Evolutionary Computation Conference Companion, (1795-1800)
  21. ACM
    Xu Y, Agrawal K and Lee I Efficient Parallel Determinacy Race Detection for Structured Futures Proceedings of the 33rd ACM Symposium on Parallelism in Algorithms and Architectures, (398-409)
  22. Han M, Park J and Baek W (2020). Design and Implementation of a Criticality- and Heterogeneity-Aware Runtime System for Task-Parallel Applications, IEEE Transactions on Parallel and Distributed Systems, 32:5, (1117-1132), Online publication date: 1-May-2021.
  23. Zheng R and Pai S Efficient execution of graph algorithms on CPU with SIMD extensions Proceedings of the 2021 IEEE/ACM International Symposium on Code Generation and Optimization, (262-276)
  24. ACM
    Raut E, Anderson J, Araya-Polo M and Meng J Porting and Evaluation of a Distributed Task-driven Stencil-based Application Proceedings of the 12th International Workshop on Programming Models and Applications for Multicores and Manycores, (21-30)
  25. ACM
    Shiina S, Iwasaki S, Taura K and Balaji P Lightweight preemptive user-level threads Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (374-388)
  26. Holbrook A, Loeffler C, Flaxman S and Suchard M (2021). Scalable Bayesian inference for self-excitatory stochastic processes applied to big American gunfire data, Statistics and Computing, 31:1, Online publication date: 1-Jan-2021.
  27. ACM
    Rinaldi L, Torquati M, Mencagli G and Danelutto M High-throughput stream processing with actors Proceedings of the 10th ACM SIGPLAN International Workshop on Programming Based on Actors, Agents, and Decentralized Control, (1-10)
  28. ACM
    Hoffmann R, Griebler D, Danelutto M and Fernandes L Stream Parallelism Annotations for Multi-Core Frameworks Proceedings of the 24th Brazilian Symposium on Context-Oriented Programming and Advanced Modularity, (48-55)
  29. Raut E, Meng J, Araya-Polo M and Chapman B Evaluating Performance of OpenMP Tasks in a Seismic Stencil Application OpenMP: Portable Multi-Level Parallelism on Modern Systems, (67-81)
  30. Olivier S Evaluating the Efficiency of OpenMP Tasking for Unbalanced Computation on Diverse CPU Architectures OpenMP: Portable Multi-Level Parallelism on Modern Systems, (18-33)
  31. Moreno P, Areias M and Rocha R A Compression-Based Design for Higher Throughput in a Lock-Free Hash Map Euro-Par 2020: Parallel Processing, (458-473)
  32. Minakova S, Tang E and Stefanov T Combining Task- and Data-Level Parallelism for High-Throughput CNN Inference on Embedded CPUs-GPUs MPSoCs Embedded Computer Systems: Architectures, Modeling, and Simulation, (18-35)
  33. ACM
    Cavelan A, Cabezón R, Grabarczyk M and Ciorba F A Smoothed Particle Hydrodynamics Mini-App for Exascale Proceedings of the Platform for Advanced Scientific Computing Conference, (1-11)
  34. ACM
    Prabhu I and Nandivada V Chunking loops with non-uniform workloads Proceedings of the 34th ACM International Conference on Supercomputing, (1-12)
  35. Griebler D, Vogel A, De Sensi D, Danelutto M and Fernandes L (2019). Simplifying and implementing service level objectives for stream parallelism, The Journal of Supercomputing, 76:6, (4603-4628), Online publication date: 1-Jun-2020.
  36. ACM
    Chuang H, Lyerly R, Lankes S and Ravindran B Scaling Shared Memory Multiprocessing Applications in Non-cache-coherent Domains Proceedings of the 13th ACM International Systems and Storage Conference, (13-24)
  37. ACM
    Kronbichler M and Kormann K (2019). Fast Matrix-Free Evaluation of Discontinuous Galerkin Finite Element Operators, ACM Transactions on Mathematical Software, 45:3, (1-40), Online publication date: 30-Sep-2019.
  38. ACM
    Pieper R, Griebler D and Fernandes L Structured Stream Parallelism for Rust Proceedings of the XXIII Brazilian Symposium on Programming Languages, (54-61)
  39. ACM
    Pérard-Gayot A, Membarth R, Leißa R, Hack S and Slusallek P (2019). Rodent, ACM Transactions on Graphics, 38:4, (1-12), Online publication date: 31-Aug-2019.
  40. Haber T and van Reeth F Improving the Runtime Performance of Non-linear Mixed-Effects Model Estimation Euro-Par 2019: Parallel Processing Workshops, (560-571)
  41. Yang C, Pichel J and Padua D Dataflow Execution of Hierarchically Tiled Arrays Euro-Par 2019: Parallel Processing, (304-316)
  42. Torquati M, Mencagli G, Drocco M, Aldinucci M, De Matteis T and Danelutto M (2019). On dynamic memory allocation in sliding-window parallel patterns for streaming analytics, The Journal of Supercomputing, 75:8, (4114-4131), Online publication date: 1-Aug-2019.
  43. Gowanlock M and Karsin B (2022). A hybrid CPU/GPU approach for optimizing sorting throughput, Parallel Computing, 85:C, (45-55), Online publication date: 1-Jul-2019.
  44. ACM
    Castillo E, Jain N, Casas M, Moreto M, Schulz M, Beivide R, Valero M and Bhatele A Optimizing computation-communication overlap in asynchronous task-based programs Proceedings of the ACM International Conference on Supercomputing, (380-391)
  45. ACM
    Das R, Tsai S, Duppala S, Lynch J, Arkin E, Chowdhury R, Mitchell J and Skiena S Data Races and the Discrete Resource-time Tradeoff Problem with Resource Reuse over Paths The 31st ACM Symposium on Parallelism in Algorithms and Architectures, (359-368)
  46. Coti C, Monniaux D and Yu H Parallel Parametric Linear Programming Solving, and Application to Polyhedral Computations Computational Science – ICCS 2019, (566-572)
  47. Cojean T, Guermouche A, Hugo A, Namyst R and Wacrenier P (2019). Resource aggregation for task-based Cholesky Factorization on top of modern architectures, Parallel Computing, 83:C, (73-92), Online publication date: 1-Apr-2019.
  48. Griebler D, Hoffmann R, Danelutto M and Fernandes L (2019). High-Level and Productive Stream Parallelism for Dedup, Ferret, and Bzip2, International Journal of Parallel Programming, 47:2, (253-271), Online publication date: 1-Apr-2019.
  49. Navarro A, Corbera F, Rodriguez A, Vilches A and Asenjo R (2019). Heterogeneous parallel_for Template for CPU---GPU Chips, International Journal of Parallel Programming, 47:2, (213-233), Online publication date: 1-Apr-2019.
  50. Pérez G and Yovine S (2019). Formal specification and implementation of an automated pattern-based parallel-code generation framework, International Journal on Software Tools for Technology Transfer (STTT), 21:2, (183-202), Online publication date: 1-Apr-2019.
  51. Thoman P, Zangerl P and Fahringer T (2019). Static Compiler Analyses for Application-specific Optimization of Task-Parallel Runtime Systems, Journal of Signal Processing Systems, 91:3-4, (303-320), Online publication date: 1-Mar-2019.
  52. Ousterhout A, Fried J, Behrens J, Belay A and Balakrishnan H Shenango Proceedings of the 16th USENIX Conference on Networked Systems Design and Implementation, (361-377)
  53. ACM
    Jordan H, Subotić P, Zhao D and Scholz B Brie Proceedings of the 10th International Workshop on Programming Models and Applications for Multicores and Manycores, (31-40)
  54. ACM
    Utterback R, Agrawal K, Fineman J and Lee I Efficient race detection with futures Proceedings of the 24th Symposium on Principles and Practice of Parallel Programming, (340-354)
  55. ACM
    Jordan H, Subotić P, Zhao D and Scholz B A specialized B-tree for concurrent datalog evaluation Proceedings of the 24th Symposium on Principles and Practice of Parallel Programming, (327-339)
  56. Fan X, Liao X and Jin H (2019). FunctionFlow, Frontiers of Computer Science: Selected Publications from Chinese Universities, 13:1, (73-85), Online publication date: 1-Feb-2019.
  57. Danelutto M, De Matteis T, De Sensi D, Mencagli G, Torquati M, Aldinucci M and Kilpatrick P (2019). The RePhrase Extended Pattern Set for Data Intensive Parallel Computing, International Journal of Parallel Programming, 47:1, (74-93), Online publication date: 1-Feb-2019.
  58. ACM
    Ramachandran A and Mittal N Improving efficacy of concurrent internal binary search trees using local recovery Proceedings of the 20th International Conference on Distributed Computing and Networking, (61-70)
  59. ACM
    Zhao H, Chen Q, Qiu Y, Wu M, Shen Y, Leng J, Li C and Guo M (2018). Bandwidth and Locality Aware Task-stealing for Manycore Architectures with Bandwidth-Asymmetric Memory, ACM Transactions on Architecture and Code Optimization, 15:4, (1-26), Online publication date: 31-Dec-2019.
  60. ACM
    Li M, Kaufman D, Kim V, Solomon J and Sheffer A (2018). OptCuts, ACM Transactions on Graphics, 37:6, (1-13), Online publication date: 31-Dec-2019.
  61. Demirci G, Marincic I and Hoffmann H A divide and conquer algorithm for DAG scheduling under power constraints Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, (1-12)
  62. Caheny P, Alvarez L, Valero M, Moretó M and Casas M Runtime-assisted cache coherence deactivation in task parallel programs Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, (1-12)
  63. Iwasaki S, Amer A, Taura K and Balaji P Lessons learned from analyzing dynamic promotion for user-level threading Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, (1-12)
  64. Demirci G, Marincic I and Hoffmann H A divide and conquer algorithm for DAG scheduling under power constraints Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, (1-12)
  65. Caheny P, Alvarez L, Valero M, Moretó M and Casas M Runtime-assisted cache coherence deactivation in task parallel programs Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, (1-12)
  66. Iwasaki S, Amer A, Taura K and Balaji P Lessons learned from analyzing dynamic promotion for user-level threading Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, (1-12)
  67. Sons M and Stiller C Efficient Multi-Drive Map Optimization towards Life-long Localization using Surround View 2018 21st International Conference on Intelligent Transportation Systems (ITSC), (2671-2677)
  68. del Rio Astorga D, Dolz M, Sánchez L, García J, Danelutto M and Torquati M (2018). Finding parallel patterns through static analysis in C++ applications, International Journal of High Performance Computing Applications, 32:6, (779-788), Online publication date: 1-Nov-2018.
  69. ACM
    Peng Z, Powell A, Wu B, Bicer T and Ren B Graphphi Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, (1-14)
  70. ACM
    Cho Y, Guzman C and Egger B Maximizing system utilization via parallelism management for co-located parallel applications Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, (1-14)
  71. Wang H, Xiao B, Wu J, Kwong S and Kuo C (2018). A Collaborative Scheduling-Based Parallel Solution for HEVC Encoding on Multicore Platforms, IEEE Transactions on Multimedia, 20:11, (2935-2948), Online publication date: 1-Nov-2018.
  72. ACM
    Leißa R, Boesche K, Hack S, Pérard-Gayot A, Membarth R, Slusallek P, Müller A and Schmidt B (2018). AnyDSL: a partial evaluation framework for programming high-performance libraries, Proceedings of the ACM on Programming Languages, 2:OOPSLA, (1-30), Online publication date: 24-Oct-2018.
  73. Chen T, Srinath S, Batten C and Suh G An architectural framework for accelerating dynamic parallel algorithms on reconfigurable hardware Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, (55-67)
  74. ACM
    Lee I and Schardl T (2018). Efficient Race Detection for Reducer Hyperobjects, ACM Transactions on Parallel Computing, 4:4, (1-40), Online publication date: 15-Sep-2018.
  75. Tagliavini G, Cesarini D and Marongiu A (2018). Unleashing Fine-Grained Parallelism on Embedded Many-Core Accelerators with Lightweight OpenMP Tasking, IEEE Transactions on Parallel and Distributed Systems, 29:9, (2150-2163), Online publication date: 1-Sep-2018.
  76. ACM
    De Goes F and James D (2018). Dynamic kelvinlets, ACM Transactions on Graphics, 37:4, (1-10), Online publication date: 31-Aug-2018.
  77. Ferrão P, Marques H and Paulino H Stream Processing on Hybrid CPU/Intel® Xeon Phi™ Systems Euro-Par 2018: Parallel Processing, (796-810)
  78. Griebler D, De Sensi D, Vogel A, Danelutto M and Fernandes L Service Level Objectives via C++11 Attributes Euro-Par 2018: Parallel Processing Workshops, (745-756)
  79. Vogel A, Griebler D, De Sensi D, Danelutto M and Fernandes L Autonomic and Latency-Aware Degree of Parallelism Management in SPar Euro-Par 2018: Parallel Processing Workshops, (28-39)
  80. ACM
    Agrawal K, Gilbert S and Lim W Parallel Working-Set Search Structures Proceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures, (321-332)
  81. Kim M, Klasky S and Pugmire D Dense texture flow visualization using data-parallel primitives Proceedings of the Symposium on Parallel Graphics and Visualization, (57-61)
  82. Castellana V and Minutoli M SHAD Proceedings of the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, (442-451)
  83. Crutzen Y, Boman R, Papeleux L and Ponthot J (2018). Continuous roll forming including in-line welding and post-cut within an ALE formalism, Finite Elements in Analysis and Design, 143:C, (11-31), Online publication date: 1-Apr-2018.
  84. Archibald B, Maier P, McCreesh C, Stewart R and Trinder P (2018). Replicable parallel branch and bound search, Journal of Parallel and Distributed Computing, 113:C, (92-114), Online publication date: 1-Mar-2018.
  85. ACM
    von Koch T, Manilov S, Vasiladiotis C, Cole M and Franke B Towards a compiler analysis for parallel algorithmic skeletons Proceedings of the 27th International Conference on Compiler Construction, (174-184)
  86. ACM
    Rosà A, Rosales E and Binder W Analyzing and optimizing task granularity on the JVM Proceedings of the 2018 International Symposium on Code Generation and Optimization, (27-37)
  87. ACM
    Fürlinger K, Kowalewski R, Fuchs T and Lehmann B Investigating the performance and productivity of DASH using the Cowichan problems Proceedings of Workshops of HPC Asia, (11-20)
  88. ACM
    Matheou G and Evripidou P (2017). Data-Driven Concurrency for High Performance Computing, ACM Transactions on Architecture and Code Optimization, 14:4, (1-26), Online publication date: 20-Dec-2017.
  89. ACM
    Gebremedhin M and Fritzson P Parallelizing simulations with runtime profiling and scheduling Proceedings of the 8th International Workshop on Equation-Based Object-Oriented Modeling Languages and Tools, (19-25)
  90. Reiche O, Özkan M, Membarth R, Teicha J and Hannig F Generating FPGA-based image processing accelerators with Hipacc Proceedings of the 36th International Conference on Computer-Aided Design, (1026-1033)
  91. Reiche O, Özkan M, Membarth R, Teich J and Hannig F Generating FPGA-based image processing accelerators with Hipacc: (Invited paper) 2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), (1026-1033)
  92. (2017). HW/SW Co-Design of the HOG algorithm on a Xilinx Zynq SoC, Journal of Parallel and Distributed Computing, 109:C, (50-62), Online publication date: 1-Nov-2017.
  93. Herrera J, Salmerón J, Hendrix E, Asenjo R and Casado L (2017). On parallel Branch and Bound frameworks for Global Optimization, Journal of Global Optimization, 69:3, (547-560), Online publication date: 1-Nov-2017.
  94. ACM
    Chu J, Zafar N and Yang X (2017). A Schur Complement Preconditioner for Scalable Parallel Fluid Simulation, ACM Transactions on Graphics, 36:5, (1-11), Online publication date: 17-Oct-2017.
  95. ACM
    Kim J, Jiang S, Torng C, Wang M, Srinath S, Ilbeyi B, Al-Hawaj K and Batten C Using intra-core loop-task accelerators to improve the productivity and performance of task-based parallel programs Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, (759-773)
  96. ACM
    Lifflander J and Krishnamoorthy S (2017). Cache locality optimization for recursive programs, ACM SIGPLAN Notices, 52:6, (1-16), Online publication date: 14-Sep-2017.
  97. González C and Fraguela B (2017). A general and efficient divide-and-conquer algorithm framework for multi-core clusters, Cluster Computing, 20:3, (2605-2626), Online publication date: 1-Sep-2017.
  98. ACM
    Yoga A and Nagarakatte S A fast causal profiler for task parallel programs Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering, (15-26)
  99. ACM
    Chu J, Zafar N and Yang X (2017). A Schur Complement Preconditioner for Scalable Parallel Fluid Simulation, ACM Transactions on Graphics, 36:4, (1), Online publication date: 20-Jul-2017.
  100. ACM
    Evans N, Olivier S, Barrett R and Stelle G Scheduling Chapel Tasks with Qthreads on Manycore Proceedings of the 7th International Workshop on Runtime and Operating Systems for Supercomputers ROSS 2017, (1-8)
  101. ACM
    Eftekhari A, Scheidegger S and Schenk O Parallelized Dimensional Decomposition for Large-Scale Dynamic Stochastic Economic Models Proceedings of the Platform for Advanced Scientific Computing Conference, (1-11)
  102. Kronbichler M, Kormann K, Pasichnyk I and Allalen M Fast Matrix-Free Discontinuous Galerkin Kernels on Modern Computer Architectures High Performance Computing, (237-255)
  103. ACM
    Gupta S, Shrivastava R and Nandivada V Optimizing recursive task parallel programs Proceedings of the International Conference on Supercomputing, (1-11)
  104. ACM
    Zlateski A and Seung H Compile-time optimized and statically scheduled N-D convnet primitives for multi-core and many-core (Xeon Phi) CPUs Proceedings of the International Conference on Supercomputing, (1-10)
  105. ACM
    Lifflander J and Krishnamoorthy S Cache locality optimization for recursive programs Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation, (1-16)
  106. Labasan S, Larsen M, Childs H and Rountree B Paviz Proceedings of the 17th Eurographics Symposium on Parallel Graphics and Visualization, (1-10)
  107. Agrawal P, Iqbal A, Russell B, Hazrati M, Kashyap V and Akhbari F PCE-SLAM: A real-time simultaneous localization and mapping using LiDAR data 2017 IEEE Intelligent Vehicles Symposium (IV), (1752-1757)
  108. Garcia V, Rico A, Villavieja C, Carpenter P, Navarro N and Ramirez A (2017). Adaptive Runtime-Assisted Block Prefetching on Chip-Multiprocessors, International Journal of Parallel Programming, 45:3, (530-550), Online publication date: 1-Jun-2017.
  109. Jannesari A, Huda Z, Atre R, Li Z and Wolf F Parallelizing audio analysis applications Proceedings of the 39th International Conference on Software Engineering: Software Engineering and Education Track, (57-66)
  110. Chwa H, Lee J, Lee J, Phan K, Easwaran A and Shin I (2017). Global EDF Schedulability Analysis for Parallel Tasks on Multi-Core Platforms, IEEE Transactions on Parallel and Distributed Systems, 28:5, (1331-1345), Online publication date: 1-May-2017.
  111. Han M, Park J and Baek W CHRT Proceedings of the Conference on Design, Automation & Test in Europe, (942-945)
  112. Bednárek D, Brabec M and Kruliš M (2017). Improving matrix-based dynamic programming on massively parallel accelerators, Information Systems, 64:C, (175-193), Online publication date: 1-Mar-2017.
  113. Sewall J, Pennycook S, Duran A, Tian X and Narayanaswamy R A modern memory management system for OpenMP Proceedings of the Third International Workshop on Accelerator Programming Using Directives, (25-35)
  114. Zlateski A, Lee K and Seung H ZNNi Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-12)
  115. ACM
    Yoga A, Nagarakatte S and Gupta A Parallel data race detection for task parallel programs with locks Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, (833-845)
  116. Shafique M, Ivanov A, Vogel B and Henkel J (2016). Scalable Power Management for On-Chip Systems with Malleable Applications, IEEE Transactions on Computers, 65:11, (3398-3412), Online publication date: 1-Nov-2016.
  117. ACM
    Danelutto M, De Matteis T, Mencagli G and Torquati M A divide-and-conquer parallel pattern implementation for multicores Proceedings of the 3rd International Workshop on Software Engineering for Parallel Systems, (10-19)
  118. Pirk H, Moll O, Zaharia M and Madden S (2016). Voodoo - a vector algebra for portable database performance on modern hardware, Proceedings of the VLDB Endowment, 9:14, (1707-1718), Online publication date: 1-Oct-2016.
  119. ACM
    Iwasaki S and Taura K A Static Cut-off for Task Parallel Programs Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, (139-150)
  120. ACM
    Reddy C, Kruse M and Cohen A Reduction Drawing Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, (87-97)
  121. ACM
    Turcksin B, Kronbichler M and Bangerth W (2016). WorkStream -- A Design Pattern for Multicore-Enabled Finite Element Computations, ACM Transactions on Mathematical Software, 43:1, (1-29), Online publication date: 29-Aug-2016.
  122. Sethi D, Talupur M and Malik S (2016). Model checking unbounded concurrent lists, International Journal on Software Tools for Technology Transfer (STTT), 18:4, (375-391), Online publication date: 1-Aug-2016.
  123. ACM
    Boyer B, Eder C, Faugère J, Lachartre S and Martani F GBLA Proceedings of the ACM on International Symposium on Symbolic and Algebraic Computation, (135-142)
  124. ACM
    Jia X and Luo Q Multi-Assignment Single Joins for Parallel Cross-Match of Astronomic Catalogs on Heterogeneous Clusters Proceedings of the 28th International Conference on Scientific and Statistical Database Management, (1-12)
  125. ACM
    Yang C and Mellor-Crummey J A Practical Solution to the Cactus Stack Problem Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures, (61-70)
  126. ACM
    Agrawal K, Li J, Lu K and Moseley B Scheduling Parallelizable Jobs Online to Minimize the Maximum Flow Time Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures, (195-205)
  127. ACM
    Kaler T, Hasenplaugh W, Schardl T and Leiserson C (2016). Executing Dynamic Data-Graph Computations Deterministically Using Chromatic Scheduling, ACM Transactions on Parallel Computing, 3:1, (1-31), Online publication date: 28-Jun-2016.
  128. ACM
    Andreetta C, Bégot V, Berthold J, Elsman M, Henglein F, Henriksen T, Nordfang M and Oancea C (2016). FinPar, ACM Transactions on Architecture and Code Optimization, 13:2, (1-27), Online publication date: 27-Jun-2016.
  129. ACM
    Schaller M, Gonnet P, Chalk A and Draper P SWIFT Proceedings of the Platform for Advanced Scientific Computing Conference, (1-10)
  130. Sridi M, Raffin B and Faucher V (2016). Cache Aware Dynamics Data Layout for Efficient Shared Memory Parallelisation of EUROPLEXUS, Procedia Computer Science, 80:C, (1083-1092), Online publication date: 1-Jun-2016.
  131. Tran Tan A, Falcou J, Etiemble D and Kaiser H (2016). Automatic Task-Based Code Generation for High Performance Domain Specific Embedded Language, International Journal of Parallel Programming, 44:3, (449-465), Online publication date: 1-Jun-2016.
  132. ACM
    Roy P, Liu X and Song S SMT-Aware Instantaneous Footprint Optimization Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, (267-279)
  133. Liu Y, Hankeln T and Schmidt B (2016). Parallel and space-efficient construction of burrows-wheeler transform and suffix array for big genome data, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 13:3, (592-598), Online publication date: 1-May-2016.
  134. Vilches A, Navarro A, Asenjo R, Corbera F, Gran R and Garzarán M (2016). Mapping Streaming Applications on Commodity Multi-CPU and GPU On-Chip Processors, IEEE Transactions on Parallel and Distributed Systems, 27:4, (1099-1115), Online publication date: 1-Apr-2016.
  135. Briggs J, Pennycook S, Fergusson J, Jäykkä J and Shellard E (2016). Separable projection integrals for higher-order correlators of the cosmic microwave sky, Journal of Computational Physics, 310:C, (285-300), Online publication date: 1-Apr-2016.
  136. Charousset D, Hiesgen R and Schmidt T (2016). Revisiting actor programming in C++, Computer Languages, Systems and Structures, 45:C, (105-131), Online publication date: 1-Apr-2016.
  137. Hansen G, Xavier P, Mish S, Voth T, Heinstein M and Glass M (2016). An MPI+$$X$$X implementation of contact global search using Kokkos, Engineering with Computers, 32:2, (295-311), Online publication date: 1-Apr-2016.
  138. Cesarini D, Marongiu A and Benini L An optimized task-based runtime system for resource-constrained parallel accelerators Proceedings of the 2016 Conference on Design, Automation & Test in Europe, (1261-1266)
  139. ACM
    Dolz M, del Rio Astorga D, Fernández J, García J, García-Carballeira F, Danelutto M and Torquati M Embedding Semantics of the Single-Producer/Single-Consumer Lock-Free Queue into a Race Detection Tool Proceedings of the 7th International Workshop on Programming Models and Applications for Multicores and Manycores, (20-29)
  140. Mansouri F, Huet S and Houzet D (2016). A domain-specific high-level programming model, Concurrency and Computation: Practice & Experience, 28:3, (750-767), Online publication date: 10-Mar-2016.
  141. ACM
    Yoga A and Nagarakatte S Atomicity violation checker for task parallel programs Proceedings of the 2016 International Symposium on Code Generation and Optimization, (239-249)
  142. Membarth R, Reiche O, Hannig F, Teich J, Korner M and Eckert W (2016). HIPAcc: A Domain-Specific Language and Compiler for Image Processing, IEEE Transactions on Parallel and Distributed Systems, 27:1, (210-224), Online publication date: 1-Jan-2016.
  143. ACM
    Zhang G, Horn W and Sanchez D Exploiting commutativity to reduce the cost of updates to shared data in cache-coherent systems Proceedings of the 48th International Symposium on Microarchitecture, (13-25)
  144. ACM
    Brooks A, Dang H, Dryden N and Snir M PPL Proceedings of the First International Workshop on Extreme Scale Programming Models and Middleware, (2-9)
  145. ACM
    Liu C and Kulkarni M Optimizing the LULESH stencil code using concurrent collections Proceedings of the 5th International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, (1-10)
  146. ACM
    Aga S, Krishnamoorthy S and Narayanasamy S CilkSpec Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-12)
  147. Chohra C, Langlois P and Parello D Efficiency of Reproducible Level 1 BLAS Scientific Computing, Computer Arithmetic, and Validated Numerics, (99-108)
  148. ACM
    Ren B, Jo Y, Krishnamoorthy S, Agrawal K and Kulkarni M (2015). Efficient execution of recursive programs on commodity vector hardware, ACM SIGPLAN Notices, 50:6, (509-520), Online publication date: 7-Aug-2015.
  149. ACM
    Emani M and O'Boyle M (2015). Celebrating diversity: a mixture of experts approach for runtime mapping in dynamic environments, ACM SIGPLAN Notices, 50:6, (499-508), Online publication date: 7-Aug-2015.
  150. ACM
    Bailey M Fundamentals seminar ACM SIGGRAPH 2015 Courses, (1-129)
  151. ACM
    Chen Q and Guo M (2015). Locality-Aware Work Stealing Based on Online Profiling and Auto-Tuning for Multisocket Multicore Architectures, ACM Transactions on Architecture and Code Optimization, 12:2, (1-24), Online publication date: 8-Jul-2015.
  152. Junchao Zhang , Behzad B and Snir M (2015). Design of a Multithreaded Barnes-Hut Algorithm for Multicore Clusters, IEEE Transactions on Parallel and Distributed Systems, 26:7, (1861-1873), Online publication date: 1-Jul-2015.
  153. ACM
    Mohr M, Buchwald S, Zwinkau A, Erhardt C, Oechslein B, Schedel J and Lohmann D Cutting out the middleman: OS-level support for x10 activities Proceedings of the ACM SIGPLAN Workshop on X10, (13-18)
  154. ACM
    Lee I and Schardl T Efficiently Detecting Races in Cilk Programs That Use Reducer Hyperobjects Proceedings of the 27th ACM symposium on Parallelism in Algorithms and Architectures, (111-122)
  155. ACM
    Hou K, Wang H and Feng W ASPaS Proceedings of the 29th ACM on International Conference on Supercomputing, (383-392)
  156. ACM
    Ren B, Jo Y, Krishnamoorthy S, Agrawal K and Kulkarni M Efficient execution of recursive programs on commodity vector hardware Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, (509-520)
  157. ACM
    Emani M and O'Boyle M Celebrating diversity: a mixture of experts approach for runtime mapping in dynamic environments Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, (499-508)
  158. ACM
    Duan Y, Honarmand N and Torrellas J (2015). Asymmetric Memory Fences, ACM SIGARCH Computer Architecture News, 43:1, (531-543), Online publication date: 29-May-2015.
  159. ACM
    Duan Y, Honarmand N and Torrellas J (2015). Asymmetric Memory Fences, ACM SIGPLAN Notices, 50:4, (531-543), Online publication date: 12-May-2015.
  160. Harnie D, Vapirev A, Wegner J, Gedich A, Steijaert M, Wuyts R and De Meuter W Scaling machine learning for target prediction in drug discovery using apache spark Proceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, (871-879)
  161. ACM
    Duan Y, Honarmand N and Torrellas J Asymmetric Memory Fences Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, (531-543)
  162. ACM
    Beard J, Li P and Chamberlain R RaftLib Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, (96-105)
  163. ACM
    Ramachandran A and Mittal N A Fast Lock-Free Internal Binary Search Tree Proceedings of the 16th International Conference on Distributed Computing and Networking, (1-10)
  164. Marzulo L, Alves T, França F and Costa V (2014). Couillard, Parallel Computing, 40:10, (661-680), Online publication date: 1-Dec-2014.
  165. Chen J and Edelman A Parallel prefix polymorphism permits parallelization, presentation & proof Proceedings of the 1st First Workshop for High Performance Technical Computing in Dynamic Languages, (47-56)
  166. ACM
    Kumar V, Zheng Y, Cavé V, Budimlić Z and Sarkar V HabaneroUPC++ Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, (1-10)
  167. Menezes N Parallel Initialization on Distributed Simulation Systems Proceedings of the 2014 IEEE/ACM 18th International Symposium on Distributed Simulation and Real Time Applications, (18-24)
  168. ACM
    Kumar V, Blackburn S and Grove D (2014). Friendly barriers, ACM SIGPLAN Notices, 49:7, (165-176), Online publication date: 25-Sep-2014.
  169. ACM
    Imam S and Sarkar V Habanero-Java library Proceedings of the 2014 International Conference on Principles and Practices of Programming on the Java platform: Virtual machines, Languages, and Tools, (75-86)
  170. ACM
    Totoo P and Loidl H Lazy data-oriented evaluation strategies Proceedings of the 3rd ACM SIGPLAN workshop on Functional high-performance computing, (63-74)
  171. ACM
    Lutz T and Grover V LambdaJIT Proceedings of the 3rd ACM SIGPLAN workshop on Functional high-performance computing, (99-108)
  172. Mansouri F, Huet S and Houzet D A Visual Programming Model to Implement Coarse-Grained DSP Applications on Parallel and Heterogeneous Clusters Revised Selected Papers, Part I, of the Euro-Par 2014 International Workshops on Parallel Processing - Volume 8805, (141-152)
  173. ACM
    Jung W, Park J and Lee J Versatile and scalable parallel histogram construction Proceedings of the 23rd international conference on Parallel architectures and compilation, (127-138)
  174. Imam S and Sarkar V Cooperative Scheduling of Parallel Tasks with General Synchronization Patterns Proceedings of the 28th European Conference on ECOOP 2014 --- Object-Oriented Programming - Volume 8586, (618-643)
  175. ACM
    Agrawal K, Fineman J, Lu K, Sheridan B, Sukha J and Utterback R Provably good scheduling for parallel programs that use data structures through implicit batching Proceedings of the 26th ACM symposium on Parallelism in algorithms and architectures, (84-95)
  176. ACM
    Kaler T, Hasenplaugh W, Schardl T and Leiserson C Executing dynamic data-graph computations deterministically using chromatic scheduling Proceedings of the 26th ACM symposium on Parallelism in algorithms and architectures, (154-165)
  177. Scogland T, Feng W, Rountree B and Supinski B CoreTSAR Proceedings of the 29th International Conference on Supercomputing - Volume 8488, (172-186)
  178. ACM
    Gilad E, Mackay E, Oskin M and Etsion Y O-structures Proceedings of the workshop on Memory Systems Performance and Correctness, (1-8)
  179. ACM
    Chen Q, Guo M and Guan H LAWS Proceedings of the 28th ACM international conference on Supercomputing, (3-12)
  180. ACM
    Imam S, Sarkar V, Leibs D and Kessler P Exploiting Implicit Parallelism in Dynamic Array Programming Languages Proceedings of ACM SIGPLAN International Workshop on Libraries, Languages, and Compilers for Array Programming, (1-7)
  181. ACM
    Sridharan S, Gupta G and Sohi G Adaptive, efficient, parallel execution of parallel programs Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation, (169-180)
  182. ACM
    Sridharan S, Gupta G and Sohi G (2014). Adaptive, efficient, parallel execution of parallel programs, ACM SIGPLAN Notices, 49:6, (169-180), Online publication date: 5-Jun-2014.
  183. ACM
    Katsogridakis P and Pratikakis P Micro-checkpointing in fault tolerant runtimes Proceedings of the 11th ACM Conference on Computing Frontiers, (1-10)
  184. ACM
    Morrison A and Afek Y (2014). Fence-free work stealing on bounded TSO processors, ACM SIGARCH Computer Architecture News, 42:1, (413-426), Online publication date: 5-Apr-2014.
  185. ACM
    Morrison A and Afek Y (2014). Fence-free work stealing on bounded TSO processors, ACM SIGPLAN Notices, 49:4, (413-426), Online publication date: 5-Apr-2014.
  186. Totoo P and Loidl H (2014). Parallel Haskell implementations of the N-body problem, Concurrency and Computation: Practice & Experience, 26:4, (987-1019), Online publication date: 25-Mar-2014.
  187. Loidl H and Singer J (2014). SICSA multicore challenge editorial preface, Concurrency and Computation: Practice & Experience, 26:4, (929-934), Online publication date: 25-Mar-2014.
  188. ACM
    Kumar V, Blackburn S and Grove D Friendly barriers Proceedings of the 10th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments, (165-176)
  189. ACM
    Morrison A and Afek Y Fence-free work stealing on bounded TSO processors Proceedings of the 19th international conference on Architectural support for programming languages and operating systems, (413-426)
  190. ACM
    Zhang W, Tardieu O, Grove D, Herta B, Kamada T, Saraswat V and Takeuchi M GLB Proceedings of the first workshop on Parallel programming for analytics applications, (31-40)
  191. ACM
    Chen Q, Zheng L and Guo M DWS Proceedings of Programming Models and Applications on Multicores and Manycores, (131-139)
  192. ACM
    Chen Q, Zheng L and Guo M DWS Proceedings of Programming Models and Applications on Multicores and Manycores, (131-139)
  193. ACM
    Chen Q and Guo M (2014). Adaptive workload-aware task scheduling for single-ISA asymmetric multicore architectures, ACM Transactions on Architecture and Code Optimization, 11:1, (1-25), Online publication date: 1-Feb-2014.
  194. Hawick K and Playne D Developmental directions in parallel accelerators Proceedings of the Twelfth Australasian Symposium on Parallel and Distributed Computing - Volume 152, (21-27)
  195. ACM
    Totoni E, Dikmen M and Garzarán M (2013). Easy, fast, and energy-efficient object detection on heterogeneous on-chip architectures, ACM Transactions on Architecture and Code Optimization, 10:4, (1-25), Online publication date: 1-Dec-2013.
  196. Shafique M and Henkel J Agent-based distributed power management for kilo-core processors Proceedings of the International Conference on Computer-Aided Design, (153-160)
  197. ACM
    Moreland K, Geveci B, Ma K and Maynard R A classification of scientific visualization algorithms for massive threading Proceedings of the 8th International Workshop on Ultrascale Visualization, (1-10)
  198. Gastineau M and Laskar J Highly Scalable Multiplication for Distributed Sparse Multivariate Polynomials on Many-Core Systems Proceedings of the 15th International Workshop on Computer Algebra in Scientific Computing - Volume 8136, (100-115)
  199. Zakkak F, Chasapis D, Pratikakis P, Bilas A and Nikolopoulos D Inference and Declaration of Independence in Task-Parallel Programs Revised Selected Papers of the 10th International Symposium on Advanced Parallel Processing Technologies - Volume 8299, (1-16)
  200. Nanz S, West S and da Silveira K Examining the expert gap in parallel programming Proceedings of the 19th international conference on Parallel Processing, (434-445)
  201. ACM
    Cascaval C, Fowler S, Montesinos-Ortego P, Piekarski W, Reshadi M, Robatmili B, Weber M and Bhavsar V (2013). ZOOMM, ACM SIGPLAN Notices, 48:8, (271-280), Online publication date: 23-Aug-2013.
  202. ACM
    Meyerovich L, Torok M, Atkinson E and Bodik R (2013). Parallel schedule synthesis for attribute grammars, ACM SIGPLAN Notices, 48:8, (187-196), Online publication date: 23-Aug-2013.
  203. Schill M, Nanz S and Meyer B Handling Parallelism in a Concurrency Model Proceedings of the International Conference on Multicore Software Engineering, Performance, and Tools - Volume 8063, (37-48)
  204. ACM
    Benoit A, Çatalyürek Ü, Robert Y and Saule E (2013). A survey of pipelined workflow scheduling, ACM Computing Surveys, 45:4, (1-36), Online publication date: 1-Aug-2013.
  205. ACM
    Lee Y, Avizienis R, Bishara A, Xia R, Lockhart D, Batten C and Asanović K (2013). Exploring the Tradeoffs between Programmability and Efficiency in Data-Parallel Accelerators, ACM Transactions on Computer Systems, 31:3, (1-38), Online publication date: 1-Aug-2013.
  206. ACM
    Kaufman A, Haddon J, Imanishi I and Moser L Developing a unified pipeline with character Proceedings of the Symposium on Digital Production, (37-43)
  207. Best M, Vining N, Jacobsen D and Fedorova A Collection-focused parallelism Proceedings of the 5th USENIX Conference on Hot Topics in Parallelism, (3-3)
  208. ACM
    Lifflander J, Krishnamoorthy S and Kale L (2013). Steal Tree, ACM SIGPLAN Notices, 48:6, (507-518), Online publication date: 23-Jun-2013.
  209. ACM
    Paudel J, Tardieu O and Amaral J Hybrid parallel task placement in X10 Proceedings of the third ACM SIGPLAN X10 Workshop, (31-38)
  210. ACM
    Marvie J, Hirtzlin P and Gautron P Many-core event evaluation Proceedings of the 18th International Conference on 3D Web Technology, (181-189)
  211. ACM
    Gill H, Lin D, Han X, Nguyen C, Gill T and Loo B Scalanytics Proceedings of the 22nd international symposium on High-performance parallel and distributed computing, (61-72)
  212. ACM
    Gill H, Lin D, Han X, Nguyen C, Gill T and Loo B Scalanytics Proceedings of the 22nd international symposium on High-performance parallel and distributed computing, (61-72)
  213. ACM
    Lifflander J, Krishnamoorthy S and Kale L Steal Tree Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation, (507-518)
  214. ACM
    Sridharan S, Gupta G and Sohi G Holistic run-time parallelism management for time and energy efficiency Proceedings of the 27th international ACM conference on International conference on supercomputing, (337-348)
  215. Ettinger M, Broquedis F, Gautier T, Ploix S and Raffin B vtkSMP Proceedings of the 13th Eurographics Symposium on Parallel Graphics and Visualization, (41-48)
  216. ACM
    Serfass D and Tang P parallel_dp Proceedings of the 51st ACM Southeast Conference, (1-6)
  217. ACM
    Tang P A C++ library for rapid development of efficient parallel dense linear algebra codes for multicore computers Proceedings of the 51st ACM Southeast Conference, (1-6)
  218. Shafique M, Vogel B and Henkel J Self-adaptive hybrid dynamic power management for many-core systems Proceedings of the Conference on Design, Automation and Test in Europe, (51-56)
  219. ACM
    Cascaval C, Fowler S, Montesinos-Ortego P, Piekarski W, Reshadi M, Robatmili B, Weber M and Bhavsar V ZOOMM Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming, (271-280)
  220. ACM
    Meyerovich L, Torok M, Atkinson E and Bodik R Parallel schedule synthesis for attribute grammars Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming, (187-196)
  221. Feng M, Tan L and Gupta R Lightweight fault detection in parallelized programs Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization (CGO), (1-11)
  222. Jiménez J and Ruiz de Miras J (2012). Fast box-counting algorithm on GPU, Computer Methods and Programs in Biomedicine, 108:3, (1229-1242), Online publication date: 1-Dec-2012.
  223. Tang P (2012). Measuring the overhead of Intel C++ Concurrent Collections over Threading Building Blocks for Gauss–Jordan elimination, Concurrency and Computation: Practice & Experience, 24:18, (2282-2301), Online publication date: 1-Dec-2012.
  224. ACM
    Anderson Z (2012). Efficiently combining parallel software using fine-grained, language-level, hierarchical resource management policies, ACM SIGPLAN Notices, 47:10, (717-736), Online publication date: 15-Nov-2012.
  225. ACM
    Kumar V, Frampton D, Blackburn S, Grove D and Tardieu O (2012). Work-stealing without the baggage, ACM SIGPLAN Notices, 47:10, (297-314), Online publication date: 15-Nov-2012.
  226. Olivier S, de Supinski B, Schulz M and Prins J Characterizing and mitigating work time inflation in task parallel programs Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, (1-12)
  227. ACM
    McKenney P Beyond expert-only parallel programming? Proceedings of the 2012 ACM workshop on Relaxing synchronization for multicore and manycore scalability, (25-32)
  228. ACM
    Anderson Z Efficiently combining parallel software using fine-grained, language-level, hierarchical resource management policies Proceedings of the ACM international conference on Object oriented programming systems languages and applications, (717-736)
  229. ACM
    Kumar V, Frampton D, Blackburn S, Grove D and Tardieu O Work-stealing without the baggage Proceedings of the ACM international conference on Object oriented programming systems languages and applications, (297-314)
  230. ACM
    Foltzer A, Kulkarni A, Swords R, Sasidharan S, Jiang E and Newton R (2012). A meta-scheduler for the par-monad, ACM SIGPLAN Notices, 47:9, (235-246), Online publication date: 15-Oct-2012.
  231. ACM
    Raman A, Lee J and August D From sequential programming to flexible parallel execution Proceedings of the 2012 international conference on Compilers, architectures and synthesis for embedded systems, (37-40)
  232. ACM
    Gill H, Lin D, Sarna L, Mead R, Lee K and Loo B (2012). SP4, ACM SIGCOMM Computer Communication Review, 42:4, (75-76), Online publication date: 24-Sep-2012.
  233. ACM
    Totoo P, Deligiannis P and Loidl H Haskell vs. f# vs. scala Proceedings of the 1st ACM SIGPLAN workshop on Functional high-performance computing, (49-60)
  234. ACM
    Tardieu O, Wang H and Lin H (2012). A work-stealing scheduler for X10's task parallelism with suspension, ACM SIGPLAN Notices, 47:8, (267-276), Online publication date: 11-Sep-2012.
  235. ACM
    Leiserson C, Schardl T and Sukha J (2012). Deterministic parallel random-number generation for dynamic-multithreading platforms, ACM SIGPLAN Notices, 47:8, (193-204), Online publication date: 11-Sep-2012.
  236. ACM
    Foltzer A, Kulkarni A, Swords R, Sasidharan S, Jiang E and Newton R A meta-scheduler for the par-monad Proceedings of the 17th ACM SIGPLAN international conference on Functional programming, (235-246)
  237. Galgonek J, Kruliš M and Hoksza D On the parallelization of the SProt measure and the TM-Score algorithm Proceedings of the 18th international conference on Parallel processing workshops, (238-247)
  238. Galgonek J, Skopal T and Hoksza D P3S Proceedings of the 18th international conference on Parallel processing workshops, (228-237)
  239. Aldinucci M, Danelutto M, Kilpatrick P, Meneghin M and Torquati M An efficient unbounded lock-free queue for multi-core systems Proceedings of the 18th international conference on Parallel Processing, (662-673)
  240. Agathos S, Kallimanis N and Dimakopoulos V Speeding up OpenMP tasking Proceedings of the 18th international conference on Parallel Processing, (650-661)
  241. ACM
    Gill H, Lin D, Sarna L, Mead R, Lee K and Loo B SP4 Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication, (75-76)
  242. ACM
    Feng M, Gupta R and Neamtiu I (2012). Effective parallelization of loops in the presence of I/O operations, ACM SIGPLAN Notices, 47:6, (487-498), Online publication date: 6-Aug-2012.
  243. ACM
    Watt M, Cutler L, Powell A, Duncan B, Hutchinson M and Ochs K LibEE Proceedings of the Digital Production Symposium, (59-66)
  244. Sethi D, Talupur M, Schwartz-Narbonne D and Malik S Parameterized model checking of fine grained concurrency Proceedings of the 19th international conference on Model Checking Software, (208-226)
  245. ACM
    Radenski A Integrating data-intensive cloud computing with multicores and clusters in an HPC course Proceedings of the 17th ACM annual conference on Innovation and technology in computer science education, (69-74)
  246. ACM
    Lee I, Shafi A and Leiserson C Memory-mapping support for reducer hyperobjects Proceedings of the twenty-fourth annual ACM symposium on Parallelism in algorithms and architectures, (287-297)
  247. ACM
    Chen Q, Guo M and Huang Z CATS Proceedings of the 26th ACM international conference on Supercomputing, (163-172)
  248. ACM
    Ma W and Krishnamoorthy S Data-driven fault tolerance for work stealing computations Proceedings of the 26th ACM international conference on Supercomputing, (79-90)
  249. ACM
    Lifflander J, Krishnamoorthy S and Kale L Work stealing and persistence-based load balancers for iterative overdecomposed applications Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing, (137-148)
  250. Sanjuan-Estrada J, Casado L, García I and Hendrix E Performance driven cooperation between kernel and auto-tuning multi-threaded interval b&b applications Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part I, (57-70)
  251. Versaci F and Pingali K Processor allocation for optimistic parallelization of irregular programs Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part I, (1-14)
  252. ACM
    Feng M, Gupta R and Neamtiu I Effective parallelization of loops in the presence of I/O operations Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language Design and Implementation, (487-498)
  253. Monteiro P HARPPIE: hyper algorithmic recipe for productive parallelism intensive endeavors Proceedings of the 34th International Conference on Software Engineering, (1559-1562)
  254. Christmann C, Hebisch E and Weisbecker A Oversubscription of computational resources on multicore desktop systems Proceedings of the 2012 international conference on Multicore Software Engineering, Performance, and Tools, (18-29)
  255. Lobachev O Parallel computation skeletons with premature termination property Proceedings of the 11th international conference on Functional and Logic Programming, (197-212)
  256. ACM
    Rahman S, Guo J, Bhat A, Garcia C, Sujon M, Yi Q, Liao C and Quinlan D Studying the impact of application-level optimizations on the power consumption of multi-core architectures Proceedings of the 9th conference on Computing Frontiers, (123-132)
  257. ACM
    Serfass D and Tang P Comparing parallel performance of Go and C++ TBB on a direct acyclic task graph using a dynamic programming problem Proceedings of the 50th Annual Southeast Regional Conference, (268-273)
  258. Ashraf R, Luna F, Dechev D and DeMara R Designing digital circuits for FPGAs using parallel genetic algorithms (WIP) Proceedings of the 2012 Symposium on Theory of Modeling and Simulation - DEVS Integrative M&S Symposium, (1-6)
  259. ACM
    Lauderdale C and Khan R Towards a codelet-based runtime for exascale computing Proceedings of the 2nd International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era, (21-26)
  260. ACM
    Edwards H and Sunderland D Kokkos Array performance-portable manycore programming model Proceedings of the 2012 International Workshop on Programming Models and Applications for Multicores and Manycores, (1-10)
  261. ACM
    Tardieu O, Wang H and Lin H A work-stealing scheduler for X10's task parallelism with suspension Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming, (267-276)
  262. ACM
    Leiserson C, Schardl T and Sukha J Deterministic parallel random-number generation for dynamic-multithreading platforms Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming, (193-204)
  263. ACM
    Burrows E and Haveraaen M Programmable data dependencies and placements Proceedings of the 7th workshop on Declarative aspects and applications of multicore programming, (31-40)
  264. ACM
    Feng M, Lin C and Gupta R (2012). PLDS, ACM Transactions on Architecture and Code Optimization, 8:4, (1-21), Online publication date: 1-Jan-2012.
  265. ACM
    Rico A, Cabarcas F, Villavieja C, Pavlovic M, Vega A, Etsion Y, Ramirez A and Valero M (2012). On the simulation of large-scale architectures using multiple application abstraction levels, ACM Transactions on Architecture and Code Optimization, 8:4, (1-20), Online publication date: 1-Jan-2012.
  266. ACM
    Bailey M and Cunningham S Introduction to computer graphics SIGGRAPH Asia 2011 Courses, (1-58)
  267. ACM
    Bangerth W, Burstedde C, Heister T and Kronbichler M (2012). Algorithms and data structures for massively parallel generic adaptive finite element codes, ACM Transactions on Mathematical Software, 38:2, (1-28), Online publication date: 1-Dec-2011.
  268. Pedemonte M, Nesmachnow S and Cancela H (2011). A survey on parallel ant colony optimization, Applied Soft Computing, 11:8, (5181-5197), Online publication date: 1-Dec-2011.
  269. ACM
    Davis T (2011). Algorithm 915, SuiteSparseQR, ACM Transactions on Mathematical Software, 38:1, (1-22), Online publication date: 1-Nov-2011.
  270. ACM
    Rhoden B, Klues K, Zhu D and Brewer E Improving per-node efficiency in the datacenter with new OS abstractions Proceedings of the 2nd ACM Symposium on Cloud Computing, (1-8)
  271. ACM
    Monteiro P, Monteiro M and Pingali K Parallelizing irregular algorithms Proceedings of the 18th Conference on Pattern Languages of Programs, (1-18)
  272. ACM
    Kobbe S, Bauer L, Lohmann D, Schröder-Preikschat W and Henkel J DistRM Proceedings of the seventh IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis, (119-128)
  273. Nogina S, Unterweger K and Weinzierl T Autotuning of adaptive mesh refinement PDE solvers on shared memory architectures Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I, (671-680)
  274. ACM
    Saraswat V, Kambadur P, Kodali S, Grove D and Krishnamoorthy S (2011). Lifeline-based global load balancing, ACM SIGPLAN Notices, 46:8, (201-212), Online publication date: 7-Sep-2011.
  275. ACM
    Feng M, Gupta R and Hu Y (2011). SpiceC, ACM SIGPLAN Notices, 46:8, (69-80), Online publication date: 7-Sep-2011.
  276. Sreeram J and Pande S Parallelizing a real-time physics engine using transactional memory Proceedings of the 17th international conference on Parallel processing - Volume Part II, (206-223)
  277. Subotic V, Ferrer R, Sancho J, Labarta J and Valero M Quantifying the potential task-based dataflow parallelism in MPI applications Proceedings of the 17th international conference on Parallel processing - Volume Part I, (39-51)
  278. Heinecke A, Klemm M, Pflüger D, Bode A and Bungartz H Extending a highly parallel data mining algorithm to the intel ® many integrated core architecture Proceedings of the 2011 international conference on Parallel Processing - Volume 2, (375-384)
  279. ACM
    Cavé V, Zhao J, Shirako J and Sarkar V Habanero-Java Proceedings of the 9th International Conference on Principles and Practice of Programming in Java, (51-61)
  280. Bocchino R and Adve V Types, regions, and effects for safe programming with object-oriented parallel frameworks Proceedings of the 25th European conference on Object-oriented programming, (306-332)
  281. Petrov E Scalable parallel interval propagation for sparse constraint satisfaction problems Proceedings of the 8th international conference on Perspectives of System Informatics, (302-312)
  282. ACM
    Pratikakis P, Vandierendonck H, Lyberis S and Nikolopoulos D A programming model for deterministic task parallelism Proceedings of the 2011 ACM SIGPLAN Workshop on Memory Systems Performance and Correctness, (7-12)
  283. ACM
    Raman A, Kim H, Oh T, Lee J and August D Parallelism orchestration using DoPE Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation, (26-37)
  284. ACM
    Raman A, Kim H, Oh T, Lee J and August D (2011). Parallelism orchestration using DoPE, ACM SIGPLAN Notices, 46:6, (26-37), Online publication date: 4-Jun-2011.
  285. ACM
    Lev Y and Moir M Lightweight parallel accumulators using C++ templates Proceedings of the 4th International Workshop on Multicore Software Engineering, (33-40)
  286. ACM
    Cledat R, Ravichandran K and Pande S Leveraging data-structure semantics for efficient algorithmic parallelism Proceedings of the 8th ACM International Conference on Computing Frontiers, (1-10)
  287. ACM
    Gray I and Audsley N (2011). Targeting complex embedded architectures by combining the multicore communications API (mcapi) with compile-time virtualisation, ACM SIGPLAN Notices, 46:5, (51-60), Online publication date: 11-Apr-2011.
  288. ACM
    Gray I and Audsley N Targeting complex embedded architectures by combining the multicore communications API (mcapi) with compile-time virtualisation Proceedings of the 2011 SIGPLAN/SIGBED conference on Languages, compilers and tools for embedded systems, (51-60)
  289. ACM
    Bhattacharjee A, Contreras G and Martonosi M (2011). Parallelization libraries, ACM Transactions on Architecture and Code Optimization, 8:1, (1-29), Online publication date: 1-Apr-2011.
  290. ACM
    Niño J Designing an undergraduate curriculum based on parallelism and concurrency Proceedings of the 49th Annual Southeast Regional Conference, (1-6)
  291. ACM
    Hohenstein U and Gleim U Using aspect-orientation to simplify concurrent programming Proceedings of the tenth international conference on Aspect-oriented software development companion, (29-40)
  292. ACM
    Dhanasekaran B and Rubin N A new method for GPU based irregular reductions and its application to k-means clustering Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units, (1-8)
  293. Membarth R, Hannig F, Teich J, Körner M and Eckert W Frameworks for multi-core architectures Proceedings of the 24th international conference on Architecture of computing systems, (62-73)
  294. ACM
    Gupta V, Knauerhase R and Schwan K (2011). Attaining system performance points, ACM SIGOPS Operating Systems Review, 45:1, (3-10), Online publication date: 18-Feb-2011.
  295. ACM
    Saraswat V, Kambadur P, Kodali S, Grove D and Krishnamoorthy S Lifeline-based global load balancing Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, (201-212)
  296. ACM
    Feng M, Gupta R and Hu Y SpiceC Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, (69-80)
  297. ACM
    Bailey M and Cunningham S Introduction to computer graphics ACM SIGGRAPH ASIA 2010 Courses, (1-100)
  298. ACM
    Trümper J, Bohnet J and Döllner J Understanding complex multithreaded software systems by using trace visualization Proceedings of the 5th international symposium on Software visualization, (133-142)
  299. Grossman M, Sbîrlea A, Budimlić Z and Sarkar V CnC-CUDA Proceedings of the 23rd international conference on Languages and compilers for parallel computing, (230-245)
  300. Nicolau A and Kejariwal A How many threads to spawn during program multithreading? Proceedings of the 23rd international conference on Languages and compilers for parallel computing, (166-183)
  301. Power R and Li J Piccolo Proceedings of the 9th USENIX conference on Operating systems design and implementation, (293-306)
  302. ACM
    Matsuzaki K and Emoto K Lessons from implementing the biCGStab method with SkeTo library Proceedings of the fourth international workshop on High-level parallel programming and applications, (15-24)
  303. Cera M, Lima J, Maillard N and Navaux P Challenges and issues of supporting task parallelism in MPI Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface, (302-305)
  304. Garrett D PMF Proceedings of the 11th international conference on Parallel problem solving from nature: Part II, (351-360)
  305. ACM
    Lee I, Boyd-Wickizer S, Huang Z and Leiserson C Using memory mapping to support cactus stacks in work-stealing runtime systems Proceedings of the 19th international conference on Parallel architectures and compilation techniques, (411-420)
  306. Mak J, Faxén K, Janson S and Mycroft A Estimating and exploiting potential parallelism by source-level dependence profiling Proceedings of the 16th international Euro-Par conference on Parallel processing: Part I, (26-37)
  307. Otto F, Schaefer C, Dempe M and Tichy W A language-based tuning mechanism for task and pipeline parallelism Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II, (328-340)
  308. Watanobe Y, Yoshioka R and Mirenkov N Embedded Clarity in Filmification of Methods Proceedings of the 2010 conference on New Trends in Software Methodologies, Tools and Techniques: Proceedings of the 9th SoMeT_10, (70-82)
  309. ACM
    Adve S and Boehm H (2010). Memory models, Communications of the ACM, 53:8, (90-101), Online publication date: 1-Aug-2010.
  310. ACM
    Gastineau M Parallel operations of sparse polynomials on multicores Proceedings of the 4th International Workshop on Parallel and Symbolic Computation, (44-52)
  311. Ostrowski K, Sakoda C and Birman K Self-replicating objects for multicore platforms Proceedings of the 24th European conference on Object-oriented programming, (452-477)
  312. ACM
    Kelm J, Johnson D, Tuohy W, Lumetta S and Patel S (2010). Cohesion, ACM SIGARCH Computer Architecture News, 38:3, (429-440), Online publication date: 19-Jun-2010.
  313. ACM
    Kelm J, Johnson D, Tuohy W, Lumetta S and Patel S Cohesion Proceedings of the 37th annual international symposium on Computer architecture, (429-440)
  314. Hong G, Hong K, Burgstaller B and Blieberger J AdaStreams Proceedings of the 15th Ada-Europe international conference on Reliable Software Technologies, (208-221)
  315. Duran A, Ferrer R, Klemm M, de Supinski B and Ayguadé E A proposal for user-defined reductions in OpenMP Proceedings of the 6th international conference on Beyond Loop Level Parallelism in OpenMP: accelerators, Tasking and more, (43-55)
  316. ACM
    Agrawal K, Lee I and Sukha J Brief announcement Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures, (186-188)
  317. ACM
    He Y, Leiserson C and Leiserson W The Cilkview scalability analyzer Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures, (145-156)
  318. ACM
    Pan H, Hindman B and Asanović K (2010). Composing parallel software efficiently with lithe, ACM SIGPLAN Notices, 45:6, (376-387), Online publication date: 12-Jun-2010.
  319. Kirschenmann W, Plagne L and Vialle S Multi-Target vectorization with MTPS c++ generic library Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2, (336-346)
  320. ACM
    Pan H, Hindman B and Asanović K Composing parallel software efficiently with lithe Proceedings of the 31st ACM SIGPLAN Conference on Programming Language Design and Implementation, (376-387)
  321. ACM
    Buss A, Harshvardhan , Papadopoulos I, Pearce O, Smith T, Tanase G, Thomas N, Xu X, Bianco M, Amato N and Rauchwerger L STAPL Proceedings of the 3rd Annual Haifa Experimental Systems Conference, (1-10)
  322. Li J, Wang S, Hsu P, Chen P and Lee J A multi-core software API for embedded MPSoC environments Proceedings of the Second Russia-Taiwan conference on Methods and tools of parallel programming multicomputers, (40-50)
  323. Martin S, Shen H and McCormick P Load-balanced isosurfacing on multi-GPU clusters Proceedings of the 10th Eurographics conference on Parallel Graphics and Visualization, (91-100)
  324. Howison M, Bethel E and Childs H MPI-hybrid parallelism for volume rendering on large, multi-core systems Proceedings of the 10th Eurographics conference on Parallel Graphics and Visualization, (1-10)
  325. ACM
    Agrawal K, Leiserson C and Sukha J (2010). Helper locks for fork-join parallel programming, ACM SIGPLAN Notices, 45:5, (245-256), Online publication date: 1-May-2010.
  326. ACM
    Coons K, Burckhardt S and Musuvathi M (2010). GAMBIT, ACM SIGPLAN Notices, 45:5, (15-24), Online publication date: 1-May-2010.
  327. ACM
    Chesebrough R and Turner I Parallel computing Proceedings of the 41st ACM technical symposium on Computer science education, (280-284)
  328. ACM
    Mou Z, Liu H and Hudak P Compress-and-conquer for optimal multicore computing Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming, (35-44)
  329. ACM
    Agrawal K, Leiserson C and Sukha J Helper locks for fork-join parallel programming Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (245-256)
  330. ACM
    Coons K, Burckhardt S and Musuvathi M GAMBIT Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (15-24)
  331. Akeila L, Sinnen O and Humadi W Object oriented parallelisation of graph algorithms using parallel iterator Proceedings of the Eighth Australasian Symposium on Parallel and Distributed Computing - Volume 107, (41-50)
  332. ACM
    Yee B, Sehr D, Dardyk G, Chen J, Muth R, Ormandy T, Okasaka S, Narula N and Fullagar N (2010). Native Client, Communications of the ACM, 53:1, (91-99), Online publication date: 1-Jan-2010.
  333. ACM
    Luk C, Hong S and Kim H Qilin Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, (45-55)
  334. Hadjidoukas P, Philos G and Dimakopoulos V (2009). Exploiting fine-grain thread parallelism on multicore architectures, Scientific Programming, 17:4, (309-323), Online publication date: 1-Dec-2009.
  335. ACM
    Kambadur P, Gupta A, Ghoting A, Avron H and Lumsdaine A PFunc Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, (1-11)
  336. ACM
    Berger E, Yang T, Liu T and Novark G Grace Proceedings of the 24th ACM SIGPLAN conference on Object oriented programming systems languages and applications, (81-96)
  337. ACM
    Berger E, Yang T, Liu T and Novark G (2009). Grace, ACM SIGPLAN Notices, 44:10, (81-96), Online publication date: 25-Oct-2009.
  338. Gudenkauf S A coordination-based model-driven method for parallel application development Proceedings of the 2009 international conference on Models in Software Engineering, (21-35)
  339. Matsuzaki K and Emoto K Implementing fusion-equipped parallel skeletons by expression templates Proceedings of the 21st international conference on Implementation and application of functional languages, (72-89)
  340. Paulino H On the definition of service abstractions for parallel computing Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part II, (74-81)
  341. Marowka A Empirical analysis of parallelism overheads on CMPs Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I, (596-605)
  342. Best M, Fedorova A, Dickie R, Tagliasacchi A, Couture-Beil A, Mustard C, Mottishaw S, Brown A, Huang Z, Xu X, Ghazali N and Brownsword A Searching for Concurrent Design Patterns in Video Games Proceedings of the 15th International Euro-Par Conference on Parallel Processing, (912-923)
  343. Otto F, Pankratius V and Tichy W XJava Proceedings of the 15th International Euro-Par Conference on Parallel Processing, (875-886)
  344. Kegel P, Schellmann M and Gorlatch S Using OpenMP vs. Threading Building Blocks for Medical Imaging on Multi-cores Proceedings of the 15th International Euro-Par Conference on Parallel Processing, (654-665)
  345. ACM
    Sukha J Brief announcement Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures, (124-126)
  346. ACM
    Frigo M, Halpern P, Leiserson C and Lewin-Berlin S Reducers and other Cilk++ hyperobjects Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures, (79-90)
  347. ACM
    Nielsen M, Christensen B, Zafar N, Roble D and Museth K Guiding of smoke animations through variational coupling of simulations at different resolutions Proceedings of the 2009 ACM SIGGRAPH/Eurographics Symposium on Computer Animation, (217-226)
  348. Hsiung P, Lin C, Lin S, Chen Y, Lu C, Tong S, Su W, Shih C, Koong C, Hsueh N, Chang C and Chu W VERTAF/Multi-Core Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing, (303-314)
  349. ACM
    Plagne L, Hülsemann F, Barthou D and Jaeger J Parallel expression template for large vectors Proceedings of the 8th workshop on Parallel/High-Performance Object-Oriented Scientific Computing, (1-8)
  350. ACM
    Kirschenmann W, Plagne L and Vialle S Multi-target C++ implementation of parallel skeletons Proceedings of the 8th workshop on Parallel/High-Performance Object-Oriented Scientific Computing, (1-10)
  351. ACM
    Han W and Lee J Dependency-aware reordering for parallelizing query optimization in multi-core CPUs Proceedings of the 2009 ACM SIGMOD International Conference on Management of data, (45-58)
  352. Hindman B, Konwinski A, Zaharia M and Stoica I A common substrate for cluster computing Proceedings of the 2009 conference on Hot topics in cloud computing
  353. ACM
    Batista V, Millman D, Pion S and Singler J Parallel geometric algorithms for multi-core computers Proceedings of the twenty-fifth annual symposium on Computational geometry, (217-226)
  354. ACM
    Navarro A, Asenjo R, Tabik S and Caşcaval C Load balancing using work-stealing for pipeline parallelism in emerging applications Proceedings of the 23rd international conference on Supercomputing, (517-518)
  355. Hsiung P, Lin S, Yean-Ru Chen , Nien-Lin Hsueh , Chih-Hung Chang , Chih-Hsiong Shih , Chorng-Shiuh Koong , Chao-Sheng Lin , Chun-Hsien Lu , Sheng-Ya Tong , Wan-Ting Su and Chu W Model-driven development of multi-core embedded software Proceedings of the 2009 ICSE Workshop on Multicore Software Engineering, (9-16)
  356. Ebnenasir A and Beik R Developing parallel programs Proceedings of the 2009 ICSE Workshop on Multicore Software Engineering, (1-8)
  357. Hermann E, Raffin B and Faure F Interactive physical simulation on multicore architectures Proceedings of the 9th Eurographics conference on Parallel Graphics and Visualization, (1-8)
  358. Lima J and Maillard N (2009). Online mapping of MPI-2 dynamic tasks to processes and threads, International Journal of High Performance Systems Architecture, 2:2, (81-89), Online publication date: 1-Mar-2009.
  359. ACM
    Tallent N and Mellor-Crummey J (2009). Effective performance measurement and analysis of multithreaded applications, ACM SIGPLAN Notices, 44:4, (229-240), Online publication date: 14-Feb-2009.
  360. ACM
    Hiraishi T, Yasugi M, Umatani S and Yuasa T (2009). Backtracking-based load balancing, ACM SIGPLAN Notices, 44:4, (55-64), Online publication date: 14-Feb-2009.
  361. ACM
    Tallent N and Mellor-Crummey J Effective performance measurement and analysis of multithreaded applications Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, (229-240)
  362. ACM
    Hiraishi T, Yasugi M, Umatani S and Yuasa T Backtracking-based load balancing Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, (55-64)
  363. Traoré D, Roch J, Maillard N, Gautier T and Bernard J Deque-Free Work-Optimal Parallel STL Algorithms Proceedings of the 14th international Euro-Par conference on Parallel Processing, (887-897)
  364. Rodríguez A, López-De-Teruel P, Ruiz A, García-Mateos G and Fernóndez L A Design Pattern for Component Oriented Development of Agent Based Multithreaded Applications Proceedings of the 14th international Euro-Par conference on Parallel Processing, (709-718)
  365. Hoffmann R and Rauber T Fine-Grained Task Scheduling Using Adaptive Data Structures Proceedings of the 14th international Euro-Par conference on Parallel Processing, (253-262)
  366. ACM
    Seiler L, Carmean D, Sprangle E, Forsyth T, Abrash M, Dubey P, Junkins S, Lake A, Sugerman J, Cavin R, Espasa R, Grochowski E, Juan T and Hanrahan P Larrabee ACM SIGGRAPH 2008 papers, (1-15)
  367. ACM
    Seiler L, Carmean D, Sprangle E, Forsyth T, Abrash M, Dubey P, Junkins S, Lake A, Sugerman J, Cavin R, Espasa R, Grochowski E, Juan T and Hanrahan P (2008). Larrabee, ACM Transactions on Graphics, 27:3, (1-15), Online publication date: 1-Aug-2008.
  368. Brodman J, Fraguela B, Garzarán M and Padua D Design Issues in Parallel Array Languages for Shared Memory Proceedings of the 8th international workshop on Embedded Computer Systems: Architectures, Modeling, and Simulation, (208-217)
  369. Kambadur P, Gregor D and Lumsdaine A OpenMP extensions for generic libraries Proceedings of the 4th international conference on OpenMP in a new era of parallelism, (123-133)
  370. ACM
    Pankratius V, Schaefer C, Jannesari A and Tichy W Software engineering for multicore systems Proceedings of the 1st international workshop on Multicore software engineering, (53-60)
  371. ACM
    Willhalm T and Popovici N Putting intel® threading building blocks to work Proceedings of the 1st international workshop on Multicore software engineering, (3-4)
  372. Guillon A and Loach D YetiSim Proceedings of the 2008 Spring simulation multiconference, (1-6)
  373. Pheatt C (2008). Intel® threading building blocks, Journal of Computing Sciences in Colleges, 23:4, (298-298), Online publication date: 1-Apr-2008.
  374. ACM
    Guo J, Bikshandi G, Fraguela B, Garzaran M and Padua D Programming with tiles Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, (111-122)
  375. Zumbusch G A container-iterator parallel programming model Proceedings of the 7th international conference on Parallel processing and applied mathematics, (1130-1139)
  376. Frachtenberg E and Schwiegelshohn U New challenges of parallel job scheduling Proceedings of the 13th international conference on Job scheduling strategies for parallel processing, (1-23)
Contributors
  • Intel Corporation

Recommendations