research-article

Public Access

From OpenACC to OpenMP 4: Toward Automatic Translation

Authors:
Nawrin Sultana

Department of Computer Science and Software Engineering, Auburn University, AL, USA

Department of Computer Science and Software Engineering, Auburn University, AL, USA
View Profile

,
Alexander Calvert

Department of Computer Science and Software Engineering, Auburn University, AL, USA

Department of Computer Science and Software Engineering, Auburn University, AL, USA
View Profile

,
Jeffrey L. Overbey

Department of Computer Science and Software Engineering, Auburn University, AL, USA

Department of Computer Science and Software Engineering, Auburn University, AL, USA
View Profile

,
Galen Arnold

National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, IL, USA

National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, IL, USA
View Profile

XSEDE16: Proceedings of the XSEDE16 Conference on Diversity, Big Data, and Science at ScaleJuly 2016Article No.: 44Pages 1–8https://doi.org/10.1145/2949550.2949654

Published:17 July 2016Publication History

XSEDE16: Proceedings of the XSEDE16 Conference on Diversity, Big Data, and Science at Scale

Pages 1–8

ABSTRACT

For the past few years, OpenACC has been the primary directive-based API for programming accelerator devices like GPUs. OpenMP 4.0 is now a competitor in this space, with support from different vendors. In this paper, we describe an algorithm to convert (a subset of) OpenACC to OpenMP 4; we implemented this algorithm in a prototype tool and evaluated it by translating the EPCC Level 1 OpenACC benchmarks. We discuss some of the challenges in the conversion process and propose what parts of the process should be automated, what should be done manually by the programmer, and what future research and development is necessary in this area.

References

J. R. Allen and K. Kennedy. Optimizing Compilers for Modern Architectures: A Dependence-based Approach. Morgan Kaufmann, San Francisco, CA, 2002. Google ScholarDigital Library
EPCC OpenACC benchmark suite. https://www.epcc.ed.ac.uk/research/computing/performance-characterisation-and-benchmarking/epcc-openacc-benchmark-suite. Accessed April 29, 2016.Google Scholar
S. Grauer-Gray, L. Xu, R. Searles, S. Ayalasomayajula, and J. Cavazos. Auto-tuning a high-level language targeted to GPU codes. In Innovative Parallel Computing (InPar), 2012, pages 1--10, May 2012.Google ScholarCross Ref
O. Hernandez, W. Ding, W. Joubert, D. Bernholdt, M. Eisenbach, and C. Kartsaklis. Porting OpenACC 2.0 to OpenMP 4.0: Key similarities and differences. http://openmpcon.org/wp-content/uploads/openmpcon2015-oscar-hernandez-portingacc.pdf. Accessed April 29, 2016.Google Scholar
O. Hernandez, W. Ding, W. Joubert, D. Bernholdt, M. Eisenbach, and C. Kartsaklis. YouTube: Porting OpenACC 2.0 to OpenMP 4.0: Key similarities and differences. https://www.youtube.com/watch?v=CHMrcMUXuuY. Accessed April 29, 2016.Google Scholar
D. B. Kirk and W.-m. Hwu. Programming massively parallel processors: a hands-on approach. Morgan-Kaufmann, 2012. Google ScholarDigital Library
S. Lee and J. S. Vetter. Early evaluation of directive-based GPU programming models for productive exascale computing. In Proc. SC12, page 23. IEEE Computer Society Press, 2012. Google ScholarDigital Library
The OpenACC application programming interface, version 2.5. http://www.openacc.org/sites/default/files/OpenACC_2pt5.pdf. Accessed June 15, 2016.Google Scholar
OpenMP 4.0 on NVIDIA CUDA GPUs. https://parallel-computing.pro/index.php/9-cuda/43-openmp-4-0-on-nvidia-cuda-gpus. Accessed April 29, 2016.Google Scholar
OpenMP application programming interface, version 4.5. http://www.openmp.org/mp-documents/openmp-4.5.pdf. Accessed June 15, 2016.Google Scholar
S. Wienke, C. Terboven, J. C. Beyer, and M. S. Müller. A pattern-based comparison of OpenACC and OpenMP for accelerator computing. In Euro-Par 2014 Parallel Processing, pages 812--823. Springer, 2014.Google Scholar
M. J. Wolfe. High Performance Compilers for Parallel Computing. Addison-Wesley, Boston, MA, 1995. Google ScholarDigital Library
R. Xu, S. Chandrasekaran, and B. Chapman. Exploring programming multi-GPUs using OpenMP and OpenACC-based hybrid model. In IPDPSW '13, pages 1169--1176. IEEE, 2013. Google ScholarDigital Library

From OpenACC to OpenMP 4: Toward Automatic Translation
1. Software and its engineering
  1. Software notations and tools
    1. General programming languages
      1. Language types

Recommendations

On the Performance Portability of OpenACC, OpenMP, Kokkos and RAJA
HPCAsia '22: International Conference on High Performance Computing in Asia-Pacific Region

Performance Portability frameworks are becoming more central and essential in heterogeneous computing systems. However, the developer toolbox lacks the tools to assess the performance portability degree of these frameworks.

This article presents a new ...
Read More
Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: Programming Productivity, Performance, and Energy Consumption
ARMS-CC '17: Proceedings of the 2017 Workshop on Adaptive Resource Management and Scheduling for Cloud Computing

Many modern parallel computing systems are heterogeneous at their node level. Such nodes may comprise general purpose CPUs and accelerators (such as, GPU, or Intel Xeon Phi) that provide high performance with suitable energy-consumption characteristics. ...
Read More
Hybridizing S3D into an Exascale application using OpenACC: An approach for moving to multi-petaflops and beyond
SC '12: Proceedings of the 2012 International Conference for High Performance Computing, Networking, Storage and Analysis

Hybridization is the process of converting an application with a single level of parallelism to an application with multiple levels of parallelism. Over the past 15 years a majority of the applications that run on High Performance Computing systems have ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
XSEDE16: Proceedings of the XSEDE16 Conference on Diversity, Big Data, and Science at Scale
July 2016
405 pages
ISBN:9781450347556
DOI:10.1145/2949550
General Chair:
Kelly Gaither
Texas Advanced Computing Center
Copyright © 2016 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 July 2016
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
GPUs
OpenACC
OpenMP
accelerators
translation
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate129of190submissions,68%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 502
  Total Downloads
- Downloads (Last 12 months)38
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

From OpenACC to OpenMP 4: Toward Automatic Translation

XSEDE16: Proceedings of the XSEDE16 Conference on Diversity, Big Data, and Science at Scale

ABSTRACT

References

Cited By

Recommendations

On the Performance Portability of OpenACC, OpenMP, Kokkos and RAJA

Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: Programming Productivity, Performance, and Energy Consumption

Hybridizing S3D into an Exascale application using OpenACC: An approach for moving to multi-petaflops and beyond

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

From OpenACC to OpenMP 4: Toward Automatic Translation

XSEDE16: Proceedings of the XSEDE16 Conference on Diversity, Big Data, and Science at Scale

ABSTRACT

References

Cited By

Recommendations

On the Performance Portability of OpenACC, OpenMP, Kokkos and RAJA

Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: Programming Productivity, Performance, and Energy Consumption

Hybridizing S3D into an Exascale application using OpenACC: An approach for moving to multi-petaflops and beyond

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media