Feld, Dustin, Jünger, Michael, Mallach, Sven ORCID: 0000-0001-5335-0678 and Soddemann, Thomas (2014). Hardware-Aware Automatic Code-Transformation to Support Compilers in Exploiting the Multi-Level Parallel Potential of Modern CPUs. In: Proceedings of the 2015 International Workshop on Code Optimisation for Multi and Many Cores (COSMIC), 2:1-2:10. ACM.

Full text not available from this repository.
DOI:http://doi.acm.org/10.1145/2723772.2723776

Abstract

Modern compilers offer more and more capabilities to automatically parallelize code-regions if these match certain properties. However, there are several application kernels that, although rather simple transformations would suffice in order to make them match these properties, are either not at all parallelized by state-of-the-art compilers or could at least be improved w.r.t. their performance. This paper proposes a loop-tiling approach focusing on automatic vectorization and multi-core parallelization, with emphasis on a smart cache exploitation. The method is based on polyhedral code transformations that are applied as a pre-compilation step and it is shown to help compilers in generating more and better parallel code-regions. It automatically adapts to hardware parameters such as the SIMD register width and cache sizes. Further, it takes memory-access patterns into account and is capable to minimize communication among tiles that are to be processed by different cores. An extensive computational study shows significant improvements in the number of instructions vectorized, cache miss rates, and running times for a range of application kernels. The method often outperforms the internal auto-parallelization techniques implemented into gcc and icc.

Item Type: Book Section, Proceedings Item or annotation in a legal commentary
Creators:
CreatorsEmailORCIDORCID Put Code
Feld, Dustindustin.feld@scai.fraunhofer.deUNSPECIFIEDUNSPECIFIED
Jünger, MichaelUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Mallach, SvenUNSPECIFIEDorcid.org/0000-0001-5335-0678UNSPECIFIED
Soddemann, ThomasUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
URN: urn:nbn:de:hbz:38-640781
DOI: http://doi.acm.org/10.1145/2723772.2723776
Title of Book: Proceedings of the 2015 International Workshop on Code Optimisation for Multi and Many Cores (COSMIC)
Series Name: ACM International Conference Proceedings Series (ACM ICPS)
Page Range: 2:1-2:10
Date: 2014
Publisher: ACM
Language: English
Faculty: Faculty of Mathematics and Natural Sciences
Divisions: Faculty of Mathematics and Natural Sciences > Department of Mathematics and Computer Science > Institute of Computer Science
Subjects: Data processing Computer science
Refereed: Yes
URI: http://kups.ub.uni-koeln.de/id/eprint/64078

Downloads

Downloads per month over past year

Altmetric

Export

Actions (login required)

View Item View Item