2014
Contribution to book  Restricted

Effective Data Access Patterns on Massively Parallel Processors

Capannini G., Baraglia R., Silvestri F., Nardini F. M.

GPU computing  k-model 

The new generation of microprocessors incorporates a huge number of cores on the same chip. This trades single-core performance off for the total amount of work done across multiple threads of execution. Graphics Processing Units (GPUs) are an example of this kind of architectures. The first generation of GPUs has been designed to support a fixed set of rendering functions. Nowa- days, GPUs are becoming easier to program. Therefore, they can be used for applications that have been traditionally handled by CPUs. The reasons of using General Purpose GPU (GPGPUs) in high-performance computations are: raw computing power, good performance per watt, and low costs. How- ever, some important issues limit a wide exploitation of GPGPUs. The main one concerns the heterogeneous and distributed nature of the memory hierar- chy. As a consequence, the speed-up of some applications depends on being able to efficiently access the data so that all cores are able to work at the same time. This chapter discusses the characteristics and the issues of the memory systems of this kind of architectures. We analyze these architectures from a theoretical point by using K-model, a model for capturing their performance constraints. K -model is used to estimate the complexity of a given algorithm defined on this model. This chapter describes how K-model can also be used to design efficient data access patterns for implementing efficient GPU algorithms. To this extent, we use K -model to derive an efficient realization of two popular algorithms, i.e., prefix sum and sorting. By means of reproducible experiments, we validate theoretical results showing that the optimization of an algorithm based on K-model corresponds to an actual optimization in practice.

Source: High-Performance Computing on Complex Environments, edited by Emmanuel Jeannot, Julius Zilinskas, pp. 115–134. Hoboken: John Wiley & Sons Inc., 2014

Publisher: John Wiley & Sons Inc., Hoboken, USA


Metrics



Back to previous page
BibTeX entry
@inbook{oai:it.cnr:prodotti:332959,
	title = {Effective Data Access Patterns on Massively Parallel Processors},
	author = {Capannini G. and Baraglia R. and Silvestri F. and Nardini F.  M.},
	publisher = {John Wiley \& Sons Inc., Hoboken, USA},
	doi = {10.1002/9781118711897.ch7},
	booktitle = {High-Performance Computing on Complex Environments, edited by Emmanuel Jeannot, Julius Zilinskas, pp. 115–134. Hoboken: John Wiley \& Sons Inc., 2014},
	year = {2014}
}