Towards the optimal synchronization granularity for dynamic scheduling of pipelined computations on heterogeneous computing systems. Riakiotakis, I., Ciorba, F. M., Andronikos, T., Papakonstantinou, G., & Chronopoulos, A. T. Concurrency and Computation: Practice and Experience, 24(18):2302–2327, John Wiley & Sons, Ltd, 2012.
Towards the optimal synchronization granularity for dynamic scheduling of pipelined computations on heterogeneous computing systems [link]Paper  doi  abstract   bibtex   
Loops are the richest source of parallelism in scientific applications. A large number of loop scheduling schemes have therefore been devised for loops with and without data dependencies (modeled as dependence distance vectors) on heterogeneous clusters. The loops with data dependencies require synchronization via cross-node communication. Synchronization requires fine-tuning to overcome the communication overhead and to yield the best possible overall performance. In this paper, a theoretical model is presented to determine the granularity of synchronization that minimizes the parallel execution time of loops with data dependencies when these are parallelized on heterogeneous systems using dynamic self-scheduling algorithms. New formulas are proposed for estimating the total number of scheduling steps when a threshold for the minimum work assigned to a processor is assumed. The proposed model uses these formulas to determine the synchronization granularity that minimizes the estimated parallel execution time. The accuracy of the proposed model is verified and validated via extensive experiments on a heterogeneous computing system. The results show that the theoretically optimal synchronization granularity, as determined by the proposed model, is very close to the experimentally observed optimal synchronization granularity, with no deviation in the best case, and within 38.4% in the worst case. Copyright © 2012 John Wiley & Sons, Ltd.
@article{riakiotakis:2012,
	Abstract = {Loops are the richest source of parallelism in scientific applications. A large number of loop scheduling schemes have therefore been devised for loops with and without data dependencies (modeled as dependence distance vectors) on heterogeneous clusters. The loops with data dependencies require synchronization via cross-node communication. Synchronization requires fine-tuning to overcome the communication overhead and to yield the best possible overall performance. In this paper, a theoretical model is presented to determine the granularity of synchronization that minimizes the parallel execution time of loops with data dependencies when these are parallelized on heterogeneous systems using dynamic self-scheduling algorithms. New formulas are proposed for estimating the total number of scheduling steps when a threshold for the minimum work assigned to a processor is assumed. The proposed model uses these formulas to determine the synchronization granularity that minimizes the estimated parallel execution time. The accuracy of the proposed model is verified and validated via extensive experiments on a heterogeneous computing system. The results show that the theoretically optimal synchronization granularity, as determined by the proposed model, is very close to the experimentally observed optimal synchronization granularity, with no deviation in the best case, and within 38.4% in the worst case. Copyright {\copyright} 2012 John Wiley & Sons, Ltd.},
	Author = {Riakiotakis, Ioannis. and Ciorba, Florina M. and Andronikos, Theodore and Papakonstantinou, George and Chronopoulos, Anthony T.},
	Date-Added = {2016-01-06 12:13:42 +0000},
	Date-Modified = {2016-01-06 12:16:39 +0000},
	Doi = {10.1002/cpe.2812},
	Issn = {1532-0634},
	Journal = {Concurrency and Computation: Practice and Experience},
	Keywords = {2012; loops with data dependencies, pipelined computations, dynamic load balancing, communication model, performance prediction, performance evaluation, inter-processor communication, synchronization, heterogeneous systems},
	Number = {18},
	Pages = {2302--2327},
	Publisher = {John Wiley & Sons, Ltd},
	Title = {{Towards the optimal synchronization granularity for dynamic scheduling of pipelined computations on heterogeneous computing systems}},
	Url = {http://dx.doi.org/10.1002/cpe.2812},
	Volume = {24},
	Year = {2012},
	Bdsk-Url-1 = {http://dx.doi.org/10.1002/cpe.2812}}

%% This BibTeX bibliography file was created using BibDesk.
%% http://bibdesk.sourceforge.net/

%% Created for Florina M. Ciorba at 2020-01-04 21:30:16 +0100 


%% Saved with string encoding Unicode (UTF-8)

Downloads: 0