Life would be so much easier if only we had the source code...
Home -> Publications
Home
  Publications
    
edited volumes
  Awards
  Research
  Teaching
  Miscellaneous
  Full CV [pdf]
  BLOG






  Events








  Past Events





Publications of Torsten Hoefler
Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:

 Accurately Measuring Collective Operations at Massive Scale

(In Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium, PMEO'08 Workshop, presented in Miami, FL, ISSN: 1530-2075, ISBN: 978-1-4244-1694-3, Apr. 2008)
Invited to a journal special issue on top picks from PMEO'08.

Abstract

Accurate, reproducible and comparable measurement of collective operations is a complicated task. Although Different measurement schemes are implemented in well-known benchmarks, many of these schemes introduce different systematic errors in their measurements. We characterize these errors and select a window-based approach as the most accurate method. However, this approach complicates measurements significantly and introduces a clock synchronization as a new source of systematic errors. We analyze approaches to avoid or correct those errors and develop a scalable synchronization scheme to conduct benchmarks on massively parallel systems. Our results are compared to the window-based scheme implemented in the SKaMPI benchmarks and show a reduction of the synchronization overhead by a factor of 16 on 128 processes.

Documents

download article:
download slides:
 

BibTeX

@inproceedings{hoefler-pmeo08,
  author={Torsten Hoefler and Timo Schneider and Andrew Lumsdaine},
  title={{Accurately Measuring Collective Operations at Massive Scale}},
  year={2008},
  month={Apr.},
  booktitle={Proceedings of the 22nd IEEE International Parallel \& Distributed Processing Symposium, PMEO'08 Workshop},
  location={Miami, FL},
  issn={1530-2075},
  isbn={978-1-4244-1694-3},
  source={http://www.unixer.de/~htor/publications/},
}


serving: 44.192.95.161:40800© Torsten Hoefler