Creating a Tool Set for Optimizing Topology-Aware Node Mappings

Martin Schulz, Abhinav Bhatele, Peer-Timo Bremer, Todd Gamblin, Katherine Isaacs, Joshua A. Levine, Valerio Pascucci
5th International Workshop on Parallel Tools for High Performance Computing
September, 2011

Modern HPC systems, such as Cray’s XE and IBM’s Blue Gene line, feature sophisticated network architectures, often in the form of high dimensional tori. In order to fully exploit the performance of these systems, it is necessary to carefully map an application’s communication structure to the underlying network topology. In this step, both latency (i.e., physical distance between nodes) and bandwidth (i.e., number of concurrently used links) have to be taken into account, leading to mappings that are often non-intuitive. To help developers with this complex problem, we are developing a set of tools that aim at helping users understand the communication behavior of their codes, map them onto the network architecture, and create better-performing topology-aware node mappings. In this paper, we present initial steps towards this goal, including a measurement environment capturing both communication patterns and network metrics within the same run, a methodology to compare these measurements, and a visualization tool that helps users understand the impact of their application’s characteristics on the network behavior.

[DOI/EE link]

@inproceedings{SBBGILP11,
  address   = {Dresden, Germany},
  author    = {Martin Schulz and Abhinav Bhatele and Peer-Timo Bremer and Todd Gamblin and Katherine Isaacs and Joshua A. Levine and Valerio Pascucci},
  booktitle = {5th International Workshop on Parallel Tools for High Performance Computing},
  ee        = {http://dx.doi.org/10.1007/978-3-642-31476-6_1},
  month     = {9},
  pages     = {1--12},
  publisher = {Springer Berlin Heidelberg},
  title     = {Creating a Tool Set for Optimizing Topology-Aware Node Mappings},
  year      = {2011}
}