Home Publications
conferences presentations
techreports
theses
all years
2010 2009 2008 2007 2006 2005 2004 Professional
Research
Teaching
BLOG
Me(e)t me?
Miscellaneous
CV
Events
|
Publications of Torsten Hoefler
Copyright Notice:
The documents distributed by this server have been provided by the
contributing authors as a means to ensure timely dissemination of
scholarly and technical work on a noncommercial basis. Copyright and all
rights therein are maintained by the authors or by other copyright
holders, notwithstanding that they have offered their works here
electronically. It is understood that all persons copying this
information will adhere to the terms and constraints invoked by each
author's copyright. These works may not be reposted without the explicit
permission of the copyright holder.
T. Hoefler, W. Gropp, R. Thakur and J. L. Traeff:
| | | Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues
(Sep. 2010, Accepted at the 17th European MPI Users Group conference (EuroMPI'10) )
AbstractDesigning and tuning parallel applications
with MPI, particularly at large scale, requires understanding the
performance implications of different choices of algorithms and
implementation options. Which algorithm is better depends in part
on the performance of the different possible communication
approaches, which in turn can depend on both the system hardware and
the MPI implementation. In the absence of detailed performance
models for different MPI implementations, application developers
often must select methods and tune codes without the means to
realistically estimate the achievable performance and rationally
defend their choices. In this paper, we demonstrate that by
constructing upper and lower bounds on performance and by taking
into account the effects of limits on network-injection rates and
effective bisection bandwidth, useful performance models can be
constructed. Since collective communication plays a crucial role in
enabling scalability, we also provide analytical models for
scalability of collective communication algorithms, such as
broadcast, allreduce, and all-to-all. We apply these models to an
IBM Blue Gene/P system and compare the analytical performance
estimates with experimentally measured values.
Documents | | | BibTeX | @inproceedings{hoefler-mpi-model, author={T. Hoefler and W. Gropp and R. Thakur and J. L. Traeff}, title={{Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues}}, year={2010}, month={Sep.}, note={Accepted at the 17th European MPI Users Group conference (EuroMPI'10)}, source={http://www.unixer.de/~htor/publications/}, } |
|
|