Home   Publications     edited volumes   Awards   Research   Teaching   Miscellaneous   Full CV [pdf]   BLOG   bio
  
 
 
  
 
  
  Events
  
  
  
  
   
  
   Past Events
  
  
  
  
  
  
   
    | 
Publications of Torsten Hoefler  
Sabela Ramos, Torsten Hoefler:
 
  |  |   | Cache Line Aware Optimizations for ccNUMA Systems
   (In Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'15) (short paper), presented in Portland, OR, USA, pages 85--88, ACM, ISBN: 978-1-4503-3550-8, Jun. 2015) 
 
 AbstractCurrent shared memory systems utilize complex memory hierarchies to maintain scalability when increasing the number of processing units. Although hardware designers aim
    to hide this complexity from the programmer, ignoring the
    detailed architectural characteristics can harm performance
    significantly. We propose to expose the block-based design
    of caches in parallel computers to middleware designers to
    allow semi-automatic performance tuning with the systematic translation from algorithms to an analytic performance
    model. For this, we design a simple interface for cache line
    aware (CLa) optimization, a translation methodology, and a
    full performance model for cache line transfers in ccNUMA
    systems. Algorithms developed using CLa design perform
    up to 14x better than vendor and open-source libraries, and
2x better than existing ccNUMA optimizations.
 
 Documentsdownload article:  
  |  |   | BibTeX |  @inproceedings{cla_programming-hpdc15,   author={Sabela Ramos and Torsten Hoefler},   title={{Cache Line Aware Optimizations for ccNUMA Systems}},   year={2015},   month={Jun.},   pages={85--88},   booktitle={Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'15) (short paper)},   location={Portland, OR, USA},   publisher={ACM},   isbn={978-1-4503-3550-8},   source={http://www.unixer.de/~htor/publications/}, } |  
  |  
  
 
 |