Home Publications
Professional
Research
LogGOPSim NB Collectives
CG Solver
HPL
Kernels
LibNBC
NBCBench
Performance
MPI Topology
Netgauge
Networks
BTL eth
ORCS
Older Projects
Teaching
BLOG
Me(e)t me?
Miscellaneous
CV
|
Application Kernels using LibNBC
This page lists several application kernels that have been implemented
to demonstrate the optimization potential of LibNBC.
The only kernel available so far is the parallel implementation of a
three-dimensional Fast Fourier Transformation. The 1-dimensional
transformation is optimized for communication not for cache-reuse. This
means that the imlementation might be slower on standard machines than a
different domain decomposition.
3d-FFT (pre-release) - (5.6 kb)
|