Quantifying Performance Bottlenecks of Stencil Computations Using the Execution-Cache-Memory Model
Quantifying Performance Bottlenecks of Stencil Computations Using the Execution-Cache-Memory Model
Stencil algorithms on regular lattices appear in many fields of computational science, and much effort has been put into optimized implementations. Such activities are usually not guided by performance models that provide estimates of expected speedup. Understanding the performance properties and bottlenecks by performance modeling enables a clear view on …