Efficient cache use for stencil operations on structured discretization grids

Type: Preprint

Publication Date: 2000-01-01

Citations: 6

DOI: https://doi.org/10.48550/arxiv.cs/0007027

Locations

  • arXiv (Cornell University) - View
  • DataCite API - View

Similar Works

Action Title Year Authors
+ Minimizing Cache Misses in Scientific Computing Using Isoperimetric Bodies 2002 Michael Frumkin
Rob F. Van der Wijngaart
+ Minimizing Cache Misses in Scientific Computing Using Isoperimetric Bodies 2002 Michael Frumkin
Rob F. Van der Wijngaart
+ PDF Chat Quantifying Performance Bottlenecks of Stencil Computations Using the Execution-Cache-Memory Model 2015 Holger Stengel
Jan Treibig
Georg Hager
Gerhard Wellein
+ PDF Chat Employing polyhedral methods to optimize stencils on FPGAs with stencil-specific caches, data reuse, and wide data bursts 2024 Florian Mayer
Julian Brandner
Michæl Philippsen
+ PDF Chat GPU algorithms for Efficient Exascale Discretizations 2021 Ahmad Abdelfattah
Valeria Barra
Natalie Beams
Ryan Bleile
Jed Brown
Jean‐Sylvain Camier
Robert Carson
Noel Chalmers
Veselin Dobrev
Yohann Dudouit
+ Efficient Domain Partitioning for Stencil-based Parallel Operators 2018 Gaurav Saxena
+ Modeling and analyzing performance for highly optimized propagation steps of the lattice Boltzmann method on sparse lattices 2014 Markus Wittmann
Thomas Zeiser
Georg Hager
Gerhard Wellein
+ Cache friendly sparse matrix-vector multiplication 2010 Sardar Anisul Haque
Shahadat Hossain
Marc Moreno Maza
+ PDF Chat Locality optimized unstructured mesh algorithms on GPUs 2019 András Attila Sulyok
Gábor Dániel Balogh
István Z. Reguly
Gihan R. Mudalige
+ Beyond 16GB: Out-of-Core Stencil Computations 2017 István Z. Reguly
Gihan R. Mudalige
Michael B. Giles
+ Locality Optimized Unstructured Mesh Algorithms on GPUs 2018 András Attila Sulyok
Gábor Dániel Balogh
István Z. Reguly
Gihan R. Mudalige
+ Locality Optimized Unstructured Mesh Algorithms on GPUs 2018 András Attila Sulyok
Gábor Dániel Balogh
István Z. Reguly
Gihan R. Mudalige
+ PDF Chat Pushing memory bandwidth limitations through efficient implementations of Block-Krylov space solvers on GPUs 2018 M. A. Clark
Alexei Strelchenko
Alejandro Vaquero
Mathias Wagner
Evan Weinberg
+ PDF Chat Multidimensional Intratile Parallelization for Memory-Starved Stencil Computations 2017 Tareq M. Malas
Georg Hager
Hatem Ltaief
David E. Keyes
+ Optimizing the linear fascicle evaluation algorithm for many-core systems 2019 Karan Aggarwal
Uday Bondhugula
+ Multi-dimensional intra-tile parallelization for memory-starved stencil computations 2015 Tareq B. Malas
Georg Hager
Hatem Ltaief
David E. Keyes
+ Multi-dimensional intra-tile parallelization for memory-starved stencil computations 2015 Tareq B. Malas
Georg Hager
Hatem Ltaief
David E. Keyes
+ PDF Chat High-Level FPGA Accelerator Design for Structured-Mesh-Based Explicit Numerical Solvers 2021 Kamalavasan Kamalakkannan
Gihan R. Mudalige
István Z. Reguly
Suhaib A. Fahmy
+ High-Level FPGA Accelerator Design for Structured-Mesh-Based Explicit Numerical Solvers 2021 Kamalavasan Kamalakkannan
Gihan R. Mudalige
István Z. Reguly
Suhaib A. Fahmy
+ High-Level FPGA Accelerator Design for Structured-Mesh-Based Explicit Numerical Solvers 2021 Kamalavasan Kamalakkannan
Gihan R. Mudalige
István Z. Reguly
Suhaib A. Fahmy