Performance Models for Data Transfers: A Case Study with Molecular Chemistry Kernels

Type: Preprint

Publication Date: 2019-04-15

Citations: 1

Abstract

With increasing complexity of hardwares, systems with different memory nodes are ubiquitous in High Performance Computing (HPC). It is paramount to develop strategies to overlap the data transfers between memory nodes with computations in order to exploit the full potential of these systems. In this article, we consider the problem of deciding the order of data transfers between two memory nodes for a set of independent tasks with the objective to minimize the makespan. We prove that with limited memory capacity, obtaining the optimal order of data transfers is a NP-complete problem. We propose several heuristics for this problem and provide details about their favorable situations. We present an analysis of our heuristics on traces, obtained by running 2 molecular chemistry kernels, namely, Hartree-Fock (HF) and Coupled Cluster Single Double (CCSD) on 10 nodes of an HPC system. Our results show that some of our heuristics achieve significant overlap for moderate memory capacities and are very close to the lower bound of makespan.

Locations

  • arXiv (Cornell University) - View - PDF
  • HAL (Le Centre pour la Communication Scientifique Directe) - View - PDF
  • HAL Portal Artxiker (Hindustan Aeronautics Limited (India)) - View - PDF

Similar Works

Action Title Year Authors
+ PDF Chat Performance Models for Data Transfers 2019 Suraj Kumar
Lionel Eyraud‐Dubois
Sriram Krishnamoorthy
+ PDF Chat Tunable approximations to control time-to-solution in an HPC molecular docking Mini-App 2020 Davide Gadioli
Gianluca Palermo
Stefano Cherubin
Emanuele Vitali
Giovanni Agosta
Candida Manelfi
Andrea R. Beccari
Carlo Cavazzoni
Nico Sanna
Cristina Silvano
+ Exploring the Relation Between Two Levels of Scheduling Using a Novel Simulation Approach. 2018 Ahmed Eleliemy
Ali Mohammed
Florina M. Ciorba
+ PDF Chat A REVIEW OF HIGH PERFORMANCE COMPUTING FOUNDATIONS FOR SCIENTISTS 2012 Pablo García‐Risueño
Pablo Ibáñez
+ Many-Task Computing and Blue Waters 2012 Daniel S. Katz
Timothy G. Armstrong
Zhao Zhang
Michael Wilde
Justin M. Wozniak
+ PDF Chat An energy-efficient scheduling algorithm for shared facility supercomputer centers 2021 E. A. Kiselev
P. N. Telegin
B. M. Shabanov
+ An energy-efficient scheduling algorithm for shared facility supercomputer centers 2021 E. A. Kiselev
Pavel Telegin
B. M. Shabanov
+ PDF Chat Scheduling on Two Types of Resources 2020 Olivier Beaumont
Louis-Claude Canon
Lionel Eyraud‐Dubois
Giorgio Lucarelli
Loris Marchal
Clément Mommessin
Bertrand Simon
Denis Trystram
+ PDF Chat Exploring the Relation between Two Levels of Scheduling Using a Novel Simulation Approach 2017 Ahmed Eleliemy
Ali Mohammed
Florina M. Ciorba
+ A Performance Model for the Communication in Fast Multipole Methods on HPC Platforms 2014 Huda Ibeid
Rio Yokota
David E. Keyes
+ A Performance Model for the Communication in Fast Multipole Methods on HPC Platforms 2014 Huda Ibeid
Rio Yokota
David E. Keyes
+ Exploring the Relation Between Two Levels of Scheduling Using a Novel Simulation Approach 2018 Ahmed Eleliemy
Ali Mohammed
Florina M. Ciorba
+ Exploiting OpenMP & OpenACC to Accelerate a Molecular Docking Mini-App in Heterogeneous HPC Nodes 2019 Emanuele Vitali
Davide Gadioli
Gianluca Palermo
Andrea R. Beccari
Carlo Cavazzoni
Cristina Silvano
+ Parallel Computing Algorithms and Applications 2007 Sudarshan Raghunathan
+ Exploiting OpenMP & OpenACC to Accelerate a Molecular Docking Mini-App in Heterogeneous HPC Nodes 2019 Emanuele Vitali
Davide Gadioli
Gianluca Palermo
Andrea R. Beccari
Carlo Cavazzoni
Cristina Silvano
+ PDF Chat FIESTA 3: Cluster-parallelizable multiloop numerical calculations in physical regions 2014 Alexander V. Smirnov
+ PDF Chat Co-scheduling Ensembles of In Situ Workflows 2022 Tu Mai Anh
Loïc Pottier
Rafael Ferreira da Silva
Frédéric Suter
Silvina Caíno‐Lores
Michela Taufer
Ewa Deelman
+ PDF Chat OCCAM: a flexible, multi-purpose and extendable HPC cluster 2017 Marco Aldinucci
S. Bagnasco
S. Lusso
Phillip A. Pasteris
Sergio Rabellino
S. Vallero
+ PDF Chat On the Scalability of Data Reduction Techniques in Current and Upcoming HPC Systems from an Application Perspective 2017 Axel Huebl
René Widera
Felix Schmitt
Alexander Matthes
Norbert Podhorszki
Jong Youl Choi
Scott Klasky
Michael Bußmann
+ PDF Chat Benchmarking and tuning the MILC code on clusters and supercomputers 2001 Steven A Gottlieb

Works Cited by This (0)

Action Title Year Authors