Algorithms for a Topology-aware Massively Parallel Computation Model

Type: Article

Publication Date: 2021-06-18

Citations: 2

DOI: https://doi.org/10.1145/3452021.3458318

Download PDF

Abstract

Most of the prior work in massively parallel data processing assumes homogeneity, i.e., every computing unit has the same computational capability and can communicate with every other unit with the same latency and bandwidth. However, this strong assumption of a uniform topology rarely holds in practical settings, where computing units are connected through complex networks. To address this issue, Blanas et al. \citeblanas2020topology recently proposed a topology-aware massively parallel computation model that integrates the network structure and heterogeneity in the modeling cost. The network is modeled as a directed graph, where each edge is associated with a cost function that depends on the data transferred between the two endpoints. The computation proceeds in synchronous rounds and the cost of each round is measured as the maximum cost over all the edges in the network. In this work, we take the first step into investigating three fundamental data processing tasks in this topology-aware parallel model: set intersection, cartesian product, and sorting. We focus on network topologies that are tree topologies, and present both lower bounds as well as (asymptotically) matching upper bounds. Instead of assuming a worst-case distribution as in previous results, the optimality of our algorithms is with respect to the initial data distribution among the network nodes. Apart from the theoretical optimality of our results, our protocols are simple, use a constant number of rounds, and we believe can be implemented in practical settings as well.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ Algorithms for a Topology-aware Massively Parallel Computation Model 2020 Xiao Hu
Paraschos Koutris
Spyros Blanas
+ Parallel Graph Algorithms in Constant Adaptive Rounds: Theory meets Practice 2020 Soheil Behnezhad
Laxman Dhulipala
Hossein Esfandiari
Jakub Łącki
Vahab Mirrokni
Warren Schudy
+ PDF Chat Near-Optimal Massively Parallel Graph Connectivity 2019 Soheil Behnezhad
Laxman Dhulipala
Hossein Esfandiari
Jakub Łącki
Vahab Mirrokni
+ Near-Optimal Massively Parallel Graph Connectivity 2019 Soheil Behnezhad
Laxman Dhulipala
Hossein Esfandiari
Jakub Łącki
Vahab Mirrokni
+ Near-Optimal Massively Parallel Graph Connectivity 2019 Soheil Behnezhad
Laxman Dhulipala
Hossein Esfandiari
Jakub Łącki
Vahab Mirrokni
+ The Hardness of Optimization Problems on the Weighted Massively Parallel Computation Model 2023 Hengzhao Ma
Jianzhong Li
Xiangyu Gao
+ PDF Chat Parallel Graph Connectivity in Log Diameter Rounds 2018 Alexandr Andoni
Zhao Song
Clifford Stein
Zhengyu Wang
Peilin Zhong
+ PDF Chat Parallel algorithms for geometric graph problems 2014 Alexandr Andoni
Aleksandar Nikolov
Krzysztof Onak
Grigory Yaroslavtsev
+ Computation-Aware Data Aggregation 2018 Bernhard Haeupler
D Ellis Hershkowitz
Anson Kahng
Ariel D. Procaccia
+ Dynamic Algorithms for the Massively Parallel Computation Model 2019 Giuseppe F. Italiano
Silvio Lattanzi
Vahab Mirrokni
Nikos Parotsidis
+ Dynamic Algorithms for the Massively Parallel Computation Model 2019 Giuseppe F. Italiano
Silvio Lattanzi
Vahab Mirrokni
Nikos Parotsidis
+ Dynamic Algorithms for the Massively Parallel Computation Model 2019 Giuseppe F. Italiano
Silvio Lattanzi
Vahab Mirrokni
Nikos Parotsidis
+ Distributed Computation of Large-scale Graph Problems 2014 Hartmut Klauck
Danupon Nanongkai
Gopal Pandurangan
Peter Robinson
+ Distributed Algorithms for Large-Scale Graphs 2013 Hartmut Klauck
Danupon Nanongkai
Gopal Pandurangan
Peter M. Robinson
+ Massively Parallel Computation via Remote Memory Access 2019 Soheil Behnezhad
Laxman Dhulipala
Hossein Esfandiari
Jakub Łącki
Warren Schudy
Vahab Mirrokni
+ Massively Parallel Computation via Remote Memory Access 2019 Soheil Behnezhad
Laxman Dhulipala
Hossein Esfandiari
Jakub Łącki
Warren Schudy
Vahab Mirrokni
+ PDF Chat Massively Parallel Computation via Remote Memory Access 2021 Soheil Behnezhad
Laxman Dhulipala
Hossein Esfandiari
Jakub Łącki
Vahab Mirrokni
Warren Schudy
+ Bandwidth Optimal Pipeline Schedule for Collective Communication 2023 Liangyu Zhao
Siddharth Pal
Prithwish Basu
Joud Khoury
Arvind Krishnamurthy
+ PDF Chat A Survey of Distributed Graph Algorithms on Massive Graphs 2024 Lingkai Meng
Yu Shao
Long Yuan
Longbin Lai
Peng Cheng
Xue Li
Wenyuan Yu
Wenjie Zhang
Xuemin Lin
Jingren Zhou
+ Communication Cost in Parallel Query Processing 2016 Paul Beame
Paraschos Koutris
Dan Suciu