OMEGA: A Low-Latency GNN Serving System for Large Graphs
OMEGA: A Low-Latency GNN Serving System for Large Graphs
Graph Neural Networks (GNNs) have been widely adopted for their ability to compute expressive node representations in graph datasets. However, serving GNNs on large graphs is challenging due to the high communication, computation, and memory overheads of constructing and executing computation graphs, which represent information flow across large neighborhoods. Existing …