ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on
Serverless Platforms with Shareable GPUs
ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on
Serverless Platforms with Shareable GPUs
Recent years have witnessed increasing interest in machine learning inferences on serverless computing for its auto-scaling and cost effective properties. Existing serverless computing, however, lacks effective job scheduling methods to handle the schedule space dramatically expanded by GPU sharing, task batching, and inter-task relations. Prior solutions have dodged the issue …