Efficient task replication for fast response times in parallel computation
Efficient task replication for fast response times in parallel computation
Large-scale distributed computing systems divide a job into many independent tasks and run them in parallel on different machines. A challenge in such parallel computing is that the time taken by a machine to execute a task is inherently variable, and thus the slowest machine becomes the bottleneck in the …