Back to Search
Journal ArticleUnknown

BAR: An Efficient Data Locality Driven Task Scheduling Algorithm for Cloud Computing

Author Affiliations
Southeast University
Year2011
Citations88

Abstract

Large scale data processing is increasingly common in cloud computing systems like MapReduce, Hadoop, and Dryad in recent years. In these systems, files are split into many small blocks and all blocks are replicated over several servers. To process files efficiently, each job is divided into many tasks and each task is allocated to a server to deals with a file block. Because network bandwidth is a scarce resource in these systems, enhancing task data locality(placing tasks on servers that contain their input blocks) is crucial for the job completion time. Although there have been many approaches on improving data locality, most of them either are greedy and ignore global optimization, or suffer from high computation complexity. To address these…
View at Publisher

BORR does not host full-text PDFs. The button above takes you to the original publisher.