Back to Search
Journal ArticleUnknown

Task shape classification and workload characterization of google cluster trace

Author Affiliations
North South University
Year2014
Citations24

Abstract

Understanding workload characteristics is crucial for optimizing and improving the performance of large scale data produced by different industries. In this paper, we analyse a large scale production workload trace (version 2) [1] which is recently made publicly available by Google. We discuss statistical summary of the data. Further we perform k-means clustering to identify common groups of job. Cluster analysis provides insight into the data by dividing the objects into groups (clusters) of objects, such that objects in a cluster are more similar to each other than to the objects in other clusters. This work presents a simple technique for constructing workload characteristics and also provides production insights into understanding workload performance in cluster machine.
View at Publisher

BORR does not host full-text PDFs. The button above takes you to the original publisher.