Clustering of CPU Usage Data in Grid Environment using Evoc Algorithm

Main Article Content

Chan Huah Yong
Kee Sim Ee
Fazilah Haron

Abstract

Clustering is a process of organizing objects into groups whose members are similar in some way. It is one of the data mining techniques is an unsupervised learning. In a grid environment, the number of computing nodes and users may reach up to thousands or millions. The grid is said to be dynamic in that the behaviors and values of these resources change all the time. Hence, these data are not suitable to be processed in an off-line mode. The existing clustering techniques today however emphasize more on the data’s behaviors categorization but not the data’s stability. Furthermore, the normal clustering techniques are more suitable to be used for static data type in an off-line mode. This paper addresses these issues by presenting an Evolving Clustering (Evoc Algorithm) which is an improved version of Evolving Clustering Method (ECM). We apply both methods on CPU usage to identify computers behaviors. The algorithm has been evaluated using three main criteria; that is dynamicity, accuracy and the ability to identify the stable cluster members. Our results show the improvements of the algorithm to process the data in an on-line mode in the evaluation of the algorithm’s dynamicity and accuracy criteria compare to other existing clustering techniques. Furthermore, the stability evaluation was a success where we were able to identify the stable cluster members from the filtered stable clusters. However, the result was highly affected by three factors namely threshold value, stability value and stability hour.

Downloads

Download data is not yet available.

Article Details

How to Cite
Huah Yong, C., Sim Ee, K., & Haron, F. (2006). Clustering of CPU Usage Data in Grid Environment using Evoc Algorithm. Malaysian Journal of Computer Science, 19(2), 105–116. Retrieved from http://ojie.um.edu.my/index.php/MJCS/article/view/6273
Section
Articles