Survey on Different Enhanced K-Means Clustering Algorithm

  IJETT-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
  
© 2015 by IJETT Journal
Volume-27 Number-4
Year of Publication : 2015
Authors : Kapil Joshi, Himanshu Gupta, Prashant Chaudhary, Punit Sharma
DOI :  10.14445/22315381/IJETT-V27P233

Citation 

Kapil Joshi, Himanshu Gupta, Prashant Chaudhary, Punit Sharma "Survey on Different Enhanced K-Means Clustering Algorithm", International Journal of Engineering Trends and Technology (IJETT), V27(4),178-182 September 2015. ISSN:2231-5381. www.ijettjournal.org. published by seventh sense research group

Abstract
Data Mining is justify technique used to extract ,meaningful information from mountain of data and Clustering is an important task in Data Mining process which can be used for the purpose to make groups or clusters of the particular given data set which is based on the similarity between them. K-Means clustering is a clustering procedure in which the given data set is divided into K i.e number of clusters. The impact factor of k-means is its simplicity, high efficiency and scalability. However, is also comprises of number of limitations: random selection of initial centroids, number of cluster K need to be initialized and influence by outliers. In view of these deficiencies, this paper represents a survey of improvements done to traditional k-means to handle such limitations and we will compare K-means clustering algorithm with various clustering algorithm.

 References

[1] M H Dunham, “Data Mining: Introductory and Advanced Topics,” Prentice Hall, 2002.
[2] Birdsall, C.K., Landon, “A.B.: Plasma Physics via Computer Simulation,” Adam Hilger, Bristol (1991).
[3] Wang Shunye “An Improved K-means Clustering Algorithm Based on Dissimilarity” 2013 International Conference on Mechatronic Sciences, Electric Engineering and Computer (MEC)Dec 20-22, 2013, Shenyang, China IEEE
[4] Navjot Kaur, Jaspreet Kaur Sahiwal, Navneet Kaur “EFFICIENT KMEANSCLUSTERING ALGORITHM USING RANKING METHOD IN DATA MINING” ISSN: 2278 – 1323 International Journal of Advanced Research in Computer Engineering & Technology Volume 1, Issue 3, May2012
[5] Don Kulasiri, Sijia Liu, Philip K. Maini and RadekErban, “DifFUZZY: A fuzzy clustering algorithm for complex data sets” , International Journal of Computational Intelligence in Bioinformatics and Systems Biology vol.1, no.4,pp. 402-417, 2010.
[6] Jenn-Long Liu, Yu-Tzu Hsu, Chih-Lung Hung , “ Development of Evolutionary Data Mining Algorithms and their Applications to Cardiac Disease Diagnosis”, WCCI 2012 IEEE World Congress on Computational Intelligence, June 2012.
[7] Md. Sohrab Mahmud, Md. Mostafizer Rahman, and Md.Nasim Akhtar “Improvement of K-means Clustering algorithm with better initial centroids based on weighted average” 2012 7th International Conference on Electrical and Computer Engineering 20-22 December, 2012, Dhaka, Bangladesh, 2012 IEEE
[8] Pallavi Purohit “A new Efficient Approach towards k-means Clustering Algorithm” ,International journal of Computer Applications,Vol 65-no 11,march 2013
[9] Juntao Wang & Xiaolong Su “An improved K-Means clustering algorithm” 2011 IEEE
[10] Raju G, Binu Thomas, Sonam Tobgay and Th. Shanta Kumar“Fuzzy Clustering Methods in Data Mining:A comparative Case Analysis” 2008 International Conference on advanced computer theory and engineering,2008 IEEE

Keywords
Data Mining, Clustering, K-means algorithm, Fuzzy C-means algorithm, Genetic algorithm, Genetic algorithm-K-Means (GAKM).