A Novel Multi View Clustering Technique for Group the Data Objects in Process Mining

  IJETT-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
  
© 2017 by IJETT Journal
Volume-44 Number-1
Year of Publication : 2017
Authors : Panila Lokanadham, Jayanthi Rao Madina

Citation 

Panila Lokanadham, Jayanthi Rao Madina "A Novel Multi View Clustering Technique for Group the Data Objects in Process Mining", International Journal of Engineering Trends and Technology (IJETT), V44(1),24-27 February 2017. ISSN:2231-5381. www.ijettjournal.org. published by seventh sense research group

Abstract
Clustering is the process of grouping objects based on some notion of similarity. It is commonly applied for exploratory analysis, segmentation, pre-processing and data summarization. Similarity is dependent on the features describing data. Clustering ensembles are a common approach to clustering problem, which combine a collection of clustering into a superior solution. The key issues are how to generate different candidate solutions and how to combine them. Common approach for generating candidate clustering solutions ignores the multiple representations of the data and the standard approach of simply selecting the best solution from candidate clustering solutions ignores the fact that there may be a set of clusters from different candidate clustering solutions which can form a better clustering solution. Multi view clustering can be applied at various stages of the clustering paradigm. This paper proposes a novel multi-view clustering algorithm that combines different ensemble techniques via various similarity metrics have been used to measure the similarity between data objects. In the novel multi view clustering algorithm contains mainly two techniques; the first technique is used to generate multiple partitions from each of the single view of a multi-view dataset. After completion of multi view of data set we can perform the clusterization process on multi view data set. In this paper we are implementing clustering process we are using K Means algorithm. After completion of clustering process take those clusters and combine those clusters will get an efficient cluster groups. By performing combining the clusters groups we are using cluster based similarity matrix. By implementing those concepts we can improve efficiency for performing the clustering process and also the cluster groups will contains most relevant datasets.

 References

[1] K. Jain, ?Data clustering: 50 years beyond k-means?, Pattern Recognition Letters, vol. 31, no. 8, pp. 651– 66, 2010.
[2] Strehl and J. Ghosh, ?Cluster ensembles—a knowledge reuse framework for combining multiple partitions? ,The Journal of Machine Learning Research, vol. 3, pp. 583–617, 2003.
[3] A. Topchy, A. K. Jain, and W. Punch, ?A mixture model of clustering ensembles?, in Proc. SIAM Intl. Conf. on Data Mining. Citeseer, 2004.
[4] A. Goder and V. Filkov, ?Consensus clustering algorithms: Comparison and refinement?, in ALENEX, vol. 8, 2008, pp. 109–117.
[5] X. Wang, C. Yang, and J. Zhou, ?Clustering aggregation by probability accumulation?, Pattern Recognition, vol. 42, no. 5, pp. 668–675, 2009.
[6] S. Vega-Pons, J. Correa-Morris, and J. Ruiz-Shulcloper, ?Weighted partition consensus via kernels?, Pattern Recognition, vol. 43, no. 8, pp. 2712–2724, 2010
[7] S. Bickel and T. Scheffer, ?Multi-view clustering?, Proceedings of the 4thIEEE International Conference on Data Mining, pp. 19-26, 2004
[8] V.R. De Sa,? Spectral clustering with two views?, Proceedings of the 22thIEEE International Conference on Machine Learning, pp. 20-27, 2005.
[9] A. Kumar and H. Daume, ?A co-training approach for multiview spectral clustering?, Proceedings of the 28thIEEE International Conference on Machine Learning, pp. 393-400, 2011
[10] A. Kumar, P. Rai and H. Daume, ?Co-regularized Multi- view Spectral Clustering?, Proceedings of the 12th IEEE International Conference on Data Mining, pp. 675-684, 2012.
[11] G. Tzortzis and A. Likas, ?Kernel-based Weighted Multi-view Clustering?, Proceedings of the 12th IEEE International Conference on Data Mining, pp. 675-684, 2012.
[12] Frings, O., Alexeyenko, A., Sonnhammer, E.L. (2013), ?MGclus: network clustering employing shared neighbours?, Molecular BioSystems.
[13] Tang, L., Wang, X., Liu, H. (2010), ?Community detection in multi-dimensional networks?, Technical Report, Defense Technical Information Center.
[14] Janssens, F., Gl¨anzel,W., De Moor, B. (2007), ?Dynamic hybrid clustering of bioinformatics by Incorporating text mining and citation analysis?, In Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 360–369). New York.
[15] Hong, Yi, Sam Kwong, Hanli Wang, and QingshengRen, ?Resampling-based selective clustering ensembles?, Pattern recognition letters 30, no. 3 (2009): 298-305.
[16] Azimi, J., & Fern, X. (2009, July), ?Adaptive Cluster Ensemble Selection?, InIJCAI (Vol. 9, pp. 992-997)
[17] A. Strehl and J. Ghosh, ?Cluster Ensembles-A Knowledge Reuse Framework for Combining Multiple Partitions?, Journal of Machine Learning Research, pp. 583- 617, 2002.
[18]. Jia, Jianhua, Xuan Xiao, Bingxiang Liu, and Licheng Jiao, ?Bagging-based spectral clustering ensemble selection?, Pattern Recognition Letters 32, no. 10 (2011): 1456-1467.

Keywords
data mining, cluster, cluster based similarity matrix, process mining, dataset, data objects.