Fuzzy FP-Tree based Data Replication Management System in Cloud

  IJETT-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
  
© 2016 by IJETT Journal
Volume-36 Number-9
Year of Publication : 2016
Authors : P.Elango, Dr. Kuppusamy
DOI :  10.14445/22315381/IJETT-V36P288

Citation 

P.Elango, Dr. Kuppusamy"Fuzzy FP-Tree based Data Replication Management System in Cloud", International Journal of Engineering Trends and Technology (IJETT), V36(9),481-489 June 2016. ISSN:2231-5381. www.ijettjournal.org. published by seventh sense research group

Abstract
Replication is a better way for enhancing high availability and increasing performance of accessing the data in database systems. The purpose of data replication is to improve transaction response time, throughput, and system availability in case of failure. The main aim of this paper is to provide a enhanced technique for solving the shortcomings of the currently existing works of data replica method in cloud environment. This work intends to propose Data Replication System based on data mining techniques. In proposed method, we will use a cloud computing with Combination of replication algorithm and job scheduling policy for data replication in the data cloud environment through monitoring all job process. The data replication will be done by identifying the frequently used data patterns in the large database of a node. This will be done by Fuzzy FpTree based frequent pattern mining algorithm. The system availability and replica is measured and identifying the location in which the replicated data will be stored and its performance shows most promising result compared to the apriori and FPGrowth approaches.

 References

1. Buyya, R., et al., Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility. Future Gener. Comput. Syst., 2009. 25(6): p. 599-616
2. Bj, M., et al., Optimizing service replication in clouds, in Proceedings of the Winter Simulation Conference. 2011, Winter Simulation Conference: Phoenix, Arizona. p. 3312-3322.
3. Ghemawat, S., H. Gobioff, and S.-T. Leung, The Google file system. SIGOPS Oper. Syst. Rev., 2003. 37(5): p. 29-43.
4. Shvachko, K., et al., The Hadoop Distributed File System, in Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST). 2010, IEEE Computer Society. p. 1-10.
5. Sun, D.-W., et al., Modeling a Dynamic Data Replication Strategy to Increase System Availability in Cloud Computing Environments. Journal of Computer Science and Technology, 2012. 27(2): p. 256-272.
6. Nguyen, T., A. Cutway, and W. Shi, Differentiated replication strategy in data centers, in Proceedings of the 2010 IFIP international conference on Network and parallel computing. 2010, Springer-Verlag: Zhengzhou, China. p. 277- 288.
7. Ghemawat, S., H. Gobioff, and S.-T. Leung, The Google file system. SIGOPS Oper. Syst. Rev., 2003. 37(5): p. 29-43.
8. Shvachko, K., et al., The Hadoop Distributed File System, in Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST). 2010, IEEE Computer Society. p. 1-10.
9. Dogan, A., A study on performance of dynamic file replication algorithms for real-time file access in Data Grids, Future Gener. Comput. Syst., 2009. 25(8): p. 829-839.
10. Wei, Q., et al., CDRM: A cost-effective dynamic replication management scheme for cloud storage cluster., in 2010, IEEE International on Cluster Computing. 2010. p. 188 - 196
11. Wang, S.-S., K.-Q. Yan, and S.-C. Wang, Achieving efficient agreement within a dual-failure cloud- computing environment. Expert Syst. Appl., 2011. 38(1): p. 906-915.
12. Y.K. Kwok, K. Karlapalem, I. Ahmad, N.M. Pun, Design and Evaluation of data allocation algorithms for distributed database systems, IEEE J. Selected Areas Comm. (Special Issue on Distributed Multimedia Systems), 14(7) (September 1996) 1332–1348
13. L.W. Dowdy, D.V. Foster, Comparative models of the file assignment problem, ACM Comput. Surveys 14(2) (June 1982), 287–313
14. B. Awerbuch, Y. Bartal, A. Fiat, Optimally-competitive distributed file allocation, 25th Annual ACM STOC, Victoria, BC, Canada, 1993, pp. 164–173.
15. O. Wolfson, S. Jajodia, Y. Huang, An adaptive data replication algorithm, ACM Trans. Database Systems 22(4), 255–314 (June 1997).
16. Agrawal, R. & Srikant, R. (1995). Mining sequential patterns. In The eleventh IEEE international conference on data engineering (pp. 3–14).
17. Han, J., Pei, J. & Yin, Y. (2000). Mining frequent patterns without candidate generation. In The 2000 ACM SIGMOD international conference on management of data (pp. 1–12)
18. F. Javier Lopez, Marta Cuadros, Armando Blanco and Angel Concha:“Unveiling Fuzzy Associations Between Breast Cancer Prognostic Factors and Gene Expression Data”, 20th International Workshop on Database and Expert Systems Application, pp.338-342,(2009).

Keywords
Data Replication, fptree, fpgrowth, cloud storage, data mining.