Semantic RDF Based Integration Framework for Heterogeneous XML Data Sources

  IJETT-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
  
© 2015 by IJETT Journal
Volume-19 Number-3
Year of Publication : 2015
Authors : Deniz KILINÇ, Fatma BOZYİĞİT, Pelin YILDIRIM
DOI :  10.14445/22315381/IJETT-V19P229

Citation 

Deniz KILINÇ , Fatma BOZY???T , Pelin YILDIRIM "Semantic RDF Based Integration Framework for Heterogeneous XML Data Sources", International Journal of Engineering Trends and Technology (IJETT), V19(3), 168-173 Jan 2015. ISSN:2231-5381. www.ijettjournal.org. published by seventh sense research group

Abstract

A significant amount of data on the Web is in the XML format or may easily be converted to XML or to its variations. XML is still the most appropriate language for data interchange and serialization. In this paper, a new framework which can integrate any heterogeneous XML data sources is presented. Each data source is translated into semantically meaningful regular expressions without changing original data source. Proposed framework has two major phases for data preparation. In the first phase, each data source is processed to obtain regular expressions which accommodate with the design choices that made in target by utilizing known global semantic vocabulary as an input. The second phase combines these regular expressions to get a global schema by preserving the original source data. A regular expression generator tool which produces regular expressions by regarding vocabulary and an integrator tool box which integrates and processes regular expressions, are also introduced.

References

[1] T. Bray, J. Paoli, C. M. Sperberg-McQueen, Markup Language(XML) 1.0 W3C Recommendation, February 1988.
[2] D. K?l?nç and A. Kut, “XML teknolojisine gerçekçi yakla??m”, in Türkiye`de Internet" Konferans? (Inet-tr’09), 2003.
[3] (2014) RDF specification. [Online]. Available: http:// www.w3c.org/RDF/
[4] (1999) XPath specification. [Online]. Available: http://www.w3.org/TR/xpath/.
[5] J. Clarke, XSL Transformations (XSLT) version 1.0. W3C Recommendation, November 1999.
[6] A. Halevy, O. Etzioni, A. Doan, Z. Ives, J. Madhavan, L. McDowell and I. Tatarinov, Crossing the Structure Chasm, in CIDT’03, 2003.
[7] L. Popa, Y. Velegrakis, R. J. Miller, M. A. Hernandez and R. Fagin, “Translating Web Data”, in Proceedings of the 28th VLDB Conference, pp. 598–609, 2002.
[8] B. Amann, C. Beeri, I. Fundulaki and M. Scholl, “Ontology-based integration of XML web resources”, The Semantic Web — ISWC 2002, vol. 2342, pp. 117–131, May 2002.
[9] R. Vdovjak and G. Houben, “RDF Based Architecture for Semantic Integration of Heterogeneous Information Sources”, in International Workshop on Information Integration on the Web, pp. 51-57, April 2001.
[10] I. F. Cruz, H. Xiao, and F. Hsu. “An Ontology-based Framework for Semantic Interoperability between XML Sources”, in Eighth International Database Engineering & Applications Symposium (IDEAS 2004), July 2004.
[11] (2004) XQuery specification [Online]. Available: http://www.w3.org/XML/Schema.
[12] G. Premkumar, K. Ramamurthy and Sree Nilakanta “Implementation of Electronic Data Interchange: An Innovation Diffusion Perspective”, Journal of Management Information Systems, vol. 11, no. 2, pp. 157-186, Fall 1994.
[13] P. Godefroid, and W. Pförtsch, “Business-to-business-marketing”, Kiehl, 2003.

Keywords
Integration, XML Data Source, RDF, XPath, XQuery