Pre-processing of Domain Ontology Graph Generation System in Punjabi

  IJETT-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
  
© 2014 by IJETT Journal
Volume-17 Number-3
Year of Publication : 2014
Authors : Rajveer Kaur , Saurabh Sharma
  10.14445/22315381/IJETT-V17P229

Citation 

Rajveer Kaur , Saurabh Sharma. "Pre-processing of Domain Ontology Graph Generation System in Punjabi", International Journal of Engineering Trends and Technology (IJETT), V17(3),141-146 Nov 2014. ISSN:2231-5381. www.ijettjournal.org. published by seventh sense research group

Abstract

This paper describes pre-processing phase of ontology graph generation system from Punjabi text documents of different domains. This research paper focuses on pre-processing of Punjabi text documents. Pre-processing is structured representation of the input text. Pre-processing of ontology graph generation includes allowing input restrictions to the text, removal of special symbols and punctuation marks, removal of duplicate terms, removal of stop words, extract terms by matching input terms with dictionary and gazetteer lists terms.

References

[1] J. N. K. Liu, Y. He, E. H. Y. Lin, and X. Wang, “Domain ontology graph model and its application in Chinese text classification,” Neural Computing & Applications, Springer, London, vol. 24, pp. 779-798, March 2014.
[2] G.S. Lehal, “A Survey of the State of the Art in Punjabi Language Processing,” Language in India, vol. 9, pp. 9-23, Oct. 2009.
[3] Nidhi and V. Gupta, “Domain based classification of Punjabi text documents using ontology and hybrid based approach,” in Proc. of 3rd Workshop on South and Southeast Asian Natural Language Processing, SANLP, COLING, Mumbai, 2012, pp. 109-122.
[4] K. Kaur and V. Gupta, “Keyword Extraction for Punjabi Language,” Indian Journal of Computer Science and Engineering, vol. 2, pp. 364-370, July 2011.
[5] V. Gupta and G. S. Lehal, “Automatic Keyword Extraction for Punjabi Language,” International Journal of Computer Science Issues, vol. 8, pp. 327-331, September 2011.
[6] P. Talita, A. W. Yeo, and N. Kulathuramaiyer, “Challenges in building domain ontology for minority languages,” in Proc. of IEEE International Conference on Computer Applications and Industrial Electronics, 2010, pp. 574-578.

Keywords
Ontology, Pre-processing phase, Ontology Graph, Knowledge Representation, Natural Language Processing.