Natural Language Chhattisgarhi: A Literature Survey

  IJETT-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
© 2014 by IJETT Journal
Volume-12 Number-2
Year of Publication : 2014
Authors : Rijuka Pathak , Somesh Dewangan


Rijuka Pathak , Somesh Dewangan. "Natural Language Chhattisgarhi: A Literature Survey", International Journal of Engineering Trends and Technology (IJETT), V12(2),113-117 June 2014. ISSN:2231-5381. published by seventh sense research group


Chhattishgarhi is a official language in the Indian state of Chhattisgarh. Spoken by 17.5 million people. In this paper we will see the work has been done in the field of natural language processing (NLP) using Chhattisgarhi language and other state languages .main goal of NLP is to create machine learning, create translator, create dictionary and create POS tagger. POS tagger is one of the important tools that are used to develop language translator and information extraction so that computer based be compatible for natural language processing. Part-of-speech tagging is the process of assigning a part-of-speech like noun, verb, pronoun, preposition, adverb, adjective or other lexical class marker to each word in a sentence. There are different types POS tagger are exist, are based on probabilistic approach and some based on morphological approaches. So in this paper we will see various developments of POS tagger and the major work has been done using Chhattishgarhi and other Indian state languages.


[1] Gobinda G.Chowdhary.”Natura language processing”, ,Dept of computer and information science,university of strathclyde,Glasgow G1 1XH,UK.
[2] Artificial intelligence by Elaine rich and Kevin knight.
[3] .Department of Information Technology Ministry of Communications & Information Technology Govt. of India “Unified Parts of Speech (POS) Standard in Indian Languages - Draft Standard –Version 1.0 “
[4] Antony P J ,Research Scholar, Computational Engineering ,and Networking (CEN), “Parts Of Speech Tagging for Indian Languages: A Literature Survey”
[5] fahim Muhammad hasan.”comparison of unigram,bigram,HMM and brill’s POS tagging approaches for some south asian languages”.,
[6] GeorgiGeorgiev and Valentin ZhikovPetyaOsenova and KirilSimov,”Feature-Rich Part-of-speech Tagging for Morphologically Complex Languages: Application to Bulgarian”
[7] .Apart of speech tagger for Indian language(POS tagger) tagset developed at IIT-Hyderabad.
[8] Manish Shrivastava and Pushpak Bhattacharyya (2008), “Hindi POS Tagger Using Naive Stemming : Harnessing Morphological Information Without Extensive Linguistic Knowledge”, Department of Computer Science and Engineering, Indian Institute of Technology, Bombay.
[9] juananotoniop’erez-ortiz and Mikel L.Forcada,”part of speech tagging with recurrent Neural networks”universitatd’Alacant,Spain 2002
[10] HimanshuAgrawal,Anirudh Mani” Part of Speech Tagging and Chunking with Conditional Random Fields.”
[11] Smriti Singh, Kuhoo Gupta“Morphological Richness Offsets Resource Demand- Experiences in Constructing a POS Tagger for Hindi”Department of Computer Science and EngineeringIndian Institute of Technology, BombayPowai, Mumbai
[12] Sathish Chandra Pammi,KishorePrahallad” POS Tagging and Chunking using Decision Forests”
[13] RejwanulHaque, SandipanDandapat, Ankit Kumar Srivastava, Sudip Kumar Naskar and Andy Way” English—Hindi Transliteration Using Context-Informed PB-SMT: the DCU System for NEWS 2009 “
[14] R. Mahesh K. Sinha” using rich morphology in resolving certain hindi-english machine translation divergence
[15] Miguel E. RuizBharathDandala” Evaluating Stemmers and Retrieval Fusion Approaches forHindi: UNT at FIRE 2010”
[16] PhaniGadde, Meher Vijay YeletiIn” Improving statistical POS tagging using Linguistic feature for Hindiand Telugu”
[17] AniketDalalKumarNagarajUmaSawant” .Building Feature Rich POS Tagger for Morphologically Rich Languages:Experiences in Hindi”
[18] AkshaySingh,SushmaBendre.RajeevSangal” HMM Based Chunker for Hindi”
[19] G.M. Ravi Sastry ,SourishChaudhuri ,P. Nagender Reddy” .An HMM based Part-Of-Speech tagger and statistical chunker for 3 Indian languages”
[20] Cohan SujayCarlos,MonojitChoudhurySandipanDandapat” Large-Coverage Root Lexicon Extraction for Hindi”
[21] .Pattabhi R K Rao T, Vijay Sundar Ram R, Vijayakrishna R and Sobha” .A Text Chunker and Hybrid POS Tagger for Indian Languages”
[22] “Word Sense Disambiguation in English to Hindi Machine Translation”
[23] SandipanDandapat” Part-of-Speech Tagging and Chunking with Maximum Entropy Model SandipanDandapat”
[24] Fahim Muhammad Hasan, NaushadUzZaman,Mumit Khan” Comparison of Unigram, Bigram, HMM and Brill’s POS Tagging Approaches for some South Asian Languages”
[25] DebasisMandal, SandipanDandapat, Mayank Gupta, Pratyush Banerjee, Sudeshna Sarkar ” Bengali and Hindi to English Cross-languageText Retrieval under Limited Resources”
[26] Vishal Chourasia ,SamudravijayaK,ManoharChandwani” Phonetically Rich Hindi Sentence Corpus for Creation of Speech Database
[29] ardeep Kumar, VishalGoyal”Development of Hindi-Punjabi Parallel Corpus Using Existing Hindi-Punjabi Machine Translation System and Using Sentence Alignments”
[30] natural language processing by ela kumar


POS Tagger,Chhattisgarhi.