Automatic Normalization of Punjabi Words

Volume-6 Number-7
Year of Publication : 2013
Authors : Vishal Gupta


Vishal Gupta. "Automatic Normalization of Punjabi Words". International Journal of Engineering Trends and Technology (IJETT). V6(7):353-357 Dec 2013. ISSN:2231-5381. published by seventh sense research group


For any language in the world, automatic normalization of words is a basic linguistic resource required to develop any type of application in Natural Language Processing (NLP) with high accuracy like: machine translation, document classification, document clustering, text question answering, topic tracking, text summarization and keywords extraction etc. It is not possible to achieve high accuracy without using automatic normalization of words for NLP applications for any language. This paper concentrates on automatic normalization of Punjabi words. Punjabi is the official language for state of Punjab. But Punjabi is under resource language. There are very less number of computational-linguistic resources available for Punjabi. This is 1st in history that automatic standardization of terms related to Punjabi is implemented and this system can be very much useful in creating other applications for Punjabi having good efficiency. For example it can be applied in different NLP applications like machine translation, document association, documents clustering, topic tracking and text summarization etc.


