A Survey Paper on Text Mining Techniques

C.Uma; S.Krithika; C.Kalaivani

doi:https://doi.org/10.14445/22315381/IJETT-V40P237

Research Article | Open Access | Download PDF

Volume 40 | Number 1 | Year 2016 | Article Id. IJETT-V40P237 | DOI : https://doi.org/10.14445/22315381/IJETT-V40P237

A Survey Paper on Text Mining Techniques

C.Uma, S.Krithika, C.Kalaivani

Citation :

C.Uma, S.Krithika, C.Kalaivani, "A Survey Paper on Text Mining Techniques," International Journal of Engineering Trends and Technology (IJETT), vol. 40, no. 1, pp. 225-229, 2016. Crossref, https://doi.org/10.14445/22315381/IJETT-V40P237

Abstract

Text mining, also referred to as text data mining, roughly equivalent to text analytics, refers to the process of deriving high-quality information from text. High-quality information is typically derived through the devising of patterns and trends through means such as statistical pattern learning. There are many techniques for text mining. In this paper we describe the techniques, Information Extraction, Information retrieval, Query processing, Natural Language processing, Categorization, Clustering. We also discuss future challenges of this area using different techniques, particularly rough set based text mining techniques, improvements and research directions in this paper.

Keywords

Data mining, Text mining, Rough sets, Classification, Summarization, and Text categorization, Clustering, Information Extraction, Information Retrieval.

References

[1] Berry Michael W., (2004), “Automatic Discovery of Similar Words”, in “Survey of Text Mining: Clustering, Classification and Retrieval”, Springer Verlag, New York, LLC, 24-43.
[2] Navathe, Shamkant B., and Elmasri Ramez, (2000), “Data Warehousing And Data Mining”, in “Fundamentals of Database Systems”, Pearson Education pvt Inc, Singapore, 841-872.
[3] Shaidah Jusoh and Hejab M. Alfawareh, “Techniques, Applications and Challenging Issue in Text Mining”, IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 6, No 2, November 2012, ISSN (Online): 1694-0814.
[4] Fred Popowich, “Using Text Mining and Natural Language Processing for Health Care Claims Processing”
[5] Hejab M. Alfawareh, Shaidah Jusoh, “Resolving Ambiguous Entity through Context Knowledge and Fuzzy Approach”, International Journal on Computer Science and Engineering (IJCSE), ISSN : 0975-3397 Vol. 3 No. 1 Jan 2011
[6] D. Cutting, D. Karger, J. Pedersen, J. Tukey. Scatter/Gather: ACluster-based Approach to Browsing Large Document Collections.ACM SIGIR Conference, 1992.
[7] L. Baker, A. McCallum. Distributional Clustering of Words for Text Classification,ACM SIGIR Conference , 1998.
[8] R. Bekkerman, R. El-Yaniv, Y. Winter, N. Tishby. On Feature Dis-tributional Clustering for Text Categorization.ACM SIGIR Con-ference, 2001.
[9] K. Nigam, A. McCallum, S. Thrun, T. Mitchell. Learning to classify text from labeled and unlabeled documents.AAAI Conference,1998.