Discovering Patterns in Text Mining: A Survey

  IJETT-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
© 2014 by IJETT Journal
Volume-13 Number-1                          
Year of Publication : 2014
Authors : Mr. A. P. Katade , Prof L.J Sankpal


Mr. A. P. Katade , Prof L.J Sankpal. "Discovering Patterns in Text Mining: A Survey", International Journal of Engineering Trends and Technology (IJETT), V13(1),45-49 July 2014. ISSN:2231-5381. published by seventh sense research group


There are many approaches which search the text documents based on the term provided to them, Text mining is a branch of data mining that deals with searching of useful information from large amounts of text documents but these approaches suffer from polysemy and synonymy thus we use pattern based approach and results also have shown that pattern based approaches are better than term based approaches. There are different techniques proposed for discovering patterns in text. We get meaningless pattern when we searched them some of the unidentified pattern also get searched for pruning this pattern we used PTM i.e. Pattern taxonomy model that illustrates the relationship between patterns in documents, to improve the performance of discovered pattern and to get more semantic information .This paper present a techniques of pattern taxonomy for pruning meaning less patterns.


[1] F. Sebastiani., “ Machine learning in automated text categorization,” ACM Computing Surveys, ,34(1):147, 2002
[2] J. Han, J. Pei, and Y. Yin, “Mining Frequent Patterns without Candidate Generation”, Proc. ACM SIGMOD Intl Conf. Management of Data (SIGMOD 00), pp. 1-12, 2000.
[3] J. Han and K.C.-C. Chang,”Data Mining for Web Intelligence,” vol. 35, no. 11, pp. 64-70, Nov. 2002.
[4] K. Aas and L. Eikvil, ‘Text Categorisation: A Survey”, Technical Report Raport NR 941, Norwegian Computing Center, 1999..
[5] Ning Zhong, Yuefeng Li, and Sheng-Tang Wu, “Effective Pattern Dis-covery for Text Mining”, vol.24,No.1,Jan.2012
[6] R. Agrawal and R. Srikant, “Fast Algorithms for Mining Association Rules in Large Databases”, Proc. 20th Intl Conf. Very Large Data Bases (VLDB 94), pp. 478-499, 1994.
[7] S. Scott and S. Matwin, “Feature engineering for text classification”, In ICML99, pages 379388, 1999.
[8] S.-T. Wu, Y. Li, Y. Xu, B. Pham, and P. Chen, “Automatic pattern-taxonomy extraction for web mining. ”, In WI04,pages 242248, 2004.
[9] S.-T. Wu, Y. Li, and Y. Xu, “Deploying Approaches for Pattern Refine-ment in Text Mining”, Proc. IEEE Sixth Intl Conf. Data Mining (ICDM 06), pp. 1157-1161, 2006.
[10] X. Li and B. Liu, “Learning to Classify Texts Using Positive and Unlabeled Data”, Proc. Intl Joint Conf. Artificial Intelligence (IJCAI03), pp. 587-594, 2003.
[11] X. Yan, J. Han, and R. Afshar, ”Clospan: Mining Closed Sequential Patterns in Large Datasets”, , Proc. SIAM Intl Conf. Data Mining (SDM 03), pp. 166-177, 2003.
[12] Y. Huang and S. Lin, “Mining Sequential Patterns Using GraphSearch Techniques”, Proc. 27th Ann. Intl Computer Software andApplications Conf., pp. 4-9, 2003.
[13] Y. Li, C. Zhang, and J.R. Swan, ”An Information Filtering Model on the Web and Its Application in Jobagent,” Knowledge-Based Systems, vol. 13, no. 5, pp. 285-296, 2000

Closed sequential pattern, Information filtering, Pattern mining, Pattern evolution, Text mining.