News Recommendation Systems Using Web Mining: A Study

  IJETT-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
© 2014 by IJETT Journal
Volume-12 Number-6
Year of Publication : 2014
Authors : Dr. M. Durairaj , K. Muthu Kumar


Dr. M. Durairaj , K. Muthu Kumar. "News Recommendation Systems Using Web Mining: A Study", International Journal of Engineering Trends and Technology (IJETT), V12(6),293-299 June 2014. ISSN:2231-5381. published by seventh sense research group


News reading has changed with the advance of the World Wide Web (www), from the traditional model of news consumption via physical newspaper subscription to access to thousands of sources via the internet. Online news reading has become very popular as the web provides access to news articles from millions of sources around the world web users are undergoing a transformation and they are now expressing themselves in the form of sharing their opinions on an item through ratings and reviews or comments, through sharing and tagging content, or by contributing new content. Recommendation, filtering, and summary of Web news has received much attention in Web intelligence, aiming to find interesting news and summarize concise content for users. In this paper, we surveyed different proposals and approaches that take users’ collective intelligence through their interactions with the contents, their contribution and navigation patterns, and finally suggests best recommendations. This paper also compares the various models used to create a solution for the problems of news recommendation.


[1] Xindong Wu, Fei Xie, Gongqing Wu, Wei Ding, “Personalized News Filtering and Summarization on the Web,” In Proceedings of the Twenty Third IEEE International Conference on Tools. 2011.
[2] Florent Garcin, Kai Zhou, Boi Faltings, Vincent Schickel, “Personalized News Recommendation Based on Collaborative Filtering,” In Proceedings of IEEE/WIC/ACM International Conferences on Web Intelligence. 2012.
[3] J. Morris and G. Hirst, “Lexical cohesion computed by thesaural relations as an indicator of the structure of text,” Computational Linguistics, 17(1): 21-48, 1991.
[4] H. Peat and P. Willet, “The limitations of term co-occurrence data for query expansion in document retrieval systems,” Journal of American Society for Information Science, 42(5): 378–383, 1991.
[5] Jensen, V. Bayesian Networks and Decision Graphs.Springer, 2001
[6] Satnam Alag, “Collective Intelligence in Action”, October, 2008, ISBN: 1933988312
[7] Jiahui Liu, Peter Dolan and Elin Rønby Pedersen, “Personalized News Recommendation Based on Click Behavior,”.
[8] D. Billsus and M. Pazzani, “Adaptive news access,” In: P.Brusilovsky, A. Kobsa, and W. Nejdl (eds.): The Adaptive Web: Methods and Strategies of Web Personalization, Springer, 2007.
[9] D. Chakrabarti, R. Kumar, and K. Punera, “Generating succinct titlesfor web URLs,” KDD-2008, pages 79-87, Las Vegas, Nevada, USA,August 24-27, 2008.
[10]Z. Dong and Q. Dong, “HowNet and the computation of meaning,”Singapore: World Scientific Publishing Company, 2006.
[11] X. Wu, G. Wu, F. Xie, Z. Zhu, X. Hu, H. Lu, and H. Li, “News filtering and summarization on the web,” IEEE Intelligent Systems,25(5): 68-76, 2010.
[12] Mai-Vu Tran, Xuan-Tu Tran, Huy-Long Uong,“ User Interest Analysis With Hidden Topic In News Recommendation System”.
[13] D.J. Hand and K. Yu, “Idiot`s Bayes: not so stupid after all?” Internat.Statist. Rev. 2001, 69, 385-398.
[14] J. Konstan, B. Miller, D. Maltz, J. Herlocker, L. Gordon, and J. Riedl,“GroupLens: Applying collaborative filtering to Usenet news,”Communications of the ACM 40, 3: 77-87, 1997.
[15] S. Li, H. Wang, S. Yu, C. Xin, “Research on maximum entropymodel for keyword indexing,” Chinese Journal of Computers, 27(9):1192-1197, 2004.
[16] H. Suo, Y. Liu, and S. Cao, “A keyword selection method based onlexical chains,” Journal of Chinese Information Processing, 20(6):25–30, 2006.
[17] A. Tan and C. Tee, “Learning user profiles for personalizedinformation dissemination,” in Proceedings of the IEEE InternationalJoint conference on Neural Networks, pages 183-188, May 1998.
[18] P. D. Turney, “Learning to extract keyphrases from text,” NationalResearch Council, Canada, NRC Technical Report ERB-1057, 1999.[21] I. H. Witten, G.W. Paynter, E. Frank, C. Gutwin, and C.G. Nevill-Manning, “KEA: Practical automatic keyphrase extraction,” inProceedings of the 4th ACM Conference on Digital Libraries, pages254-256, Berkeley, California, US, 1999.
[19] H. Zhang, Q. Liu, X. Cheng, H. Zhang, and H. Yu, “Chinese lexicalanalysis using hierarchical hidden markov model,” in Proceedings ofthe Second SigHan Workshop, pages 63-70, August 2003.
[20] Toby Segaran, "Programming CollectiveIntelligence, Building Smart Web 2.0Applications", 2007
[21] S. Debnath, N. Ganguly, P. Mitra, “Featureweighting in content based recommendationsystem using social network analysis” , InWWW `08: Proceedings of the 17th Int’l Conf.on World Wide Web (2008), pp. 1041-1042.
[22] R. Chung, D. Sundaram, A. Srinivasan,“Integrated personal recommender systems”, In ICEC `07: Proceedings of the ninth Int’l Conf. onElectronic commerce (2007), pp. 65-74.
[23] J. Wang, A. P. de Vries, M. JT Reinders,“Unifying user-based and item-based collaborative filtering approaches by similarity fusion”, In SIGIR `06: Proc. of the 29th annual int’l ACM SIGIR Conf. on R&D in information retrieval (2006), pp. 501-508.
[24] G. Adomavicius, ErTuzhilin, “Toward the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions”, vol. 17, no. 6, pp. 734-749, 2005.
[25] Gentili, G., Micarelli, A., Sciarrone, F.: Infoweb: An AdaptiveInformation Filtering System for the Cultural Heritage Domain.Applied Artificial Intelligence 17(8-9) (2003) 715-744.
[26] Pretschner, A., Gauch, S.: Ontology Based Personalized Search.In: Proceedings of the 11th IEEE International Conference onTools with Artificial Intelligence (ICTAI) November 8-10 (1999)391-398.
[27] Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichletallocation.Journal of Machine Learning Research, 3, 993–1022[10] H Misra, O Cappe, and F Yvon. 2008. Using LDA to detect semantically incoherent documents. In Proc. of CoNLL 2008,pages 41–48, Manchester, England.
[28] Cam-Tu Nguyen, Xuan-HieuPhan, Thu-Trang Nguyen, Quang-Thuy Ha, Susumu Horiguchi, Web Search Clustering and Labelingwith Hidden Topics, The ACM Transactions on Asian LanguageInformation Processing, Vol. 8, pages 1--40, August, 2009.
[29] PhanXuanHieu, Susumu Horiguchi, Nguyen Le Minh. Learningto Classify Short and Sparse Text & Web with Hidden Topics fromLarge-scale Data Collections.In 17th International World WideWeb Conference, 2008.
[30] Chen, L., Sycara, K.: A Personal Agent for Browsing andSearching. In: Proceedings of the 2nd International Conference onAutonomous Agents, Minneapolis/St. Paul, May 9-13, (1998) 132-139.

Web mining, Lexical Chains, Bayesian Framework for User Interest Prediction, Topic Analysis Model, Keyword Extraction Algorithm, Newsletters System and collective intelligence..