A Survey on Feature Selection Using FAST Approach to Reduce High Dimensional Data

  IJETT-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
  
© 2014 by IJETT Journal
Volume-8 Number-5                          
Year of Publication : 2014
Authors :  R.Munieswari , S.Saranya
  10.14445/22315381/IJETT-V8P242

Citation 

R.Munieswari , S.Saranya ."A Survey on Feature Selection Using FAST Approach to Reduce High Dimensional Data", International Journal of Engineering Trends and Technology(IJETT), V8(5),229-231 February 2014. ISSN:2231-5381. www.ijettjournal.org. published by seventh sense research group

Abstract

Feature selection is a process of identifying the most useful subset of features.The survey summarising most of the feature selection methods and algorithms. Feature selection is the process of identifying a subset of most useful features. Typically the feature selection methods consist of four basic main steps and classify different existing feature selection algorithms .It is defined in terms of generation methods and evaluation functions. Most useful or representative methods are chosen from each category. The strength and weakness of different feature selection algorithms are explained. The aim here is to select some of the feature to form a feature subset. Feature selection has been effective technique in dimensionality reduction, removing irrelevant data, increasing learning accuracy, and improving comprehensibility. Increase in dimensionality of data imposes a severe challenge to many existing feature selection methods with respect to efficiency and effectiveness. To find a subset of features, the efficiency is related to time, the effectiveness is related to the quality of the subset of features. Existing feature selection algorithm removes only irrelevant features. But FAST algorithm removes both Irrelevant and redundant features. This survey mainly focuses on Comparison of various techniques and algorithms for feature selection process.

References

[1] Qinbao Song, Jingjie Ni, and Guangtao Wang, “A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data,” IEEE Transaction on Knowledge and Data, Engineering, Vol. 25, No. 1, January 2013.
[2] AlmuallimH.and. Dietterich T.G, “Algorithms for Identifying Relevant Features”, Proc. Ninth Canadian Conf. Artificial Intelligence, pp. 38-45, 1992.
[3] Arauzo-Azofra A.,. Benitez J.M, and Castro J.L., “A Feature Set Measure Based on Relief,” Proc. Fifth Int’l Conf. Recent Advances in Soft Computing, pp. 104-109, 2004.
[4] Biesiada J. and Duch W., “Features selection for High-Dimensional data a Pearson Redundancy Based Filter,” Advances in Soft Computing, vol. 45, pp. 242-249, 2008.
[5] Das S, “Filters, Wrappers and a Boosting-Based Hybrid for Feature Selection,” Proc. 18th Int’l Conf. Machine Learning, pp. 74-81, 2001.
[6] Dash M. and Liu H., “Feature Selection for Classification,” Intelligent Data Analysis, vol. 1, no. 3, pp. 131-156, 1997.
[7] Kohavi R. and. John G.H, “Wrappers for Feature Subset Selection,” Artificial Intelligence, vol. 97, nos. 1/2, pp. 273-324, 1997.
[8] Souza J., “Feature Selection with a General Hybrid Algorithm,” PhD dissertation, Univ. of Ottawa, 2004.

Keywords
Feature selection, classification, Filter method, Hybrid method, redundant features, and irrelevant features.