[C54] E. Kalapanidas and N. Avouris, Feature Selection using a Genetic Algorithm applied on an Air Quality Forecasting Problem, 3rd BESAI, Proc. ECAI 2002, Lyon, July 2002. (pdf)
Feature selection is a process followed in order to improve the generalization and the performance of several classification and/or regression algorithms. Feature selection processes are divided in two categories, the filter and the wrapper approach. The formal is performed independently of the learning algorithm while the later makes use of the algorithm in an iterative way.
This paper focuses on the exploitation of a genetic algorithm used to extract an optimal feature subset of a large database containing pollutant concentration measurements, following the wrapper approach. The feature subset feeds a nearest neighbor algorithm in order to predict the daily maximum concentration for two pollutants. The encoding problem of the complexity of representation of the features in the genomes is tackled. Results of the experimentation on an air quality forecasting problem will be presented, as well as slight alterations on the standard simple genetic algorithm paradigm that guided the algorithm to a mature convergence and gave good solutions.