Douglas B. Pereira, Alexandre Plastino, Rafael B. Pereira, Bianca Zadrozny, Luiz Henrique de C. Merschmann, Alex A. Freitas.
Attribute selection is a data preprocessing step used to identify at- tributes relevant to the classification task. Recently, a lazy technique which postpones the choice of attributes to the moment an instance is submitted to classification was proposed. In the original lazy technique proposal, a measure based on the entropy concept was presented to evaluate the quality of the attri- butes. In this work, we propose four new measures, based on: the chi-square statistic test, the Cramer coefficient, the Gini index and the gain ratio concept. Experimental results show the relevance of this proposal since, for a large num- ber of datasets, the best performance of the lazy selection strategy was achieved when the new measures were used.
http://www.lbd.dcc.ufmg.br/colecoes/enia/2011/0026.pdf
Caso o link acima esteja inválido, faça uma busca pelo texto completo na Web: Buscar na Web