BDBComp
Parceria:
SBC
Análise do Impacto do Gerador de Conjuntos de Dados em Experimentos de Deduplicação de Dados

Levy de Souza SilvaMirella M. Moro

Using tools to create synthetic datasets is the only solution for evaluating data duplication algorithms when real datasets are not available. However,the evaluation results may be affected by the diversity and levels of parameters available in such tools. Our goal is to verify which parameters and levelsimpact more on the results of deduplication experiments. Hence, we performfactorial projects on datasets created with the most used tool. Results show thattwo parameters explain the largest variation of results.

http://www.lbd.dcc.ufmg.br/colecoes/sbbd/2017/029.pdf

Caso o link acima esteja inválido, faça uma busca pelo texto completo na Web: Buscar na Web

Biblioteca Digital Brasileira de Computação - Contato: bdbcomp@lbd.dcc.ufmg.br
     Mantida por:
LBD