Construcao de hierarquia de temas e subtemas de texto

Marco GonzalezAna MartinsFernanda BaraoMarceu LeiteVera L. S. de Lima

This paper presents a contribution to identification of themes and subthemes of Portuguese texts. We build hierarchical structures with terms extracted from a text, trying to define the order of their relative importance and the way how they group. We use techniques, such as stemming, extraction of lexical relations, and frequency weighting; as well as techniques from algorithms and data structures, such as construction of maximum spanning trees.

