Shallow Parsing Based on Comma Values

Diego Garat

In the belief that punctuation can aid in the process of sentence structure analysis, our work focuses on a prior assignment of values to commas in Spanish texts. Supervised machine learning techniques are applied for learning commas classifiers, taking as input attributes positional information and part of speech tags. One of these comma classifiers and a rule-based analyzer are combined in order to recognize and label text structures. The prior assignment of values to commas allowed the simplification of recognition rules, with very encouraging results.

