Alejandro Mosquera, Paloma Moreda.
The study of text informality can provide us with valuable information for different NLP tasks. In the particular case of social media texts, their special characteristics like the presence of emoticons, slang or colloquial words can be used for obtaining additional information about their informality level. This paper demonstrates that the discovery of informality levels in Web 2.0 texts can be improved by incorporating formality and informality scores. The classification method based on our proposal reaches a 78% F1 using unsupervised machine learning techniques.
http://www.lbd.dcc.ufmg.br/colecoes/stil/2011/0023.pdf
Caso o link acima esteja inválido, faça uma busca pelo texto completo na Web: Buscar na Web