Hate speech annotation: Analysis of an italian twitter corpus

Abstract

The paper describes the development of a corpus from social media built with the aim of representing and analysing hate speech against some minority groups in Italy. The issues related to data collection and annotation are introduced, focusing on the challenges we addressed in designing a multifaceted set of labels where the main features of verbal hate expressions may be modelled. Moreover, an analysis of the disagreement among the annotators is presented in order to carry out a preliminary evaluation of the data set and the

Publication
4th Italian Conference on Computational Linguistics, CLiC-it 2017

Main contributions

Coming soon

Limitations

Coming soon

Future Directions

Coming soon