An italian twitter corpus of hate speech against immigrants

Abstract

The paper describes a recently-created Twitter corpus of about 6,000 tweets, annotated for hate speech against immigrants, and developed to be a reference dataset for an automatic system of hate speech monitoring. The annotation scheme was therefore specifically designed to account for the multiplicity of factors that can contribute to the definition of a hate speech notion, and to offer a broader tagset capable of better representing all those factors, which may increase, or rather mitigate, the impact of the message. This resulted in a scheme

Publication
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

Main contributions

Coming soon

Limitations

Coming soon

Future Directions

Coming soon

Related