A dictionary-based approach to racism detection in dutch social media

Abstract

We present a dictionary-based approach to racism detection in Dutch social media comments, which were retrieved from two public Belgian social media sites likely to attract racist reactions. These comments were labeled as racist or non-racist by multiple annotators. For our approach, three discourse dictionaries were created: first, we created a dictionary by retrieving possibly racist and more neutral terms from the training data, and then augmenting these with more general words to remove some bias. A second dictionary was created

Publication
arXiv preprint arXiv:1608.08738

Main contributions

Coming soon

Limitations

Coming soon

Future Directions

Coming soon

Related