HateMonitors: Language Agnostic Abuse Detection in Social Media

Punyajoy Saha, Binny Mathews, Animesh Mukherjee, Pawan Goyal

December 2019

Preprint PDF Code Dataset

Abstract

Reducing hateful and offensive content in online social media pose a dual problem for the moderators. On the one hand, rigid censorship on social media cannot be imposed. On the other, the free flow of such content cannot be allowed. Hence, we require efficient abusive language detection system to detect such harmful content in socialmedia. In this paper, we present our machine learning model, HateMonitor, developed for Hate Speech and Offensive Content Identification in Indo-European Languages (HASOC), a shared task at FIRE 2019.W e have used Gradient Boosting model, along with BERT and LASER embeddings, to make the system language agnostic. Our model cameatFirst positionfor the German sub-task A.

Type

Conference paper

Publication

CEUR Workshop Proceedings