Skip to content

Toxicity detection

Automatic detection and classification of toxic, offensive, and abusive language in text. Toxicity detection encompasses a range of harmful content including hate speech, harassment, insults, and offensive remarks. Detection is challenging due to implicit toxicity, cultural context dependence, and the diverse interpretations of what constitutes harmful content across different communities.

Key papers