Prefer a chat interface with context about you and your work?
Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech
Employing language models to generate explanations for an incoming implicit hate post is an active area of research. The explanation is intended to make explicit the underlying stereotype and aid content moderators. The training often combines top-k relevant knowledge graph (KG) tuples to provide world knowledge and improve performance on …