Social Science Research Council Research AMP Just Tech
Citation

The unappreciated role of intent in algorithmic moderation of abusive content on social media

Author:
Wang, Xinyu; Koneru, Sai; Venkit, Pranav Narayanan; Frischmann, Brett; Rajtmajer, Sarah
Publication:
Harvard Kennedy School Misinformation Review
Year:
2025

A significant body of research is dedicated to developing language models that can detect various types of online abuse, for example, hate speech, cyberbullying. However, there is a disconnect between platform policies, which often consider the author’s intention as a criterion for content moderation, and the current capabilities of detection models, which typically lack efforts