The unappreciated role of intent in algorithmic moderation of abusive content on social media

Author:: Wang, Xinyu; Koneru, Sai; Venkit, Pranav Narayanan; Frischmann, Brett; Rajtmajer, Sarah
Publication:: Harvard Kennedy School Misinformation Review
Year:: 2025

A significant body of research is dedicated to developing language models that can detect various types of online abuse, for example, hate speech, cyberbullying. However, there is a disconnect between platform policies, which often consider the author’s intention as a criterion for content moderation, and the current capabilities of detection models, which typically lack efforts

See citation in ‘MediaWell Citations Library’ Zotero library

Go to citation source