It Is Not Only the Negative that Deserves Attention!

Understanding, Generation & Evaluation of (Positive) Moderation

Verfasst von

Iman Jundi, Eva Maria Vecchi, Carlotta Quensel, Neele Falk, Gabriella Lapesa

Abstract

Moderation is essential for maintaining and improving the quality of online discussions. This involves: (1) countering negativity, e.g. hate speech and toxicity, and (2) promoting positive discourse, e.g. broadening the discussion to involve other users and perspectives. While significant efforts have focused on addressing negativity, driven by an urgency to address such issues, this left moderation promoting positive discourse (henceforth PositiveModeration) under-studied. With the recent advancements in LLMs, Positive Moderation can potentially be scaled to vast conversations, fostering more thoughtful discussions and bridging the increasing divide in online interactions.We advance the understanding of Positive Moderation by annotating a dataset on 13 moderation properties, e.g. neutrality, clarity and curiosity. We extract instructions from professional moderation guidelines and use them to prompt LLaMA to generate such moderation. This is followed by extensive evaluation showing that (1) annotators rate generated higher than professional moderation, but still slightly prefer professional moderation in pairwise comparison, and (2) LLMs can be used to estimate human evaluation as an efficient alternative.

Details

Organisationseinheit(en)
Fachgebiet Maschinelle Sprachverarbeitung
Externe Organisation(en)
Universität Stuttgart
Heinrich-Heine-Universität Düsseldorf
GESIS - Leibniz-Institut für Sozialwissenschaften
Typ
Aufsatz in Konferenzband
Seiten
11360-11395
Anzahl der Seiten
36
Publikationsdatum
04.2025
Publikationsstatus
Veröffentlicht
Peer-reviewed
Ja
ASJC Scopus Sachgebiete
Computernetzwerke und -kommunikation, Hardware und Architektur, Information systems, Software
Elektronische Version(en)
https://doi.org/10.18653/v1/2025.naacl-long.567 (Zugang: Offen )

Zitieren

Laden...