Publication Details

Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness

authored by
Maximilian Spliethöver, Sai Nikhil Menon, Henning Wachsmuth
Abstract

Dialects introduce syntactic and lexical variations in language that occur in regional or social groups. Most NLP methods are not sensitive to such variations. This may lead to unfair behavior of the methods, conveying negative bias towards dialect speakers. While previous work has studied dialect-related fairness for aspects like hate speech, other aspects of biased language, such as lewdness, remain fully unexplored. To fill this gap, we investigate performance disparities between dialects in the detection of five aspects of biased language and how to mitigate them. To alleviate bias, we present a multitask learning approach that models dialect language as an auxiliary task to incorporate syntactic and lexical variations. In our experiments with African-American English dialect, we provide empirical evidence that complementing common learning approaches with dialect modeling improves their fairness. Furthermore, the results suggest that multitask learning achieves state-of-the-art performance and helps to detect properties of biased language more reliably.

Organisation(s)
Institute of Artificial Intelligence
Natural Language Processing Section
External Organisation(s)
Paderborn University
Type
Conference contribution
Pages
9294-9313
No. of pages
20
Publication date
08.2024
Publication status
Published
Peer reviewed
Yes
ASJC Scopus subject areas
Language and Linguistics, Computer Science Applications, Linguistics and Language
Electronic version(s)
https://doi.org/10.48550/arXiv.2406.09977 (Access: Open)
https://doi.org/10.18653/v1/2024.findings-acl.553 (Access: Open)