Quality and Agreement in Multilabel Emotion Annotation: A Case Study and Evaluation Framework

arXiv:2606.21069v1 Announce Type: new Abstract: Emotion annotation is inherently subjective, yet most NLP pipelines still assume "gold" labels, typically produced by majority voting, and treat annotator variation as noise. In this paper, we present a multilabel emotion annotation case study and use it to examine how annotator behavior and aggregation choices affect both agreement estimates and downstream emotion classifiers. Rather than collapsing disagreement into a single label, we represent t...

arXiv cs.CL ·Emily \"Ohman, Anna Koufakou ·
compartilhar: