Modelling Variability in Human Annotator Simulation

Wen Wu*, Wenlin Chen*, Chao Zhang, Phil Woodland

August 2024

Abstract

Human annotator simulation (HAS) serves as a cost-effective substitute for human evaluation tasks such as data annotation and system assessment. It is important to incorporate the variability present in human evaluation into HAS, since it helps capture diverse subjective interpretations and mitigate potential biases and over-representation. This work introduces a novel framework for modelling variability in HAS. Conditional softmax flow (S-CNF) is proposed to model the distribution of subjective human annotations, which leverages diverse human annotations via meta-learning. This enables efficient generation of annotations that exhibit human variability for unlabelled input. In addition, a wide range of evaluation metrics are adopted to assess the capability and efficiency of HAS systems in predicting the aggregated behaviours of human annotators, matching the distribution of human annotations, and simulating the inter-annotator disagreements. Results demonstrate that the proposed method achieves state-of-the-art performance on two real-world human evaluation tasks: emotion recognition and toxic speech detection.

Type

Conference paper

Publication

Findings of the Association for Computational Linguistics: ACL 2024

Tags: Uncertainty

Modelling Variability in Human Annotator Simulation

Abstract

Wenlin Chen

Research Scientist

Related