Búsqueda | Portal Regional de la BVS

Editorial: Human-centered AI: Crowd computing.

Yang, Jie; Bozzon, Alessandro; Gadiraju, Ujwal; Lease, Matthew.

Front Artif Intell ; 6: 1161006, 2023.

Artículo en Inglés | MEDLINE | ID: mdl-37009200

In Search of Ambiguity: A Three-Stage Workflow Design to Clarify Annotation Guidelines for Crowd Workers.

Pradhan, Vivek Krishna; Schaekermann, Mike; Lease, Matthew.

Front Artif Intell ; 5: 828187, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-35664506

RESUMEN

We propose a novel three-stage FIND-RESOLVE-LABEL workflow for crowdsourced annotation to reduce ambiguity in task instructions and, thus, improve annotation quality. Stage 1 (FIND) asks the crowd to find examples whose correct label seems ambiguous given task instructions. Workers are also asked to provide a short tag that describes the ambiguous concept embodied by the specific instance found. We compare collaborative vs. non-collaborative designs for this stage. In Stage 2 (RESOLVE), the requester selects one or more of these ambiguous examples to label (resolving ambiguity). The new label(s) are automatically injected back into task instructions in order to improve clarity. Finally, in Stage 3 (LABEL), workers perform the actual annotation using the revised guidelines with clarifying examples. We compare three designs using these examples: examples only, tags only, or both. We report image labeling experiments over six task designs using Amazon's Mechanical Turk. Results show improved annotation accuracy and further insights regarding effective design for crowdsourced annotation tasks.

Aggregating and Predicting Sequence Labels from Crowd Annotations.

Nguyen, An T; Wallace, Byron C; Li, Junyi Jessy; Nenkova, Ani; Lease, Matthew.

Proc Conf Assoc Comput Linguist Meet ; 2017: 299-309, 2017.

Artículo en Inglés | MEDLINE | ID: mdl-29093611

RESUMEN

Despite sequences being core to NLP, scant work has considered how to handle noisy sequence labels from multiple annotators for the same text. Given such annotations, we consider two complementary tasks: (1) aggregating sequential crowd labels to infer a best single set of consensus annotations; and (2) using crowd annotations as training data for a model that can predict sequences in unannotated text. For aggregation, we propose a novel Hidden Markov Model variant. To predict sequences in unannotated text, we propose a neural approach using Long Short Term Memory. We evaluate a suite of methods across two different applications and text genres: Named-Entity Recognition in news articles and Information Extraction from biomedical abstracts. Results show improvement over strong baselines. Our source code and data are available online.

RESUMEN

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA