Pesquisa | Secretaria de Estado da Saúde

Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents.

Köster, Raphael; Hadfield-Menell, Dylan; Everett, Richard; Weidinger, Laura; Hadfield, Gillian K; Leibo, Joel Z.

Proc Natl Acad Sci U S A ; 119(3)2022 01 18.

Artigo em Inglês | MEDLINE | ID: mdl-35022231

RESUMO

How do societies learn and maintain social norms? Here we use multiagent reinforcement learning to investigate the learning dynamics of enforcement and compliance behaviors. Artificial agents populate a foraging environment and need to learn to avoid a poisonous berry. Agents learn to avoid eating poisonous berries better when doing so is taboo, meaning the behavior is punished by other agents. The taboo helps overcome a credit assignment problem in discovering delayed health effects. Critically, introducing an additional taboo, which results in punishment for eating a harmless berry, further improves overall returns. This "silly rule" counterintuitively has a positive effect because it gives agents more practice in learning rule enforcement. By probing what individual agents have learned, we demonstrate that normative behavior relies on a sequence of learned skills. Learning rule compliance builds upon prior learning of rule enforcement by other agents. Our results highlight the benefit of employing a multiagent reinforcement learning computational model focused on learning to implement complex actions.

Assuntos

Aprendizagem , Reforço Psicológico , Normas Sociais , Meio Ambiente , Humanos

Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data.

Balagopalan, Aparna; Madras, David; Yang, David H; Hadfield-Menell, Dylan; Hadfield, Gillian K; Ghassemi, Marzyeh.

Sci Adv ; 9(19): eabq0701, 2023 05 10.

Artigo em Inglês | MEDLINE | ID: mdl-37163590

RESUMO

As governments and industry turn to increased use of automated decision systems, it becomes essential to consider how closely such systems can reproduce human judgment. We identify a core potential failure, finding that annotators label objects differently depending on whether they are being asked a factual question or a normative question. This challenges a natural assumption maintained in many standard machine-learning (ML) data acquisition procedures: that there is no difference between predicting the factual classification of an object and an exercise of judgment about whether an object violates a rule premised on those facts. We find that using factual labels to train models intended for normative judgments introduces a notable measurement error. We show that models trained using factual labels yield significantly different judgments than those trained using normative labels and that the impact of this effect on model performance can exceed that of other factors (e.g., dataset size) that routinely attract attention from ML researchers and practitioners.

Assuntos

Julgamento , Aprendizado de Máquina , Humanos , Governo

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa