Pesquisa | BVS Doenças Infecciosas e Parasitárias

Designing all-pay auctions using deep learning and multi-agent simulation.

Gemp, Ian; Anthony, Thomas; Kramar, Janos; Eccles, Tom; Tacchetti, Andrea; Bachrach, Yoram.

Sci Rep ; 12(1): 16937, 2022 10 08.

Artigo em Inglês | MEDLINE | ID: mdl-36209288

RESUMO

We propose a multi-agent learning approach for designing crowdsourcing contests and All-Pay auctions. Prizes in contests incentivise contestants to expend effort on their entries, with different prize allocations resulting in different incentives and bidding behaviors. In contrast to auctions designed manually by economists, our method searches the possible design space using a simulation of the multi-agent learning process, and can thus handle settings where a game-theoretic equilibrium analysis is not tractable. Our method simulates agent learning in contests and evaluates the utility of the resulting outcome for the auctioneer. Given a large contest design space, we assess through simulation many possible contest designs within the space, and fit a neural network to predict outcomes for previously untested contest designs. Finally, we apply mirror ascent to optimize the design so as to achieve more desirable outcomes. Our empirical analysis shows our approach closely matches the optimal outcomes in settings where the equilibrium is known, and can produce high quality designs in settings where the equilibrium strategies are not solvable analytically.

Assuntos

Crowdsourcing , Aprendizado Profundo , Simulação por Computador , Motivação

Negotiation and honesty in artificial intelligence methods for the board game of Diplomacy.

Kramár, János; Eccles, Tom; Gemp, Ian; Tacchetti, Andrea; McKee, Kevin R; Malinowski, Mateusz; Graepel, Thore; Bachrach, Yoram.

Nat Commun ; 13(1): 7214, 2022 12 06.

Artigo em Inglês | MEDLINE | ID: mdl-36473833

RESUMO

The success of human civilization is rooted in our ability to cooperate by communicating and making joint plans. We study how artificial agents may use communication to better cooperate in Diplomacy, a long-standing AI challenge. We propose negotiation algorithms allowing agents to agree on contracts regarding joint plans, and show they outperform agents lacking this ability. For humans, misleading others about our intentions forms a barrier to cooperation. Diplomacy requires reasoning about our opponents' future plans, enabling us to study broken commitments between agents and the conditions for honest cooperation. We find that artificial agents face a similar problem as humans: communities of communicating agents are susceptible to peers who deviate from agreements. To defend against this, we show that the inclination to sanction peers who break contracts dramatically reduces the advantage of such deviators. Hence, sanctioning helps foster mostly truthful communication, despite conditions that initially favor deviations from agreements.

Assuntos

Inteligência Artificial , Humanos

Competition-level code generation with AlphaCode.

Li, Yujia; Choi, David; Chung, Junyoung; Kushman, Nate; Schrittwieser, Julian; Leblond, Rémi; Eccles, Tom; Keeling, James; Gimeno, Felix; Dal Lago, Agustin; Hubert, Thomas; Choy, Peter; de Masson d'Autume, Cyprien; Babuschkin, Igor; Chen, Xinyun; Huang, Po-Sen; Welbl, Johannes; Gowal, Sven; Cherepanov, Alexey; Molloy, James; Mankowitz, Daniel J; Sutherland Robson, Esme; Kohli, Pushmeet; de Freitas, Nando; Kavukcuoglu, Koray; Vinyals, Oriol.

Science ; 378(6624): 1092-1097, 2022 12 09.

Artigo em Inglês | MEDLINE | ID: mdl-36480631

RESUMO

Programming is a powerful and ubiquitous problem-solving tool. Systems that can assist programmers or even generate programs themselves could make programming more productive and accessible. Recent transformer-based neural network models show impressive code generation abilities yet still perform poorly on more complex tasks requiring problem-solving skills, such as competitive programming problems. Here, we introduce AlphaCode, a system for code generation that achieved an average ranking in the top 54.3% in simulated evaluations on recent programming competitions on the Codeforces platform. AlphaCode solves problems by generating millions of diverse programs using specially trained transformer-based networks and then filtering and clustering those programs to a maximum of just 10 submissions. This result marks the first time an artificial intelligence system has performed competitively in programming competitions.

Mastering the game of Stratego with model-free multiagent reinforcement learning.

Perolat, Julien; De Vylder, Bart; Hennes, Daniel; Tarassov, Eugene; Strub, Florian; de Boer, Vincent; Muller, Paul; Connor, Jerome T; Burch, Neil; Anthony, Thomas; McAleer, Stephen; Elie, Romuald; Cen, Sarah H; Wang, Zhe; Gruslys, Audrunas; Malysheva, Aleksandra; Khan, Mina; Ozair, Sherjil; Timbers, Finbarr; Pohlen, Toby; Eccles, Tom; Rowland, Mark; Lanctot, Marc; Lespiau, Jean-Baptiste; Piot, Bilal; Omidshafiei, Shayegan; Lockhart, Edward; Sifre, Laurent; Beauguerlange, Nathalie; Munos, Remi; Silver, David; Singh, Satinder; Hassabis, Demis; Tuyls, Karl.

Science ; 378(6623): 990-996, 2022 12 02.

Artigo em Inglês | MEDLINE | ID: mdl-36454847

RESUMO

We introduce DeepNash, an autonomous agent that plays the imperfect information game Stratego at a human expert level. Stratego is one of the few iconic board games that artificial intelligence (AI) has not yet mastered. It is a game characterized by a twin challenge: It requires long-term strategic thinking as in chess, but it also requires dealing with imperfect information as in poker. The technique underpinning DeepNash uses a game-theoretic, model-free deep reinforcement learning method, without search, that learns to master Stratego through self-play from scratch. DeepNash beat existing state-of-the-art AI methods in Stratego and achieved a year-to-date (2022) and all-time top-three ranking on the Gravon games platform, competing with human expert players.

Assuntos

Inteligência Artificial , Reforço Psicológico , Jogos de Vídeo , Humanos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA