Marc Lanctot

Cituota

	Visi	Nuo 2019
Šaltiniai	38417	31587
h-rodyklė	39	35
i10-rodyklė	66	59

8000

4000

2000

6000

20142015201620172018201920202021202220232024116 122 910 1810 3157 4382 5333 6184 6574 7191 1913

Viešas pasiekiamumas

Peržiūrėti viską

5 straipsniai

0 straipsnių

pasiekiami

nepasiekiami

Pagal finansavimo įpareigojimus

Bendraautoriai

Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLPatvirtintas el. paštas ucl.ac.uk
Karl TuylsResearch Scientist, Google DeepMind and Professor of computer science, University of LiverpoolPatvirtintas el. paštas google.com
David SilverDeepMind, UCLPatvirtintas el. paštas google.com
Michael BowlingUniversity of AlbertaPatvirtintas el. paštas ualberta.ca
Laurent SifreGoogle DeepMindPatvirtintas el. paštas polytechnique.edu
Arthur GuezGoogle DeepMindPatvirtintas el. paštas google.com
Julian SchrittwieserDeepMindPatvirtintas el. paštas furidamu.org
Joel Z LeiboResearch scientistPatvirtintas el. paštas google.com
julien perolatDeepMindPatvirtintas el. paštas google.com
Timothy P. LillicrapDirector of Research, Google DeepMindPatvirtintas el. paštas google.com
Audrūnas GruslysPatvirtintas el. paštas gruslys.com
Chris J. MaddisonUniversity of TorontoPatvirtintas el. paštas cs.toronto.edu
Aja HuangDeepMindPatvirtintas el. paštas google.com
George van den DriesscheDeepMindPatvirtintas el. paštas deepmind.com
Neil BurchSony AI & Alberta Machine Intelligence Institute, University of AlbertaPatvirtintas el. paštas ualberta.ca
Vinicius ZambaldiGoogle DeepmindPatvirtintas el. paštas google.com
Thomas HubertGoogle DeepmindPatvirtintas el. paštas google.com
Rémi MunosDeepMindPatvirtintas el. paštas inria.fr
Nal KalchbrennerGoogle DeepMindPatvirtintas el. paštas google.com
koray kavukcuogluDeepMindPatvirtintas el. paštas kavukcuoglu.org

Stebėti

Marc Lanctot

Research Scientist, Google DeepMind

Patvirtintas el. paštas google.com - Pagrindinis puslapis

Artificial Intelligence Game Theory Search Multiagent Systems Reinforcement Learning


Pavadinimas Rūšiuoti pagal šaltinius Rūšiuoti pagal metus Rūšiuoti pagal pavadinimą	Cituota Cituota	Metai
Mastering the game of Go with deep neural networks and tree search D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ... Nature 529 (7587), 484-489, 2016	18586	2016
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... Science 362 (6419), 1140-1144, 2018	6171*	2018
Dueling Network Architectures for Deep Reinforcement Learning Z Wang, T Schaul, M Hessel, H van Hasselt, M Lanctot, N de Freitas arXiv preprint arXiv:1511.06581, 2016	4736	2016
Value-decomposition networks for cooperative multi-agent learning based on team reward P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... Proceedings of the 17th international conference on autonomous agents and …, 2018	1576*	2018
Deep Q-learning from Demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Association for the Advancement of Artificial Intelligence (AAAI), 2018	1166	2018
Multi-agent Reinforcement Learning in Sequential Social Dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel AAMAS, 2017	894	2017
A unified game-theoretic approach to multiagent reinforcement learning M Lanctot, V Zambaldi, A Gruslys, A Lazaridou, K Tuyls, J Pérolat, D Silver, ... arXiv preprint arXiv:1711.00832, 2017	709	2017
The hanabi challenge: A new frontier for ai research N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ... Artificial Intelligence 280, 103216, 2020	372	2020
Fictitious Self-Play in Extensive-Form Games J Heinrich, M Lanctot, D Silver International Conference on Machine Learning, 2015	365	2015
Monte Carlo sampling for regret minimization in extensive games M Lanctot, K Waugh, M Zinkevich, M Bowling Advances in neural information processing systems 22, 1078-1086, 2009	359	2009
Memory-efficient backpropagation through time A Gruslys, R Munos, I Danihelka, M Lanctot, A Graves Advances In Neural Information Processing Systems, 4125-4133, 2016	244*	2016
OpenSpiel: A Framework for Reinforcement Learning in Games M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ... arXiv preprint arXiv:1908.09453, 2019	232	2019
Emergent Communication through Negotiation K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark arXiv preprint arXiv:1804.03980, 2018	177	2018
Actor-critic policy optimization in partially observable multiagent environments S Srinivasan, M Lanctot, V Zambaldi, J Pérolat, K Tuyls, R Munos, ... Advances in Neural Information Processing Systems, 3422-3435, 2018	158	2018
Mastering the game of Stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	142	2022
Convolution by evolution: Differentiable pattern producing networks C Fernando, D Banarse, M Reynolds, F Besse, D Pfau, M Jaderberg, ... Proceedings of the Genetic and Evolutionary Computation Conference 2016, 109-116, 2016	135	2016
α-Rank: Multi-Agent Evaluation by Evolution S Omidshafiei, C Papadimitriou, G Piliouras, K Tuyls, M Rowland, ... Scientific reports 9 (1), 9937, 2019	121	2019
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research JZ Leibo, E Hughes, M Lanctot, T Graepel arXiv preprint arXiv:1903.00742, 2019	113	2019
Real-Time Monte-Carlo Tree Search in Ms Pac-Man T Pepels, MHM Winands, M Lanctot Transactions on Computation Intelligence and AI in Games, 2014	113	2014
Efficient Nash equilibrium approximation through Monte Carlo counterfactual regret minimization. M Johanson, N Bard, M Lanctot, RG Gibson, M Bowling AAMAS, 837-846, 2012	107	2012

Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.

Straipsniai 1–20

Šaltinių per metus

Dubliuoti šaltiniai

Sujungti šaltiniai

Pridėti bendraautoriusBendraautoriai

Stebėti

Cituota

Bendraautoriai