Bilal Piot

Cituota

	Visi	Nuo 2019
Šaltiniai	15000	14152
h-rodyklė	36	33
i10-rodyklė	46	44

4500

2250

1125

3375

2014201520162017201820192020202120222023202448 43 91 128 467 842 1324 2455 3656 4464 1398

Viešas pasiekiamumas

Peržiūrėti viską

3 straipsniai

0 straipsnių

pasiekiami

nepasiekiami

Pagal finansavimo įpareigojimus

Bendraautoriai

Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Patvirtintas el. paštas univ-lille.fr
Mohammad Gheshlaghi AzarCohere AIPatvirtintas el. paštas google.com
Zhaohan Daniel GuoDeepMindPatvirtintas el. paštas google.com
Rémi MunosDeepMindPatvirtintas el. paštas inria.fr
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindPatvirtintas el. paštas meta.com
Florent AltchéResearch Engineer, DeepMindPatvirtintas el. paštas google.com
Jean-bastien GrillPatvirtintas el. paštas google.com
Florian STRUBDeepMindPatvirtintas el. paštas google.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Patvirtintas el. paštas univ-lorraine.fr
Corentin TallecDeepMindPatvirtintas el. paštas google.com
Pierre RichemondGoogle DeepMindPatvirtintas el. paštas deepmind.com
Charles BlundellResearch Scientist at DeepMindPatvirtintas el. paštas google.com
Todd HesterWaymoPatvirtintas el. paštas waymo.com
Pablo SprechmannResearch Scientist at Google DeepMindPatvirtintas el. paštas google.com
Steven KapturowskiDeepMindPatvirtintas el. paštas google.com
Mel VecerikDeepMind, University College LondonPatvirtintas el. paštas ucl.ac.uk
Dan HorganGoogle DeepMindPatvirtintas el. paštas google.com
Adrià Puigdomènech BadiaDeepMindPatvirtintas el. paštas google.com
Alex VitvitskyiDeepMindPatvirtintas el. paštas google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLPatvirtintas el. paštas google.com

Stebėti

Bilal Piot

Google Deepmind

Patvirtintas el. paštas google.com

reinforcement learning inverse reinforcement learning


Pavadinimas Rūšiuoti pagal šaltinius Rūšiuoti pagal metus Rūšiuoti pagal pavadinimą	Cituota Cituota	Metai
Bootstrap your own latent: A new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ... arXiv preprint arXiv:2006.07733, 2020	5828	2020
Rainbow: Combining improvements in deep reinforcement learning M Hessel, J Modayil, H Van Hasselt, T Schaul, G Ostrovski, W Dabney, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2484	2018
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1162	2018
Noisy Networks for Exploration M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ... arXiv preprint arXiv:1706.10295 2018, 2017	1114*	2017
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ... arXiv preprint arXiv:1707.08817, 2017	741	2017
Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ... International conference on machine learning, 507-517, 2020	594	2020
Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020	316	2020
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	229	2020
Learning from demonstrations for real world reinforcement learning T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ... arXiv preprint arXiv:1704.03732, 2017	175	2017
Mastering the game of stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	139	2022
Bootstrap latent-predictive representations for multitask reinforcement learning ZD Guo, BA Pires, B Piot, JB Grill, F Altché, R Munos, MG Azar International Conference on Machine Learning, 3875-3886, 2020	138	2020
Observe and look further: Achieving consistent performance on atari T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ... arXiv preprint arXiv:1805.11593, 2018	128	2018
Inverse reinforcement learning through structured classification E Klein, M Geist, B Piot, O Pietquin Advances in neural information processing systems 25, 2012	119	2012
Approximate dynamic programming for two-player zero-sum markov games J Perolat, B Scherrer, B Piot, O Pietquin International Conference on Machine Learning, 1321-1329, 2015	113	2015
Bridging the gap between imitation learning and inverse reinforcement learning B Piot, M Geist, O Pietquin IEEE transactions on neural networks and learning systems 28 (8), 1814-1826, 2016	99	2016
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning A Gruslys, W Dabney, MG Azar, B Piot, M Bellemare, R Munos arXiv preprint arXiv:1704.04651, 2017	97	2017
Hindsight credit assignment A Harutyunyan, W Dabney, T Mesnard, M Gheshlaghi Azar, B Piot, ... Advances in neural information processing systems 32, 2019	89	2019
Boosted bellman residual minimization handling expert demonstrations B Piot, M Geist, O Pietquin Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014	87	2014
Byol works even without batch statistics PH Richemond, JB Grill, F Altché, C Tallec, F Strub, A Brock, S Smith, ... arXiv preprint arXiv:2010.10241, 2020	85	2020
Neural predictive belief representations ZD Guo, MG Azar, B Piot, BA Pires, R Munos arXiv preprint arXiv:1811.06407, 2018	84	2018

Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.

Straipsniai 1–20

Šaltinių per metus

Dubliuoti šaltiniai

Sujungti šaltiniai

Pridėti bendraautoriusBendraautoriai

Stebėti

Cituota

Bendraautoriai