Urmăriți
Marc G. Bellemare
Marc G. Bellemare
Google Brain
Adresă de e-mail confirmată pe google.com - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Human-level control through deep reinforcement learning
V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness, MG Bellemare, ...
nature 518 (7540), 529-533, 2015
291622015
The Arcade Learning Environment: An Evaluation Platform for General Agents
MG Bellemare, Y Naddaf, J Veness, M Bowling
Journal of Artificial Intelligence Research 47, 253--279, 2013
34152013
A distributional perspective on reinforcement learning
MG Bellemare, W Dabney, R Munos
International conference on machine learning, 449-458, 2017
16362017
Unifying count-based exploration and intrinsic motivation
M Bellemare, S Srinivasan, G Ostrovski, T Schaul, D Saxton, R Munos
Advances in neural information processing systems 29, 2016
15972016
An introduction to deep reinforcement learning
V François-Lavet, P Henderson, R Islam, MG Bellemare, J Pineau
Foundations and Trends® in Machine Learning 11 (3-4), 219-354, 2018
15642018
Distributional reinforcement learning with quantile regression
W Dabney, M Rowland, M Bellemare, R Munos
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
7352018
Count-based exploration with neural density models
G Ostrovski, MG Bellemare, A Oord, R Munos
International conference on machine learning, 2721-2730, 2017
6832017
Safe and efficient off-policy reinforcement learning
R Munos, T Stepleton, A Harutyunyan, M Bellemare
Advances in neural information processing systems 29, 2016
6782016
Revisiting the arcade learning environment: Evaluation protocols and open problems for general agents
MC Machado, MG Bellemare, E Talvitie, J Veness, M Hausknecht, ...
Journal of Artificial Intelligence Research 61, 523-562, 2018
5862018
Automated curriculum learning for neural networks
A Graves, MG Bellemare, J Menick, R Munos, K Kavukcuoglu
international conference on machine learning, 1311-1320, 2017
5802017
Deep reinforcement learning at the edge of the statistical precipice
R Agarwal, M Schwarzer, PS Castro, AC Courville, M Bellemare
Advances in neural information processing systems 34, 29304-29320, 2021
4762021
The cramer distance as a solution to biased wasserstein gradients
MG Bellemare, I Danihelka, W Dabney, S Mohamed, ...
arXiv preprint arXiv:1705.10743, 2017
4012017
The hanabi challenge: A new frontier for ai research
N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ...
Artificial Intelligence 280, 103216, 2020
3652020
Autonomous navigation of stratospheric balloons using reinforcement learning
MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ...
Nature 588 (7836), 77-82, 2020
3232020
A laplacian framework for option discovery in reinforcement learning
MC Machado, MG Bellemare, M Bowling
International Conference on Machine Learning, 2295-2304, 2017
2972017
Deepmdp: Learning continuous latent space models for representation learning
C Gelada, S Kumar, J Buckman, O Nachum, MG Bellemare
International conference on machine learning, 2170-2179, 2019
2902019
Dopamine: A research framework for deep reinforcement learning
PS Castro, S Moitra, C Gelada, S Kumar, MG Bellemare
arXiv preprint arXiv:1812.06110, 2018
2702018
Count-based exploration with the successor representation
MC Machado, MG Bellemare, M Bowling
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5125-5133, 2020
1802020
Contrastive behavioral similarity embeddings for generalization in reinforcement learning
R Agarwal, MC Machado, PS Castro, MG Bellemare
arXiv preprint arXiv:2101.05265, 2021
1712021
Increasing the action gap: New operators for reinforcement learning
MG Bellemare, G Ostrovski, A Guez, P Thomas, R Munos
Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016
1672016
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–20