Urmăriți
Theophane Weber
Theophane Weber
Research Scientist at DeepMind
Adresă de e-mail confirmată pe google.com - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Deep reinforcement learning in large discrete action spaces
G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt, ...
arXiv preprint arXiv:1512.07679, 2015
7112015
Neural scene representation and rendering
SMA Eslami, D Jimenez Rezende, F Besse, F Viola, AS Morcos, ...
Science 360 (6394), 1204-1210, 2018
7102018
Imagination-augmented agents for deep reinforcement learning
T Weber, S Racaniere, DP Reichert, L Buesing, A Guez, DJ Rezende, ...
arXiv preprint arXiv:1707.06203, 2017
693*2017
Attend, infer, repeat: Fast scene understanding with generative models
SM Eslami, N Heess, T Weber, Y Tassa, D Szepesvari, GE Hinton
Advances in neural information processing systems 29, 2016
5862016
Gradient estimation using stochastic computation graphs
J Schulman, N Heess, T Weber, P Abbeel
Advances in neural information processing systems 28, 2015
4472015
Visual interaction networks: Learning a physics simulator from video
N Watters, D Zoran, T Weber, P Battaglia, R Pascanu, A Tacchetti
Advances in neural information processing systems 30, 2017
4062017
Relational recurrent neural networks
A Santoro, R Faulkner, D Raposo, J Rae, M Chrzanowski, T Weber, ...
Advances in neural information processing systems 31, 2018
2652018
Woulda, coulda, shoulda: Counterfactually-guided policy search
L Buesing, T Weber, Y Zwols, S Racaniere, A Guez, JB Lespiau, N Heess
arXiv preprint arXiv:1811.06272, 2018
1582018
Automated variational inference in probabilistic programming
D Wingate, T Weber
arXiv preprint arXiv:1301.1299, 2013
1562013
Temporal difference variational auto-encoder
K Gregor, G Papamakarios, F Besse, L Buesing, T Weber
arXiv preprint arXiv:1806.03107, 2018
1462018
Learning model-based planning from scratch
R Pascanu, Y Li, O Vinyals, N Heess, L Buesing, S Racanière, D Reichert, ...
arXiv preprint arXiv:1707.06170, 2017
1192017
Learning and querying fast generative models for reinforcement learning
L Buesing, T Weber, S Racaniere, SM Eslami, D Rezende, DP Reichert, ...
arXiv preprint arXiv:1802.03006, 2018
1092018
An investigation of model-free planning
A Guez, M Mirza, K Gregor, R Kabra, S Racanière, T Weber, D Raposo, ...
International Conference on Machine Learning, 2464-2473, 2019
952019
Learning to search with mctsnets
A Guez, T Weber, I Antonoglou, K Simonyan, O Vinyals, D Wierstra, ...
International conference on machine learning, 1822-1831, 2018
932018
System linearization
T Weber, B Vigoda, P Pratt, J Park, M McCormick
US Patent App. 13/678,904, 2013
802013
Muesli: Combining improvements in policy optimization
M Hessel, I Danihelka, F Viola, A Guez, S Schmitt, L Sifre, T Weber, ...
International conference on machine learning, 4214-4226, 2021
782021
On the role of planning in model-based deep reinforcement learning
JB Hamrick, AL Friesen, F Behbahani, A Guez, F Viola, S Witherspoon, ...
arXiv preprint arXiv:2011.04021, 2020
752020
Counterfactual credit assignment in model-free reinforcement learning
T Mesnard, T Weber, F Viola, S Thakoor, A Saade, A Harutyunyan, ...
arXiv preprint arXiv:2011.09464, 2020
672020
Combining q-learning and search with amortized value estimates
JB Hamrick, V Bapst, A Sanchez-Gonzalez, T Pfaff, T Weber, L Buesing, ...
arXiv preprint arXiv:1912.02807, 2019
552019
Credit assignment techniques in stochastic computation graphs
T Weber, N Heess, L Buesing, D Silver
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
532019
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–20