Theophane Weber

Citat de

	Toate	Din 2019
Referințe bibliografice	5927	5126
h-index	31	29
i10-index	44	41

1000

500

250

750

2011201220132014201520162017201820192020202120222023202423 12 21 16 30 86 147 415 764 950 968 981 951 508

Acces public

Afișați-le pe toate

1 articol

disponibile

indisponibile

Pe baza cerințelor privind finanțarea

Coautori

Lars BuesingGoogle DeepMindAdresă de e-mail confirmată pe google.com
Danilo J. RezendeDirector at DeepMindAdresă de e-mail confirmată pe google.com
Arthur GuezGoogle DeepMindAdresă de e-mail confirmată pe google.com
Peter BattagliaResearch Scientist, DeepMindAdresă de e-mail confirmată pe google.com
Sébastien RacanièreResearch Scientist, DeepMindAdresă de e-mail confirmată pe google.com
David P. ReichertGoogle DeepMindAdresă de e-mail confirmată pe google.com
S. M. Ali EslamiGoogle DeepMindAdresă de e-mail confirmată pe google.com
Oriol VinyalsResearch Scientist at Google DeepMindAdresă de e-mail confirmată pe google.com
Timothy P. LillicrapDirector of Research, Google DeepMindAdresă de e-mail confirmată pe google.com
David SilverDeepMind, UCLAdresă de e-mail confirmată pe google.com
Adrià Puigdomènech BadiaDeepMindAdresă de e-mail confirmată pe google.com
Yujia LiResearch Scientist, Google DeepMindAdresă de e-mail confirmată pe google.com
John SchulmanResearch Scientist, OpenAIAdresă de e-mail confirmată pe openai.com
Justin DauwelsDelft University of TechnologyAdresă de e-mail confirmată pe tudelft.nl
Andrzej CichockiSystems Research Institute, Nicolaus Copernicus University, RIKEN (AIP)Adresă de e-mail confirmată pe riken.jp
David WingateAssociate professor of Computer Science, Brigham Young UniversityAdresă de e-mail confirmată pe cs.byu.edu
David Alan GoldbergAssociate Professor, Operations Research and Information Engineering (ORIE), Cornell UniversityAdresă de e-mail confirmată pe cornell.edu
David GamarnikProfessor of Operations Research, MITAdresă de e-mail confirmată pe mit.edu

Urmăriți

Theophane Weber

Research Scientist at DeepMind

Adresă de e-mail confirmată pe google.com - Pagina de pornire

Artificial Intelligence Machine Learning Reinforcement Learning Probabilistic Modeling.


Titlu Sortați după descrierea bibliografică Sortați după an Sortați după titlu	Citat de Citat de	Anul
Deep reinforcement learning in large discrete action spaces G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt, ... arXiv preprint arXiv:1512.07679, 2015	711	2015
Neural scene representation and rendering SMA Eslami, D Jimenez Rezende, F Besse, F Viola, AS Morcos, ... Science 360 (6394), 1204-1210, 2018	710	2018
Imagination-augmented agents for deep reinforcement learning T Weber, S Racaniere, DP Reichert, L Buesing, A Guez, DJ Rezende, ... arXiv preprint arXiv:1707.06203, 2017	693*	2017
Attend, infer, repeat: Fast scene understanding with generative models SM Eslami, N Heess, T Weber, Y Tassa, D Szepesvari, GE Hinton Advances in neural information processing systems 29, 2016	586	2016
Gradient estimation using stochastic computation graphs J Schulman, N Heess, T Weber, P Abbeel Advances in neural information processing systems 28, 2015	447	2015
Visual interaction networks: Learning a physics simulator from video N Watters, D Zoran, T Weber, P Battaglia, R Pascanu, A Tacchetti Advances in neural information processing systems 30, 2017	406	2017
Relational recurrent neural networks A Santoro, R Faulkner, D Raposo, J Rae, M Chrzanowski, T Weber, ... Advances in neural information processing systems 31, 2018	265	2018
Woulda, coulda, shoulda: Counterfactually-guided policy search L Buesing, T Weber, Y Zwols, S Racaniere, A Guez, JB Lespiau, N Heess arXiv preprint arXiv:1811.06272, 2018	158	2018
Automated variational inference in probabilistic programming D Wingate, T Weber arXiv preprint arXiv:1301.1299, 2013	156	2013
Temporal difference variational auto-encoder K Gregor, G Papamakarios, F Besse, L Buesing, T Weber arXiv preprint arXiv:1806.03107, 2018	146	2018
Learning model-based planning from scratch R Pascanu, Y Li, O Vinyals, N Heess, L Buesing, S Racanière, D Reichert, ... arXiv preprint arXiv:1707.06170, 2017	119	2017
Learning and querying fast generative models for reinforcement learning L Buesing, T Weber, S Racaniere, SM Eslami, D Rezende, DP Reichert, ... arXiv preprint arXiv:1802.03006, 2018	109	2018
An investigation of model-free planning A Guez, M Mirza, K Gregor, R Kabra, S Racanière, T Weber, D Raposo, ... International Conference on Machine Learning, 2464-2473, 2019	95	2019
Learning to search with mctsnets A Guez, T Weber, I Antonoglou, K Simonyan, O Vinyals, D Wierstra, ... International conference on machine learning, 1822-1831, 2018	93	2018
System linearization T Weber, B Vigoda, P Pratt, J Park, M McCormick US Patent App. 13/678,904, 2013	80	2013
Muesli: Combining improvements in policy optimization M Hessel, I Danihelka, F Viola, A Guez, S Schmitt, L Sifre, T Weber, ... International conference on machine learning, 4214-4226, 2021	78	2021
On the role of planning in model-based deep reinforcement learning JB Hamrick, AL Friesen, F Behbahani, A Guez, F Viola, S Witherspoon, ... arXiv preprint arXiv:2011.04021, 2020	75	2020
Counterfactual credit assignment in model-free reinforcement learning T Mesnard, T Weber, F Viola, S Thakoor, A Saade, A Harutyunyan, ... arXiv preprint arXiv:2011.09464, 2020	67	2020
Combining q-learning and search with amortized value estimates JB Hamrick, V Bapst, A Sanchez-Gonzalez, T Pfaff, T Weber, L Buesing, ... arXiv preprint arXiv:1912.02807, 2019	55	2019
Credit assignment techniques in stochastic computation graphs T Weber, N Heess, L Buesing, D Silver The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	53	2019

Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.

Articole 1–20

Referințe bibliografice pe an

Citate duplicat

Citate fuzionate

Adăugați coautoriCoautori

Urmăriți

Citat de

Coautori