Urmăriți
Alex Vitvitskyi
Alex Vitvitskyi
DeepMind
Adresă de e-mail confirmată pe google.com
Titlu
Citat de
Citat de
Anul
Agent57: Outperforming the atari human benchmark
AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ...
International conference on machine learning, 507-517, 2020
6382020
Never give up: Learning directed exploration strategies
AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ...
arXiv preprint arXiv:2002.06038, 2020
3262020
A generalist neural algorithmic learner
B Ibarz, V Kurin, G Papamakarios, K Nikiforou, M Bennani, R Csordás, ...
Learning on graphs conference, 2: 1-2: 23, 2022
522022
Beyond fine-tuning: Transferring behavior in reinforcement learning
V Campos, P Sprechmann, S Hansen, A Barreto, S Kapturowski, ...
arXiv preprint arXiv:2102.13515, 2021
232021
Coverage as a principle for discovering transferable behavior in reinforcement learning
V Campos, P Sprechmann, SS Hansen, A Barreto, C Blundell, A Vitvitskyi, ...
92021
Never give up: Learning directed exploration strategies. arXiv
AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ...
arXiv preprint arXiv:2002.06038, 2020
82020
Never give up: Learning directed exploration strategies
A Puigdomènech Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, ...
arXiv e-prints, arXiv: 2002.06038, 2020
72020
Agent57: Outperforming the atari human benchmark. arXiv 2020
AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, D Guo, ...
arXiv preprint arXiv:2003.13350, 0
7
Agent57: Outperforming the Atari Human Benchmark. arXiv e-prints, page
AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, D Guo, ...
arXiv preprint arXiv:2003.13350, 2020
52020
Transformers need glasses! Information over-squashing in language tasks
F Barbero, A Banino, S Kapturowski, D Kumaran, JGM Araújo, A Vitvitskyi, ...
arXiv preprint arXiv:2406.04267, 2024
22024
Transformers meet Neural Algorithmic Reasoners
W Bounsi, B Ibarz, A Dudzik, JB Hamrick, L Markeeva, A Vitvitskyi, ...
arXiv preprint arXiv:2406.09308, 2024
2024
The CLRS-Text Algorithmic Reasoning Language Benchmark
L Markeeva, S McLeish, B Ibarz, W Bounsi, O Kozlova, A Vitvitskyi, ...
arXiv preprint arXiv:2406.04229, 2024
2024
Jointly learning exploratory and non-exploratory action selection policies
AP Badia, P Sprechmann, A Vitvitskyi, Z Guo, B Piot, SJ Kapturowski, ...
US Patent App. 18/334,112, 2024
2024
Jointly learning exploratory and non-exploratory action selection policies
AP Badia, P Sprechmann, A Vitvitskyi, Z Guo, B Piot, SJ Kapturowski, ...
US Patent 11,714,990, 2023
2023
Reinforcement learning with adaptive return computation schemes
AP Badia, B Piot, P Sprechmann, SJ Kapturowski, A Vitvitskyi, Z Guo, ...
US Patent App. 17/797,878, 2023
2023
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–15