Yunhao Tang

Cited by

	All	Since 2019
Citations	1593	1585
h-index	17	17
i10-index	26	26

640

320

160

480

20182019202020212022202320246 39 127 190 237 362 626

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Rémi MunosDeepMindVerified email at inria.fr
Krzysztof ChoromanskiGoogle Brain Robotics New York & Columbia UniversityVerified email at columbia.edu
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Aldo PacchianoBroad Institute of MIT and HarvardVerified email at broadinstitute.org
Mark RowlandResearch Scientist, Google DeepMindVerified email at google.com
Will DabneyDeepMindVerified email at google.com
Shipra AgrawalColumbia universityVerified email at columbia.edu
Tamás SarlósGoogleVerified email at google.com
Vikas SindhwaniGoogle DeepMind RoboticsVerified email at google.com
Tadashi KozunoOmron Sinic XVerified email at sinicx.com
Wenbo GaoColumbia UniversityVerified email at columbia.edu
Florent AltchéResearch Engineer, DeepMindVerified email at google.com
Yuri FaenzaAssociate Professor, IEOR, Columbia UniversityVerified email at columbia.edu
Alp KucukelbirAdjunct Professor of Computer Science, Columbia UniversityVerified email at cs.columbia.edu
Adrian WellerDirector of Research, Machine Learning, University of CambridgeVerified email at eng.cam.ac.uk
Anna ChoromanskaNew York UniversityVerified email at nyu.edu
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyVerified email at cs.berkeley.edu
Jiri HronResearch Scientist, Google DeepMindVerified email at google.com
Steven KapturowskiDeepMindVerified email at google.com
David AbelResearch Scientist, DeepMindVerified email at deepmind.com

Yunhao Tang

Research Scientist, DeepMind

Verified email at columbia.edu - Homepage

Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	485	2023
Reinforcement learning for integer programming: Learning to cut Y Tang, S Agrawal, Y Faenza International conference on machine learning, 9367-9376, 2020	190	2020
Es-maml: Simple hessian-free meta learning X Song, W Gao, Y Yang, K Choromanski, A Pacchiano, Y Tang arXiv preprint arXiv:1910.01215, 2019	126	2019
Discretizing continuous action space for on-policy optimization Y Tang, S Agrawal Proceedings of the aaai conference on artificial intelligence 34 (04), 5981-5988, 2020	115	2020
Monte-Carlo tree search as regularized policy optimization JB Grill, F Altché, Y Tang, T Hubert, M Valko, I Antonoglou, R Munos International Conference on Machine Learning, 3769-3778, 2020	69	2020
Byol-explore: Exploration by bootstrapped prediction Z Guo, S Thakoor, M Pîslar, B Avila Pires, F Altché, C Tallec, A Saade, ... Advances in neural information processing systems 35, 31855-31870, 2022	53	2022
From complexity to simplicity: Adaptive es-active subspaces for blackbox optimization KM Choromanski, A Pacchiano, J Parker-Holder, Y Tang, V Sindhwani Advances in Neural Information Processing Systems 32, 2019	49	2019
Orthogonal estimation of Wasserstein distances M Rowland, J Hron, Y Tang, K Choromanski, T Sarlos, A Weller The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	46	2019
Provably robust blackbox optimization for reinforcement learning K Choromanski, A Pacchiano, J Parker-Holder, Y Tang, D Jain, Y Yang, ... CoRR, abs/1903.02993, 2019	42	2019
Exploration by distributional reinforcement learning Y Tang, S Agrawal arXiv preprint arXiv:1805.01907, 2018	40	2018
Learning to Score Behaviors for Guided Policy Optimization A Pacchiano, J Parker-Holder, Y Tang, A Choromanska, K Choromanski, ... arXiv preprint arXiv:1906.04349, 2019	39	2019
Boosting trust region policy optimization by normalizing flows policy Y Tang, S Agrawal arXiv preprint arXiv:1809.10326, 2018	33	2018
Self-imitation learning via generalized lower bound q-learning Y Tang Advances in neural information processing systems 33, 13964-13975, 2020	23	2020
Understanding self-predictive learning for reinforcement learning Y Tang, ZD Guo, PH Richemond, BA Pires, Y Chandak, R Munos, ... International Conference on Machine Learning, 33632-33656, 2023	21	2023
Hindsight expectation maximization for goal-conditioned reinforcement learning Y Tang, A Kucukelbir International Conference on Artificial Intelligence and Statistics, 2863-2871, 2021	20	2021
Revisiting Peng’s Q() for Modern Reinforcement Learning T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ... International Conference on Machine Learning, 5794-5804, 2021	18	2021
Taylor expansion policy optimization Y Tang, M Valko, R Munos International Conference on Machine Learning, 9397-9406, 2020	17	2020
Nash learning from human feedback R Munos, M Valko, D Calandriello, MG Azar, M Rowland, ZD Guo, Y Tang, ... arXiv preprint arXiv:2312.00886, 2023	15	2023
An analysis of quantile temporal-difference learning M Rowland, R Munos, MG Azar, Y Tang, G Ostrovski, A Harutyunyan, ... arXiv preprint arXiv:2301.04462, 2023	15	2023
Online hyper-parameter tuning in off-policy learning via evolutionary strategies Y Tang, K Choromanski arXiv preprint arXiv:2006.07554, 2020	14	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors