Hengshuai Yao

Citat de

	Toate	Din 2019
Referințe bibliografice	1004	930
h-index	17	17
i10-index	24	23

320

160

240

201120122013201420152016201720182019202020212022202320244 8 2 2 5 15 11 17 37 87 152 231 302 117

Acces public

Afișați-le pe toate

4 articole

0 articole

disponibile

indisponibile

Pe baza cerințelor privind finanțarea

Coautori

Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, AmiiAdresă de e-mail confirmată pe ualberta.ca
Csaba SzepesvariDeepMind & University of AlbertaAdresă de e-mail confirmată pe cs.ualberta.ca
Shangtong ZhangUniversity of VirginiaAdresă de e-mail confirmată pe virginia.edu
Bei JiangAssociate Professor of Statistics, University of AlbertaAdresă de e-mail confirmată pe ualberta.ca
Richard S. SuttonKeen, Amii, and University of AlbertaAdresă de e-mail confirmată pe richsutton.com
Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceAdresă de e-mail confirmată pe iisc.ac.in
Borislav MavrinUniversity of AlbertaAdresă de e-mail confirmată pe ualberta.ca
Randy GoebelProfessor of Computing Science, University of AlbertaAdresă de e-mail confirmată pe ualberta.ca
Masoud S. Nosrati, PhDSenior ML Software Engineer, Meta (Facebook)Adresă de e-mail confirmată pe fb.com
Martha WhiteUniversity of AlbertaAdresă de e-mail confirmată pe ualberta.ca
Shahin AtakishiyevPhD Candidate in Computing Science, University of AlbertaAdresă de e-mail confirmată pe ualberta.ca
Peyman YadmellatWoven Planet HoldingsAdresă de e-mail confirmată pe woven-planet.global
Martin JagersandUniversity of AlbertaAdresă de e-mail confirmată pe cs.ualberta.ca
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoAdresă de e-mail confirmată pe cs.ox.ac.uk
Bo LiuAAAI SM, IEEE SMAdresă de e-mail confirmată pe cs.umass.edu
Amir-massoud FarahmandUniversity of TorontoAdresă de e-mail confirmată pe cs.toronto.edu
Mennatullah SiamOntario Tech UniversityAdresă de e-mail confirmată pe ontariotechu.ca
Naren DoraiswamyUniversity of MichiganAdresă de e-mail confirmată pe umich.edu
Boris N. OreshkinPrincipal Scientist at AmazonAdresă de e-mail confirmată pe amazon.com
Jincheng MeiResearch Scientist, Google BrainAdresă de e-mail confirmată pe google.com

Urmăriți

Hengshuai Yao

Sony AI

Adresă de e-mail confirmată pe ualberta.ca - Pagina de pornire

Deep Representation Decision Boundary SGD Reinforcement Learning step-size adaptation


Titlu Sortați după descrierea bibliografică Sortați după an Sortați după titlu	Citat de Citat de	Anul
Explainable artificial intelligence for autonomous driving: A comprehensive overview and field guide for future research directions S Atakishiyev, M Salameh, H Yao, R Goebel arXiv preprint arXiv:2112.11561, 2021	103	2021
Distributional Reinforcement Learning for Efficient Exploration B Mavrin, S Zhang, H Yao, K Kong, Linglong, Wu, Y Yu https://arxiv.org/abs/1905.06125, 2019	91	2019
Negative log likelihood ratio loss for deep neural network classification H Yao, D Zhu, B Jiang, P Yu Proceedings of the Future Technologies Conference (FTC) 2019: Volume 1, 276-282, 2020	87	2020
Discounted reinforcement learning is not an optimization problem A Naik, R Shariff, N Yasui, H Yao, RS Sutton arXiv preprint arXiv:1910.02140, 2019	60	2019
Mapless navigation among dynamics with social-safety-awareness: a reinforcement learning approach from 2d laser scans J Jin, NM Nguyen, N Sakib, D Graves, H Yao, M Jagersand 2020 IEEE international conference on robotics and automation (ICRA), 6979-6985, 2020	54	2020
Provably convergent two-timescale off-policy actor-critic with function approximation S Zhang, B Liu, H Yao, S Whiteson International Conference on Machine Learning, 11204-11213, 2020	53	2020
Weakly supervised few-shot object segmentation using co-attention with visual and semantic embeddings M Siam, N Doraiswamy, BN Oreshkin, H Yao, M Jagersand arXiv preprint arXiv:2001.09540, 2020	48	2020
Universal Option Models H Yao, C Szepesvari, R Sutton, S Bhatnagar, J Modayil	45*	2014
Breaking the deadly triad with a target network S Zhang, H Yao, S Whiteson International Conference on Machine Learning, 12621-12631, 2021	40	2021
A multi-component framework for the analysis and design of explainable artificial intelligence MY Kim, S Atakishiyev, HKB Babiker, N Farruque, R Goebel, OR Zaïane, ... Machine Learning and Knowledge Extraction 3 (4), 900-921, 2021	36	2021
Method of prediction of a state of an object in the environment using an action model of a neural network H Yao, SM Nosrati, H Chen, P Yadmellat, Y Zhang US Patent 10,997,491, 2021	34	2021
Multi-step dyna planning for policy evaluation and control H Yao, S Bhatnagar, D Diao Advances in neural information processing systems 22, 2009	34*	2009
Quota: The quantile option architecture for reinforcement learning S Zhang, H Yao Proceedings of the AAAI conference on artificial intelligence 33 (01), 5797-5804, 2019	32	2019
Ace: An actor ensemble algorithm for continuous control with tree search S Zhang, H Yao Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 5789-5796, 2019	31	2019
Pseudo-MDPs and Factored Linear Action Models H Yao, C Szepesvari, BA Pires, X Zhang IEEE ADPRL, 2014	26	2014
Method of selection of an action for an object using a neural network H Yao, H Chen, SM Nosrati, P Yadmellat, Y Zhang US Patent 10,935,982, 2021	21	2021
Approximate policy iteration with linear action models H Yao, C Szepesvári Proceedings of the AAAI Conference on Artificial Intelligence 26 (1), 1212-1218, 2012	18	2012
Hill climbing on value estimates for search-control in Dyna Y Pan, H Yao, A Farahmand, M White arXiv preprint arXiv:1906.07791, 2019	17	2019
Preconditioned temporal difference learning H Yao, ZQ Liu Proceedings of the 25th international conference on Machine learning, 1208-1215, 2008	17	2008
Towards practical hierarchical reinforcement learning for multi-lane autonomous driving MS Nosrati, EA Abolfathi, M Elmahgiubi, P Yadmellat, J Luo, Y Zhang, ...	16	2018

Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.

Articole 1–20

Referințe bibliografice pe an

Citate duplicat

Citate fuzionate

Adăugați coautoriCoautori

Urmăriți

Citat de

Coautori