Urmăriți
Gandharv Patil
Gandharv Patil
McGill University, Mila
Adresă de e-mail confirmată pe mail.mcgill.ca - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
G Patil, LA Prashanth, D Nagaraj, D Precup
International Conference on Artificial Intelligence and Statistics, 5438-5448, 2023
102023
Variance penalized on-policy and off-policy actor-critic
A Jain, G Patil, A Jain, K Khetarpal, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 35 (9), 7899-7907, 2021
102021
On learning history-based policies for controlling Markov decision processes
G Patil, A Mahajan, D Precup
International Conference on Artificial Intelligence and Statistics, 3511-3519, 2024
52024
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–3