Urmăriți
Ruiqi Zhang
Ruiqi Zhang
Ph.D. Student, Statistics Department at University of California, Berkeley
Adresă de e-mail confirmată pe berkeley.edu - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Trained Transformers Learn Linear Models In-Context
R Zhang, S Frei, PL Bartlett
arXiv preprint arXiv:2306.09927, 2023
512023
Off-policy fitted q-evaluation with differentiable function approximators: Z-estimation and inference theory
R Zhang, X Zhang, C Ni, M Wang
International Conference on Machine Learning, 26713-26749, 2022
162022
Optimal Estimation of Policy Gradient via Double Fitted Iteration
C Ni, R Zhang, X Ji, X Zhang, M Wang
International Conference on Machine Learning, 16724-16783, 2022
4*2022
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data
R Zhang, A Zanette
Advances in Neural Information Processing Systems, 2024, 2024
22024
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition
Z Chen, Z Zhao, Z Zhu, R Zhang, X Li, B Raj, H Yao
arXiv preprint arXiv:2402.11452, 2024
2024
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–5