Urmăriți
Long Ouyang
Long Ouyang
OpenAI
Adresă de e-mail confirmată pe openai.com - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Training language models to follow instructions with human feedback
L Ouyang, J Wu, X Jiang, D Almeida, C Wainwright, P Mishkin, C Zhang, ...
Advances in neural information processing systems 35, 27730-27744, 2022
76882022
Gpt-4 technical report
J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ...
arXiv preprint arXiv:2303.08774, 2023
24452023
Learning to summarize with human feedback
N Stiennon, L Ouyang, J Wu, D Ziegler, R Lowe, C Voss, A Radford, ...
Advances in Neural Information Processing Systems 33, 3008-3021, 2020
12962020
Webgpt: Browser-assisted question-answering with human feedback
R Nakano, J Hilton, S Balaji, J Wu, L Ouyang, C Kim, C Hesse, S Jain, ...
arXiv preprint arXiv:2112.09332, 2021
8092021
Improving image generation with better captions
J Betker, G Goh, L Jing, T Brooks, J Wang, L Li, L Ouyang, J Zhuang, ...
Computer Science. https://cdn. openai. com/papers/dall-e-3. pdf 2 (3), 8, 2023
3342023
Recursively summarizing books with human feedback
J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano
arXiv preprint arXiv:2109.10862, 2021
2142021
Training language models to follow instructions with human feedback, 2022
L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ...
URL https://arxiv. org/abs/2203.02155 13, 1, 2022
1842022
Self-critiquing models for assisting human evaluators
W Saunders, C Yeh, J Wu, S Bills, L Ouyang, J Ward, J Leike
arXiv preprint arXiv:2206.05802, 2022
1432022
Training language models to follow instructions with human feedback. arXiv
L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ...
arXiv preprint arXiv:2203.02155, 2022
822022
Training language models to follow instructions with human feedback. arXiv 2022
L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ...
arXiv preprint arXiv:2203.02155 10, 2022
392022
Practical optimal experiment design with probabilistic programs
L Ouyang, MH Tessler, D Ly, N Goodman
arXiv preprint arXiv:1608.05046, 2016
222016
Semantic coherence facilitates distributional learning
L Ouyang, L Boroditsky, MC Frank
Cognitive science 41, 855-884, 2017
192017
Learning to summarize from human feedback, 2020
N Stiennon, L Ouyang, J Wu, DM Ziegler, R Lowe, C Voss, A Radford, ...
URL https://arxiv. org/abs, 2009
112009
Fabular: Regression formulas as probabilistic programming
J Borgström, AD Gordon, L Ouyang, C Russo, A ¦cibior, M Szymczak
Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of …, 2016
82016
webppl-oed: A practical optimal experiment design system.
L Ouyang, MH Tessler, D Ly, ND Goodman
CogSci, 2018
72018
Recursively summarizing books with human feedback, 2021
J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano
URL https://arxiv. org/abs/2109.10862, 0
7
Semantic coherence facilitates distributional learning of word meanings
L Ouyang, L Boroditsky, M Frank
Proceedings of the Annual Meeting of the Cognitive Science Society 34 (34), 2012
32012
Bayesian inference of regular expressions from human-generated example strings
L Ouyang
arXiv preprint arXiv:1805.08427, 2018
22018
Pedagogical learning
L Ouyang, MC Frank
arXiv preprint arXiv:1711.09401, 2017
12017
The Effect of Learning on Learning
L Ouyang
Stanford University, 2015
2015
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–20