Learning to summarize with human feedback N Stiennon, L Ouyang, J Wu, D Ziegler, R Lowe, C Voss, A Radford, ... Advances in Neural Information Processing Systems 33, 3008-3021, 2020 | 1886 | 2020 |
Fine-tuning language models from human preferences DM Ziegler, N Stiennon, J Wu, TB Brown, A Radford, D Amodei, ... arXiv preprint arXiv:1909.08593, 2019 | 1543 | 2019 |
Recursively summarizing books with human feedback J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano arXiv preprint arXiv:2109.10862, 2021 | 265 | 2021 |
Fine-tuning language models from human preferences (2020) DM Ziegler, N Stiennon, J Wu, TB Brown, A Radford, D Amodei, ... URL: http://arxiv. org/abs/1909.08593, 1909 | 54 | 1909 |
Fine-tuning language models from human preferences. arXiv 2019 DM Ziegler, N Stiennon, J Wu, TB Brown, A Radford, D Amodei, ... arXiv preprint arXiv:1909.08593, 1909 | 48 | 1909 |
Learning to summarize from human feedback, 2020 N Stiennon, L Ouyang, J Wu, DM Ziegler, R Lowe, C Voss, A Radford, ... URL https://arxiv. org/abs, 2009 | 24 | 2009 |
Recursively summarizing books with human feedback, 2021 J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano URL https://arxiv. org/abs/2109.10862, 0 | 11 | |
The moduli space of real curves and a Z/2-equivariant Madsen-Weiss theorem N Stiennon Stanford University, 2013 | 8 | 2013 |
Research Preparation Criterion S Jia, VB Kolachalama, DM Ziegler, N Stiennon, J Wu, TB Brown, ... arXiv preprint arXiv:1909.08593, 2019 | | 2019 |
“Loudness”: On priors over preference relations (Brief technical note) B Fallenstein, N Stiennon Tech. rep. 2014. url: https://intelligence. org/files/LoudnessPriors. pdf, 2014 | | 2014 |
RECURSIVELY-DEFINED LOGICAL THEORIES ARE WELL-DEFINED (BRIEF TECHNICAL NOTE) N STIENNON | | |