Follow
Adhiguna Kuncoro
Adhiguna Kuncoro
Oxford University and Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Scaling language models: Methods, analysis & insights from training gopher
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
arXiv preprint arXiv:2112.11446, 2021
7592021
Localizing syntactic predictions using recurrent neural network grammars
JR Brennan, C Dyer, A Kuncoro, JT Hale
Neuropsychologia 146, 107479, 2020
7252020
Dynet: The dynamic neural network toolkit
G Neubig, C Dyer, Y Goldberg, A Matthews, W Ammar, A Anastasopoulos, ...
arXiv preprint, 2017
443*2017
What do recurrent neural network grammars learn about syntax?
A Kuncoro, M Ballesteros, L Kong, C Dyer, G Neubig, NA Smith
Proceedings of EACL 2017 1, 1249-1258, 2017
1632017
LSTMs can learn syntax-sensitive dependencies well, but modeling structure makes them better
A Kuncoro, C Dyer, J Hale, D Yogatama, S Clark, P Blunsom
Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018
1582018
Finding syntax in human encephalography with beam search
J Hale, C Dyer, A Kuncoro, JR Brennan
arXiv preprint arXiv:1806.04127, 2018
1462018
Unsupervised recurrent neural network grammars
Y Kim, AM Rush, L Yu, A Kuncoro, C Dyer, G Melis
arXiv preprint arXiv:1904.03746, 2019
1442019
Mind the gap: Assessing temporal generalization in neural language models
A Lazaridou, A Kuncoro, E Gribovskaya, D Agrawal, A Liska, T Terzi, ...
Advances in Neural Information Processing Systems 34, 29348-29363, 2021
124*2021
Distilling an ensemble of greedy dependency parsers into one MST parser
A Kuncoro, M Ballesteros, L Kong, C Dyer, NA Smith
Proceedings of EMNLP, 1744-1753, 2016
842016
Cyprien de Masson d’Autume
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
802021
IndoNLG: Benchmark and resources for evaluating Indonesian natural language generation
S Cahyawijaya, GI Winata, B Wilie, K Vincentio, X Li, A Kuncoro, S Ruder, ...
arXiv preprint arXiv:2104.08200, 2021
632021
Memory architectures in recurrent neural network language models
D Yogatama, Y Miao, G Melis, W Ling, A Kuncoro, C Dyer, P Blunsom
International Conference on Learning Representations, 2018
602018
Cyprien de Masson d’Autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego de Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew J
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, HF Song, J Aslanides, ...
Johnson, Blake A. Hechtman, Laura Weidinger, Iason Gabriel, William S. Isaac …, 2021
482021
Syntactic structure distillation pretraining for bidirectional encoders
A Kuncoro, L Kong, D Fried, D Yogatama, L Rimell, C Dyer, P Blunsom
Transactions of the Association for Computational Linguistics 8, 776-794, 2020
47*2020
Scalable syntax-aware language models using knowledge distillation
A Kuncoro, C Dyer, L Rimell, S Clark, P Blunsom
arXiv preprint arXiv:1906.06438, 2019
402019
Cyprien de Masson d’Autume, Tomáš Kociský, Sebastian Ruder, Dani Yogatama, Kris Cao, Susannah Young, and Phil Blunsom. 2021. Mind the gap: Assessing temporal generalization in …
A Lazaridou, A Kuncoro, E Gribovskaya, D Agrawal, A Liska, T Terzi, ...
Advances in Neural Information Processing Systems 34, 6-14, 0
40
Transformer grammars: Augmenting transformer language models with syntactic inductive biases at scale
L Sartran, S Barrett, A Kuncoro, M Stanojević, P Blunsom, C Dyer
Transactions of the Association for Computational Linguistics 10, 1423-1439, 2022
382022
A systematic investigation of commonsense knowledge in large language models
XL Li, A Kuncoro, J Hoffmann, CM d'Autume, P Blunsom, A Nematzadeh
arXiv preprint arXiv:2111.00607, 2021
372021
The perils of natural behaviour tests for unnatural models: the case of number agreement
A Kuncoro, C Dyer, J Hale, P Blunsom
Poster presented at Learning Language in Humans and in Machines, Paris, Fr …, 2018
92018
DiLoCo: Distributed Low-Communication Training of Language Models
A Douillard, Q Feng, AA Rusu, R Chhaparia, Y Donchev, A Kuncoro, ...
arXiv preprint arXiv:2311.08105, 2023
62023
The system can't perform the operation now. Try again later.
Articles 1–20