Multi30K: Multilingual English-German image descriptions D Elliott, S Frank, K Sima'an, L Specia Proceedings of the 5th Workshop on Vision and Language, 2016 | 533 | 2016 |
Automatic description generation from images: A survey of models, datasets, and evaluation measures R Bernardi, R Cakici, D Elliott, A Erdem, E Erdem, N Ikizler-Cinbis, ... Journal of Artificial Intelligence Research 55, 409-442, 2016 | 413 | 2016 |
Image Description using Visual Dependency Representations D Elliott, F Keller Conference on Empirical Methods in Natural Language Processing, 1292-1302, 2013 | 345 | 2013 |
A shared task on multimodal machine translation and crosslingual image description L Specia, S Frank, K Sima’An, D Elliott Proceedings of the First Conference on Machine Translation: Volume 2, Shared …, 2016 | 237 | 2016 |
How2: a large-scale dataset for multimodal language understanding R Sanabria, O Caglayan, S Palaskar, D Elliott, L Barrault, L Specia, ... Workshop on Visually Grounded Interaction and Language, 2018 | 228 | 2018 |
Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description D Elliott, S Frank, L Barrault, F Bougares, L Specia Proceedings of the Second Conference on Machine Translation, 215-233, 2017 | 218 | 2017 |
Findings of the third shared task on multimodal machine translation L Barrault, F Bougares, L Specia, C Lala, D Elliott, S Frank Proceedings of the Third Conference on Machine Translation, 308-327, 2018 | 149 | 2018 |
Comparing Automatic Evaluation Measures for Image Description D Elliott, F Keller Proceedings of the 52nd Annual Meeting of the Association for Computational …, 2014 | 146 | 2014 |
Imagination improves multimodal translation D Elliott, A Kádár Proceedings of the Eighth International Joint Conference on Natural Language …, 2017 | 143 | 2017 |
Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs E Bugliarello, R Cotterell, N Okazaki, D Elliott Transactions of the Association of Computational Linguistics, 2021 | 110 | 2021 |
Multi-language Image Description with Neural Sequence Models D Elliott, S Frank, E Hasler arXiv preprint arXiv:1510.04709, 2015 | 98 | 2015 |
Adversarial Evaluation of Multimodal Machine Translation D Elliott Proceedings of the 2018 Conference on Empirical Methods in Natural Language …, 2018 | 83 | 2018 |
Visually Grounded Reasoning across Languages and Cultures F Liu, E Bugliarello, EM Ponti, S Reddy, N Collier, D Elliott Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 66 | 2021 |
Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers S Frank, E Bugliarello, D Elliott Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 64* | 2021 |
Describing Images using Inferred Visual Dependency Representations. D Elliott, A de Vries Proceedings of the 53rd Annual Meeting of the Association for Computational …, 2015 | 60 | 2015 |
Multimodal machine translation through visuals and speech U Sulubacak, O Caglayan, SA Grönroos, A Rouhe, D Elliott, L Specia, ... Machine Translation 34, 97-147, 2020 | 57 | 2020 |
Measuring the diversity of automatic image descriptions E Van Miltenburg, D Elliott, P Vossen Proceedings of the 27th International Conference on Computational …, 2018 | 46 | 2018 |
Adversarial removal of demographic attributes revisited M Barrett, Y Kementchedjhieva, Y Elazar, D Elliott, A Søgaard Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019 | 45 | 2019 |
DCU-UvA Multimodal MT System Report I Calixto, D Elliott, S Frank Proceedings of the First Conference on Machine Translation, Berlin, Germany, 2016 | 45 | 2016 |
Compositional Generalization in Image Captioning M Nikolaus, M Abdou, M Lamm, R Aralikatte, D Elliott Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019 | 41 | 2019 |