Thomas Wolf

Cited by

	All	Since 2019
Citations	27146	26787
h-index	29	28
i10-index	45	43

10000

5000

2500

7500

201920202021202220232024235 2222 4357 6526 9882 3522

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Julien ChaumondHugging FaceVerified email at huggingface.co
Victor SanhHugging FaceVerified email at huggingface.co
Lysandre DebutMachine Learning Engineer, Hugging FaceVerified email at huggingface.co
Clément DelangueHugging FaceVerified email at huggingface.co
Yacine JerniteResearch Scientist, HuggingFaceVerified email at cs.nyu.edu
Quentin LhoestHugging FaceVerified email at huggingface.co
Alexander M. RushAssociate Professor, Cornell UniversityVerified email at cornell.edu
Joe DavisonUniversity of UtahVerified email at utah.edu
Canwen XuBoson AIVerified email at ucsd.edu
Patrick von PlatenResearch Engineer at Hugging FaceVerified email at huggingface.co
Julien PluResearch Scientist, LettriaVerified email at eurecom.fr
Morgan FuntowiczHugging FaceVerified email at huggingface.co
Mariama DRAMÉétudianteVerified email at edu.em-lyon.com
Sam ShleiferFacebook AI ResearchVerified email at fb.com
Jérôme LesueurProfessor of Physics, ESPCI Paris, Université PSL, CNRSVerified email at mines-nancy.org
Rémi Louf🤗 Hugging Face Inc.Verified email at huggingface.co
Lewis TunstallHugging FaceVerified email at itp.unibe.ch
Nicolas BergealESPCI Paris - CNRS - PSL University - Sorbonne UniversitéVerified email at espci.fr
Teven Le ScaoHugging FaceVerified email at huggingface.co
Sebastian RuderResearch Scientist, CohereVerified email at cohere.com

Thomas Wolf

Co-founder at HuggingFace

Verified email at polytechnique.edu - Homepage

machine learning deep learning natural language processing computational linguistics artificial


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Transformers: State-of-the-art natural language processing T Wolf, L Debut, V Sanh, J Chaumond, C Delangue, A Moi, P Cistac, ... Proceedings of the 2020 conference on empirical methods in natural language …, 2020	13261*	2020
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter V Sanh, L Debut, J Chaumond, T Wolf arXiv preprint arXiv:1910.01108, 2019	6377*	2019
Multitask prompted training enables zero-shot task generalization V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ... arXiv preprint arXiv:2110.08207, 2021	1227	2021
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1162	2023
Transfer learning in natural language processing S Ruder, ME Peters, S Swayamdipta, T Wolf Proceedings of the 2019 conference of the North American chapter of the …, 2019	658	2019
Transfertransfo: A transfer learning approach for neural network based conversational agents T Wolf, V Sanh, J Chaumond, C Delangue arXiv preprint arXiv:1901.08149, 2019	495	2019
Datasets: A community library for natural language processing Q Lhoest, AV del Moral, Y Jernite, A Thakur, P von Platen, S Patil, ... arXiv preprint arXiv:2109.02846, 2021	417*	2021
Movement pruning: Adaptive sparsity by fine-tuning V Sanh, T Wolf, A Rush Advances in neural information processing systems 33, 20378-20389, 2020	367	2020
Starcoder: may the source be with you! R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ... arXiv preprint arXiv:2305.06161, 2023	339*	2023
Two-dimensional superconductivity at a Mott insulator/band insulator interface LaTiO₃/SrTiO₃ J Biscaras, N Bergeal, A Kushwaha, T Wolf, A Rastogi, RC Budhani, ... Nature communications 1 (1), 89, 2010	338	2010
A hierarchical multi-task approach for learning embeddings from semantic tasks V Sanh, T Wolf, S Ruder Proceedings of the AAAI conference on artificial intelligence 33 (01), 6949-6956, 2019	261	2019
Natural language processing with transformers L Tunstall, L Von Werra, T Wolf " O'Reilly Media, Inc.", 2022	251	2022
Diffusers: State-of-the-art diffusion models P Von Platen, S Patil, A Lozhkov, P Cuenca, N Lambert, K Rasul, ...	216	2022
Transformers: State-of-the-art natural language processing W Thomas, D Lysandre, S Victor, C Julien, D Clement, M Anthony, ... Proceedings of the 2020 conference on empirical methods in natural language …, 2020	186	2020
Zephyr: Direct distillation of lm alignment L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... arXiv preprint arXiv:2310.16944, 2023	145	2023
The stack: 3 tb of permissively licensed source code D Kocetkov, R Li, LB Allal, J Li, C Mou, CM Ferrandis, Y Jernite, M Mitchell, ... arXiv preprint arXiv:2211.15533, 2022	128	2022
Open llm leaderboard E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ... Hugging Face, 2023	115	2023
Large-scale transfer learning for natural language generation S Golovanov, R Kurbanov, S Nikolenko, K Truskovskyi, A Tselousov, ... Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019	102	2019
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter (2019) V Sanh, L Debut, J Chaumond, T Wolf arXiv preprint arXiv:1910.01108, 1910	97	1910
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR abs/1910.01108 (2019) V Sanh, L Debut, J Chaumond, T Wolf URL: http://arxiv. org/abs/1910 1108, 1910	91	1910

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors