Follow
Benoît Sagot
Benoît Sagot
Directeur de recherches at Inria, head of the ALMAnaCH team
Verified email at inria.fr - Homepage
Title
Cited by
Cited by
Year
What does BERT learn about the structure of language?
G Jawahar, B Sagot, D Seddah
57th Annual Meeting of the Association for Computational Linguistics (ACL …, 2019
12392019
CamemBERT: a Tasty French Language Model
L Martin, B Muller, PJ Ortiz Suárez, Y Dupont, L Romary, ...
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
9832020
Bloom: A 176b-parameter open-access multilingual language model
BS Workshop, TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, ...
arXiv preprint arXiv:2211.05100, 2022
9612022
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures
PJ Ortiz Suárez, B Sagot, L Romary
Challenges in the Management of Large Corpora (CMLC-7) 2019, 9, 2019
361*2019
The Lefff, a freely available and large-coverage morphological and syntactic lexicon for French
B Sagot
LREC 2010, 2010
296*2010
Building a free French wordnet from multilingual resources
B Sagot, D Fišer
Ontolex 2008, 2008
2532008
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
PJ Ortiz Suárez, L Romary, B Sagot
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
186*2020
Coupling an annotated corpus and a morphosyntactic lexicon for state-of-the-art POS tagging with less human effort
P Denis, B Sagot
PACLIC 2009, 2009
1712009
Controllable sentence simplification
L Martin, B Sagot, E de la Clergerie, A Bordes
arXiv preprint arXiv:1910.02677, 2019
1442019
MUSS: multilingual unsupervised sentence simplification by mining paraphrases
L Martin, A Fan, É De La Clergerie, A Bordes, B Sagot
arXiv preprint arXiv:2005.00352, 2020
1212020
ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations
F Alva-Manchego, L Martin, A Bordes, C Scarton, B Sagot, L Specia
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
1202020
Universal dependencies 2.5
D Zeman, J Nivre, et al.
LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied …, 2020
1182020
The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
B Sagot, L Clément, E de La Clergerie, P Boullier
LREC 2006, 2006
1102006
Quality at a glance: An audit of web-crawled multilingual datasets
J Kreutzer, I Caswell, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ...
Transactions of the Association for Computational Linguistics 10, 50-72, 2022
1092022
When being unseen from mBERT is just the beginning: Handling new languages with multilingual language models
B Muller, A Anastasopoulos, B Sagot, D Seddah
arXiv preprint arXiv:2010.12858, 2020
1052020
Coupling an annotated corpus and a lexicon for state-of-the-art POS tagging
P Denis, B Sagot
Language resources and evaluation 46 (4), 721-736, 2012
992012
Influence of pre-annotation on POS-tagged corpus development
K Fort, B Sagot
The fourth ACL linguistic annotation workshop, 56--63, 2010
992010
Morphology based automatic acquisition of large-coverage lexica
L Clément, B Lang, B Sagot
LREC 2004, 2004
832004
The CoMeRe corpus for French: structuring and annotating heterogeneous CMC genres
T Chanier, C Poudat, B Sagot, G Antoniadis, CR Wigham, L Hriba, ...
Journal for language technology and computational linguistics 29 (2), 1-30, 2014
752014
SxPipe 2: architecture pour le traitement pré-syntaxique de corpus bruts
B Sagot, P Boullier
Revue TAL 49 (2), 155-188, 2008
742008
The system can't perform the operation now. Try again later.
Articles 1–20