Follow
Vít Suchomel
Vít Suchomel
Masaryk University and Lexical Computing Ltd.
Verified email at mail.muni.cz
Title
Cited by
Cited by
Year
The Sketch Engine: ten years on
A Kilgarriff, V Baisa, J Bušta, M Jakubíček, V Kovář, J Michelfeit, P Rychlý, ...
Lexicography 1 (1), 7-36, 2014
38902014
The TenTen corpus family
M Jakubíček, A Kilgarriff, V Kovář, P Rychlý, V Suchomel
7th international corpus linguistics conference CL, 125-127, 2013
6232013
HindEnCorp-Hindi-English and Hindi-only Corpus for Machine Translation.
O Bojar, V Diatka, P Rychlý, P Stranák, V Suchomel, A Tamchyna, ...
LREC, 3550-3555, 2014
1472014
Efficient web crawling for large text corpora
V Suchomel, J Pomikálek
Proceedings of the seventh Web as Corpus Workshop (WAC7), 39-43, 2012
143*2012
SkELL: Web Interface for English Language Learning.
V Baisa, V Suchomel
RASLAN, 63-70, 2014
1052014
arTenTen: Arabic corpus and word sketches
T Arts, Y Belinkov, N Habash, A Kilgarriff, V Suchomel
Journal of King Saud University-Computer and Information Sciences 26 (4 …, 2014
742014
Finding terms in corpora for many languages with the Sketch Engine
M Jakubíček, A Kilgarriff, V Kovář, P Rychlý, V Suchomel
Proceedings of the Demonstrations at the 14th Conference of the European …, 2014
742014
Text Tokenisation Using unitok.
J Michelfeit, J Pomikálek, V Suchomel
RASLAN, 71-75, 2014
542014
csTenTen17, a Recent Czech Web Corpus.
V Suchomel
RASLAN, 111-123, 2018
272018
Large corpora for Turkic languages and unsupervised morphological analysis
V Baisa, V Suchomel
Proceedings of the Eighth conference on International Language Resources and …, 2012
252012
Recent Czech Web Corpora.
V Suchomel
RASLAN, 77-83, 2012
242012
Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
T Erjavec, M Ogrodniczuk, P Osenova, N Ljubešić, K Simov, V Grigorova, ...
CLARIN ERIC, 2021
222021
Current challenges in web corpus building
M Jakubíček, V Kovář, P Rychlý, V Suchomel
Proceedings of the 12th Web as Corpus Workshop, 1-4, 2020
172020
MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages
M Banón, M Espla-Gomis, ML Forcada, C García-Romero, T Kuzman, ...
23rd Annual Conference of the European Association for Machine Translation …, 2022
162022
Annotated amharic corpora
P Rychlý, V Suchomel
Text, Speech, and Dialogue: 19th International Conference, TSD 2016, Brno …, 2016
152016
arTenTen: a new, vast corpus for Arabic
Y Belinkov, N Habash, A Kilgarriff, N Ordan, R Roth, V Suchomel
Proceedings of WACL 20, 2013
152013
Terminology extraction for academic Slovene using sketch engine
D Fišer, V Suchomel, M Jakubícek
Tenth Workshop on Recent Advances in Slavonic Natural Language Processing …, 2016
122016
Building a 50M Corpus of Tajik Language.
G Dovudov, J Pomikálek, V Suchomel, P Smerk
RASLAN, 89-95, 2011
112011
Better web corpora for corpus linguistics and NLP
V Suchomel
Masaryk University, 2020
102020
HindMonoCorp 0.5
O Bojar, V Diatka, P Rychlý, P Straňák, V Suchomel, A Tamchyna, ...
Charles University, Faculty of Mathematics and Physics, Institute of Formal …, 2014
92014
The system can't perform the operation now. Try again later.
Articles 1–20