Urmăriți
Sebastian Gehrman
Sebastian Gehrman
Head of NLP, CTO Office, Bloomberg LP
Adresă de e-mail confirmată pe bloomberg.net - Pagina de pornire
Titlu
Citat de
Citat de
Anul
PaLM: Scaling language modeling with pathways
A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
arXiv preprint arXiv:2204.02311, 2022
29102022
Bloom: A 176b-parameter open-access multilingual language model
BS Workshop, TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, ...
arXiv preprint arXiv:2211.05100, 2022
9962022
Bottom-up abstractive summarization
S Gehrmann, Y Deng, AM Rush
EMNLP 2018, 2018
7812018
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
6182022
Palm 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
6162023
LSTMVis: A tool for visual analysis of hidden state dynamics in recurrent neural networks
H Strobelt*, S Gehrmann*, H Pfister, AM Rush
IEEE transactions on visualization and computer graphics 24 (1), 667-676, 2017
4902017
GLTR: Statistical detection and visualization of generated text
S Gehrmann*, H Strobelt*, AM Rush
ACL Demo 2019, 2019
3392019
BloombergGPT: A large language model for finance
S Wu, O Irsoy, S Lu, V Dabravolski, M Dredze, S Gehrmann, P Kambadur, ...
arXiv preprint arXiv:2303.17564, 2023
3192023
Investigating gender bias in language models using causal mediation analysis
J Vig*, S Gehrmann*, Y Belinkov*, S Qian, D Nevo, Y Singer, S Shieber
NeurIPS 2021 33, 12388-12401, 2020
312*2020
ToTTo: A controlled table-to-text generation dataset
AP Parikh, X Wang, S Gehrmann, M Faruqui, B Dhingra, D Yang, D Das
EMNLP 2020, 2020
2942020
Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives
S Gehrmann, F Dernoncourt, Y Li, ET Carlson, JT Wu, J Welt, J Foote Jr, ...
PloS one 13 (2), e0192360, 2018
259*2018
Seq2Seq-Vis: A visual debugging tool for sequence-to-sequence models
H Strobelt*, S Gehrmann*, M Behrisch, A Perer, H Pfister, AM Rush
IEEE transactions on visualization and computer graphics 25 (1), 353-363, 2018
2382018
Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations
P Das, T Sercu, K Wadhawan, I Padhi, S Gehrmann, F Cipcigan, ...
Nature Biomedical Engineering 5 (6), 613-623, 2021
2222021
exBERT: A visual analysis tool to explore learned representations in transformers models
B Hoover, H Strobelt, S Gehrmann
EMNLP Demo 2019, 2019
1592019
The language interpretability tool: Extensible, interactive visualizations and analysis for NLP models
I Tenney, J Wexler, J Bastings, T Bolukbasi, A Coenen, S Gehrmann, ...
ACL Demo 2020, 2020
1532020
The GEM benchmark: Natural language generation, its evaluation and metrics
S Gehrmann, T Adewumi, K Aggarwal, PS Ammanamanchi, ...
GEM Workshop at ACL 2021, 2021
1272021
Challenging big-bench tasks and whether chain-of-thought can solve them
M Suzgun, N Scales, N Schärli, S Gehrmann, Y Tay, HW Chung, ...
ACL Findings 2023, 2022
1102022
Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text
S Gehrmann, E Clark, T Sellam
JAIR, 2022
892022
End-to-end content and plan selection for data-to-text generation
S Gehrmann, FZ Dai, H Elder, AM Rush
INLG 2018, 2018
812018
H Chi, Denny Zhou, et al. Challenging big-bench tasks and whether chain-of-thought can solve them
M Suzgun, N Scales, N Schärli, S Gehrmann, Y Tay, HW Chung, ...
arXiv preprint arXiv:2210.09261, 2022
782022
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–20