Urmăriți
Olivier Pietquin
Olivier Pietquin
Cohere | ex Google DeepMind (On leave - Professor at University of Lille)
Adresă de e-mail confirmată pe univ-lille.fr - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Deep q-learning from demonstrations
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
12252018
Noisy Networks for Exploration
SL Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian ...
International Conference on Learning Representations (ICLR), 2018
1177*2018
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards
M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ...
arXiv preprint arXiv:1707.08817, 2017
7822017
Modulating early visual processing by language
H De Vries, F Strub, J Mary, H Larochelle, O Pietquin, AC Courville
Advances in neural information processing systems 30, 2017
5522017
Guesswhat?! visual object discovery through multi-modal dialogue
H De Vries, F Strub, S Chandar, O Pietquin, H Larochelle, A Courville
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
4532017
Audiolm: a language modeling approach to audio generation
Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ...
IEEE/ACM transactions on audio, speech, and language processing 31, 2523-2533, 2023
3642023
Listen and translate: A proof of concept for end-to-end speech-to-text translation
A Bérard, O Pietquin, C Servan, L Besacier
arXiv preprint arXiv:1612.01744, 2016
3152016
A theory of regularized markov decision processes
M Geist, B Scherrer, O Pietquin
International Conference on Machine Learning, 2160-2169, 2019
3102019
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2392020
What matters for on-policy deep actor-critic methods? A large-scale empirical study
M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ...
arXiv preprint arXiv:2006.05990, 2020
237*2020
End-to-end automatic speech translation of audiobooks
A Bérard, L Besacier, AC Kocabiyikoglu, O Pietquin
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
2202018
A probabilistic framework for dialog simulation and optimal strategy learning
O Pietquin, T Dutoit
IEEE Transactions on Audio, Speech, and Language Processing 14 (2), 589-599, 2006
2032006
Learning from demonstrations for real world reinforcement learning
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
arXiv preprint arXiv:1704.03732, 2017, 2018
1892018
What matters for on-policy deep actor-critic methods? a large-scale study
M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ...
International conference on learning representations, 2020
1642020
Machine learning for spoken dialogue systems
O Lemon, O Pietquin
European Conference on Speech Communication and Technologies (Interspeech'07 …, 2007
1502007
A framework for unsupervised learning of dialogue strategies
O Pietquin
Presses univ. de Louvain, 2005
1482005
A survey on metrics for the evaluation of user simulations
O Pietquin, H Hastie
The knowledge engineering review 28 (1), 59-73, 2013
1402013
Observe and look further: Achieving consistent performance on atari
T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ...
arXiv preprint arXiv:1805.11593, 2018
1372018
Primal wasserstein imitation learning
R Dadashi, L Hussenot, M Geist, O Pietquin
arXiv preprint arXiv:2006.04678, 2020
1302020
Kalman temporal differences
M Geist, O Pietquin
Journal of artificial intelligence research 39, 483-532, 2010
1242010
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–20