Follow
Johan Ferret
Johan Ferret
Research Scientist, Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini: a Family of Highly Capable Multimodal Models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
14902023
Gemma: Open Models Based on Gemini Research and Technology
G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ...
arXiv preprint arXiv:2403.08295, 2024
4252024
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
H Lee, S Phatale, H Mansoor, T Mesnard, J Ferret, K Lu, C Bishop, E Hall, ...
International Conference on Machine Learning (ICML 2024), 2023
3082023
Acme: A Research Framework for Distributed Reinforcement Learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2532020
Adversarially Guided Actor-Critic
Y Flet-Berliac*, J Ferret*, O Pietquin, P Preux, M Geist
International Conference on Learning Representations (ICLR 2021), 2021
842021
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
P Roit*, J Ferret*, L Shani*, R Aharoni, G Cideron, R Dadashi, M Geist, ...
ACL, 2023
552023
Direct Language Model Alignment from Online AI Feedback
S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ...
arXiv preprint arXiv:2402.04792, 2024
522024
Gemma 2: Improving open language models at a practical size
G Team, M Riviere, S Pathak, PG Sessa, C Hardin, S Bhupatiraju, ...
arXiv preprint arXiv:2408.00118, 2024
402024
WARM: On the Benefits of Weight Averaged Reward Models
A Ramé, N Vieillard, L Hussenot, R Dadashi, G Cideron, O Bachem, ...
International Conference on Machine Learning (ICML 2024), 2024
332024
Self-Attentional Credit Assignment for Transfer in Reinforcement Learning
J Ferret, R Marinier, M Geist, O Pietquin
International Joint Conference on Artificial Intelligence (IJCAI 2020), 2019
332019
Self-Imitation Advantage Learning
J Ferret, O Pietquin, M Geist
International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020
272020
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
N Grinsztajn*, J Ferret*, O Pietquin, P Preux, M Geist
Advances in Neural Information Processing Systems (NeurIPS 2021), 2021
222021
Lazy-MDPs: Towards Interpretable Reinforcement Learning By Learning When To Act
A Jacq*, J Ferret*, O Pietquin, M Geist
International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2022
20*2022
Credit assignment as a proxy for transfer in reinforcement learning
J Ferret, R Marinier, M Geist, O Pietquin
Learning Transferrable Skills Workshop, NeurIPS, 2019
62019
Bond: Aligning llms with best-of-n distillation
PG Sessa, R Dadashi, L Hussenot, J Ferret, N Vieillard, A Ramé, ...
arXiv preprint arXiv:2407.14622, 2024
52024
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
E Pignatelli, J Ferret, M Geist, T Mesnard, H van Hasselt, L Toni
Transactions on Machine Learning Research (TMLR), 2023
52023
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ...
arXiv preprint arXiv:2404.07839, 2024
42024
Warp: On the benefits of weight averaged rewarded policies
A Ramé, J Ferret, N Vieillard, R Dadashi, L Hussenot, PL Cedoz, ...
arXiv preprint arXiv:2406.16768, 2024
22024
More efficient exploration with symbolic priors on action sequence equivalences
T Johnstone, N Grinsztajn, J Ferret, P Preux
Deep Reinforcement Learning Workshop, NeurIPS, 2022
2*2022
On actions that matter: Credit assignment and interpretability in reinforcement learning
J Ferret
Université de Lille, 2022
22022
The system can't perform the operation now. Try again later.
Articles 1–20