ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.14331
  4. Cited By
Maximum entropy GFlowNets with soft Q-learning
v1v2 (latest)

Maximum entropy GFlowNets with soft Q-learning

21 December 2023
Sobhan Mohammadpour
Emmanuel Bengio
Emma Frejinger
Pierre-Luc Bacon
ArXiv (abs)PDFHTMLGithub (285★)

Papers citing "Maximum entropy GFlowNets with soft Q-learning"

16 / 16 papers shown
gfnx: Fast and Scalable Library for Generative Flow Networks in JAX
gfnx: Fast and Scalable Library for Generative Flow Networks in JAX
D. Tiapkin
Artem Agarkov
Nikita Morozov
Ian Maksimov
Askar Tsyganov
Timofei Gritsaev
S. Samsonov
155
2
0
20 Nov 2025
Reinforced sequential Monte Carlo for amortised sampling
Reinforced sequential Monte Carlo for amortised sampling
Sanghyeok Choi
Sarthak Mittal
Victor Elvira
Jinkyoo Park
Nikolay Malkin
164
1
0
13 Oct 2025
FlowRL: Matching Reward Distributions for LLM Reasoning
FlowRL: Matching Reward Distributions for LLM Reasoning
Xuekai Zhu
Daixuan Cheng
D. Zhang
Hengli Li
Kaiyan Zhang
...
J. Gao
Xiaodong Liu
Bowen Zhou
Hongyuan Mei
Zhouhan Lin
LRM
428
19
0
18 Sep 2025
Relative Trajectory Balance is equivalent to Trust-PCL
Relative Trajectory Balance is equivalent to Trust-PCL
T. Deleu
Padideh Nouri
Yoshua Bengio
Doina Precup
OffRL
183
1
0
01 Sep 2025
Discrete Compositional Generation via General Soft Operators and Robust Reinforcement Learning
Discrete Compositional Generation via General Soft Operators and Robust Reinforcement Learning
Marco Jiralerspong
E. Derman
Danilo Vucetic
Nikolay Malkin
Bilun Sun
Tianyu Zhang
Pierre-Luc Bacon
Gauthier Gidel
OffRL
416
1
0
20 Jun 2025
Scalable and Cost-Efficient de Novo Template-Based Molecular Generation
Scalable and Cost-Efficient de Novo Template-Based Molecular Generation
Piotr Gaiñski
Oussama Boussif
Andrei Rekesh
Dmytro Shevchuk
Ali Parviz
Mike Tyers
Robert A. Batey
Michał Koziarski
252
5
0
10 Jun 2025
Symmetry-Aware GFlowNets
Symmetry-Aware GFlowNets
Hohyun Kim
Seunggeun Lee
Min-hwan Oh
297
1
0
03 Jun 2025
Adaptive Destruction Processes for Diffusion Samplers
Adaptive Destruction Processes for Diffusion Samplers
Timofei Gritsaev
Nikita Morozov
Kirill Tamogashev
D. Tiapkin
S. Samsonov
A. Naumov
Dmitry Vetrov
Nikolay Malkin
354
5
0
02 Jun 2025
Beyond the Proxy: Trajectory-Distilled Guidance for Offline GFlowNet Training
Beyond the Proxy: Trajectory-Distilled Guidance for Offline GFlowNet Training
Ruishuo Chen
Xun Wang
Rui Hu
Zhuoran Li
Longbo Huang
336
0
0
26 May 2025
RL-finetuning LLMs from on- and off-policy data with a single algorithm
RL-finetuning LLMs from on- and off-policy data with a single algorithm
Yunhao Tang
Taco Cohen
David W. Zhang
Michal Valko
Rémi Munos
OffRL
434
11
0
25 Mar 2025
Genetic-guided GFlowNets for Sample Efficient Molecular Optimization
Genetic-guided GFlowNets for Sample Efficient Molecular OptimizationNeural Information Processing Systems (NeurIPS), 2024
Hyeon-Seob Kim
Minsu Kim
Sanghyeok Choi
Jinkyoo Park
443
24
0
31 Dec 2024
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood MaximizationInternational Conference on Learning Representations (ICLR), 2024
Timofei Gritsaev
Nikita Morozov
S. Samsonov
D. Tiapkin
331
7
0
20 Oct 2024
Action abstractions for amortized sampling
Action abstractions for amortized samplingInternational Conference on Learning Representations (ICLR), 2024
Oussama Boussif
Léna Néhale Ezzine
J. Viviano
Michał Koziarski
Moksh Jain
Nikolay Malkin
Emmanuel Bengio
Rim Assouel
Yoshua Bengio
261
3
0
19 Oct 2024
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion
  Models: A Tutorial and Review
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review
Masatoshi Uehara
Yulai Zhao
Tommaso Biancalani
Sergey Levine
356
65
0
18 Jul 2024
Ant Colony Sampling with GFlowNets for Combinatorial Optimization
Ant Colony Sampling with GFlowNets for Combinatorial OptimizationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Minsu Kim
Sanghyeok Choi
Hyeon-Seob Kim
Jiwoo Son
Jinkyoo Park
Yoshua Bengio
478
62
0
11 Mar 2024
Investigating Generalization Behaviours of Generative Flow Networks
Investigating Generalization Behaviours of Generative Flow Networks
Lazar Atanackovic
Emmanuel Bengio
AI4CE
371
8
0
07 Feb 2024
1
Page 1 of 1