Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2112.10504
Cited By
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
16 December 2021
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic"
21 / 21 papers shown
Title
HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking
Runquan Gui
Liang Luo
Jun Wang
Chi Ma
Huiling Zhen
Mingxuan Yuan
Jianye Hao
Defu Lian
Tong Xu
Feng Wu
LRM
589
9
0
05 May 2025
Apollo-MILP: An Alternating Prediction-Correction Neural Solving Framework for Mixed-Integer Linear Programming
International Conference on Learning Representations (ICLR), 2025
Haoyang Liu
Jie Wang
Zijie Geng
Xijun Li
Yuxuan Zong
Fangzhou Zhu
Jianye Hao
Feng Wu
263
11
0
03 Mar 2025
Towards Empowerment Gain through Causal Structure Learning in Model-Based RL
Hongye Cao
Fan Feng
Meng Fang
Shaokang Dong
Zhenxing Ge
Jing Huo
Yang Gao
266
3
0
14 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
341
14
0
13 Feb 2025
STLight: a Fully Convolutional Approach for Efficient Predictive Learning by Spatio-Temporal joint Processing
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Andrea Alfarano
Alberto Alfarano
Linda Friso
Andrea Bacciu
Irene Amerini
Fabrizio Silvestri
234
1
0
15 Nov 2024
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
Neural Information Processing Systems (NeurIPS), 2024
Rui Yang
Jie Wang
Guoping Wu
Yangqiu Song
AAML
OffRL
355
8
0
01 Nov 2024
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
Xu-Hui Liu
Tian-Shuo Liu
Shengyi Jiang
Ruifeng Chen
Zhilong Zhang
Xinwei Chen
Yang Yu
OffRL
OnRL
264
8
0
17 Jul 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
319
5
0
29 May 2024
Scalable and Effective Arithmetic Tree Generation for Adder and Multiplier Designs
Yao Lai
Jinxin Liu
Yao Lai
Ping Luo
277
8
0
10 May 2024
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
M. Ostaszewski
Marek Cygan
214
0
0
01 Mar 2024
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming
Haotian Ling
Zhihai Wang
Jie Wang
260
8
0
31 Jan 2024
State Sequences Prediction via Fourier Transform for Representation Learning
Neural Information Processing Systems (NeurIPS), 2023
Mingxuan Ye
Yufei Kuang
Jie Wang
Rui Yang
Wen-gang Zhou
Houqiang Li
Feng Wu
AI4TS
178
12
0
24 Oct 2023
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
International Conference on Learning Representations (ICLR), 2023
Xiyao Wang
Ruijie Zheng
Yanchao Sun
Ruonan Jia
Wichayaporn Wongkamjan
Huazhe Xu
Furong Huang
OffRL
259
17
0
11 Oct 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization
Neural Information Processing Systems (NeurIPS), 2023
Hai Zhang
Hang Yu
Siyue Tao
Di Zhang
Chang Huang
Hongtu Zhou
Xiao Zhang
Chen Ye
245
12
0
22 Sep 2023
A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design
International Conference on Machine Learning (ICML), 2023
Zhihai Wang
Lei Chen
Jie Wang
Xing Li
Yinqi Bai
Xijun Li
Mingxuan Yuan
Jianye Hao
Yongdong Zhang
Feng Wu
158
12
0
22 Aug 2023
Transformers in Reinforcement Learning: A Survey
Pranav Agarwal
A. Rahman
P. St-Charles
Simon J. D. Prince
Samira Ebrahimi Kahou
OffRL
216
27
0
12 Jul 2023
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer
International Conference on Machine Learning (ICML), 2023
Yao Lai
Jinxin Liu
Zhentao Tang
Bin Wang
Jianye Hao
Ping Luo
OffRL
146
56
0
26 Jun 2023
Generalization in Visual Reinforcement Learning with the Reward Sequence Distribution
Jie Wang
Rui Yang
Zijie Geng
Zhihao Shi
Mingxuan Ye
Qi Zhou
Shuiwang Ji
Bin Li
Yongdong Zhang
Feng Wu
170
6
0
19 Feb 2023
NARS vs. Reinforcement learning: ONA vs. Q-Learning
Ali Beikmohammadi
169
0
0
23 Dec 2022
Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions
Knowledge Discovery and Data Mining (KDD), 2022
Rui Yang
Jie Wang
Zijie Geng
Mingxuan Ye
Shuiwang Ji
Bin Li
Fengli Wu
OOD
255
24
0
20 May 2022
A Review for Deep Reinforcement Learning in Atari:Benchmarks, Challenges, and Solutions
Jiajun Fan
OffRL
272
24
0
08 Dec 2021
1