ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

International Conference on Machine Learning (ICML), 2018
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,375 papers shown
NaviSplit: Dynamic Multi-Branch Split DNNs for Efficient Distributed Autonomous Navigation
NaviSplit: Dynamic Multi-Branch Split DNNs for Efficient Distributed Autonomous NavigationIEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM), 2024
Timothy K Johnsen
Ian Harshbarger
Zixia Xia
Marco Levorato
204
4
0
10 Apr 2026
NaviSlim: Adaptive Context-Aware Navigation and Sensing via Dynamic Slimmable Networks
NaviSlim: Adaptive Context-Aware Navigation and Sensing via Dynamic Slimmable NetworksInternational Conference on Internet-of-Things Design and Implementation (IoTDI), 2024
Timothy K Johnsen
Marco Levorato
229
3
0
10 Apr 2026
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
Xuyang Li
Romit Maulik
FAtt
550
0
0
10 Apr 2026
ALPINE: Closed-Loop Adaptive Privacy Budget Allocation for Mobile Edge Crowdsensing
ALPINE: Closed-Loop Adaptive Privacy Budget Allocation for Mobile Edge Crowdsensing
Guanjie Cheng
Siyang Liu
Junqin Huang
Xinkui Zhao
Yin Wang
Mengying Zhu
Linghe Kong
166
1
0
10 Apr 2026
CACTO-SL: Using Sobolev Learning to improve Continuous Actor-Critic with Trajectory Optimization
CACTO-SL: Using Sobolev Learning to improve Continuous Actor-Critic with Trajectory Optimization
Elisa Alboni
Gianluigi Grandesso
G. P. R. Papini
Justin Carpentier
Andrea Del Prete
238
8
0
30 Mar 2026
Enhancing Deep Deterministic Policy Gradients on Continuous Control Tasks with Decoupled Prioritized Experience Replay
Enhancing Deep Deterministic Policy Gradients on Continuous Control Tasks with Decoupled Prioritized Experience Replay
Mehmet Efe Lorasdagi
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
198
0
0
04 Dec 2025
BiTAgent: A Task-Aware Modular Framework for Bidirectional Coupling between Multimodal Large Language Models and World Models
BiTAgent: A Task-Aware Modular Framework for Bidirectional Coupling between Multimodal Large Language Models and World Models
Yu-Wei Zhan
Xin Wang
Pengzhe Mao
Tongtong Feng
Ren Wang
Wenwu Zhu
268
0
0
04 Dec 2025
World Models for Autonomous Navigation of Terrestrial Robots from LIDAR Observations
World Models for Autonomous Navigation of Terrestrial Robots from LIDAR Observations
Raul Steinmetz
Fabio Demo Rosa
V. A. Kich
J. A. Bottega
Ricardo B. Grando
D. T. Gamarra
3DV
446
1
0
03 Dec 2025
Towards better dense rewards in Reinforcement Learning Applications
Towards better dense rewards in Reinforcement Learning Applications
Shuyuan Zhang
OffRL
146
0
0
03 Dec 2025
Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning
Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning
Franki Nguimatsia Tiofack
Théotime Le Hellard
Fabian Schramm
Nicolas Perrin-Gilbert
Justin Carpentier
320
1
0
03 Dec 2025
Learning Sim-to-Real Humanoid Locomotion in 15 Minutes
Learning Sim-to-Real Humanoid Locomotion in 15 Minutes
Younggyo Seo
Carmelo Sferrazza
Juyue Chen
Guanya Shi
Rocky Duan
Pieter Abbeel
264
5
0
01 Dec 2025
Forecasting in Offline Reinforcement Learning for Non-stationary Environments
Forecasting in Offline Reinforcement Learning for Non-stationary Environments
S. E. Ada
Georg Martius
Emre Ugur
Erhan Öztop
OffRL
262
0
0
01 Dec 2025
Hardware-Software Collaborative Computing of Photonic Spiking Reinforcement Learning for Robotic Continuous Control
Hardware-Software Collaborative Computing of Photonic Spiking Reinforcement Learning for Robotic Continuous Control
Mengting Yu
Shuiying Xiang
Changjian Xie
Yonghang Chen
Haowen Zhao
Xingxing Guo
Yahui Zhang
Yanan Han
Yue Hao
141
0
0
29 Nov 2025
Independent policy gradient-based reinforcement learning for economic and reliable energy management of multi-microgrid systems
Independent policy gradient-based reinforcement learning for economic and reliable energy management of multi-microgrid systems
Junkai Hu
Li Xia
420
0
0
26 Nov 2025
Reinforcing Action Policies by Prophesying
Reinforcing Action Policies by Prophesying
Jiahui Zhang
Ze Huang
Chun Gu
Zipei Ma
Li Zhang
286
8
0
25 Nov 2025
Learning Massively Multitask World Models for Continuous Control
Learning Massively Multitask World Models for Continuous Control
Nicklas Hansen
Hao Su
Xiaolong Wang
OffRLCLLLM&Ro
618
3
0
24 Nov 2025
First-order Sobolev Reinforcement Learning
First-order Sobolev Reinforcement Learning
Fabian Schramm
Nicolas Perrin-Gilbert
Justin Carpentier
93
0
0
24 Nov 2025
How to Train Your Latent Control Barrier Function: Smooth Safety Filtering Under Hard-to-Model Constraints
How to Train Your Latent Control Barrier Function: Smooth Safety Filtering Under Hard-to-Model Constraints
Kensuke Nakamura
Arun L. Bishop
Steven Man
Aaron M. Johnson
Zachary Manchester
Andrea V. Bajcsy
140
3
0
23 Nov 2025
MOMA-AC: A preference-driven actor-critic framework for continuous multi-objective multi-agent reinforcement learning
MOMA-AC: A preference-driven actor-critic framework for continuous multi-objective multi-agent reinforcement learningNeurocomputing (Neurocomputing), 2025
Adam Callaghan
Karl Mason
Patrick Mannion
AI4CE
127
0
0
22 Nov 2025
RL-AD-Net: Reinforcement Learning Guided Adaptive Displacement in Latent Space for Refined Point Cloud Completion
RL-AD-Net: Reinforcement Learning Guided Adaptive Displacement in Latent Space for Refined Point Cloud Completion
Bhanu Pratap Paregi
Vaibhav Kumar
3DPC
215
0
0
21 Nov 2025
Mitigating Estimation Bias with Representation Learning in TD Error-Driven Regularization
Mitigating Estimation Bias with Representation Learning in TD Error-Driven Regularization
Haohui Chen
Zhiyong Chen
Aoxiang Liu
Wentuo Fang
189
0
0
20 Nov 2025
Revisiting Fairness-aware Interactive Recommendation: Item Lifecycle as a Control Knob
Revisiting Fairness-aware Interactive Recommendation: Item Lifecycle as a Control Knob
Yun Lu
Xiaoyu Shi
Hong Xie
Chongjun Xia
Zhenhui Gong
Mingsheng Shang
116
0
0
20 Nov 2025
Optimizing Operation Recipes with Reinforcement Learning for Safe and Interpretable Control of Chemical Processes
Optimizing Operation Recipes with Reinforcement Learning for Safe and Interpretable Control of Chemical Processes
D. Brandner
Sergio Lucia
198
0
0
20 Nov 2025
Stabilizing Policy Gradient Methods via Reward Profiling
Stabilizing Policy Gradient Methods via Reward Profiling
Shihab Ahmed
El Houcine Bergou
A. Dutta
Yue Wang
OffRL
269
0
0
20 Nov 2025
Socially aware navigation for mobile robots: a survey on deep reinforcement learning approaches
Socially aware navigation for mobile robots: a survey on deep reinforcement learning approaches
Ibrahim Khalil Kabir
Muhammad Faizan Mysorewala
129
3
0
18 Nov 2025
DiffFP: Learning Behaviors from Scratch via Diffusion-based Fictitious Play
DiffFP: Learning Behaviors from Scratch via Diffusion-based Fictitious Play
Akash Karthikeyan
Yash Vardhan Pant
167
0
0
17 Nov 2025
Cryptocurrency Portfolio Management with Reinforcement Learning: Soft Actor--Critic and Deep Deterministic Policy Gradient Algorithms
Cryptocurrency Portfolio Management with Reinforcement Learning: Soft Actor--Critic and Deep Deterministic Policy Gradient Algorithms
Kamal Paykan
AIFin
183
0
0
16 Nov 2025
Intelligent Collaborative Optimization for Rubber Tyre Film Production Based on Multi-path Differentiated Clipping Proximal Policy Optimization
Intelligent Collaborative Optimization for Rubber Tyre Film Production Based on Multi-path Differentiated Clipping Proximal Policy Optimization
Yinghao Ruan
Wei Pang
Shuaihao Liu
Huili Yang
Leyi Han
Xinghui Dong
246
0
0
15 Nov 2025
Incorporating Spatial Information into Goal-Conditioned Hierarchical Reinforcement Learning via Graph Representations
Incorporating Spatial Information into Goal-Conditioned Hierarchical Reinforcement Learning via Graph Representations
Shuyuan Zhang
Zihan Wang
Xiao-Wen Chang
Doina Precup
143
2
0
14 Nov 2025
Dynamic Weight Adaptation in Spiking Neural Networks Inspired by Biological Homeostasis
Dynamic Weight Adaptation in Spiking Neural Networks Inspired by Biological Homeostasis
Yunduo Zhou
B. Dong
Chang Li
Y. Wang
Xuefeng Yin
Yang Wang
Xin Yang
208
0
0
13 Nov 2025
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
Yunchang Ma
Tenglong Liu
Yixing Lan
Xin Yin
Changxin Zhang
Xinglong Zhang
Xin Xu
OffRL
298
0
0
12 Nov 2025
Balance Equation-based Distributionally Robust Offline Imitation Learning
Balance Equation-based Distributionally Robust Offline Imitation Learning
Rishabh Agrawal
Yusuf Alvi
R. Jain
A. Nayyar
OffRLOOD
280
0
0
11 Nov 2025
Beyond Distributions: Geometric Action Control for Continuous Reinforcement Learning
Beyond Distributions: Geometric Action Control for Continuous Reinforcement Learning
Zhihao Lin
326
0
0
11 Nov 2025
Secure Low-altitude Maritime Communications via Intelligent Jamming
Secure Low-altitude Maritime Communications via Intelligent JammingScience China Information Sciences (Sci. China Inf. Sci.), 2025
Jiawei Huang
Aimin Wang
Geng Sun
Jiahui Li
Jiacheng Wang
Weijie Yuan
Dusit Niyato
Xianbin Wang
150
0
0
10 Nov 2025
Physically-Grounded Goal Imagination: Physics-Informed Variational Autoencoder for Self-Supervised Reinforcement Learning
Physically-Grounded Goal Imagination: Physics-Informed Variational Autoencoder for Self-Supervised Reinforcement Learning
Lan Thi Ha Nguyen
Kien Ton Manh
Anh Do Duc
Nam Pham Hai
DRLSSLAI4CE
612
0
0
10 Nov 2025
Shocks Under Control: Taming Transonic Compressible Flow over an RAE2822 Airfoil with Deep Reinforcement Learning
Shocks Under Control: Taming Transonic Compressible Flow over an RAE2822 Airfoil with Deep Reinforcement Learning
Trishit Mondal
Ricardo Vinuesa
Ameya D. Jagtap
AI4CE
166
1
0
10 Nov 2025
Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization
Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization
Sayambhu Sen
Shalabh Bhatnagar
148
0
0
10 Nov 2025
Dynamics-Decoupled Trajectory Alignment for Sim-to-Real Transfer in Reinforcement Learning for Autonomous Driving
Dynamics-Decoupled Trajectory Alignment for Sim-to-Real Transfer in Reinforcement Learning for Autonomous Driving
Thomas Steinecker
Alexander Bienemann
Denis Trescher
Thorsten Luettel
Mirko Maehlisch
323
0
0
10 Nov 2025
From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos
From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos
Xun Wang
Zhuoran Li
Yanshan Lin
Hai Zhong
Longbo Huang
OffRLLLMAG
201
0
0
04 Nov 2025
Natural-gas storage modelling by deep reinforcement learning
Natural-gas storage modelling by deep reinforcement learning
Tiziano Balaconi
Aldo Glielmo
Marco Taboga
128
0
0
04 Nov 2025
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Yixiu Mao
Yun Qu
Qi Wang
Xiangyang Ji
OffRL
194
1
0
04 Nov 2025
Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization
Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization
Ziqi Wang
Jiashun Liu
L. Pan
263
2
0
03 Nov 2025
L2T-Tune:LLM-Guided Hybrid Database Tuning with LHS and TD3
L2T-Tune:LLM-Guided Hybrid Database Tuning with LHS and TD3
Xinyue Yang
Chen Zheng
Yaoyang Hou
Renhao Zhang
Y. Zhang
Yanjun Wu
Heng Zhang
330
0
0
03 Nov 2025
Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments
Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments
Xiaoyi He
Danggui Chen
Zhenshuo Zhang
Zimeng Bai
126
1
0
30 Oct 2025
Real-DRL: Teach and Learn in Reality
Real-DRL: Teach and Learn in Reality
Y. Mao
Yihao Cai
L. Sha
OffRL
210
0
0
30 Oct 2025
Morphology-Aware Graph Reinforcement Learning for Tensegrity Robot Locomotion
Morphology-Aware Graph Reinforcement Learning for Tensegrity Robot Locomotion
Chi Zhang
Mingrui Li
W. Tong
X. Y. Huang
AI4CE
130
0
0
30 Oct 2025
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
Wenli Xiao
Haotian Lin
Andy Peng
Haoru Xue
Tairan He
...
Jimmy Wu
Zhengyi Luo
Linxi Fan
Guanya Shi
Yuke Zhu
VLM
694
23
0
30 Oct 2025
Accelerating Real-World Overtaking in F1TENTH Racing Employing Reinforcement Learning Methods
Accelerating Real-World Overtaking in F1TENTH Racing Employing Reinforcement Learning Methods
Emily Steiner
Daniel van der Spuy
Futian Zhou
Afereti Pama
Minas V. Liarokapis
Henry Williams
125
1
0
30 Oct 2025
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
Sagalpreet Singh
Rishi Saket
A. Raghuveer
OOD
155
0
0
29 Oct 2025
Off-policy Reinforcement Learning with Model-based Exploration Augmentation
Off-policy Reinforcement Learning with Model-based Exploration Augmentation
Likun Wang
Xiangteng Zhang
Yinuo Wang
Guojian Zhan
Wenxuan Wang
Haoyu Gao
Jingliang Duan
Shengbo Eben Li
OffRL
221
1
0
29 Oct 2025
1234...464748
Next
Page 1 of 48
Pageof 48