ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.07193
  4. Cited By
Lipschitz Continuity in Model-based Reinforcement Learning

Lipschitz Continuity in Model-based Reinforcement Learning

19 April 2018
Kavosh Asadi
Dipendra Kumar Misra
Michael L. Littman
    KELM
ArXivPDFHTML

Papers citing "Lipschitz Continuity in Model-based Reinforcement Learning"

50 / 88 papers shown
Title
URECA: The Chain of Two Minimum Set Cover Problems exists behind Adaptation to Shifts in Semantic Code Search
URECA: The Chain of Two Minimum Set Cover Problems exists behind Adaptation to Shifts in Semantic Code Search
Seok-Ung Choi
Joonghyuk Hahn
Yo-Sub Han
51
0
0
11 Feb 2025
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Shenghong He
OffRL
177
0
0
10 Feb 2025
Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes
Zijian Wang
Bin Wang
Mingwen Shao
Hongbo Dou
Boxiang Tao
36
0
0
06 Jan 2025
Local Linearity: the Key for No-regret Reinforcement Learning in
  Continuous MDPs
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restelli
39
0
0
31 Oct 2024
Topological Foundations of Reinforcement Learning
Topological Foundations of Reinforcement Learning
David Krame Kadurha
20
0
0
25 Sep 2024
Agent-state based policies in POMDPs: Beyond belief-state MDPs
Agent-state based policies in POMDPs: Beyond belief-state MDPs
Amit Sinha
Aditya Mahajan
26
2
0
24 Sep 2024
LLM-Empowered State Representation for Reinforcement Learning
LLM-Empowered State Representation for Reinforcement Learning
Boyuan Wang
Yun Qu
Yuhang Jiang
Jianzhun Shao
Chang-rui Liu
Wenming Yang
Xiangyang Ji
40
7
0
18 Jul 2024
An Optimal Tightness Bound for the Simulation Lemma
An Optimal Tightness Bound for the Simulation Lemma
Sam Lobel
Ronald E. Parr
28
2
0
24 Jun 2024
Learning Variable Compliance Control From a Few Demonstrations for
  Bimanual Robot with Haptic Feedback Teleoperation System
Learning Variable Compliance Control From a Few Demonstrations for Bimanual Robot with Haptic Feedback Teleoperation System
Tatsuya Kamijo
C. C. Beltran-Hernandez
Masashi Hamaya
60
11
0
21 Jun 2024
Early Detection of Misinformation for Infodemic Management: A Domain
  Adaptation Approach
Early Detection of Misinformation for Infodemic Management: A Domain Adaptation Approach
Minjia Mao
Xiaohang Zhao
Xiao Fang
40
0
0
02 Jun 2024
Resisting Stochastic Risks in Diffusion Planners with the Trajectory
  Aggregation Tree
Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree
Lang Feng
Pengjie Gu
Jingyi Wang
Gang Pan
42
2
0
28 May 2024
PDE Control Gym: A Benchmark for Data-Driven Boundary Control of Partial
  Differential Equations
PDE Control Gym: A Benchmark for Data-Driven Boundary Control of Partial Differential Equations
Luke Bhan
Yuexin Bian
Miroslav Krstic
Yuanyuan Shi
OOD
AI4CE
27
6
0
18 May 2024
Projection by Convolution: Optimal Sample Complexity for Reinforcement
  Learning in Continuous-Space MDPs
Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restelli
42
3
0
10 May 2024
A Note on Loss Functions and Error Compounding in Model-based
  Reinforcement Learning
A Note on Loss Functions and Error Compounding in Model-based Reinforcement Learning
Nan Jiang
27
5
0
15 Apr 2024
Model-based Reinforcement Learning for Parameterized Action Spaces
Model-based Reinforcement Learning for Parameterized Action Spaces
Renhao Zhang
Haotian Fu
Yilin Miao
George Konidaris
31
3
0
03 Apr 2024
A Case for Validation Buffer in Pessimistic Actor-Critic
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
M. Ostaszewski
Marek Cygan
34
0
0
01 Mar 2024
Beyond A*: Better Planning with Transformers via Search Dynamics
  Bootstrapping
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Lucas Lehnert
Sainbayar Sukhbaatar
DiJia Su
Qinqing Zheng
Paul Mcvay
Michael Rabbat
Yuandong Tian
32
53
0
21 Feb 2024
Performance Improvement Bounds for Lipschitz Configurable Markov
  Decision Processes
Performance Improvement Bounds for Lipschitz Configurable Markov Decision Processes
Alberto Maria Metelli
13
0
0
21 Feb 2024
No-Regret Reinforcement Learning in Smooth MDPs
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restell
36
5
0
06 Feb 2024
Long-term Safe Reinforcement Learning with Binary Feedback
Long-term Safe Reinforcement Learning with Binary Feedback
Akifumi Wachi
Wataru Hashimoto
Kazumune Hashimoto
OffRL
35
3
0
08 Jan 2024
Anytime-Competitive Reinforcement Learning with Policy Prior
Anytime-Competitive Reinforcement Learning with Policy Prior
Jianyi Yang
Pengfei Li
Tongxin Li
Adam Wierman
Shaolei Ren
43
2
0
02 Nov 2023
CCIL: Continuity-based Data Augmentation for Corrective Imitation
  Learning
CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning
Liyiming Ke
Yunchu Zhang
Abhay Deshpande
S. Srinivasa
Abhishek Gupta
OffRL
27
12
0
19 Oct 2023
Adaptive Online Replanning with Diffusion Models
Adaptive Online Replanning with Diffusion Models
Siyuan Zhou
Yilun Du
Shun Zhang
Mengdi Xu
Yikang Shen
Wei Xiao
Dit-Yan Yeung
Chuang Gan
30
22
0
14 Oct 2023
A Bayesian Approach to Robust Inverse Reinforcement Learning
A Bayesian Approach to Robust Inverse Reinforcement Learning
Ran Wei
Siliang Zeng
Chenliang Li
Alfredo García
Anthony D. McDonald
Mingyi Hong
OffRL
31
4
0
15 Sep 2023
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating
  The Worst Kernel
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel
Kaixin Wang
Uri Gadot
Navdeep Kumar
Kfir Y. Levy
Shie Mannor
34
2
0
09 Jun 2023
Finding Counterfactually Optimal Action Sequences in Continuous State
  Spaces
Finding Counterfactually Optimal Action Sequences in Continuous State Spaces
Stratis Tsirtsis
Manuel Gomez Rodriguez
CML
OffRL
30
9
0
06 Jun 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
34
9
0
29 May 2023
Self-Supervised Reinforcement Learning that Transfers using Random
  Features
Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Kaipeng Zhang
Abhishek Gupta
OffRL
SSL
22
6
0
26 May 2023
GELU Activation Function in Deep Learning: A Comprehensive Mathematical
  Analysis and Performance
GELU Activation Function in Deep Learning: A Comprehensive Mathematical Analysis and Performance
Minhyeok Lee
21
29
0
20 May 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and
  Bidirectional Curriculum
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
Jigang Kim
Daesol Cho
H. J. Kim
22
3
0
17 May 2023
A Survey on Offline Model-Based Reinforcement Learning
A Survey on Offline Model-Based Reinforcement Learning
Haoyang He
OffRL
24
7
0
05 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Bayesian Reinforcement Learning with Limited Cognitive Load
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
34
8
0
05 May 2023
Planning with Sequence Models through Iterative Energy Minimization
Planning with Sequence Models through Iterative Energy Minimization
Hongyi Chen
Yilun Du
Yiye Chen
J. Tenenbaum
Patricio A. Vela
32
6
0
28 Mar 2023
Some Fundamental Aspects about Lipschitz Continuity of Neural Networks
Some Fundamental Aspects about Lipschitz Continuity of Neural Networks
Grigory Khromov
Sidak Pal Singh
29
7
0
21 Feb 2023
Is Model Ensemble Necessary? Model-based RL via a Single Model with
  Lipschitz Regularized Value Function
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Ruijie Zheng
Xiyao Wang
Huazhe Xu
Furong Huang
48
13
0
02 Feb 2023
Tight Performance Guarantees of Imitator Policies with Continuous
  Actions
Tight Performance Guarantees of Imitator Policies with Continuous Actions
Davide Maran
Alberto Maria Metelli
Marcello Restelli
OffRL
23
4
0
07 Dec 2022
Representation Learning for Continuous Action Spaces is Beneficial for
  Efficient Policy Learning
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
19
1
0
23 Nov 2022
Reward-Predictive Clustering
Reward-Predictive Clustering
Lucas Lehnert
M. Frank
Michael L. Littman
OffRL
22
0
0
07 Nov 2022
Model-based Reinforcement Learning with a Hamiltonian Canonical ODE
  Network
Model-based Reinforcement Learning with a Hamiltonian Canonical ODE Network
Yao Feng
Yuhong Jiang
Hang Su
Dong Yan
Jun Zhu
15
1
0
02 Nov 2022
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement
  Learning
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
28
4
0
30 Oct 2022
Language Control Diffusion: Efficiently Scaling through Space, Time, and
  Tasks
Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks
Edwin Zhang
Yujie Lu
William Wang
Amy Zhang
DiffM
LM&Ro
32
16
0
27 Oct 2022
On Many-Actions Policy Gradient
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
19
0
0
24 Oct 2022
When to Update Your Model: Constrained Model-based Reinforcement
  Learning
When to Update Your Model: Constrained Model-based Reinforcement Learning
Tianying Ji
Yu-Juan Luo
Gang Hua
Mingxuan Jing
Fengxiang He
Wen-bing Huang
24
18
0
15 Oct 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
50
101
0
19 Jun 2022
Deciding What to Model: Value-Equivalent Sampling for Reinforcement
  Learning
Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning
Dilip Arumugam
Benjamin Van Roy
OffRL
33
15
0
04 Jun 2022
Truly Deterministic Policy Optimization
Truly Deterministic Policy Optimization
Ehsan Saleh
Saba Ghaffari
Timothy Bretl
Matthew West
OffRL
24
3
0
30 May 2022
Faster Deep Reinforcement Learning with Slower Online Network
Faster Deep Reinforcement Learning with Slower Online Network
Kavosh Asadi
Rasool Fakoor
Omer Gottesman
Taesup Kim
Michael L. Littman
Alexander J. Smola
OnRL
11
6
0
10 Dec 2021
On Effective Scheduling of Model-based Reinforcement Learning
On Effective Scheduling of Model-based Reinforcement Learning
Hang Lai
Jian Shen
Weinan Zhang
Yimin Huang
Xingzhi Zhang
Ruiming Tang
Yong Yu
Zhenguo Li
28
18
0
16 Nov 2021
Coarse-Grained Smoothness for RL in Metric Spaces
Coarse-Grained Smoothness for RL in Metric Spaces
Giorgio Giannone
Kavosh Asadi
Cameron Allen
Sam Lobel
George Konidaris
Michael Littman
40
3
0
23 Oct 2021
Block Contextual MDPs for Continual Learning
Block Contextual MDPs for Continual Learning
Shagun Sodhani
Franziska Meier
Joelle Pineau
Amy Zhang
CLL
33
25
0
13 Oct 2021
12
Next