ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.07854
  4. Cited By
End-to-End Robotic Reinforcement Learning without Reward Engineering

End-to-End Robotic Reinforcement Learning without Reward Engineering

16 April 2019
Avi Singh
Larry Yang
Kristian Hartikainen
Chelsea Finn
Sergey Levine
    SSL
    OffRL
ArXivPDFHTML

Papers citing "End-to-End Robotic Reinforcement Learning without Reward Engineering"

50 / 54 papers shown
Title
BiasBench: A reproducible benchmark for tuning the biases of event cameras
BiasBench: A reproducible benchmark for tuning the biases of event cameras
Andreas Ziegler
David Joseph
Thomas Gossard
Emil Moldovan
A. Zell
29
0
0
25 Apr 2025
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Sinan Ibrahim
Mostafa Mostafa
Ali Jnadi
Hadi Salloum
Pavel Osinenko
OffRL
39
12
0
31 Dec 2024
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
Ondrej Biza
Thomas Weng
Lingfeng Sun
Karl Schmeckpeper
Tarik Kelestemur
Yecheng Jason Ma
Robert C. Platt
Jan Willem van de Meent
Lawson L. S. Wong
OffRL
43
0
0
25 Oct 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
64
1
0
22 Aug 2024
Adapt2Reward: Adapting Video-Language Models to Generalizable Robotic
  Rewards via Failure Prompts
Adapt2Reward: Adapting Video-Language Models to Generalizable Robotic Rewards via Failure Prompts
Yanting Yang
Minghao Chen
Qibo Qiu
Jiahao Wu
Wenxiao Wang
Binbin Lin
Ziyu Guan
Xiaofei He
LM&Ro
39
2
0
20 Jul 2024
Affordance-Guided Reinforcement Learning via Visual Prompting
Affordance-Guided Reinforcement Learning via Visual Prompting
Olivia Y. Lee
Annie Xie
Kuan Fang
Karl Pertsch
Chelsea Finn
OffRL
LM&Ro
74
7
0
14 Jul 2024
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation
Kuo-Han Hung
Pang-Chi Lo
Jia-Fong Yeh
Han-Yuan Hsu
Yi-Ting Chen
Winston H. Hsu
30
0
0
26 May 2024
An Optimization-Based Planner with B-spline Parameterized
  Continuous-Time Reference Signals
An Optimization-Based Planner with B-spline Parameterized Continuous-Time Reference Signals
Chuyuan Tao
Sheng Cheng
Yang Zhao
Fanxin Wang
N. Hovakimyan
32
0
0
29 Mar 2024
Leveraging Symmetry in RL-based Legged Locomotion Control
Leveraging Symmetry in RL-based Legged Locomotion Control
Zhi Su
Xiaoyu Huang
Daniel Felipe Ordoñez Apraez
Yunfei Li
Zhongyu Li
...
Giulio Turrisi
Massimiliano Pontil
Claudio Semini
Yi Wu
K. Sreenath
33
14
0
26 Mar 2024
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
35
4
0
06 Nov 2023
On-Robot Bayesian Reinforcement Learning for POMDPs
On-Robot Bayesian Reinforcement Learning for POMDPs
Hai V. Nguyen
Sammie Katt
Yuchen Xiao
Chris Amato
OffRL
18
1
0
22 Jul 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning
  from Observations
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
28
16
0
30 Mar 2023
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement
  Learning
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning
Archit Sharma
Ahmed M. Ahmed
Rehaan Ahmad
Chelsea Finn
SSL
46
17
0
02 Mar 2023
A Supervisory Learning Control Framework for Autonomous & Real-time Task
  Planning for an Underactuated Cooperative Robotic task
A Supervisory Learning Control Framework for Autonomous & Real-time Task Planning for an Underactuated Cooperative Robotic task
Sander De Witte
Tom Lefebvre
Thijs Van Hauwermeiren
Guillaume Crevecoeur
15
0
0
22 Feb 2023
Effective Multimodal Reinforcement Learning with Modality Alignment and
  Importance Enhancement
Effective Multimodal Reinforcement Learning with Modality Alignment and Importance Enhancement
Jinming Ma
Feng Wu
Yingfeng Chen
Xianpeng Ji
Yu-qiong Ding
OffRL
27
4
0
18 Feb 2023
Learning passive policies with virtual energy tanks in robotics
Learning passive policies with virtual energy tanks in robotics
R. Zanella
G. Palli
Stefano Stramigioli
Federico Califano
14
3
0
30 Jan 2023
Training Robots to Evaluate Robots: Example-Based Interactive Reward
  Functions for Policy Learning
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
28
5
0
17 Dec 2022
MoDem: Accelerating Visual Model-Based Reinforcement Learning with
  Demonstrations
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
26
49
0
12 Dec 2022
Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
David Zhang
Micah Carroll
Andreea Bobu
Anca Dragan
22
4
0
30 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
23
0
0
23 Nov 2022
Rewards Encoding Environment Dynamics Improves Preference-based
  Reinforcement Learning
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning
Katherine Metcalf
Miguel Sarabia
B. Theobald
OffRL
32
4
0
12 Nov 2022
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for
  Industrial Insertion of Novel Connectors from Vision
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision
Ashvin Nair
Brian Zhu
Gokul Narayanan
Eugen Solowjow
Sergey Levine
OffRL
OnRL
25
14
0
27 Oct 2022
Holo-Dex: Teaching Dexterity with Immersive Mixed Reality
Holo-Dex: Teaching Dexterity with Immersive Mixed Reality
Sridhar Pandian Arunachalam
Irmak Güzey
Soumith Chintala
Lerrel Pinto
34
67
0
12 Oct 2022
MSVIPER: Improved Policy Distillation for Reinforcement-Learning-Based
  Robot Navigation
MSVIPER: Improved Policy Distillation for Reinforcement-Learning-Based Robot Navigation
Aaron M. Roth
Jing Liang
Ram D. Sriram
Elham Tabassi
Dinesh Manocha
24
1
0
19 Sep 2022
Co-Imitation: Learning Design and Behaviour by Imitation
Co-Imitation: Learning Design and Behaviour by Imitation
C. Rajani
Karol Arndt
David Blanco Mulero
K. Luck
Ville Kyrki
6
4
0
02 Sep 2022
Can Foundation Models Perform Zero-Shot Task Specification For Robot
  Manipulation?
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?
Yuchen Cui
S. Niekum
Abhi Gupta
Vikash Kumar
Aravind Rajeswaran
LM&Ro
21
73
0
23 Apr 2022
On-Robot Learning With Equivariant Models
On-Robot Learning With Equivariant Models
Dian Wang
Ming Jia
Xu Zhu
Robin G. Walters
Robert W. Platt
OffRL
SSL
28
35
0
09 Mar 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
A General, Evolution-Inspired Reward Function for Social Robotics
A General, Evolution-Inspired Reward Function for Social Robotics
Thomas Kingsford
16
0
0
01 Feb 2022
Improving Behavioural Cloning with Human-Driven Dynamic Dataset
  Augmentation
Improving Behavioural Cloning with Human-Driven Dynamic Dataset Augmentation
Federico Malato
Joona Jehkonen
Ville Hautamaki
11
1
0
19 Jan 2022
Robot Skill Adaptation via Soft Actor-Critic Gaussian Mixture Models
Robot Skill Adaptation via Soft Actor-Critic Gaussian Mixture Models
Iman Nematollahi
Erick Rosete-Beas
Adrian Rofer
Tim Welschehold
Abhinav Valada
Wolfram Burgard
17
15
0
25 Nov 2021
Rearranging the Environment to Maximize Energy with a Robotic Circuit
  Drawing
Rearranging the Environment to Maximize Energy with a Robotic Circuit Drawing
X. Tan
Zhikang Liu
Chenxiao Yu
A. Rosendo
14
0
0
15 Nov 2021
Example-Driven Model-Based Reinforcement Learning for Solving
  Long-Horizon Visuomotor Tasks
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks
Bohan Wu
Suraj Nair
Li Fei-Fei
Chelsea Finn
OffRL
LM&Ro
38
24
0
21 Sep 2021
Learning Language-Conditioned Robot Behavior from Offline Data and
  Crowd-Sourced Annotation
Learning Language-Conditioned Robot Behavior from Offline Data and Crowd-Sourced Annotation
Suraj Nair
E. Mitchell
Kevin Chen
Brian Ichter
Silvio Savarese
Chelsea Finn
LM&Ro
OffRL
25
154
0
02 Sep 2021
A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning
A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
8
16
0
06 Aug 2021
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of
  Sparse Reward Iterative Tasks
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks
Albert Wilcox
Ashwin Balakrishna
Brijen Thananjeyan
Joseph E. Gonzalez
Ken Goldberg
8
11
0
10 Jul 2021
Programming and Deployment of Autonomous Swarms using Multi-Agent
  Reinforcement Learning
Programming and Deployment of Autonomous Swarms using Multi-Agent Reinforcement Learning
Jayson G. Boubin
Codi Burley
Peida Han
Bowen Li
Barry Porter
Christopher Stewart
19
3
0
21 May 2021
Robotic Surgery With Lean Reinforcement Learning
Robotic Surgery With Lean Reinforcement Learning
Yotam Barnoy
Molly O'Brien
W. Wang
Gregory D. Hager
OffRL
35
20
0
03 May 2021
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human
  Videos
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos
Annie S. Chen
Suraj Nair
Chelsea Finn
30
136
0
31 Mar 2021
Replacing Rewards with Examples: Example-Based Policy Search via
  Recursive Classification
Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
29
50
0
23 Mar 2021
Unsupervised Feature Learning for Manipulation with Contrastive Domain
  Randomization
Unsupervised Feature Learning for Manipulation with Contrastive Domain Randomization
Carmel Rabinovitz
Niko A. Grupen
Aviv Tamar
OOD
SSL
19
3
0
20 Mar 2021
Semi-supervised reward learning for offline reinforcement learning
Semi-supervised reward learning for offline reinforcement learning
Ksenia Konyushkova
Konrad Zolna
Y. Aytar
Alexander Novikov
Scott E. Reed
Serkan Cabi
Nando de Freitas
SSL
OffRL
58
23
0
12 Dec 2020
Learning Dense Rewards for Contact-Rich Manipulation Tasks
Learning Dense Rewards for Contact-Rich Manipulation Tasks
Zheng Wu
Wenzhao Lian
Vaibhav Unhelkar
M. Tomizuka
S. Schaal
8
37
0
17 Nov 2020
COG: Connecting New Skills to Past Experience with Offline Reinforcement
  Learning
COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning
Avi Singh
Albert Yu
Jonathan Yang
Jesse Zhang
Aviral Kumar
Sergey Levine
SSL
OffRL
OnRL
29
103
0
27 Oct 2020
MELD: Meta-Reinforcement Learning from Images via Latent State Models
MELD: Meta-Reinforcement Learning from Images via Latent State Models
Tony Zhao
Anusha Nagabandi
Kate Rakelly
Chelsea Finn
Sergey Levine
OffRL
16
36
0
26 Oct 2020
Human-guided Robot Behavior Learning: A GAN-assisted Preference-based
  Reinforcement Learning Approach
Human-guided Robot Behavior Learning: A GAN-assisted Preference-based Reinforcement Learning Approach
Huixin Zhan
Feng Tao
Yongcan Cao
35
26
0
15 Oct 2020
Vision-Based Goal-Conditioned Policies for Underwater Navigation in the
  Presence of Obstacles
Vision-Based Goal-Conditioned Policies for Underwater Navigation in the Presence of Obstacles
Travis Manderson
J. A. G. Higuera
Stefan Wapnick
J. Tremblay
Florian Shkurti
D. Meger
Gregory Dudek
16
50
0
29 Jun 2020
Self-supervised Learning for Precise Pick-and-place without Object Model
Self-supervised Learning for Precise Pick-and-place without Object Model
Lars Berscheid
Pascal Meissner
Torsten Kröger
SSL
DRL
18
66
0
15 Jun 2020
Language Conditioned Imitation Learning over Unstructured Data
Language Conditioned Imitation Learning over Unstructured Data
Corey Lynch
P. Sermanet
LM&Ro
30
241
0
15 May 2020
Balance Between Efficient and Effective Learning: Dense2Sparse Reward
  Shaping for Robot Manipulation with Environment Uncertainty
Balance Between Efficient and Effective Learning: Dense2Sparse Reward Shaping for Robot Manipulation with Environment Uncertainty
Yongle Luo
Kun Dong
Lili Zhao
Zhiyong Sun
Chao Zhou
Bo Song
31
13
0
05 Mar 2020
12
Next