ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.18719
  4. Cited By
VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning

VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning

24 May 2025
Guanxing Lu
Wenkai Guo
Chubin Zhang
Yuheng Zhou
Haonan Jiang
Zifeng Gao
Yansong Tang
Ziwei Wang
    OffRL
ArXiv (abs)PDFHTML

Papers citing "VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning"

22 / 72 papers shown
Title
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at
  Scale
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale
Yao Lu
Karol Hausman
Yevgen Chebotar
Mengyuan Yan
Eric Jang
...
Ted Xiao
A. Irpan
Mohi Khansari
Dmitry Kalashnikov
Sergey Levine
OffRL
197
61
0
09 Nov 2021
Bridge Data: Boosting Generalization of Robotic Skills with Cross-Domain
  Datasets
Bridge Data: Boosting Generalization of Robotic Skills with Cross-Domain Datasets
F. Ebert
Yanlai Yang
Karl Schmeckpeper
Bernadette Bucher
G. Georgakis
Kostas Daniilidis
Chelsea Finn
Sergey Levine
253
236
0
27 Sep 2021
LoRA: Low-Rank Adaptation of Large Language Models
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRLAI4TSAI4CEALMAIMat
807
10,658
0
17 Jun 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
Decision Transformer: Reinforcement Learning via Sequence Modeling
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
201
1,671
0
02 Jun 2021
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Dmitry Kalashnikov
Jacob Varley
Yevgen Chebotar
Benjamin Swanson
Rico Jonschkowski
Chelsea Finn
Sergey Levine
Karol Hausman
OffRL
146
280
0
16 Apr 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Towards Continual Reinforcement Learning: A Review and Perspectives
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLLOffRL
148
324
0
25 Dec 2020
Transfer Learning in Deep Reinforcement Learning: A Survey
Transfer Learning in Deep Reinforcement Learning: A Survey
Zhuangdi Zhu
Kaixiang Lin
Anil K. Jain
Jiayu Zhou
OffRLLRM
149
606
0
16 Sep 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRLOnRL
164
620
0
16 Jun 2020
The Ingredients of Real-World Robotic Reinforcement Learning
The Ingredients of Real-World Robotic Reinforcement Learning
Henry Zhu
Justin Yu
Abhishek Gupta
Dhruv Shah
Kristian Hartikainen
Avi Singh
Vikash Kumar
Sergey Levine
OffRL
158
181
0
27 Apr 2020
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and
  Reinforcement Learning
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
121
435
0
25 Oct 2019
RoboNet: Large-Scale Multi-Robot Learning
RoboNet: Large-Scale Multi-Robot Learning
Sudeep Dasari
F. Ebert
Stephen Tian
Suraj Nair
Bernadette Bucher
Karl Schmeckpeper
Siddharth Singh
Sergey Levine
Chelsea Finn
LM&Ro
123
304
0
24 Oct 2019
Solving Rubik's Cube with a Robot Hand
Solving Rubik's Cube with a Robot Hand
OpenAI
Ilge Akkaya
Marcin Andrychowicz
Maciek Chociej
Ma-teusz Litwin
...
Peter Welinder
Lilian Weng
Qiming Yuan
Wojciech Zaremba
Lei Zhang
ODL
207
1,236
0
16 Oct 2019
RoboTurk: A Crowdsourcing Platform for Robotic Skill Learning through
  Imitation
RoboTurk: A Crowdsourcing Platform for Robotic Skill Learning through Imitation
Mehdi Letafati
Yuke Zhu
Animesh Garg
Jonathan Booher
Max Spero
...
John Emmons
Anchit Gupta
Emre Orbay
Silvio Savarese
Li Fei-Fei
OffRL
103
293
0
07 Nov 2018
Robot Learning in Homes: Improving Generalization and Reducing Dataset
  Bias
Robot Learning in Homes: Improving Generalization and Reducing Dataset Bias
Abhinav Gupta
Adithyavairavan Murali
Dhiraj Gandhi
Lerrel Pinto
134
153
0
18 Jul 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic
  Manipulation
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
...
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
197
1,474
0
27 Jun 2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Yuke Zhu
Ziyun Wang
J. Merel
Andrei A. Rusu
Tom Erez
...
S. Tunyasuvunakool
János Kramár
R. Hadsell
Nando de Freitas
N. Heess
SSL
140
320
0
26 Feb 2018
Ray: A Distributed Framework for Emerging AI Applications
Ray: A Distributed Framework for Emerging AI Applications
Philipp Moritz
Robert Nishihara
Stephanie Wang
Alexey Tumanov
Richard Liaw
...
Melih Elibol
Zongheng Yang
William Paul
Michael I. Jordan
Ion Stoica
GNN
168
1,270
0
16 Dec 2017
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning
  and Demonstrations
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
Giulia Vezzani
John Schulman
E. Todorov
Sergey Levine
245
1,107
0
28 Sep 2017
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics
  Problems with Sparse Rewards
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Matej Vecerík
Todd Hester
Jonathan Scholz
Fumin Wang
Olivier Pietquin
Bilal Piot
N. Heess
Thomas Rothörl
Thomas Lampe
Martin Riedmiller
OffRL
125
671
0
27 Jul 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
706
19,377
0
20 Jul 2017
Supersizing Self-supervision: Learning to Grasp from 50K Tries and 700
  Robot Hours
Supersizing Self-supervision: Learning to Grasp from 50K Tries and 700 Robot Hours
Lerrel Pinto
Abhinav Gupta
SSL
116
1,152
0
23 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
169
3,453
0
08 Jun 2015
Previous
12