ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.16189
  4. Cited By
OpenRL: A Unified Reinforcement Learning Framework

OpenRL: A Unified Reinforcement Learning Framework

20 December 2023
Shiyu Huang
Wentse Chen
Yiwen Sun
Fuqing Bie
Weijuan Tu
ArXivPDFHTML

Papers citing "OpenRL: A Unified Reinforcement Learning Framework"

6 / 6 papers shown
Title
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
Xintong Duan
Yutong He
Fahim Tajwar
Wen-Tse Chen
Ruslan Salakhutdinov
Jeff Schneider
OffRL
AI4CE
89
0
0
22 Jan 2025
MQE: Unleashing the Power of Interaction with Multi-agent Quadruped
  Environment
MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment
Ziyan Xiong
Bo Chen
Shiyu Huang
Weijuan Tu
Zhaofeng He
Yang Gao
27
4
0
24 Mar 2024
PrefixRL: Optimization of Parallel Prefix Circuits using Deep
  Reinforcement Learning
PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning
Rajarshi Roy
Jonathan Raiman
Neel Kant
Ilyas Elkin
Robert M. Kirby
Michael Siu
S. Oberman
Saad Godil
Bryan Catanzaro
35
38
0
14 May 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
132
76
0
01 Feb 2021
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
275
1,561
0
18 Sep 2019
1