Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.03102
Cited By
DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays
5 June 2024
Bo Xia
Yilun Kong
Yongzhe Chang
Bo Yuan
Zhiheng Li
Xueqian Wang
Bin Liang
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays"
5 / 5 papers shown
Title
Probing the Safety Response Boundary of Large Language Models via Unsafe Decoding Path Generation
Haoyu Wang
Bingzhe Wu
Yatao Bian
Yongzhe Chang
Xueqian Wang
Peilin Zhao
55
2
0
20 Aug 2024
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning
Yilun Kong
Hangyu Mao
Qi Zhao
Bin Zhang
Jingqing Ruan
Li Shen
Yongzhe Chang
Xueqian Wang
Rui Zhao
Dacheng Tao
OffRL
29
1
0
20 Aug 2024
Trajectory Planning for Teleoperated Space Manipulators Using Deep Reinforcement Learning
Bo Xia
Xianru Tian
Bo Yuan
Zhiheng Li
Bin Liang
Xueqian Wang
27
0
0
10 Aug 2024
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,730
0
04 Mar 2022
Reinforcement Learning with Random Delays
Simon Ramstedt
Yann Bouteiller
Giovanni Beltrame
C. Pal
Jonathan Binas
107
59
0
06 Oct 2020
1