Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.22338
Cited By
Text2Grad: Reinforcement Learning from Natural Language Feedback
28 May 2025
Hanyang Wang
Lu Wang
Chaoyun Zhang
Tianjun Mao
Si Qin
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Text2Grad: Reinforcement Learning from Natural Language Feedback"
5 / 5 papers shown
Title
UFO2: The Desktop AgentOS
Chaoyun Zhang
He Huang
Chiming Ni
J. Mu
Si Qin
...
Minghua Ma
Jian-Guang Lou
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
LLMAG
230
5
0
20 Apr 2025
Reinforcement Learning from Human Feedback
Nathan Lambert
OffRL
AI4CE
124
23
0
16 Apr 2025
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding
Zhangchen Xu
Yang Liu
Yueqin Yin
Mingyuan Zhou
Radha Poovendran
ALM
OffRL
130
18
0
04 Mar 2025
HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation
Zhaojian Yu
Yilun Zhao
Arman Cohan
Xiao-Ping Zhang
LRM
117
10
0
03 Jan 2025
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators
Yann Dubois
Balázs Galambosi
Percy Liang
Tatsunori Hashimoto
ALM
179
403
0
06 Apr 2024
1