ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.19281
  4. Cited By
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning

A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning

25 May 2025
Yuzheng Hu
Fan Wu
Haotian Ye
David A. Forsyth
James Y. Zou
Nan Jiang
Jiaqi W. Ma
Han Zhao
    OffRL
ArXiv (abs)PDFHTML

Papers citing "A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning"

9 / 9 papers shown
Title
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Yiping Wang
Qing Yang
Zhiyuan Zeng
Liliang Ren
Liu Liu
...
Jianfeng Gao
Weizhu Chen
Shuaiqiang Wang
Simon Shaolei Du
Yelong Shen
OffRLReLMLRM
429
91
0
29 Apr 2025
Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning
Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning
Yixuan Even Xu
Yash Savani
Fei Fang
Zico Kolter
OffRL
185
20
0
18 Apr 2025
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
Taiwei Shi
Yiyang Wu
Linxin Song
Wanrong Zhu
Jieyu Zhao
LRM
239
27
0
07 Apr 2025
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Rishav Rishav
Somjit Nath
Vincent Michalski
Samira Ebrahimi Kahou
FAttOffRL
226
1
0
19 Mar 2025
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Qiying Yu
Zheng Zhang
Ruofei Zhu
Yufeng Yuan
Xiaochen Zuo
...
Ya Zhang
Lin Yan
Mu Qiao
Yonghui Wu
Mingxuan Wang
OffRLLRM
368
480
0
18 Mar 2025
A Survey on Explainable Deep Reinforcement Learning
A Survey on Explainable Deep Reinforcement Learning
Zelei Cheng
Jiahao Yu
Masashi Sugiyama
OffRL
113
5
0
08 Feb 2025
Reviving The Classics: Active Reward Modeling in Large Language Model Alignment
Reviving The Classics: Active Reward Modeling in Large Language Model Alignment
Yunyi Shen
Hao Sun
Jean-Francois Ton
83
4
0
04 Feb 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLMVLMOffRLAI4TSLRM
573
3,336
0
22 Jan 2025
Most Influential Subset Selection: Challenges, Promises, and Beyond
Most Influential Subset Selection: Challenges, Promises, and Beyond
Yuzheng Hu
Pingbang Hu
Han Zhao
Jiaqi W. Ma
TDI
275
11
0
10 Jan 2025
1