Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2508.12800
Cited By
v1
v2
v3 (latest)
Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward
18 August 2025
Yong Deng
Guoqing Wang
ZhenZhe Ying
Xiaofeng Wu
Jinzhen Lin
Wenwen Xiong
Yuqin Dai
Shuo Yang
Zhanwei Zhang
Qiwen Wang
Yang Qin
Yuan Wang
Quanxing Zha
Sunhao Dai
Changhua Meng
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (4 upvotes)
Github (88★)
Papers citing
"Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward"
13 / 13 papers shown
PRInTS: Reward Modeling for Long-Horizon Information Seeking
Jaewoo Lee
Archiki Prasad
Justin Chih-Yao Chen
Zaid Khan
Elias Stengel-Eskin
Mohit Bansal
OffRL
LRM
217
0
0
24 Nov 2025
InfoFlow: Reinforcing Search Agent Via Reward Density Optimization
Kun Luo
Hongjin Qian
Zheng Liu
Ziyi Xia
Shitao Xiao
Siqi Bao
Jun Zhao
Kang Liu
124
0
0
30 Oct 2025
Repurposing Synthetic Data for Fine-grained Search Agent Supervision
Yida Zhao
Kuan Li
Xixi Wu
Liwen Zhang
Dingchu Zhang
...
Xinyu Wang
Kewei Tu
Pengjun Xie
Jingren Zhou
Yong Jiang
137
3
0
28 Oct 2025
MR-Align: Meta-Reasoning Informed Factuality Alignment for Large Reasoning Models
Xinming Wang
Jian Xu
Bin Yu
Sheng Lian
Hongzhu Yi
...
Boran Wang
Hongming Yang
Han Hu
Xu-Yao Zhang
Cheng-Lin Liu
HILM
LRM
285
0
0
27 Oct 2025
Search Self-play: Pushing the Frontier of Agent Capability without Supervision
Hongliang Lu
Yuhang Wen
Pengyu Cheng
Ruijin Ding
Haotian Xu
Jiaqi Guo
Chutian Wang
Haonan Chen
Xiaoxi Jiang
Guanjun Jiang
LRM
135
4
0
21 Oct 2025
A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications
Minhua Lin
Zongyu Wu
Zhichao Xu
Hui Liu
Xianfeng Tang
Qi He
Charu C. Aggarwal
Hui Liu
Xiang Zhang
Suhang Wang
AI4TS
LRM
572
2
0
19 Oct 2025
Code-driven Number Sequence Calculation: Enhancing the inductive Reasoning Abilities of Large Language Models
Kedi Chen
Zhikai Lei
Xu Guo
Xuecheng Wu
Siyuan Zeng
...
J. Zhou
Liang He
Qipeng Guo
Kai Chen
Wei-na Zhang
AIMat
AI4TS
LRM
334
0
0
16 Oct 2025
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents
Guoqing Wang
Sunhao Dai
Guangze Ye
Zeyu Gan
Wei Yao
Yong Deng
Xiaofeng Wu
ZhenZhe Ying
OffRL
188
3
0
16 Oct 2025
Deep Research with Open-Domain Evaluation and Multi-Stage Guardrails for Safety
Wei-Chieh Huang
Henry Peng Zou
Y. Wu
Dongyuan Li
Yankai Chen
...
Liancheng Fang
Langzhou He
Renhe Jiang
Philip S. Yu
Philip S. Yu
171
2
0
13 Oct 2025
Agentic-KGR: Co-evolutionary Knowledge Graph Construction through Multi-Agent Reinforcement Learning
Jing Li
Zhijie Sun
Z. Zhou
Suming Qiu
J. Huang
Haijia Sun
Linyuan Qiu
134
0
0
10 Oct 2025
Gradient Coupling: The Hidden Barrier to Generalization in Agentic Reinforcement Learning
Jingyu Liu
xiaopeng Wu
Jingquan Peng
Kehan Chen
Chuan Yu
Lizhong Ding
Yong Liu
181
0
0
28 Sep 2025
EviNote-RAG: Enhancing RAG Models via Answer-Supportive Evidence Notes
Yuqin Dai
Guoqing Wang
Yuan Wang
Kairan Dou
Kaichen Zhou
...
Can Yi
Changhua Meng
Yuchen Zhou
Yongliang Shen
Shuai Lu
RALM
253
4
0
31 Aug 2025
Large Language Models for Information Retrieval: A Survey
Yutao Zhu
Huaying Yuan
Shuting Wang
Jiongnan Liu
Wenhan Liu
Chenlong Deng
Haonan Chen
Zheng Liu
Zhicheng Dou
Ji-Rong Wen
KELM
637
465
0
14 Aug 2023
1
Page 1 of 1