Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.09332
Cited By
v1
v2
v3 (latest)
WebGPT: Browser-assisted question-answering with human feedback
17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"WebGPT: Browser-assisted question-answering with human feedback"
50 / 1,123 papers shown
Icon
2
^{2}
2
: Aligning Large Language Models Using Self-Synthetic Preference Data via Inherent Regulation
Qiyuan Chen
Hongsen Huang
Qian Shao
Jiahe Chen
Jintai Chen
H. Xu
Renjie Hua
Ren Chuan
Jian Wu
110
0
0
06 Sep 2025
Towards a Unified View of Large Language Model Post-Training
Xingtai Lv
Yuxin Zuo
Youbang Sun
Hongyi Liu
Yuntian Wei
...
Lixuan He
Xuekai Zhu
Kaiyan Zhang
Bingning Wang
Ning Ding
OffRL
108
11
0
04 Sep 2025
Why Language Models Hallucinate
Adam Tauman Kalai
Ofir Nachum
Santosh Vempala
Edwin Zhang
HILM
LRM
388
78
0
04 Sep 2025
Explainable Knowledge Graph Retrieval-Augmented Generation (KG-RAG) with KG-SMILE
Zahra Zehtabi Sabeti Moghaddam
Zeinab Dehghani
Maneeha Rani
Mohammed Naveed Akram
B. Mishra
R. R. Kureshi
D. Thakker
167
0
0
03 Sep 2025
DeepTRACE: Auditing Deep Research AI Systems for Tracking Reliability Across Citations and Evidence
Pranav Narayanan Venkit
Philippe Laban
Yilun Zhou
Kung-Hsiang Huang
Yixin Mao
Chien-Sheng Wu
HILM
HAI
140
1
0
02 Sep 2025
EviNote-RAG: Enhancing RAG Models via Answer-Supportive Evidence Notes
Yuqin Dai
Guoqing Wang
Yuan Wang
Kairan Dou
Kaichen Zhou
...
Can Yi
Changhua Meng
Yuchen Zhou
Yongliang Shen
Shuai Lu
RALM
243
4
0
31 Aug 2025
Can Compact Language Models Search Like Agents? Distillation-Guided Policy Optimization for Preserving Agentic RAG Capabilities
Rikuto Kotoge
Mai Nishimura
Jiaxin Ma
LM&Ro
LRM
232
0
0
27 Aug 2025
ReSURE: Regularizing Supervision Unreliability for Multi-turn Dialogue Fine-tuning
Yiming Du
Yifan Xiang
Bin Liang
Dahua Lin
Kam-Fai Wong
Fei Tan
OffRL
179
1
0
27 Aug 2025
Better Language Model-Based Judging Reward Modeling through Scaling Comprehension Boundaries
Meiling Ning
Zhongbao Zhang
Junda Ye
Jiabao Guo
Qingyuan Guan
LRM
132
0
0
25 Aug 2025
CoCoA: Confidence and Context-Aware Adaptive Decoding for Resolving Knowledge Conflicts in Large Language Models
Anant Khandelwal
Manish Gupta
Puneet Agrawal
193
1
0
25 Aug 2025
WebSight: A Vision-First Architecture for Robust Web Agents
Tanvir Bhathal
Asanshay Gupta
LRM
119
2
0
23 Aug 2025
Decoding Alignment: A Critical Survey of LLM Development Initiatives through Value-setting and Data-centric Lens
Ilias Chalkidis
OffRL
ALM
156
1
0
23 Aug 2025
Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
Huichi Zhou
Yihang Chen
Siyuan Guo
Xue Yan
Kin-Hei Lee
...
Ka Yiu Lee
Guchun Zhang
Youssef Attia El Hili
Linyi Yang
Jun Wang
LLMAG
424
13
0
22 Aug 2025
Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering
Bolei He
Xinran He
Run Shao
Shanfu Shu
Xianwei Xue
Mingquan Cheng
Haifeng Li
Zhenhua Ling
RALM
LRM
258
1
0
21 Aug 2025
Comp-X: On Defining an Interactive Learned Image Compression Paradigm With Expert-driven LLM Agent
Yixin Gao
Xin Li
Xiaohan Pan
Runsen Feng
Bingchen Li
Y. Qi
Y. Lu
Zhengxue Cheng
Zhibo Chen
Jörn Ostermann
134
0
0
21 Aug 2025
From Bits to Boardrooms: A Cutting-Edge Multi-Agent LLM Framework for Business Excellence
Zihao Wang
Junming Zhang
LLMAG
224
0
0
21 Aug 2025
Foundational Design Principles and Patterns for Building Robust and Adaptive GenAI-Native Systems
Frederik Vandeputte
AI4TS
154
2
0
21 Aug 2025
Multimodal Data Storage and Retrieval for Embodied AI: A Survey
Yihao Lu
Hao Tang
144
2
0
19 Aug 2025
Deep Research: A Survey of Autonomous Research Agents
Wenlin Zhang
Xiaopeng Li
Yingyi Zhang
Pengyue Jia
Yichao Wang
Huifeng Guo
Yong Liu
Xiangyu Zhao
LLMAG
112
11
0
18 Aug 2025
A Functionality-Grounded Benchmark for Evaluating Web Agents in E-commerce Domains
Xianren Zhang
Shreyas Prasad
Di Wang
Qiuhai Zeng
Suhang Wang
Wenbo Yan
Mat Hans
119
3
0
18 Aug 2025
Fast, Slow, and Tool-augmented Thinking for LLMs: A Review
Xinda Jia
Jinpeng Li
Zezhong Wang
Jingjing Li
Xingshan Zeng
Yasheng Wang
Weinan Zhang
Yong Yu
Weiwen Liu
LRM
136
0
0
17 Aug 2025
Thinking Inside the Mask: In-Place Prompting in Diffusion LLMs
Xiangqi Jin
Y. Wang
Yifeng Gao
Zichen Wen
Biqing Qi
Dongrui Liu
Linfeng Zhang
LRM
184
8
0
14 Aug 2025
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
Shilong Li
Xingyuan Bu
Wenjie Wang
Jiaheng Liu
Jun Dong
...
Wenhao Huang
Wangchunshu Zhou
Zhaoxiang Zhang
Ruizhe Ding
Shilei Wen
LLMAG
LRM
313
6
0
14 Aug 2025
Improving and Evaluating Open Deep Research Agents
Doaa Allabadi
Kyle Bradbury
Jordan M. Malof
101
0
0
13 Aug 2025
OpenCUA: Open Foundations for Computer-Use Agents
Xinyuan Wang
Bowen Wang
Dunjie Lu
Junlin Yang
Tianbao Xie
...
Victor Zhong
Flood Sung
Y.Charles
Zhilin Yang
Tao Yu
ELM
VLM
256
27
0
12 Aug 2025
HGMF: A Hierarchical Gaussian Mixture Framework for Scalable Tool Invocation within the Model Context Protocol
Wenpeng Xing
Zhipeng Chen
C. D. Lin
Meng Han
82
0
0
11 Aug 2025
Towards Theoretical Understanding of Transformer Test-Time Computing: Investigation on In-Context Linear Regression
Xingwu Chen
Miao Lu
Beining Wu
Difan Zou
149
0
0
11 Aug 2025
Remote Sensing Image Intelligent Interpretation with the Language-Centered Perspective: Principles, Methods and Challenges
Haifeng Li
Wang Guo
Haiyang Wu
Mengwei Wu
Jipeng Zhang
Qing Zhu
Yu Liu
Xin Huang
Chao Tao
139
1
0
09 Aug 2025
Chain of Questions: Guiding Multimodal Curiosity in Language Models
Nima Iji
Kia Dashtipour
LRM
165
0
0
06 Aug 2025
Large Language Model's Multi-Capability Alignment in Biomedical Domain
Weilei Wang
Linqing Chen
Hanmeng Zhong
Wentao Wu
LM&MA
ELM
136
0
0
06 Aug 2025
ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients"
Zhongyi Zhou
Kohei Uehara
Haoyu Zhang
Jingtao Zhou
Lin Gu
Ruofei Du
Zheng Xu
Tatsuya Harada
AI4Ed
217
1
0
06 Aug 2025
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience
Zeyi Sun
Ziyu Liu
Yuhang Zang
Yuhang Cao
Xiaoyi Dong
Tong Wu
Dahua Lin
Yuan Liu
LLMAG
248
17
0
06 Aug 2025
AttnTrace: Attention-based Context Traceback for Long-Context LLMs
Yanting Wang
Runpeng Geng
Ying Chen
Jinyuan Jia
LLMAG
198
1
1
05 Aug 2025
SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based Agents
Jiaye Lin
Yifu Guo
Yuzhen Han
Sen Hu
Ziyi Ni
...
Chen Hu
Daxin Jiang
Binxing Jiao
Chen-Hao Hu
Huacan Wang
LLMAG
LRM
359
18
0
04 Aug 2025
CUPID: Evaluating Personalized and Contextualized Alignment of LLMs from Interactions
Tae Soo Kim
Yoonjoo Lee
Yoonah Park
Jiho Kim
Young-Ho Kim
Juho Kim
236
1
0
03 Aug 2025
MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning
Hongjin Qian
Zheng Liu
LM&Ro
226
1
0
01 Aug 2025
BAR Conjecture: the Feasibility of Inference Budget-Constrained LLM Services with Authenticity and Reasoning
Jinan Zhou
Rajat Ghosh
Vaishnavi Bhargava
Debojyoti Dutta
Aryan Singhal
186
0
0
31 Jul 2025
Phi-Ground Tech Report: Advancing Perception in GUI Grounding
Miaosen Zhang
Ziqiang Xu
Jialiang Zhu
Qi Dai
Kai Qiu
...
Chong Luo
Tianyi Chen
Justin Wagle
Tim Franklin
Baining Guo
LRM
230
10
0
31 Jul 2025
Improving Generative Ad Text on Facebook using Reinforcement Learning
Daniel Jiang
Alex Nikulkov
Yu-Chia Chen
Yang Bai
Zheqing Zhu
209
2
0
29 Jul 2025
Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes
Rui Jiao
Yue Zhang
Jinku Li
LRM
204
0
0
25 Jul 2025
Understanding Human Limits in Pattern Recognition: A Computational Model of Sequential Reasoning in Rock, Paper, Scissors
Logan Cross
Erik Brockbank
Tobias Gerstenberg
Judith E. Fan
Daniel L. K. Yamins
Nick Haber
123
0
0
25 Jul 2025
A Systematic Review of Key Retrieval-Augmented Generation (RAG) Systems: Progress, Gaps, and Future Directions
Agada Joseph Oche
Ademola Glory Folashade
Tirthankar Ghosal
Arpan Biswas
3DV
VLM
358
17
0
25 Jul 2025
Thinking Isn't an Illusion: Overcoming the Limitations of Reasoning Models via Tool Augmentations
Zhao Song
Song Yue
Jiahao Zhang
LRM
198
4
0
23 Jul 2025
Theoretical Foundations and Mitigation of Hallucination in Large Language Models
Esmail Gumaan
HILM
125
2
0
20 Jul 2025
SAND: Boosting LLM Agents with Self-Taught Action Deliberation
Yu Xia
Yiran Shen
Junda Wu
Tong Yu
Sungchul Kim
Ryan Rossi
Lina Yao
Julian McAuley
LLMAG
LRM
183
1
0
10 Jul 2025
Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation
Anirban Saha Anik
Xiaoying Song
Elliott Wang
Bryan Wang
Bengisu Yarimbas
Lingzi Hong
RALM
231
1
0
09 Jul 2025
Agentic-R1: Distilled Dual-Strategy Reasoning
Weihua Du
Pranjal Aggarwal
Sean Welleck
Yiming Yang
LRM
176
3
0
08 Jul 2025
iPanda: An LLM-based Agent for Automated Conformance Testing of Communication Protocols
Xikai Sun
Fan Dang
Shiqi Jiang
Jingao Xu
Kebin Liu
...
Zihao Yang
Weichen Zhang
Haimo Lu
Yawen Zheng
Yunhao Liu
183
0
0
01 Jul 2025
WebArXiv: Evaluating Multimodal Agents on Time-Invariant arXiv Tasks
Zihao Sun
Ling Chen
LLMAG
158
0
0
01 Jul 2025
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model
Xinping Zhao
Xinshuo Hu
Zifei Shan
Shouzheng Huang
Yao Zhou
...
Meishan Zhang
Haofen Wang
Jun-chen Yu
Baotian Hu
Min Zhang
VLM
413
7
0
26 Jun 2025
Previous
1
2
3
4
5
6
...
21
22
23
Next