Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.09332
Cited By
v1
v2
v3 (latest)
WebGPT: Browser-assisted question-answering with human feedback
17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"WebGPT: Browser-assisted question-answering with human feedback"
50 / 1,123 papers shown
Inference-Time Reward Hacking in Large Language Models
Hadi Khalaf
C. M. Verdun
Alex Oesterling
Himabindu Lakkaraju
Flavio du Pin Calmon
237
2
0
24 Jun 2025
Deep Research Agents: A Systematic Examination And Roadmap
Y. Huang
Yihao Chen
Haozheng Zhang
Kang Li
Huichi Zhou
...
Lifeng Shang
Songcen Xu
Jianye Hao
Youssef Attia El Hili
Jun Wang
LLMAG
286
48
0
22 Jun 2025
Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models
Yukun Huang
Sanxing Chen
Jian Pei
Manzil Zaheer
Bhuwan Dhingra
KELM
HILM
RALM
LRM
407
0
0
21 Jun 2025
Relic: Enhancing Reward Model Generalization for Low-Resource Indic Languages with Few-Shot Examples
Soumya Suvra Ghosal
Vaibhav Singh
Akash Ghosh
Soumyabrata Pal
Subhadip Baidya
Sriparna Saha
Dinesh Manocha
215
2
0
19 Jun 2025
Reranking-based Generation for Unbiased Perspective Summarization
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Narutatsu Ri
Nicholas Deas
Kathleen McKeown
OffRL
171
0
0
19 Jun 2025
Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective
Léo Gagnon
Eric Elmoznino
Sarthak Mittal
Tom Marty
Tejas Kasetty
Dhanya Sridhar
Guillaume Lajoie
230
0
0
19 Jun 2025
MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents
Zijian Zhou
Ao Qu
Zhaoxuan Wu
Sunghwan Kim
Alok Prakash
Daniela Rus
Jinhua Zhao
Bryan Kian Hsiang Low
Paul Liang
LLMAG
OffRL
LRM
394
50
0
18 Jun 2025
Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs
Jing Yang Lee
Kong-Aik Lee
Woon-Seng Gan
243
1
0
18 Jun 2025
OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents
Thomas Kuntz
Agatha Duzan
Hao Zhao
Francesco Croce
Zico Kolter
Nicolas Flammarion
Maksym Andriushchenko
LLMAG
ELM
296
17
0
17 Jun 2025
Min-p, Max Exaggeration: A Critical Analysis of Min-p Sampling in Language Models
Rylan Schaeffer
Joshua Kazdan
Yegor Denisov-Blanch
315
0
0
16 Jun 2025
Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives
Wei Zeng
Hengshu Zhu
Chuan Qin
Han Wu
Yihang Cheng
...
Xiaowei Jin
Yinuo Shen
Zhenxing Wang
Feimin Zhong
Hui Xiong
AI4TS
438
0
0
11 Jun 2025
Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization
Zixuan Huang
Yikun Ban
Lean Fu
Xiaojie Li
Zhongxiang Dai
Jianxin Li
Deqing Wang
347
2
0
08 Jun 2025
Human-assisted Robotic Policy Refinement via Action Preference Optimization
Wenke Xia
Yichu Yang
Hongtao Wu
Xiao Ma
Tao Kong
Di Hu
369
2
0
08 Jun 2025
C-SEO Bench: Does Conversational SEO Work?
Haritz Puerto
Martin Gubri
Tommaso Green
Seong Joon Oh
Sangdoo Yun
ELM
789
2
0
06 Jun 2025
Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
Tennison Liu
M. Schaar
AIFin
LRM
390
2
0
05 Jun 2025
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration
Quan Shi
Carlos E. Jimenez
Shunyu Yao
Nick Haber
Diyi Yang
Karthik Narasimhan
337
1
0
05 Jun 2025
Micro-Act: Mitigating Knowledge Conflict in LLM-based RAG via Actionable Self-Reasoning
Nan Huo
Jinyang Li
Bowen Qin
Ge Qu
Xiaolong Li
Xiaodong Li
Chenhao Ma
Reynold Cheng
RALM
346
1
0
05 Jun 2025
Kinetics: Rethinking Test-Time Scaling Laws
Ranajoy Sadhukhan
Zhuoming Chen
Haizhong Zheng
Yang Zhou
Emma Strubell
Beidi Chen
457
6
0
05 Jun 2025
TracLLM: A Generic Framework for Attributing Long Context LLMs
Yanting Wang
Wei Zou
Runpeng Geng
Jinyuan Jia
LLMAG
512
4
0
04 Jun 2025
AI Agents for Conversational Patient Triage: Preliminary Simulation-Based Evaluation with Real-World EHR Data
Sina Rashidian
Nan Li
Jonathan Amar
Jong Ha Lee
Sam Pugh
Eric Yang
Geoff Masterson
Myoung Cha
Yugang Jia
Akhil Vaid
203
1
0
04 Jun 2025
Does Thinking More always Help? Mirage of Test-Time Scaling in Reasoning Models
Soumya Suvra Ghosal
Souradip Chakraborty
Avinash Reddy
Yifu Lu
Mengdi Wang
Dinesh Manocha
Furong Huang
Mohammad Ghavamzadeh
Amrit Singh Bedi
ReLM
LRM
387
17
0
04 Jun 2025
Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework
Zhaorui Yang
Bo Pan
Han Wang
Yiyao Wang
Xingyu Liu
...
Bo Zhang
Wei Chen
Minfeng Zhu
Bo Zhang
Wei Chen
337
6
0
03 Jun 2025
DeepShop: A Benchmark for Deep Research Shopping Agents
Yougang Lyu
Xiaoyu Zhang
Lingyong Yan
Maarten de Rijke
Zhaochun Ren
Xiuying Chen
337
12
0
03 Jun 2025
Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs
Nguyen-Khang Le
Quan Minh Bui
Minh Nguyen
Hiep Nguyen
Trung Vo
Son T. Luu
Shoshin Nomura
Minh Le Nguyen
183
4
0
03 Jun 2025
Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights
M. Andreux
Breno Baldas Skuk
Hamza Benchekroun
Emilien Biré
Antoine Bonnet
...
Marc Thibault
L. Thiry
Léo Tronchon
Nicolas Usunier
Tony Wu
LLMAG
196
0
0
03 Jun 2025
Psi-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models
Taehoon Yoon
Yunhong Min
Kyeongmin Yeo
Minhyuk Sung
364
0
0
02 Jun 2025
Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences
Hyojin Bahng
Caroline Chan
F. Durand
Phillip Isola
EGVM
412
7
0
02 Jun 2025
HADA: Human-AI Agent Decision Alignment Architecture
Tapio Pitkäranta
Leena Pitkäranta
193
1
0
01 Jun 2025
MCP-Zero: Active Tool Discovery for Autonomous LLM Agents
Xiang Fei
Xiawu Zheng
Hao Feng
LLMAG
505
0
0
01 Jun 2025
Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents
Yaxin Luo
Zhaoyi Li
Jiacheng Liu
Jiacheng Cui
Xiaohan Zhao
Zhiqiang Shen
LLMAG
LRM
VLM
276
7
0
30 May 2025
When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways
Kailin Jiang
Yuntao Du
Yukai Ding
Yuchen Ren
Ning Jiang
Zhi Gao
Zilong Zheng
Lei Liu
Bin Li
Qing Li
KELM
222
2
0
30 May 2025
Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data
Seohyeong Lee
Eunwon Kim
Hwaran Lee
Buru Chang
315
1
0
29 May 2025
Proximalized Preference Optimization for Diverse Feedback Types: A Decomposed Perspective on DPO
Kaiyang Guo
Yinchuan Li
Zhitang Chen
352
2
0
29 May 2025
AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models
Jinchuan Zhang
Lu Yin
Yan Zhou
Songlin Hu
LLMAG
LM&Ro
214
3
0
29 May 2025
Text2Grad: Reinforcement Learning from Natural Language Feedback
Hanyang Wang
Lu Wang
Chaoyun Zhang
Tianjun Mao
Si Qin
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
225
0
0
28 May 2025
RISE: Reasoning Enhancement via Iterative Self-Exploration in Multi-hop Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Bolei He
Xinran He
Mengke Chen
Xianwei Xue
Ying Zhu
Zhenhua Ling
ReLM
LRM
250
1
0
28 May 2025
Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation
Yuhao Wang
Ruiyang Ren
Yucheng Wang
Wayne Xin Zhao
Jing Liu
Hua Wu
Haifeng Wang
RALM
OffRL
230
2
0
27 May 2025
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Xiaqiang Tang
Jian Li
Keyu Hu
Du Nan
Xiaolong Li
Xi Zhang
Weigao Sun
Sihong Xie
HILM
440
2
0
27 May 2025
Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Xiaochuan Liu
Ruihua Song
Xiting Wang
Xu Chen
264
1
0
26 May 2025
SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety
Geon-hyeong Kim
Youngsoo Jang
Yu Jin Kim
Byoungjip Kim
Honglak Lee
Kyunghoon Bae
Moontae Lee
257
17
0
26 May 2025
Token-level Accept or Reject: A Micro Alignment Approach for Large Language Models
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Y. Zhang
Yu Yu
Bo Tang
Yu Zhu
Chuxiong Sun
...
Jie Hu
Zipeng Xie
Zhiyu Li
Feiyu Xiong
Edward Chung
483
0
0
26 May 2025
Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO
Ruizhe Shi
Minhak Song
Runlong Zhou
Zihan Zhang
Maryam Fazel
S. S. Du
309
6
0
26 May 2025
ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment
Xiaoqiang Lin
Arun Verma
Zhongxiang Dai
Daniela Rus
See-Kiong Ng
Bryan Kian Hsiang Low
717
3
0
25 May 2025
ChartLens: Fine-grained Visual Attribution in Charts
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Manan Suri
Puneet Mathur
Nedim Lipka
Franck Dernoncourt
Ryan Rossi
Dinesh Manocha
212
1
0
25 May 2025
Retrieval-Augmented Generation for Service Discovery: Chunking Strategies and Benchmarking
Robin D. Pesl
Jerin G. Mathew
Massimo Mecella
Marco Aiello
229
3
0
25 May 2025
Dynamic Risk Assessments for Offensive Cybersecurity Agents
Boyi Wei
Benedikt Stroebl
Jiacen Xu
Joie Zhang
Zhou Li
Peter Henderson
575
4
0
23 May 2025
VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models
Yuchen Yan
Jin Jiang
Zhenbang Ren
Yijun Li
Xudong Cai
...
Mengdi Zhang
Jian Shao
Yongliang Shen
Jun Xiao
Yueting Zhuang
OffRL
ALM
LRM
409
8
0
21 May 2025
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
Tianbao Xie
Jiaqi Deng
Xiaochuan Li
Junlin Yang
Haoyuan Wu
...
Yiheng Xu
Junli Wang
Doyen Sahoo
Tao Yu
Caiming Xiong
405
52
0
19 May 2025
Web Intellectual Property at Risk: Preventing Unauthorized Real-Time Retrieval by Large Language Models
Yisheng Zhong
Yizhu Wen
Junfeng Guo
Mehran Kafai
Heng Huang
Hanqing Guo
Zhuangdi Zhu
279
0
0
19 May 2025
Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Hongru Wang
Wenyu Huang
Yufei Wang
Yuanhao Xi
Jianqiao Lu
Huan Zhang
Nan Hu
Zeming Liu
Jeff Z. Pan
Kam-Fai Wong
LLMAG
315
7
0
19 May 2025
Previous
1
2
3
4
5
...
21
22
23
Next