Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.09332
Cited By
v1
v2
v3 (latest)
WebGPT: Browser-assisted question-answering with human feedback
17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"WebGPT: Browser-assisted question-answering with human feedback"
50 / 1,123 papers shown
Authenticated Delegation and Authorized AI Agents
Tobin South
Samuele Marro
Thomas Hardjono
Robert Mahari
Cedric Deslandes Whitney
Dazza Greenwood
Alan Chan
Alex Pentland
419
27
0
17 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
IEEE Reviews in Biomedical Engineering (RBME), 2024
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
778
72
0
17 Jan 2025
WebWalker: Benchmarking LLMs in Web Traversal
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jialong Wu
Wenbiao Yin
Yong Jiang
Zhenglin Wang
Zekun Xi
...
Linhai Zhang
Yulan He
Deyu Zhou
Pengjun Xie
Fei Huang
619
88
0
13 Jan 2025
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Hiroki Furuta
Yutaka Matsuo
Aleksandra Faust
Izzeddin Gur
CLL
608
19
0
03 Jan 2025
Enhancing Preference-based Linear Bandits via Human Response Time
Neural Information Processing Systems (NeurIPS), 2024
Shen Li
Yuyang Zhang
Tongzheng Ren
Claire Liang
Na Li
J. Shah
490
1
0
03 Jan 2025
PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations
Ruosen Li
Teerth Patel
Xinya Du
LLMAG
ALM
553
126
0
03 Jan 2025
AutoPresent: Designing Structured Visuals from Scratch
Computer Vision and Pattern Recognition (CVPR), 2025
Jiaxin Ge
Zora Z. Wang
Xuhui Zhou
Yi-Hao Peng
Sanjay Subramanian
...
Maarten Sap
Alane Suhr
Daniel Fried
Graham Neubig
Trevor Darrell
278
8
0
01 Jan 2025
Zero-Indexing Internet Search Augmented Generation for Large Language Models
Guangxin He
Zonghong Dai
Jiangcheng Zhu
Binqiang Zhao
Qicheng Hu
Chenyue Li
You Peng
Chen Wang
Binhang Yuan
369
1
0
31 Dec 2024
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Weiwei Sun
Lingyong Yan
Xinyu Ma
Shuaiqiang Wang
Sudipta Singha Roy
Zhumin Chen
D. Yin
Zhaochun Ren
RALM
ALM
ELM
LRM
LM&MA
688
427
0
31 Dec 2024
Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning
Alex Beutel
Kai Y. Xiao
Johannes Heidecke
Lilian Weng
AAML
183
17
0
24 Dec 2024
Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models
Cameron R. Jones
Benjamin Bergen
462
14
0
22 Dec 2024
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Ziang Ye
Zizhuo Zhang
Yang Zhang
Jianxin Ma
Junyang Lin
Fuli Feng
LRM
269
3
0
19 Dec 2024
Relational Programming with Foundation Models
AAAI Conference on Artificial Intelligence (AAAI), 2024
Ziyang Li
Jiani Huang
Jason Liu
Felix Zhu
Eric Zhao
William Dodds
Neelay Velingker
Rajeev Alur
Mayur Naik
313
10
0
19 Dec 2024
LDC: Learning to Generate Research Idea with Dynamic Control
Ruochen Li
Liqiang Jing
Chi Han
Jiawei Zhou
Xinya Du
LRM
304
13
0
19 Dec 2024
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Zhuoran Jin
Hongbang Yuan
Tianyi Men
Pengfei Cao
Yubo Chen
Kang Liu
Jun Zhao
ALM
441
22
0
18 Dec 2024
Context-DPO: Aligning Language Models for Context-Faithfulness
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Baolong Bi
Shaohan Huang
Longji Xu
Tianchi Yang
Zihan Zhang
...
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
Shenghua Liu
312
33
0
18 Dec 2024
CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers
Dimitrios Mallis
Ahmet Serdar Karadeniz
Sebastian Cavada
Danila Rukhovich
Niki Maria Foteinopoulou
K. Cherenkova
Anis Kacem
Djamila Aouada
605
16
0
18 Dec 2024
EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Cheng Qian
Peixuan Han
Qinyu Luo
Bingxiang He
Xiusi Chen
...
Jiarui Yao
Xiaocheng Yang
Denghui Zhang
Yunzhu Li
Heng Ji
LLMAG
LRM
520
3
0
18 Dec 2024
RareAgents: Autonomous Multi-disciplinary Team for Rare Disease Diagnosis and Treatment
Xuanzhong Chen
Ye Jin
Xiaohao Mao
Lun Wang
Shuyang Zhang
Ting Chen
332
7
0
17 Dec 2024
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
Junjie Lin
Jian Zhao
Lin Liu
Yue Deng
Youpeng Zhao
Lanxiao Huang
Xia Lin
Wengang Zhou
Haoyang Li
267
2
0
16 Dec 2024
Attention with Dependency Parsing Augmentation for Fine-Grained Attribution
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Qiang Ding
Lvzhou Luo
Yixuan Cao
Ping Luo
294
4
0
16 Dec 2024
Beyond the Binary: Capturing Diverse Preferences With Reward Regularization
Vishakh Padmakumar
Chuanyang Jin
Hannah Rose Kirk
He He
254
8
0
05 Dec 2024
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems
Rafael Teixeira de Lima
Shubham Gupta
Cesar Berrospi
Lokesh Mishra
Michele Dolfi
Peter W. J. Staar
Panagiotis Vagenas
258
10
0
29 Nov 2024
Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation
International Conference on Advanced Information Systems Engineering (CAiSE), 2024
Robin D. Pesl
Jerin G. Mathew
Massimo Mecella
Marco Aiello
272
6
0
29 Nov 2024
Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Rong-Cheng Tu
Zi-Ao Ma
Tian Lan
Yuehao Zhao
Heyan Huang
Xian-Ling Mao
MLLM
VLM
EGVM
362
11
0
23 Nov 2024
SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world Human-Machine Interactions
Yaqi Wang
Haipei Xu
LLMAG
235
1
0
21 Nov 2024
Value Imprint: A Technique for Auditing the Human Values Embedded in RLHF Datasets
Neural Information Processing Systems (NeurIPS), 2024
Ike Obi
Rohan Pant
Srishti Shekhar Agrawal
Maham Ghazanfar
Aaron Basiletti
233
9
0
18 Nov 2024
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Xinyan Guan
Yanjiang Liu
Xinyu Lu
Boxi Cao
Xianpei Han
...
Le Sun
Jie Lou
Bowen Yu
Yaojie Lu
Hongyu Lin
ALM
590
9
0
18 Nov 2024
Drowning in Documents: Consequences of Scaling Reranker Inference
Mathew Jacob
Erik Lindgren
Matei A. Zaharia
Michael Carbin
Omar Khattab
Andrew Drozdov
OffRL
565
7
0
18 Nov 2024
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use
Siyuan Hu
Mingyu Ouyang
Difei Gao
Mike Zheng Shou
LM&Ro
LLMAG
215
45
0
15 Nov 2024
Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yuang Cai
Yuyu Yuan
Jinsheng Shi
Qinhong Lin
246
4
0
14 Nov 2024
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant
Neural Information Processing Systems (NeurIPS), 2024
Yujia Zhou
Zheng Liu
Zhicheng Dou
AIFin
LRM
RALM
162
5
0
11 Nov 2024
Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks
Adam Fourney
Gagan Bansal
Hussein Mozannar
Cheng Tan
Eduardo Salinas
...
Victor C. Dibia
Ahmed Hassan Awadallah
Ece Kamar
Rafah Hosn
Saleema Amershi
AI4CE
LRM
LLMAG
358
132
0
07 Nov 2024
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
Chenlu Ye
Quanquan Gu
Tong Zhang
OffRL
558
13
0
07 Nov 2024
Long Context RAG Performance of Large Language Models
Quinn Leng
Jacob P. Portes
Sam Havens
Matei A. Zaharia
Michael Carbin
AIFin
RALM
3DV
274
25
0
05 Nov 2024
Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Control
Neural Information Processing Systems (NeurIPS), 2024
Yuxin Xiao
Chaoqun Wan
Yonggang Zhang
Wenxiao Wang
Binbin Lin
Xiaofei He
Xu Shen
Jieping Ye
187
3
0
04 Nov 2024
Foundations and Recent Trends in Multimodal Mobile Agents: A Survey
Biao Wu
Yanda Li
Meng Fang
Zirui Song
Zhiwei Zhang
Yunchao Wei
LM&Ro
LLMAG
OffRL
AI4TS
426
19
0
04 Nov 2024
Sample-Efficient Alignment for LLMs
Zichen Liu
Changyu Chen
Chao Du
Wee Sun Lee
Min Lin
286
10
0
03 Nov 2024
Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models
Aliyah R. Hsu
James Zhu
Zhichao Wang
Bin Bi
Shubham Mehrotra
...
Sougata Chaudhuri
Regunathan Radhakrishnan
S. Asur
Claire Na Cheng
Bin Yu
ALM
LRM
695
1
0
03 Nov 2024
CORAG: A Cost-Constrained Retrieval Optimization System for Retrieval-Augmented Generation
Liang Luo
Haitao Yuan
Wei Dong
Gao Cong
Feifei Li
3DV
205
7
0
01 Nov 2024
Attention Tracker: Detecting Prompt Injection Attacks in LLMs
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Kuo-Han Hung
Ching-Yun Ko
Ambrish Rawat
I-Hsin Chung
Winston H. Hsu
Pin-Yu Chen
411
54
0
01 Nov 2024
GPT for Games: An Updated Scoping Review (2020-2024)
IEEE Transactions on Games (IEEE Trans. Games), 2024
Daijin Yang
Erica Kleinman
Casper Harteveld
LLMAG
AI4TS
AI4CE
577
15
0
01 Nov 2024
Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
BigData Congress [Services Society] (BSS), 2024
Yu Pan
Jianxin Sun
Hongfeng Yu
Joe Luck
Geng Bai
Nipuna Chamara
Yufeng Ge
Tala Awada
254
4
0
31 Oct 2024
AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
Yifan Xu
Xiao Liu
Xingwu Sun
Siyi Cheng
Hao Yu
Hanyu Lai
Shudan Zhang
Dan Zhang
Jie Tang
Yuxiao Dong
LLMAG
296
50
0
31 Oct 2024
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models
Tanmay Parekh
Pradyot Prakash
Alexander Radovic
Akshay Shekher
Denis Savenkov
LRM
844
3
0
30 Oct 2024
A Perspective for Adapting Generalist AI to Specialized Medical AI Applications and Their Challenges
Zhenting Wang
Hanyin Wang
Benjamin Danek
Ying Li
Christina Mack
Hoifung Poon
Y. Wang
Pranav Rajpurkar
Jimeng Sun
LM&MA
308
11
0
28 Oct 2024
AutoGLM: Autonomous Foundation Agents for GUIs
Xiao Liu
Bo Qin
Dongzhu Liang
Guang Dong
Hanyu Lai
...
Yujia Wang
Yongjun Xu
Zehan Qi
Yuxiao Dong
Jie Tang
LLMAG
303
51
0
28 Oct 2024
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Zhixin Zhang
Yiyuan Zhang
Xiaohan Ding
Xiangyu Yue
229
12
0
28 Oct 2024
Fast Best-of-N Decoding via Speculative Rejection
Neural Information Processing Systems (NeurIPS), 2024
Hanshi Sun
Momin Haider
Ruiqi Zhang
Huitao Yang
Jiahao Qiu
Ming Yin
Mengdi Wang
Peter L. Bartlett
Andrea Zanette
BDL
378
101
0
26 Oct 2024
FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning
International Conference on AI in Finance (ICAF), 2024
Nicole Cho
Nishan Srishankar
Lucas Cecchi
William Watson
AIFin
221
5
0
25 Oct 2024
Previous
1
2
3
...
6
7
8
...
21
22
23
Next
Page 7 of 23
Page
of 23
Go