Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.09332
Cited By
v1
v2
v3 (latest)
WebGPT: Browser-assisted question-answering with human feedback
17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"WebGPT: Browser-assisted question-answering with human feedback"
50 / 1,123 papers shown
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Hongliang He
Wenlin Yao
Kaixin Ma
Wenhao Yu
Han Zhang
Tianqing Fang
Zhenzhong Lan
Dong Yu
LM&Ro
LLMAG
380
28
0
25 Oct 2024
Infogent: An Agent-Based Framework for Web Information Aggregation
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
R. Reddy
Sagnik Mukherjee
Jeonghwan Kim
Zhenhailong Wang
Dilek Z. Hakkani-Tür
Heng Ji
259
15
0
24 Oct 2024
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Liwen Wang
Sheng Chen
Linnan Jiang
Shu Pan
Runze Cai
Sen Yang
Fei Yang
547
14
0
24 Oct 2024
IPL: Leveraging Multimodal Large Language Models for Intelligent Product Listing
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Kang Chen
Qingheng Zhang
Chengbao Lian
Yixin Ji
Xuwei Liu
Shuguang Han
Guoqiang Wu
Fei Huang
Jufeng Chen
136
6
0
22 Oct 2024
Beyond Retrieval: Generating Narratives in Conversational Recommender Systems
The Web Conference (WWW), 2024
Krishna Sayana
Raghavendra Vasudeva
Yuri Vasilevski
Kun Su
Liam Hebert
H. Pham
Ambarish Jash
Sukhdeep S. Sodhi
3DV
292
5
0
22 Oct 2024
AdvAgent: Controllable Blackbox Red-teaming on Web Agents
Chejian Xu
Mintong Kang
Jiawei Zhang
Zeyi Liao
Lingbo Mo
Mengqi Yuan
Huan Sun
Bo Li
AAML
176
15
0
22 Oct 2024
RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
International Conference on Learning Representations (ICLR), 2024
Yantao Liu
Zijun Yao
Rui Min
Yixin Cao
Lei Hou
Juanzi Li
OffRL
ALM
345
95
0
21 Oct 2024
ComPO: Community Preferences for Language Model Personalization
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Sachin Kumar
Chan Young Park
Yulia Tsvetkov
Noah A. Smith
Hannaneh Hajishirzi
263
13
0
21 Oct 2024
A Survey of Conversational Search
Fengran Mo
Kelong Mao
Ziliang Zhao
Hongjin Qian
Haonan Chen
Yiruo Cheng
Xiaochen Li
Yinlin Zhu
Zhicheng Dou
Jian-Yun Nie
KELM
552
23
0
21 Oct 2024
Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Kaige Xie
Philippe Laban
Prafulla Kumar Choubey
Caiming Xiong
Chien-Sheng Wu
171
4
0
20 Oct 2024
Personalized Adaptation via In-Context Preference Learning
Allison Lau
Younwoo Choi
Vahid Balazadeh
Keertana Chidambaram
Vasilis Syrgkanis
Fahad Razak
VLM
OffRL
108
10
0
17 Oct 2024
ControlAgent: Automating Control System Design via Novel Integration of LLM Agents and Domain Expertise
Xingang Guo
Darioush Keivan
U. Syed
Lianhui Qin
Huan Zhang
Geir Dullerud
Peter M. Seiler
Bin Hu
168
18
0
17 Oct 2024
RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Zhuoran Liu
Danpei Zhao
Bo Yuan
322
9
0
17 Oct 2024
Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level Caching
Jie Peng
Zhang Cao
Huaizhi Qu
Zhengyu Zhang
Chang Guo
Yanyong Zhang
Zhichao Cao
Tianlong Chen
316
5
0
17 Oct 2024
Divide-Verify-Refine: Can LLMs Self-Align with Complex Instructions?
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Xianren Zhang
Xianfeng Tang
Hui Liu
Zongyu Wu
Qi He
Dongwon Lee
Suhang Wang
ALM
281
2
0
16 Oct 2024
On the Capacity of Citation Generation by Large Language Models
China Conference on Information Retrieval (CIR), 2024
Haosheng Qian
Yixing Fan
Ruqing Zhang
Jiafeng Guo
HILM
211
2
0
15 Oct 2024
Ada-K Routing: Boosting the Efficiency of MoE-based LLMs
Tongtian Yue
Longteng Guo
Jie Cheng
Xuange Gao
Qingbin Liu
MoE
292
8
0
14 Oct 2024
MisinfoEval: Generative AI in the Era of "Alternative Facts"
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Saadia Gabriel
Liang Lyu
James Siderius
Elisa Kreiss
Jacob Andreas
Asu Ozdaglar
271
9
0
13 Oct 2024
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jiachun Li
Pengfei Cao
Chenhao Wang
Zhuoran Jin
Yubo Chen
Kang Liu
Xiaojian Jiang
Jiexin Xu
Jun Zhao
LRM
KELM
203
1
0
12 Oct 2024
Retrieving Contextual Information for Long-Form Question Answering using Weak Supervision
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Philipp Christmann
Svitlana Vakulenko
Ionut Teodor Sorodoc
Bill Byrne
Adria de Gispert
RALM
204
0
0
11 Oct 2024
Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents
Priyanshu Kumar
Elaine Lau
Saranya Vijayakumar
Tu Trinh
Scale Red Team
...
Sean Hendryx
Shuyan Zhou
Matt Fredrikson
Summer Yue
Zifan Wang
LLMAG
244
49
0
11 Oct 2024
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both
Abhijnan Nath
Changsoo Jung
Ethan Seefried
Nikhil Krishnaswamy
1.1K
6
0
11 Oct 2024
Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models
Sitao Cheng
Liangming Pan
Xunjian Yin
Xinyi Wang
William Yang Wang
KELM
242
10
0
10 Oct 2024
Agents Thinking Fast and Slow: A Talker-Reasoner Architecture
Konstantina Christakopoulou
Shibl Mourad
Maja Matarić
LLMAG
226
21
0
10 Oct 2024
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
International Conference on Learning Representations (ICLR), 2024
Amrith Rajagopal Setlur
Chirag Nagpal
Adam Fisch
Xinyang Geng
Jacob Eisenstein
Rishabh Agarwal
Alekh Agarwal
Jonathan Berant
Aviral Kumar
OffRL
LRM
396
161
0
10 Oct 2024
Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction
International Conference on Learning Representations (ICLR), 2024
Jarrid Rector-Brooks
Mohsin Hasan
Zhangzhi Peng
Zachary Quinn
Chenghao Liu
...
Michael Bronstein
Yoshua Bengio
Pranam Chatterjee
Alexander Tong
Avishek Joey Bose
DiffM
283
22
0
10 Oct 2024
AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Hongru Wang
Rui Wang
Boyang Xue
Heming Xia
Jingtao Cao
Zeming Liu
Jeff Z. Pan
Kam-Fai Wong
ALM
207
22
0
10 Oct 2024
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
International Conference on Learning Representations (ICLR), 2024
Changle Qu
Sunhao Dai
Xiaochi Wei
Hengyi Cai
Shuaiqiang Wang
D. Yin
Jun Xu
Ji-Rong Wen
355
26
0
10 Oct 2024
Self-Boosting Large Language Models with Synthetic Preference Data
International Conference on Learning Representations (ICLR), 2024
Qingxiu Dong
Li Dong
Xingxing Zhang
Zhifang Sui
Furu Wei
SyDa
248
30
0
09 Oct 2024
ClickAgent: Enhancing UI Location Capabilities of Autonomous Agents
Jakub Hoscilowicz
Bartosz Maj
Bartosz Kozakiewicz
Oleksii Tymoshchuk
Artur Janicki
LLMAG
328
10
0
09 Oct 2024
Bridging Today and the Future of Humanity: AI Safety in 2024 and Beyond
Shanshan Han
607
1
0
09 Oct 2024
Uncovering Factor Level Preferences to Improve Human-Model Alignment
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Juhyun Oh
Eunsu Kim
Jiseon Kim
Wenda Xu
Inha Cha
William Yang Wang
Alice Oh
380
1
0
09 Oct 2024
TinyClick: Single-Turn Agent for Empowering GUI Automation
Pawel Pawlowski
Krystian Zawistowski
Wojciech Lapacz
Marcin Skorupa
Adam Wiacek
Sebastien Postansque
Jakub Hoscilowicz
LRM
LLMAG
MLLM
412
9
0
09 Oct 2024
ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
Zhenchao Jin
Mengchen Liu
Dongdong Chen
Lingting Zhu
Yunsheng Li
Ziqiang Li
KELM
129
3
0
08 Oct 2024
Integrating Planning into Single-Turn Long-Form Text Generation
Yi Liang
You Wu
Honglei Zhuang
Li Chen
Jiaming Shen
...
Zhen Qin
Sumit Sanghai
Xuanhui Wang
Carl Yang
Michael Bendersky
239
7
0
08 Oct 2024
Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Bolei He
Nuo Chen
Xinran He
Lingyong Yan
Zhenkai Wei
Jinchang Luo
Zhen-Hua Ling
RALM
LRM
150
15
0
08 Oct 2024
AgentSquare: Automatic LLM Agent Search in Modular Design Space
International Conference on Learning Representations (ICLR), 2024
Yu Shang
Yu Li
Keyu Zhao
Likai Ma
Qingbin Liu
Fengli Xu
Yong Li
LLMAG
532
54
0
08 Oct 2024
Driving with Regulation: Trustworthy and Interpretable Decision-Making for Autonomous Driving with Retrieval-Augmented Reasoning
Tianhui Cai
Yifan Liu
Zewei Zhou
Haoxuan Ma
Seth Z. Zhao
Zhiwen Wu
Xu Han
Zhiyu Huang
Jiaqi Ma
418
20
0
07 Oct 2024
LRHP: Learning Representations for Human Preferences via Preference Pairs
Chenglong Wang
Yang Gan
Yifu Huo
Yongyu Mu
Qiaozhi He
Murun Yang
Tong Xiao
Chunliang Zhang
Tongran Liu
Jingbo Zhu
AI4TS
316
3
0
06 Oct 2024
Identification des paramètres dún modèle logistique en dynamique des populations avec sortie affine
Messaoud Souilah
Imene Sabira Soualah
104
0
0
06 Oct 2024
Aligning LLMs with Individual Preferences via Interaction
International Conference on Computational Linguistics (COLING), 2024
Shujin Wu
May Fung
Cheng Qian
Jeonghwan Kim
Dilek Z. Hakkani-Tür
Heng Ji
345
52
0
04 Oct 2024
CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning
Huimu Yu
Xing Wu
Weidong Yin
Debing Zhang
Songlin Hu
LRM
314
7
0
03 Oct 2024
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
International Conference on Learning Representations (ICLR), 2024
Yekun Chai
Haoran Sun
Huang Fang
Shuohuan Wang
Yu Sun
Hua Wu
984
5
0
03 Oct 2024
Evaluating Robustness of Reward Models for Mathematical Reasoning
Sunghwan Kim
Dongjin Kang
Taeyoon Kwon
Hyungjoo Chae
Jungsoo Won
Dongha Lee
Jinyoung Yeo
199
15
0
02 Oct 2024
HelpSteer2-Preference: Complementing Ratings with Preferences
International Conference on Learning Representations (ICLR), 2024
Zhilin Wang
Alexander Bukharin
Olivier Delalleau
Daniel Egert
Gerald Shen
Jiaqi Zeng
Oleksii Kuchaiev
Yi Dong
ALM
460
103
0
02 Oct 2024
HybridFlow: A Flexible and Efficient RLHF Framework
European Conference on Computer Systems (EuroSys), 2024
Guangming Sheng
Chi Zhang
Zilingfeng Ye
Xibin Wu
Wang Zhang
Ru Zhang
Size Zheng
Haibin Lin
Chuan Wu
AI4CE
659
1,008
0
28 Sep 2024
Align
2
^2
2
LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Hongzhe Huang
Zhewen Yu
Jiang Liu
Li Cai
Dian Jiao
...
Siliang Tang
Juncheng Li
Hao Jiang
Haoyuan Li
Yueting Zhuang
MLLM
ALM
108
0
0
27 Sep 2024
Open-World Evaluation for Retrieving Diverse Perspectives
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hung-Ting Chen
Eunsol Choi
363
3
0
26 Sep 2024
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
International Conference on Learning Representations (ICLR), 2024
Qining Zhang
Lei Ying
OffRL
483
10
0
25 Sep 2024
Analyzing Probabilistic Methods for Evaluating Agent Capabilities
Axel Højmark
Govind Pimpale
Arjun Panickssery
Marius Hobbhahn
Jérémy Scheurer
309
4
0
24 Sep 2024
Previous
1
2
3
...
7
8
9
...
21
22
23
Next
Page 8 of 23
Page
of 23
Go