Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.09332
Cited By
v1
v2
v3 (latest)
WebGPT: Browser-assisted question-answering with human feedback
17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"WebGPT: Browser-assisted question-answering with human feedback"
50 / 1,125 papers shown
Analyzing Probabilistic Methods for Evaluating Agent Capabilities
Axel Højmark
Govind Pimpale
Arjun Panickssery
Marius Hobbhahn
Jérémy Scheurer
312
4
0
24 Sep 2024
LLM With Tools: A Survey
Zhuocheng Shen
250
40
0
24 Sep 2024
LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Sihui Yang
Keping Bi
Wanqing Cui
Jiafeng Guo
Xueqi Cheng
276
5
0
23 Sep 2024
PROMPTFUZZ: Harnessing Fuzzing Techniques for Robust Testing of Prompt Injection in LLMs
Jiahao Yu
Yangguang Shao
Hanwen Miao
Junzheng Shi
SILM
AAML
473
20
0
23 Sep 2024
Backtracking Improves Generation Safety
Yiming Zhang
Jianfeng Chi
Hailey Nguyen
Kartikeya Upasani
Daniel M. Bikel
Jason Weston
Eric Michael Smith
SILM
310
24
0
22 Sep 2024
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal Reasoning with Large Language Models
Shengsheng Qian
Zuyi Zhou
Dizhan Xue
Bing Wang
Changsheng Xu
LRM
428
5
0
19 Sep 2024
Textualized Agent-Style Reasoning for Complex Tasks by Multiple Round LLM Generation
Chen Liang
Zhifan Feng
Zihe Liu
Wenbin Jiang
Jinan Xu
Yufeng Chen
Yong Wang
LLMAG
LRM
165
3
0
19 Sep 2024
From Lists to Emojis: How Format Bias Affects Model Alignment
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Xuanchang Zhang
Wei Xiong
Lichang Chen
Wanrong Zhu
Heng Huang
Tong Zhang
ALM
445
22
0
18 Sep 2024
CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration
Lei Li
Renjie Pi
Tianyang Han
Han Wu
Lanqing Hong
Lingpeng Kong
Xin Jiang
Zhenguo Li
313
19
0
17 Sep 2024
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
International Conference on Learning Representations (ICLR), 2024
Maojia Song
Shang Hong Sim
Rishabh Bhardwaj
Hai Leong Chieu
Navonil Majumder
Soujanya Poria
578
29
0
17 Sep 2024
Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs
Yifan Wang
David Stevens
Pranay Shah
Wenwen Jiang
Miao Liu
...
Boying Gong
Daniel Lee
Jiabo Hu
Ning Zhang
Bob Kamma
247
5
0
16 Sep 2024
StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models
Baolong Bi
Shenghua Liu
Yiwei Wang
Lingrui Mei
Hongcheng Gao
Junfeng Fang
Xueqi Cheng
KELM
221
10
0
16 Sep 2024
Trustworthiness in Retrieval-Augmented Generation Systems: A Survey
Yujia Zhou
Yan Liu
Xiaoxi Li
Jiajie Jin
Hongjin Qian
Zheng Liu
Chaozhuo Li
Zhicheng Dou
Tsung-Yi Ho
Philip S. Yu
3DV
RALM
282
83
0
16 Sep 2024
Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison
Judy Hanwen Shen
Archit Sharma
Jun Qin
190
13
0
15 Sep 2024
Policy Filtration for RLHF to Mitigate Noise in Reward Models
Chuheng Zhang
Wei Shen
Li Zhao
Xuyun Zhang
Xiaolong Xu
Wanchun Dou
Jiang Biang
OffRL
415
6
0
11 Sep 2024
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Han Wang
Archiki Prasad
Elias Stengel-Eskin
Joey Tianyi Zhou
485
19
0
11 Sep 2024
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Buhua Liu
Shitong Shao
Bao Li
Lichen Bai
Zhiqiang Xu
Haoyi Xiong
James Kwok
Sumi Helal
Bo Han
467
23
0
11 Sep 2024
AGR: Age Group fairness Reward for Bias Mitigation in LLMs
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Shuirong Cao
Ruoxi Cheng
Zhiqiang Wang
185
12
0
06 Sep 2024
On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yong Lin
Skyler Seto
Maartje ter Hoeve
Katherine Metcalf
B. Theobald
Xuan Wang
Yizhe Zhang
Chen Huang
Tong Zhang
360
23
0
05 Sep 2024
Towards a Unified View of Preference Learning for Large Language Models: A Survey
Bofei Gao
Feifan Song
Yibo Miao
Zefan Cai
Zhiyong Yang
...
Houfeng Wang
Zhifang Sui
Peiyi Wang
Baobao Chang
Baobao Chang
473
17
0
04 Sep 2024
From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
Segev Shlomov
Ben Wiesel
Aviad Sela
Ido Levy
Liane Galanti
Roy Abitbol
LLMAG
340
8
0
03 Sep 2024
ContextCite: Attributing Model Generation to Context
Neural Information Processing Systems (NeurIPS), 2024
Benjamin Cohen-Wang
Harshay Shah
Kristian Georgiev
Aleksander Madry
LRM
361
62
0
01 Sep 2024
Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Yuncheng Yang
Yulei Qin
Tong Wu
Zihan Xu
Gang Li
...
Yuchen Shi
Ke Li
Xing Sun
Jie Yang
Yun Gu
ALM
OffRL
MoE
356
1
0
28 Aug 2024
ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model
Lifan Jiang
Zhihui Wang
Siqi Yin
Guangxiao Ma
Peng Zhang
Boxi Wu
DiffM
350
19
0
28 Aug 2024
Enhancing and Accelerating Large Language Models via Instruction-Aware Contextual Compression
Haowen Hou
Fei Ma
Binwen Bai
Xinxin Zhu
Fei Yu
200
4
0
28 Aug 2024
How will advanced AI systems impact democracy?
Christopher Summerfield
Lisa Argyle
Michiel Bakker
Teddy Collins
Esin Durmus
...
Elizabeth Seger
Divya Siddarth
Henrik Skaug Sætra
MH Tessler
M. Botvinick
304
10
0
27 Aug 2024
AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems
Chi-Min Chan
Jianxuan Yu
Weize Chen
Chunyang Jiang
Xinyu Liu
Weijie Shi
Zhiyuan Liu
Wei Xue
Yike Guo
LLMAG
288
5
0
27 Aug 2024
Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for Ancient Indian Philosophy
Priyanka Mandikal
RALM
VLM
217
1
0
21 Aug 2024
DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework
Zhifei Xie
Daniel Tang
Dingwei Tan
Jacques Klein
Tegawend F. Bissyand
Saad Ezzini
VGen
232
22
0
21 Aug 2024
Athena: Safe Autonomous Agents with Verbal Contrastive Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Tanmana Sadhu
Ali Pesaranghader
Yanan Chen
Dong Hoon Yi
ELM
LLMAG
AAML
67
5
0
20 Aug 2024
SysBench: Can Large Language Models Follow System Messages?
Yanzhao Qin
Tao Zhang
Tao Zhang
Yanjun Shen
Wenjing Luo
...
Yujing Qiao
Weipeng Chen
Guosheng Dong
Wentao Zhang
Bin Cui
ALM
397
16
0
20 Aug 2024
Minor DPO reject penalty to increase training robustness
Shiming Xie
Hong Chen
Fred Yu
Zeye Sun
Xiuyu Wu
Yingfan Hu
215
5
0
19 Aug 2024
HySem: A context length optimized LLM pipeline for unstructured tabular extraction
Narayanan PP
A. P. N. Iyer
252
2
0
18 Aug 2024
SEAL: Systematic Error Analysis for Value ALignment
AAAI Conference on Artificial Intelligence (AAAI), 2024
Manon Revel
Matteo Cargnelutti
Tyna Eloundou
Greg Leppert
287
6
0
16 Aug 2024
The Future of Open Human Feedback
Nature Machine Intelligence (Nat. Mach. Intell.), 2024
Shachar Don-Yehiya
Ben Burtenshaw
Ramon Fernandez Astudillo
Cailean Osborne
Mimansa Jaiswal
...
Omri Abend
Jennifer Ding
Sara Hooker
Hannah Rose Kirk
Leshem Choshen
VLM
ALM
298
9
0
15 Aug 2024
Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding
Xiner Li
Yulai Zhao
Chenyu Wang
Gabriele Scalia
Gökçen Eraslan
Surag Nair
Tommaso Biancalani
Aviv Regev
Sergey Levine
Masatoshi Uehara
422
81
0
15 Aug 2024
The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community
Shachar Don-Yehiya
Leshem Choshen
Omri Abend
280
5
0
15 Aug 2024
Automated Design of Agentic Systems
International Conference on Learning Representations (ICLR), 2024
Shengran Hu
Cong Lu
Jeff Clune
AI4CE
471
120
0
15 Aug 2024
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
Pranav Putta
Edmund Mills
Naman Garg
S. Motwani
Chelsea Finn
Divyansh Garg
Rafael Rafailov
LLMAG
LRM
291
147
0
13 Aug 2024
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Xiao-Yang Liu
Tianjie Zhang
Yu Gu
Iat Long Iong
Yifan Xu
...
Zhengxiao Du
Chan Hee Song
Yu Su
Yuxiao Dong
Jie Tang
VLM
LLMAG
253
68
0
12 Aug 2024
A Hybrid RAG System with Comprehensive Enhancement on Complex Reasoning
Ye Yuan
Chengwu Liu
Jingyang Yuan
Gongbo Sun
Siqi Li
Ming Zhang
LRM
432
17
0
09 Aug 2024
Learning Fine-Grained Grounded Citations for Attributed Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Lei Huang
Xiaocheng Feng
Weitao Ma
Yuxuan Gu
Weihong Zhong
...
Weijiang Yu
Weihua Peng
Duyu Tang
Dandan Tu
Bing Qin
HILM
281
7
0
08 Aug 2024
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation
Junde Wu
Jiayuan Zhu
Yunli Qi
274
109
0
08 Aug 2024
MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models
Yuchen Dong
Xiaoxiang Fang
Yuchen Hu
Renshuang Jiang
Zhe Jiang
242
0
0
07 Aug 2024
Making Long-Context Language Models Better Multi-Hop Reasoners
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Yanyang Li
Shuo Liang
Michael R. Lyu
Liwei Wang
LLMAG
LRM
312
29
0
06 Aug 2024
Can DPO Learn Diverse Human Values? A Theoretical Scaling Law
Shawn Im
Yixuan Li
619
3
0
06 Aug 2024
A Framework for Fine-Tuning LLMs using Heterogeneous Feedback
Ryan Aponte
Ryan Rossi
Shunan Guo
Franck Dernoncourt
Tong Yu
Xiang Chen
Subrata Mitra
Nedim Lipka
OffRL
144
1
0
05 Aug 2024
DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation
European Conference on Computer Vision (ECCV), 2024
Rakshith Subramanyam
Kowshik Thopalli
V. Narayanaswamy
Jayaraman J.Thiagarajan
268
4
0
01 Aug 2024
Improving Retrieval Augmented Language Model with Self-Reasoning
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yuan Xia
Jingbo Zhou
Zhenhui Shi
Jun Chen
Hai-ting Huang
AIFin
LRM
ReLM
KELM
258
34
0
29 Jul 2024
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
International Conference on Learning Representations (ICLR), 2024
Zehui Chen
Kuikun Liu
Qiuchen Wang
Jiangning Liu
Wenwei Zhang
Kai Chen
Feng Zhao
LLMAG
389
44
0
29 Jul 2024
Previous
1
2
3
...
8
9
10
...
21
22
23
Next
Page 9 of 23
Page
of 23
Go