Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.09332
Cited By
v1
v2
v3 (latest)
WebGPT: Browser-assisted question-answering with human feedback
17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"WebGPT: Browser-assisted question-answering with human feedback"
50 / 1,125 papers shown
A Study on Training and Developing Large Language Models for Behavior Tree Generation
Fu Li
Xueying Wang
Bin Li
Yunlong Wu
Yanzhen Wang
Xiaodong Yi
260
10
0
16 Jan 2024
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
International Conference on Machine Learning (ICML), 2024
Zongxin Yang
Guikun Chen
Xiaodi Li
Wenguan Wang
Yi Yang
LM&Ro
LLMAG
515
64
0
16 Jan 2024
The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey
Saurav Pawar
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Vasu Sharma
Amitava Das
218
43
0
15 Jan 2024
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation
Meng Cao
Lei Shu
Lei Yu
Yun Zhu
Nevan Wichers
Yinxiao Liu
Lei Meng
OffRL
ALM
327
18
0
14 Jan 2024
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Weizhou Shen
Chenliang Li
Hongzhan Chen
Ming Yan
Xiaojun Quan
Hehong Chen
Ji Zhang
Fei Huang
LLMAG
376
95
0
14 Jan 2024
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Wenqi Shi
Ran Xu
Yuchen Zhuang
Yue Yu
Jieyu Zhang
Hang Wu
Yuanda Zhu
Joyce C. Ho
Carl Yang
Hang Wu
204
76
0
13 Jan 2024
INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Yutao Zhu
Peitian Zhang
Chenghao Zhang
Yifei Chen
Binyu Xie
Zheng Liu
Ji-Rong Wen
Zhicheng Dou
246
28
0
12 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
ZuJie Wen
Ke Xu
Qi Li
337
101
0
11 Jan 2024
MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
Renjie Pi
Tianyang Han
Jianshu Zhang
Yueqi Xie
Boyao Wang
Qing Lian
Hanze Dong
Jipeng Zhang
Tong Zhang
AAML
372
109
0
05 Jan 2024
From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models
Na Liu
Liangyu Chen
Xiaoyu Tian
Wei Zou
Kaijiang Chen
Ming Cui
LLMAG
259
45
0
05 Jan 2024
Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding
Yuu Jinnai
Kaito Ariu
357
13
0
05 Jan 2024
Understanding LLMs: A Comprehensive Overview from Training to Inference
Yi-Hsueh Liu
Haoyang He
Tianle Han
Xu-Yao Zhang
Mengyuan Liu
...
Xiaoyan Cai
Tuo Zhang
Ning Qiang
Tianming Liu
Bao Ge
SyDa
465
125
0
04 Jan 2024
Theoretical guarantees on the best-of-n alignment policy
Ahmad Beirami
Alekh Agarwal
Jonathan Berant
Alex DÁmour
Jacob Eisenstein
Chirag Nagpal
A. Suresh
513
89
0
03 Jan 2024
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Ke Yang
Jiateng Liu
John Wu
Chaoqi Yang
Yi R. Fung
...
Xu Cao
Xingyao Wang
Yiquan Wang
Chenhui Xu
Chengxiang Zhai
LLMAG
ELM
476
115
0
01 Jan 2024
Enhancing Open-Domain Task-Solving Capability of LLMs via Autonomous Tool Integration from GitHub
Bohan Lyu
Xin Cong
Heyang Yu
Pan Yang
Yujia Qin
...
Zhong Zhang
Shi Yu
Y. Lin
Zhiyuan Liu
Maosong Sun
LLMAG
272
5
0
28 Dec 2023
ShennongAlpha: an AI-driven sharing and collaboration platform for intelligent curation, acquisition, and translation of natural medicinal material knowledge
Zijie YANG
Yongjing Yin
Chaojun Kong
Tiange Chi
Wufan Tao
Yue Zhang
Tian Xu
50
12
0
27 Dec 2023
LARP: Language-Agent Role Play for Open-World Games
Ming Yan
Ruihao Li
Hao Zhang
Hao Wang
Zhilan Yang
Ji Yan
LLMAG
LM&Ro
AI4CE
269
24
0
24 Dec 2023
MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA
Lang Yu
Qin Chen
Jie Zhou
Liang He
KELM
275
80
0
19 Dec 2023
Agent-based Learning of Materials Datasets from Scientific Literature
Mehrad Ansari
S. M. Moosavi
AI4CE
167
19
0
18 Dec 2023
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint
Wei Xiong
Hanze Dong
Chen Ye
Ziqi Wang
Han Zhong
Heng Ji
Nan Jiang
Tong Zhang
OffRL
379
298
0
18 Dec 2023
Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations
Zilin Wang
Hao-Wen Zhuang
Lu Li
Yinmin Zhang
Junjie Zhong
Jun Chen
Yu Yang
Boshi Tang
Zhiyong Wu
194
5
0
18 Dec 2023
Retrieval-Augmented Generation for Large Language Models: A Survey
Yunfan Gao
Yun Xiong
Xinyu Gao
Kangxiang Jia
Jinliu Pan
Yuxi Bi
Yi Dai
Jiawei Sun
Meng Wang
Haofen Wang
3DV
RALM
1.2K
2,795
1
18 Dec 2023
Let AI Entertain You: Increasing User Engagement with Generative AI and Rejection Sampling
Jingying Zeng
Jaewon Yang
Waleed Malik
Xiao Yan
Richard Huang
Qi He
148
2
0
16 Dec 2023
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Renat Aksitov
Sobhan Miryoosefi
Zong-xiao Li
Daliang Li
Sheila Babayan
...
Sushant Prakash
Pranesh Srinivasan
Manzil Zaheer
Felix X. Yu
Sanjiv Kumar
LRM
ReLM
LLMAG
KELM
272
75
0
15 Dec 2023
Towards Verifiable Text Generation with Evolving Memory and Self-Reflection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hao Sun
Hengyi Cai
Bo Wang
Yingyan Hou
Xiaochi Wei
Shuaiqiang Wang
Yan Zhang
D. Yin
358
17
0
14 Dec 2023
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuan Sui
Jiaru Zou
Mengyu Zhou
Xinyi He
Lun Du
Shi Han
Dongmei Zhang
LRM
LMTD
209
46
0
14 Dec 2023
LDM
2
^2
2
: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xingjin Wang
Linjing Li
D. Zeng
148
1
0
13 Dec 2023
AI capabilities can be significantly improved without expensive retraining
Tom Davidson
Jean-Stanislas Denain
Pablo Villalobos
Guillem Bas
OffRL
VLM
247
31
0
12 Dec 2023
On Diversified Preferences of Large Language Model Alignment
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Dun Zeng
Yong Dai
Pengyu Cheng
Longyue Wang
Tianhao Hu
Wanshun Chen
Nan Du
Zenglin Xu
ALM
399
22
0
12 Dec 2023
Exploring Large Language Models to Facilitate Variable Autonomy for Human-Robot Teaming
Younes Lakhnati
Max Pascher
Jens Gerken
LLMAG
LM&Ro
309
13
0
12 Dec 2023
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Tao Gui
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
495
4
0
12 Dec 2023
Alignment for Honesty
Neural Information Processing Systems (NeurIPS), 2023
Yuqing Yang
Ethan Chern
Xipeng Qiu
Graham Neubig
Pengfei Liu
273
60
0
12 Dec 2023
"I Want It That Way": Enabling Interactive Decision Support Using Large Language Models and Constraint Programming
Connor Lawless
Jakob Schoeffer
Lindy Le
Kael Rowan
Shilad Sen
Cristina St. Hill
Jina Suh
Bahar Sarrafzadeh
337
27
0
12 Dec 2023
"What's important here?": Opportunities and Challenges of Using LLMs in Retrieving Information from Web Interfaces
Faria Huq
Jeffrey P. Bigham
Nikolas Martelaro
243
8
0
11 Dec 2023
KwaiAgents: Generalized Information-seeking Agent System with Large Language Models
Haojie Pan
Zepeng Zhai
Hao Yuan
Yaojia Lv
Ruiji Fu
Ming Liu
Zhongyuan Wang
Bing Qin
LLMAG
RALM
270
14
0
08 Dec 2023
Learning to Break: Knowledge-Enhanced Reasoning in Multi-Agent Debate System
Haotian Wang
Xiyuan Du
Weijiang Yu
Qianglong Chen
Kun Zhu
Zheng Chu
Lian Yan
Yi Guan
367
29
0
08 Dec 2023
LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent Ecosystem
Yingqiang Ge
Yujie Ren
Qingfeng Lan
Shuyuan Xu
Juntao Tan
Zelong Li
LLMAG
259
40
0
06 Dec 2023
Speculative Exploration on the Concept of Artificial Agents Conducting Autonomous Research
Shiro Takagi
256
1
0
06 Dec 2023
Rethinking E-Commerce Search
Haixun Wang
Taesik Na
188
8
0
06 Dec 2023
ULMA: Unified Language Model Alignment with Human Demonstration and Point-wise Preference
Tianchi Cai
Xierui Song
Jiyan Jiang
Fei Teng
Jinjie Gu
Guannan Zhang
ALM
200
8
0
05 Dec 2023
Explore, Select, Derive, and Recall: Augmenting LLM with Human-like Memory for Mobile Task Automation
ACM/IEEE International Conference on Mobile Computing and Networking (MobiCom), 2023
Sunjae Lee
Junyoung Choi
Jungjae Lee
Munim Hasan Wasi
Hojun Choi
Steven Y. Ko
Sangeun Oh
Insik Shin
RALM
352
6
0
04 Dec 2023
D-Bot: Database Diagnosis System using Large Language Models
Proceedings of the VLDB Endowment (PVLDB), 2023
Xuanhe Zhou
Guoliang Li
Zhaoyan Sun
Zhiyuan Liu
Weize Chen
Jianming Wu
Jiesi Liu
Ruohang Feng
Guoyang Zeng
LLMAG
217
34
0
03 Dec 2023
Nash Learning from Human Feedback
International Conference on Machine Learning (ICML), 2023
Rémi Munos
Michal Valko
Daniele Calandriello
M. G. Azar
Mark Rowland
...
Nikola Momchev
Olivier Bachem
D. Mankowitz
Doina Precup
Bilal Piot
574
190
0
01 Dec 2023
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
European Conference on Computer Vision (ECCV), 2023
Yufei Zhan
Yousong Zhu
Zhiyang Chen
Fan Yang
E. Goles
Jinqiao Wang
ObjD
242
30
0
24 Nov 2023
PrivateLoRA For Efficient Privacy Preserving LLM
Yiming Wang
Yu Lin
Xiaodong Zeng
Guannan Zhang
309
25
0
23 Nov 2023
DaG LLM ver 1.0: Pioneering Instruction-Tuned Language Modeling for Korean NLP
Dongjun Jang
Sangah Lee
Sungjoo Byun
Jinwoong Kim
Jean Seo
...
Soyeon Kim
Chaeyoung Oh
Jaeyoon Kim
Hyemi Jo
Hyopil Shin
ALM
235
0
0
23 Nov 2023
GAIA: a benchmark for General AI Assistants
Grégoire Mialon
Clémentine Fourrier
Craig Swift
Thomas Wolf
Yann LeCun
Thomas Scialom
AI4MH
ALM
ELM
RALM
460
473
0
21 Nov 2023
Unifying Corroborative and Contributive Attributions in Large Language Models
Theodora Worledge
Judy Hanwen Shen
Nicole Meister
Caleb Winston
Carlos Guestrin
TDI
320
13
0
20 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAG
LM&Ro
LRM
363
94
0
20 Nov 2023
Towards Robust Text Retrieval with Progressive Learning
Tong Wu
Yulei Qin
Enwei Zhang
Zihan Xu
Yuting Gao
Ke Li
Xing Sun
RALM
VLM
173
1
0
20 Nov 2023
Previous
1
2
3
...
14
15
16
...
21
22
23
Next
Page 15 of 23
Page
of 23
Go