ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09332
  4. Cited By
WebGPT: Browser-assisted question-answering with human feedback
v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
    ALMRALM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 1,125 papers shown
CoCoST: Automatic Complex Code Generation with Online Searching and
  Correctness Testing
CoCoST: Automatic Complex Code Generation with Online Searching and Correctness Testing
Xinyi He
Jiaru Zou
Yun Lin
Mengyu Zhou
Shi Han
Zejian Yuan
Dongmei Zhang
168
5
0
20 Mar 2024
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large
  Vision Language Models
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Tongtian Yue
Jie Cheng
Longteng Guo
Xingyuan Dai
Zijia Zhao
Xingjian He
Gang Xiong
Yisheng Lv
Jing Liu
216
13
0
20 Mar 2024
Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open
  Domain Multi-Hop Question Answering
Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question AnsweringInternational Conference on Language Resources and Evaluation (LREC), 2024
Yuan Gao
Yiheng Zhu
Yuanbin Cao
Yinzhi Zhou
Zhen Wu
Yujie Chen
Shenglan Wu
Haoyuan Hu
Xinyu Dai
LRM
216
5
0
19 Mar 2024
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Kevin Xu
Yeganeh Kordi
Kate Sanders
Yizhong Wang
Adam Byerly
Kate Sanders
Adam Byerly
Jingyu Zhang
Benjamin Van Durme
Daniel Khashabi
LLMAG
543
16
0
18 Mar 2024
JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented
  Fine-Tuning
JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning
Anique Tahir
Lu Cheng
Huan Liu
227
3
0
17 Mar 2024
Improving Dialogue Agents by Decomposing One Global Explicit Annotation
  with Local Implicit Multimodal Feedback
Improving Dialogue Agents by Decomposing One Global Explicit Annotation with Local Implicit Multimodal Feedback
Dong Won Lee
Hae Won Park
Yoon Kim
C. Breazeal
Louis-Philippe Morency
260
0
0
17 Mar 2024
Beyond Static Evaluation: A Dynamic Approach to Assessing AI Assistants'
  API Invocation Capabilities
Beyond Static Evaluation: A Dynamic Approach to Assessing AI Assistants' API Invocation Capabilities
Honglin Mu
Yang Xu
Yunlong Feng
Xiaofeng Han
Yitong Li
Yutai Hou
Wanxiang Che
ELM
186
4
0
17 Mar 2024
FlowMind: Automatic Workflow Generation with LLMs
FlowMind: Automatic Workflow Generation with LLMs
Zhen Zeng
William Watson
Nicole Cho
Saba Rahimi
Shayleen Reynolds
T. Balch
Manuela Veloso
205
49
0
17 Mar 2024
Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine
  Knowledge
Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge
Yizhen Li
Shaohan Huang
Jiaxing Qi
Lei Quan
Dongran Han
Zhongzhi Luan
LM&MAAI4MH
129
7
0
14 Mar 2024
Re-Search for The Truth: Multi-round Retrieval-augmented Large Language
  Models are Strong Fake News Detectors
Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors
Guanghua Li
Wensheng Lu
Wei Zhang
Defu Lian
Kezhong Lu
Rui Mao
Kai Shu
Hao Liao
HILM
201
15
0
14 Mar 2024
Strengthening Multimodal Large Language Model with Bootstrapped
  Preference Optimization
Strengthening Multimodal Large Language Model with Bootstrapped Preference OptimizationEuropean Conference on Computer Vision (ECCV), 2024
Renjie Pi
Tianyang Han
Wei Xiong
Jipeng Zhang
Runtao Liu
Boyao Wang
Tong Zhang
MLLM
381
76
0
13 Mar 2024
Bifurcated Attention: Accelerating Massively Parallel Decoding with
  Shared Prefixes in LLMs
Bifurcated Attention: Accelerating Massively Parallel Decoding with Shared Prefixes in LLMs
Ben Athiwaratkun
Sujan Kumar Gonugondla
Sanjay Krishna Gouda
Haifeng Qian
Hantian Ding
...
Liangfu Chen
Parminder Bhatia
Ramesh Nallapati
Sudipta Sengupta
Bing Xiang
267
5
0
13 Mar 2024
Human Alignment of Large Language Models through Online Preference
  Optimisation
Human Alignment of Large Language Models through Online Preference OptimisationInternational Conference on Machine Learning (ICML), 2024
Daniele Calandriello
Daniel Guo
Rémi Munos
Mark Rowland
Yunhao Tang
...
Michal Valko
Tianqi Liu
Rishabh Joshi
Zeyu Zheng
Bilal Piot
277
88
0
13 Mar 2024
BAGEL: Bootstrapping Agents by Guiding Exploration with Language
BAGEL: Bootstrapping Agents by Guiding Exploration with LanguageInternational Conference on Machine Learning (ICML), 2024
Shikhar Murty
Christopher D. Manning
Peter Shaw
Mandar Joshi
Kenton Lee
LM&RoLLMAG
335
28
0
12 Mar 2024
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Beyond Text: Frozen Large Language Models in Visual Signal ComprehensionComputer Vision and Pattern Recognition (CVPR), 2024
Lei Zhu
Fangyun Wei
Yanye Lu
MLLMVLM
232
30
0
12 Mar 2024
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work
  Tasks?
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?International Conference on Machine Learning (ICML), 2024
Alexandre Drouin
Maxime Gasse
Massimo Caccia
I. Laradji
Manuel Del Verme
...
Megh Thakkar
Quentin Cappart
David Vazquez
Nicolas Chapados
Alexandre Lacoste
LLMAG
377
136
0
12 Mar 2024
Materials science in the era of large language models: a perspective
Materials science in the era of large language models: a perspectiveDigital Discovery (DD), 2024
Ge Lei
Ronan Docherty
Samuel J. Cooper
231
44
0
11 Mar 2024
Enhancing Data Quality in Federated Fine-Tuning of Foundation Models
Enhancing Data Quality in Federated Fine-Tuning of Foundation Models
Wanru Zhao
Yaxin Du
Nicholas D. Lane
Siheng Chen
Yanfeng Wang
222
4
0
07 Mar 2024
On the Essence and Prospect: An Investigation of Alignment Approaches
  for Big Models
On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models
Xinpeng Wang
Shitong Duan
Xiaoyuan Yi
Jing Yao
Shanlin Zhou
Zhihua Wei
Peng Zhang
Dongkuan Xu
Maosong Sun
Xing Xie
OffRL
403
24
0
07 Mar 2024
A Survey on Human-AI Collaboration with Large Foundation Models
A Survey on Human-AI Collaboration with Large Foundation Models
Vanshika Vats
Marzia Binta Nizam
Minghao Liu
Ziyuan Wang
Richard Ho
...
Celeste Shen
Rachel Shen
Nafisa Hussain
Kesav Ravichandran
James Davis
LM&MA
566
11
0
07 Mar 2024
Learning to Decode Collaboratively with Multiple Language Models
Learning to Decode Collaboratively with Multiple Language Models
Zejiang Shen
Hunter Lang
Bailin Wang
Yoon Kim
David Sontag
163
54
0
06 Mar 2024
Mixture-of-LoRAs: An Efficient Multitask Tuning for Large Language
  Models
Mixture-of-LoRAs: An Efficient Multitask Tuning for Large Language Models
Wenfeng Feng
Chuzhan Hao
Yuewei Zhang
Yu Han
Hao Wang
ALMMoE
185
21
0
06 Mar 2024
Reliable, Adaptable, and Attributable Language Models with Retrieval
Reliable, Adaptable, and Attributable Language Models with Retrieval
Akari Asai
Zexuan Zhong
Danqi Chen
Pang Wei Koh
Luke Zettlemoyer
Hanna Hajishirzi
Anuj Kumar
KELMRALM
340
82
0
05 Mar 2024
Learning to Use Tools via Cooperative and Interactive Agents
Learning to Use Tools via Cooperative and Interactive Agents
Zhengliang Shi
Shen Gao
Xiuyi Chen
Zhumin Chen
Lingyong Yan
Haibo Shi
D. Yin
Sudipta Singha Roy
Suzan Verberne
Zhaochun Ren
LLMAG
393
52
0
05 Mar 2024
WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search
  Results with Citations
WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations
Haolin Deng
Chang Wang
Xin Li
Dezhang Yuan
Junlang Zhan
Tianhua Zhou
Jin Ma
Jun Gao
Ruifeng Xu
HILM
250
7
0
04 Mar 2024
DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling
DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling
Shanghaoran Quan
MoEOffRL
239
12
0
02 Mar 2024
Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based
  Search Engines
Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines
Lijia Ma
Xingchen Xu
Yong-Ming Tan
166
10
0
29 Feb 2024
PlanGPT: Enhancing Urban Planning with Tailored Language Model and
  Efficient Retrieval
PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval
He Zhu
Wenjia Zhang
Nuoxian Huang
Boyang Li
Luyao Niu
...
Yicheng Tao
Junyou Su
Zhaoya Gong
Chenyu Fang
Xing Liu
LLMAG
225
17
0
29 Feb 2024
ToolNet: Connecting Large Language Models with Massive Tools via Tool
  Graph
ToolNet: Connecting Large Language Models with Massive Tools via Tool Graph
Xukun Liu
Zhiyuan Peng
Xiaoyuan Yi
Xing Xie
Lirong Xiang
Yuchen Liu
Dongkuan Xu
CLLLLMAG
189
45
0
29 Feb 2024
Approaching Human-Level Forecasting with Language Models
Approaching Human-Level Forecasting with Language Models
Danny Halawi
Fred Zhang
Chen Yueh-Han
Jacob Steinhardt
270
56
0
28 Feb 2024
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist
  Autonomous Agents for Desktop and Web
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web
Raghav Kapoor
Y. Butala
M. Russak
Jing Yu Koh
Kiran Kamble
Waseem Alshikh
Ruslan Salakhutdinov
LLMAG
498
109
0
27 Feb 2024
A Survey of Large Language Models in Cybersecurity
A Survey of Large Language Models in Cybersecurity
Gabriel de Jesus Coelho da Silva
Carlos Becker Westphall
264
14
0
26 Feb 2024
Evaluating Robustness of Generative Search Engine on Adversarial Factual
  Questions
Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions
Xuming Hu
Xiaochuan Li
Junzhe Chen
Hai-Tao Zheng
Yangning Li
...
Yasheng Wang
Qun Liu
Lijie Wen
Philip S. Yu
Zhijiang Guo
AAMLELM
250
8
0
25 Feb 2024
How Large Language Models Encode Context Knowledge? A Layer-Wise Probing
  Study
How Large Language Models Encode Context Knowledge? A Layer-Wise Probing Study
Tianjie Ju
Weiwei Sun
Wei Du
Xinwei Yuan
Zhaochun Ren
Gongshen Liu
KELM
236
58
0
25 Feb 2024
Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A
  Case-Study in E-Commerce Opinion Summarization
Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization
Swaroop Nath
Tejpalsingh Siledar
Sankara Sri Raghava Ravindra Muddu
Rupasai Rangaraju
H. Khadilkar
...
Suman Banerjee
Amey Patil
Sudhanshu Singh
M. Chelliah
Nikesh Garera
206
1
0
23 Feb 2024
AttributionBench: How Hard is Automatic Attribution Evaluation?
AttributionBench: How Hard is Automatic Attribution Evaluation?
Yifei Li
Xiang Yue
Zeyi Liao
Huan Sun
HILM
267
19
0
23 Feb 2024
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
Zicheng Lin
Zhibin Gou
Tian Liang
Ruilin Luo
Haowei Liu
Yujiu Yang
LRM
418
82
0
22 Feb 2024
GenSERP: Large Language Models for Whole Page Presentation
Zhenning Zhang
Yunan Zhang
Suyu Ge
Guangwei Weng
M. Narang
Xia Song
Saurabh Tiwary
KELM
729
2
0
22 Feb 2024
$\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens
∞\infty∞Bench: Extending Long Context Evaluation Beyond 100K Tokens
Xinrong Zhang
Yingfa Chen
Shengding Hu
Zihang Xu
Junhao Chen
...
Xu Han
Zhen Leng Thai
Shuo Wang
Zhiyuan Liu
Maosong Sun
RALMLRM
548
285
0
21 Feb 2024
Bayesian Reward Models for LLM Alignment
Bayesian Reward Models for LLM Alignment
Adam X. Yang
Maxime Robeyns
Thomas Coste
Zhengyan Shi
Jun Wang
Haitham Bou-Ammar
Laurence Aitchison
245
27
0
20 Feb 2024
A Survey on Knowledge Distillation of Large Language Models
A Survey on Knowledge Distillation of Large Language Models
Xiaohan Xu
Ming Li
Chongyang Tao
Tao Shen
Reynold Cheng
Jinyang Li
Can Xu
Dacheng Tao
Wanrong Zhu
KELMVLM
469
247
0
20 Feb 2024
Large Language Model-based Human-Agent Collaboration for Complex Task
  Solving
Large Language Model-based Human-Agent Collaboration for Complex Task Solving
Xueyang Feng
Zhiyuan Chen
Yujia Qin
Yankai Lin
Xu Chen
Zhiyuan Liu
Ji-Rong Wen
LLMAG
261
39
0
20 Feb 2024
Instruction-tuned Language Models are Better Knowledge Learners
Instruction-tuned Language Models are Better Knowledge Learners
Zhengbao Jiang
Zhiqing Sun
Weijia Shi
Pedro Rodriguez
Chunting Zhou
Graham Neubig
Xi Lin
Anuj Kumar
Srinivasan Iyer
KELM
303
55
0
20 Feb 2024
ARKS: Active Retrieval in Knowledge Soup for Code Generation
ARKS: Active Retrieval in Knowledge Soup for Code Generation
Hongjin Su
Shuyang Jiang
Yuhang Lai
Haoyuan Wu
Boao Shi
Che Liu
Qian Liu
Tao Yu
KELMRALM
49
10
0
19 Feb 2024
Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When
  and What to Retrieve for LLMs
Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs
Jiejun Tan
Zhicheng Dou
Yutao Zhu
Peidong Guo
Kun Fang
Ji-Rong Wen
396
54
0
19 Feb 2024
What Evidence Do Language Models Find Convincing?
What Evidence Do Language Models Find Convincing?
Alexander Wan
Eric Wallace
Dan Klein
533
59
0
19 Feb 2024
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific
  Data Visualization
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization
Zhiyu Yang
Zihan Zhou
Shuo Wang
Xin Cong
Xu Han
...
Pengyuan Liu
Dong Yu
Zhiyuan Liu
Xiaodong Shi
Maosong Sun
LLMAG
224
77
0
18 Feb 2024
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based
  Agents
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents
Wenkai Yang
Xiaohan Bi
Yankai Lin
Sishuo Chen
Jie Zhou
Xu Sun
LLMAGAAML
291
123
0
17 Feb 2024
KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning
  over Knowledge Graph
KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph
Jinhao Jiang
Kun Zhou
Wayne Xin Zhao
Yang Song
Chen Zhu
Hengshu Zhu
Ji-Rong Wen
LLMAGRALM
145
84
0
17 Feb 2024
BlendFilter: Advancing Retrieval-Augmented Large Language Models via
  Query Generation Blending and Knowledge Filtering
BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering
Haoyu Wang
Ruirui Li
Haoming Jiang
Jinjin Tian
Zhengyang Wang
Chen Luo
Xianfeng Tang
Monica Cheng
Tuo Zhao
Jing Gao
RALMKELM
235
36
0
16 Feb 2024
Previous
123...121314...212223
Next
Page 13 of 23
Pageof 23