ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09332
  4. Cited By
WebGPT: Browser-assisted question-answering with human feedback
v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
    ALMRALM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 1,123 papers shown
Authenticated Delegation and Authorized AI Agents
Authenticated Delegation and Authorized AI Agents
Tobin South
Samuele Marro
Thomas Hardjono
Robert Mahari
Cedric Deslandes Whitney
Dazza Greenwood
Alan Chan
Alex Pentland
419
27
0
17 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
A Comprehensive Survey of Foundation Models in MedicineIEEE Reviews in Biomedical Engineering (RBME), 2024
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CELM&MAVLM
778
72
0
17 Jan 2025
WebWalker: Benchmarking LLMs in Web Traversal
WebWalker: Benchmarking LLMs in Web TraversalAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jialong Wu
Wenbiao Yin
Yong Jiang
Zhenglin Wang
Zekun Xi
...
Linhai Zhang
Yulan He
Deyu Zhou
Pengjun Xie
Fei Huang
619
88
0
13 Jan 2025
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Hiroki Furuta
Yutaka Matsuo
Aleksandra Faust
Izzeddin Gur
CLL
608
19
0
03 Jan 2025
Enhancing Preference-based Linear Bandits via Human Response Time
Enhancing Preference-based Linear Bandits via Human Response TimeNeural Information Processing Systems (NeurIPS), 2024
Shen Li
Yuyang Zhang
Tongzheng Ren
Claire Liang
Na Li
J. Shah
490
1
0
03 Jan 2025
PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations
PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations
Ruosen Li
Teerth Patel
Xinya Du
LLMAGALM
553
126
0
03 Jan 2025
AutoPresent: Designing Structured Visuals from Scratch
AutoPresent: Designing Structured Visuals from ScratchComputer Vision and Pattern Recognition (CVPR), 2025
Jiaxin Ge
Zora Z. Wang
Xuhui Zhou
Yi-Hao Peng
Sanjay Subramanian
...
Maarten Sap
Alane Suhr
Daniel Fried
Graham Neubig
Trevor Darrell
278
8
0
01 Jan 2025
Zero-Indexing Internet Search Augmented Generation for Large Language Models
Zero-Indexing Internet Search Augmented Generation for Large Language Models
Guangxin He
Zonghong Dai
Jiangcheng Zhu
Binqiang Zhao
Qicheng Hu
Chenyue Li
You Peng
Chen Wang
Binhang Yuan
369
1
0
31 Dec 2024
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking AgentsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Weiwei Sun
Lingyong Yan
Xinyu Ma
Shuaiqiang Wang
Sudipta Singha Roy
Zhumin Chen
D. Yin
Zhaochun Ren
RALMALMELMLRMLM&MA
688
427
0
31 Dec 2024
Diverse and Effective Red Teaming with Auto-generated Rewards and
  Multi-step Reinforcement Learning
Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning
Alex Beutel
Kai Y. Xiao
Johannes Heidecke
Lilian Weng
AAML
183
17
0
24 Dec 2024
Lies, Damned Lies, and Distributional Language Statistics: Persuasion
  and Deception with Large Language Models
Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models
Cameron R. Jones
Benjamin Bergen
462
14
0
22 Dec 2024
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model
  Fine-tuning
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Ziang Ye
Zizhuo Zhang
Yang Zhang
Jianxin Ma
Junyang Lin
Fuli Feng
LRM
269
3
0
19 Dec 2024
Relational Programming with Foundation Models
Relational Programming with Foundation ModelsAAAI Conference on Artificial Intelligence (AAAI), 2024
Ziyang Li
Jiani Huang
Jason Liu
Felix Zhu
Eric Zhao
William Dodds
Neelay Velingker
Rajeev Alur
Mayur Naik
313
10
0
19 Dec 2024
LDC: Learning to Generate Research Idea with Dynamic Control
LDC: Learning to Generate Research Idea with Dynamic Control
Ruochen Li
Liqiang Jing
Chi Han
Jiawei Zhou
Xinya Du
LRM
304
13
0
19 Dec 2024
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented
  Generation for Preference Alignment
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Zhuoran Jin
Hongbang Yuan
Tianyi Men
Pengfei Cao
Yubo Chen
Kang Liu
Jun Zhao
ALM
441
22
0
18 Dec 2024
Context-DPO: Aligning Language Models for Context-Faithfulness
Context-DPO: Aligning Language Models for Context-FaithfulnessAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Baolong Bi
Shaohan Huang
Longji Xu
Tianchi Yang
Zihan Zhang
...
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
Shenghua Liu
312
33
0
18 Dec 2024
CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers
CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers
Dimitrios Mallis
Ahmet Serdar Karadeniz
Sebastian Cavada
Danila Rukhovich
Niki Maria Foteinopoulou
K. Cherenkova
Anis Kacem
Djamila Aouada
605
16
0
18 Dec 2024
EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents
EscapeBench: Towards Advancing Creative Intelligence of Language Model AgentsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Cheng Qian
Peixuan Han
Qinyu Luo
Bingxiang He
Xiusi Chen
...
Jiarui Yao
Xiaocheng Yang
Denghui Zhang
Yunzhu Li
Heng Ji
LLMAGLRM
520
3
0
18 Dec 2024
RareAgents: Autonomous Multi-disciplinary Team for Rare Disease Diagnosis and Treatment
RareAgents: Autonomous Multi-disciplinary Team for Rare Disease Diagnosis and Treatment
Xuanzhong Chen
Ye Jin
Xiaohao Mao
Lun Wang
Shuyang Zhang
Ting Chen
332
7
0
17 Dec 2024
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL
  Evaluation and LLM Enhancement
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
Junjie Lin
Jian Zhao
Lin Liu
Yue Deng
Youpeng Zhao
Lanxiao Huang
Xia Lin
Wengang Zhou
Haoyang Li
267
2
0
16 Dec 2024
Attention with Dependency Parsing Augmentation for Fine-Grained
  Attribution
Attention with Dependency Parsing Augmentation for Fine-Grained AttributionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Qiang Ding
Lvzhou Luo
Yixuan Cao
Ping Luo
294
4
0
16 Dec 2024
Beyond the Binary: Capturing Diverse Preferences With Reward
  Regularization
Beyond the Binary: Capturing Diverse Preferences With Reward Regularization
Vishakh Padmakumar
Chuanyang Jin
Hannah Rose Kirk
He He
254
8
0
05 Dec 2024
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating
  RAG Systems
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems
Rafael Teixeira de Lima
Shubham Gupta
Cesar Berrospi
Lokesh Mishra
Michele Dolfi
Peter W. J. Staar
Panagiotis Vagenas
258
10
0
29 Nov 2024
Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation
Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented GenerationInternational Conference on Advanced Information Systems Engineering (CAiSE), 2024
Robin D. Pesl
Jerin G. Mathew
Massimo Mecella
Marco Aiello
272
6
0
29 Nov 2024
Automatic Evaluation for Text-to-image Generation: Task-decomposed
  Framework, Distilled Training, and Meta-evaluation Benchmark
Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation BenchmarkAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Rong-Cheng Tu
Zi-Ao Ma
Tian Lan
Yuehao Zhao
Heyan Huang
Xian-Ling Mao
MLLMVLMEGVM
362
11
0
23 Nov 2024
SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world
  Human-Machine Interactions
SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world Human-Machine Interactions
Yaqi Wang
Haipei Xu
LLMAG
235
1
0
21 Nov 2024
Value Imprint: A Technique for Auditing the Human Values Embedded in
  RLHF Datasets
Value Imprint: A Technique for Auditing the Human Values Embedded in RLHF DatasetsNeural Information Processing Systems (NeurIPS), 2024
Ike Obi
Rohan Pant
Srishti Shekhar Agrawal
Maham Ghazanfar
Aaron Basiletti
233
9
0
18 Nov 2024
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Xinyan Guan
Yanjiang Liu
Xinyu Lu
Boxi Cao
Xianpei Han
...
Le Sun
Jie Lou
Bowen Yu
Yaojie Lu
Hongyu Lin
ALM
590
9
0
18 Nov 2024
Drowning in Documents: Consequences of Scaling Reranker Inference
Mathew Jacob
Erik Lindgren
Matei A. Zaharia
Michael Carbin
Omar Khattab
Andrew Drozdov
OffRL
565
7
0
18 Nov 2024
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer
  Use
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use
Siyuan Hu
Mingyu Ouyang
Difei Gao
Mike Zheng Shou
LM&RoLLMAG
215
45
0
15 Nov 2024
Approximated Variational Bayesian Inverse Reinforcement Learning for
  Large Language Model Alignment
Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model AlignmentAAAI Conference on Artificial Intelligence (AAAI), 2024
Yuang Cai
Yuyu Yuan
Jinsheng Shi
Qinhong Lin
246
4
0
14 Nov 2024
AssistRAG: Boosting the Potential of Large Language Models with an
  Intelligent Information Assistant
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information AssistantNeural Information Processing Systems (NeurIPS), 2024
Yujia Zhou
Zheng Liu
Zhicheng Dou
AIFinLRMRALM
162
5
0
11 Nov 2024
Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks
Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks
Adam Fourney
Gagan Bansal
Hussein Mozannar
Cheng Tan
Eduardo Salinas
...
Victor C. Dibia
Ahmed Hassan Awadallah
Ece Kamar
Rafah Hosn
Saleema Amershi
AI4CELRMLLMAG
358
132
0
07 Nov 2024
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
Chenlu Ye
Quanquan Gu
Tong Zhang
OffRL
558
13
0
07 Nov 2024
Long Context RAG Performance of Large Language Models
Long Context RAG Performance of Large Language Models
Quinn Leng
Jacob P. Portes
Sam Havens
Matei A. Zaharia
Michael Carbin
AIFinRALM3DV
274
25
0
05 Nov 2024
Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse
  Activation Control
Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation ControlNeural Information Processing Systems (NeurIPS), 2024
Yuxin Xiao
Chaoqun Wan
Yonggang Zhang
Wenxiao Wang
Binbin Lin
Xiaofei He
Xu Shen
Jieping Ye
187
3
0
04 Nov 2024
Foundations and Recent Trends in Multimodal Mobile Agents: A Survey
Foundations and Recent Trends in Multimodal Mobile Agents: A Survey
Biao Wu
Yanda Li
Meng Fang
Zirui Song
Zhiwei Zhang
Yunchao Wei
LM&RoLLMAGOffRLAI4TS
426
19
0
04 Nov 2024
Sample-Efficient Alignment for LLMs
Sample-Efficient Alignment for LLMs
Zichen Liu
Changyu Chen
Chao Du
Wee Sun Lee
Min Lin
286
10
0
03 Nov 2024
Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models
Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models
Aliyah R. Hsu
James Zhu
Zhichao Wang
Bin Bi
Shubham Mehrotra
...
Sougata Chaudhuri
Regunathan Radhakrishnan
S. Asur
Claire Na Cheng
Bin Yu
ALMLRM
695
1
0
03 Nov 2024
CORAG: A Cost-Constrained Retrieval Optimization System for
  Retrieval-Augmented Generation
CORAG: A Cost-Constrained Retrieval Optimization System for Retrieval-Augmented Generation
Liang Luo
Haitao Yuan
Wei Dong
Gao Cong
Feifei Li
3DV
205
7
0
01 Nov 2024
Attention Tracker: Detecting Prompt Injection Attacks in LLMs
Attention Tracker: Detecting Prompt Injection Attacks in LLMsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Kuo-Han Hung
Ching-Yun Ko
Ambrish Rawat
I-Hsin Chung
Winston H. Hsu
Pin-Yu Chen
411
54
0
01 Nov 2024
GPT for Games: An Updated Scoping Review (2020-2024)
GPT for Games: An Updated Scoping Review (2020-2024)IEEE Transactions on Games (IEEE Trans. Games), 2024
Daijin Yang
Erica Kleinman
Casper Harteveld
LLMAGAI4TSAI4CE
577
15
0
01 Nov 2024
Building Multi-Agent Copilot towards Autonomous Agricultural Data
  Management and Analysis
Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and AnalysisBigData Congress [Services Society] (BSS), 2024
Yu Pan
Jianxin Sun
Hongfeng Yu
Joe Luck
Geng Bai
Nipuna Chamara
Yufeng Ge
Tala Awada
254
4
0
31 Oct 2024
AndroidLab: Training and Systematic Benchmarking of Android Autonomous
  Agents
AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
Yifan Xu
Xiao Liu
Xingwu Sun
Siyi Cheng
Hao Yu
Hanyu Lai
Shudan Zhang
Dan Zhang
Jie Tang
Yuxiao Dong
LLMAG
296
50
0
31 Oct 2024
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models
Tanmay Parekh
Pradyot Prakash
Alexander Radovic
Akshay Shekher
Denis Savenkov
LRM
844
3
0
30 Oct 2024
A Perspective for Adapting Generalist AI to Specialized Medical AI
  Applications and Their Challenges
A Perspective for Adapting Generalist AI to Specialized Medical AI Applications and Their Challenges
Zhenting Wang
Hanyin Wang
Benjamin Danek
Ying Li
Christina Mack
Hoifung Poon
Y. Wang
Pranav Rajpurkar
Jimeng Sun
LM&MA
308
11
0
28 Oct 2024
AutoGLM: Autonomous Foundation Agents for GUIs
AutoGLM: Autonomous Foundation Agents for GUIs
Xiao Liu
Bo Qin
Dongzhu Liang
Guang Dong
Hanyu Lai
...
Yujia Wang
Yongjun Xu
Zehan Qi
Yuxiao Dong
Jie Tang
LLMAG
303
51
0
28 Oct 2024
Vision Search Assistant: Empower Vision-Language Models as Multimodal
  Search Engines
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Zhixin Zhang
Yiyuan Zhang
Xiaohan Ding
Xiangyu Yue
229
12
0
28 Oct 2024
Fast Best-of-N Decoding via Speculative Rejection
Fast Best-of-N Decoding via Speculative RejectionNeural Information Processing Systems (NeurIPS), 2024
Hanshi Sun
Momin Haider
Ruiqi Zhang
Huitao Yang
Jiahao Qiu
Ming Yin
Mengdi Wang
Peter L. Bartlett
Andrea Zanette
BDL
378
101
0
26 Oct 2024
FISHNET: Financial Intelligence from Sub-querying, Harmonizing,
  Neural-Conditioning, Expert Swarms, and Task Planning
FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task PlanningInternational Conference on AI in Finance (ICAF), 2024
Nicole Cho
Nishan Srishankar
Lucas Cecchi
William Watson
AIFin
221
5
0
25 Oct 2024
Previous
123...678...212223
Next
Page 7 of 23
Pageof 23