ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09332
  4. Cited By
WebGPT: Browser-assisted question-answering with human feedback
v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
    ALMRALM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 1,125 papers shown
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
Ori Yoran
S. Amouyal
Chaitanya Malaviya
Ben Bogin
Ofir Press
Jonathan Berant
LLMAG
361
75
0
22 Jul 2024
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by
  Direct Preference Optimization
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization
Md Sultan al Nahian
R. Kavuluru
MedImAI4CE
164
0
0
19 Jul 2024
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Peng Xu
Ming-Yu Liu
Xianchao Wu
Zihan Liu
Mohammad Shoeybi
Mohammad Shoeybi
Bryan Catanzaro
RALM
492
35
0
19 Jul 2024
Learning Goal-Conditioned Representations for Language Reward Models
Learning Goal-Conditioned Representations for Language Reward Models
Vaskar Nath
Dylan Slack
Jeff Da
Yuntao Ma
Hugh Zhang
Spencer Whitehead
Sean Hendryx
187
1
0
18 Jul 2024
Agent-E: From Autonomous Web Navigation to Foundational Design
  Principles in Agentic Systems
Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems
Tamer Abuelsaad
Deepak Akkil
Prasenjit Dey
Ashish Jagmohan
Aditya Vempaty
Ravi Kokku
315
53
0
17 Jul 2024
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
To Eun Kim
Alireza Salemi
Andrew Drozdov
Fernando Diaz
Hamed Zamani
376
11
0
17 Jul 2024
How Are LLMs Mitigating Stereotyping Harms? Learning from Search Engine
  Studies
How Are LLMs Mitigating Stereotyping Harms? Learning from Search Engine Studies
Alina Leidinger
Richard Rogers
391
19
0
16 Jul 2024
Localizing and Mitigating Errors in Long-form Question Answering
Localizing and Mitigating Errors in Long-form Question Answering
Rachneet Sachdeva
Yixiao Song
Mohit Iyyer
Iryna Gurevych
HILM
320
0
0
16 Jul 2024
Sibyl: Simple yet Effective Agent Framework for Complex Real-world
  Reasoning
Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning
Yulong Wang
Tianhao Shen
Lifeng Liu
Jian Xie
LLMAGLRM
292
16
0
15 Jul 2024
Fine-grained Analysis of In-context Linear Estimation: Data,
  Architecture, and Beyond
Fine-grained Analysis of In-context Linear Estimation: Data, Architecture, and Beyond
Yingcong Li
A. S. Rawat
Samet Oymak
245
16
0
13 Jul 2024
A Survey on Symbolic Knowledge Distillation of Large Language Models
A Survey on Symbolic Knowledge Distillation of Large Language Models
Kamal Acharya
Alvaro Velasquez
Haoze Song
SyDa
288
26
0
12 Jul 2024
Large Language Models as Biomedical Hypothesis Generators: A
  Comprehensive Evaluation
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Biqing Qi
Kaiyan Zhang
Kai Tian
Haoxiang Li
Zhang-Ren Chen
Sihang Zeng
Ermo Hua
Hu Jinfang
Bowen Zhou
LM&MA
393
33
0
12 Jul 2024
Internet of Agents: Weaving a Web of Heterogeneous Agents for
  Collaborative Intelligence
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Weize Chen
Ziming You
Ran Li
Yitong Guan
Chen Qian
Chenyang Zhao
Cheng Yang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
LLMAG
335
67
0
09 Jul 2024
It Cannot Be Right If It Was Written by AI: On Lawyers' Preferences of
  Documents Perceived as Authored by an LLM vs a Human
It Cannot Be Right If It Was Written by AI: On Lawyers' Preferences of Documents Perceived as Authored by an LLM vs a Human
Jakub Harasta
Tereza Novotná
Jaromír Šavelka
ELM
306
17
0
09 Jul 2024
Variational Best-of-N Alignment
Variational Best-of-N Alignment
Afra Amini
Tim Vieira
Robert Bamler
Ryan Cotterell
BDL
492
37
0
08 Jul 2024
Orchestrating LLMs with Different Personalizations
Orchestrating LLMs with Different Personalizations
Jin Peng Zhou
Katie Z Luo
Jingwen Gu
Jason Yuan
Kilian Q. Weinberger
Wen Sun
137
4
0
04 Jul 2024
RLHF Can Speak Many Languages: Unlocking Multilingual Preference
  Optimization for LLMs
RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs
John Dang
Arash Ahmadian
Kelly Marchisio
Julia Kreutzer
Ahmet Üstün
Sara Hooker
249
43
0
02 Jul 2024
Concise and Precise Context Compression for Tool-Using Language Models
Concise and Precise Context Compression for Tool-Using Language Models
Yang Xu
Yunlong Feng
Honglin Mu
Yutai Hou
Yitong Li
...
Zhongyang Li
Dandan Tu
Qingfu Zhu
Hao Fei
Wanxiang Che
LLMAG
176
9
0
02 Jul 2024
LogEval: A Comprehensive Benchmark Suite for Large Language Models In
  Log Analysis
LogEval: A Comprehensive Benchmark Suite for Large Language Models In Log Analysis
Tianyu Cui
Shiyu Ma
Ziang Chen
Tong Xiao
Shimin Tao
...
Changchang Liu
Yuzhe Cai
Weibin Meng
Yongqian Sun
Dan Pei
ELM
193
17
0
02 Jul 2024
DogeRM: Equipping Reward Models with Domain Knowledge through Model
  Merging
DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging
Tzu-Han Lin
Chen-An Li
Hung-yi Lee
Yun-Nung Chen
VLMALM
139
6
0
01 Jul 2024
$\text{Memory}^3$: Language Modeling with Explicit Memory
Memory3\text{Memory}^3Memory3: Language Modeling with Explicit Memory
Hongkang Yang
Peng Liu
Wenjin Wang
Huayi Lai
Zhiyu Li
...
Yu Yu
Kai Chen
Feiyu Xiong
Linpeng Tang
Weinan E
238
33
0
01 Jul 2024
ProductAgent: Benchmarking Conversational Product Search Agent with
  Asking Clarification Questions
ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions
Jingheng Ye
Yong Jiang
Xiaobin Wang
Hai-Tao Zheng
Yangning Li
Hai-Tao Zheng
Pengjun Xie
Fei Huang
241
5
0
01 Jul 2024
Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation
Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation
Sirui Xia
Xintao Wang
Jiaqing Liang
Yifei Zhang
Weikang Zhou
Jiaji Deng
Fei Yu
Yanghua Xiao
RALM
441
15
0
01 Jul 2024
Advancing Process Verification for Large Language Models via Tree-Based
  Preference Learning
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning
Mingqian He
Yongliang Shen
Wenqi Zhang
Zeqi Tan
Weiming Lu
LRM
226
13
0
29 Jun 2024
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Sujan Dutta
Sayantan Mahinder
R. Anantha
Bortik Bandyopadhyay
ALM
212
12
0
28 Jun 2024
Scalable and Domain-General Abstractive Proposition Segmentation
Scalable and Domain-General Abstractive Proposition Segmentation
Mohammad Javad Hosseini
Yang Gao
Tim Baumgärtner
Alex Fabrikant
Reinald Kim Amplayo
193
2
0
28 Jun 2024
Lifelong Robot Library Learning: Bootstrapping Composable and
  Generalizable Skills for Embodied Control with Language Models
Lifelong Robot Library Learning: Bootstrapping Composable and Generalizable Skills for Embodied Control with Language Models
Georgios Tziafas
Hamidreza Kasaei
KELMLM&Ro
327
14
0
26 Jun 2024
Not All Preference Pairs Are Created Equal: A Recipe for
  Annotation-Efficient Iterative Preference Learning
Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning
Sen Yang
Leyang Cui
Deng Cai
Xinting Huang
Shuming Shi
Wai Lam
208
10
0
25 Jun 2024
Reinforcement Learning via Auxiliary Task Distillation
Reinforcement Learning via Auxiliary Task Distillation
Abhinav Harish
Larry Heck
Josiah P. Hanna
Z. Kira
Andrew Szot
256
1
0
24 Jun 2024
Towards Comprehensive Preference Data Collection for Reward Modeling
Towards Comprehensive Preference Data Collection for Reward Modeling
Yulan Hu
Qingyang Li
Sheng Ouyang
Ge Chen
Kaihui Chen
Lijun Mei
Xucheng Ye
Fuzheng Zhang
Yong Liu
SyDa
704
4
0
24 Jun 2024
Cascade Reward Sampling for Efficient Decoding-Time Alignment
Cascade Reward Sampling for Efficient Decoding-Time Alignment
Bolian Li
Yifan Wang
Anamika Lochab
A. Grama
Ruqi Zhang
AI4TS
591
29
0
24 Jun 2024
LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations
LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations
Shashank Kirtania
Priyanshu Gupta
Arjun Radhakirshna
LRM
331
17
0
22 Jun 2024
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
Xinrong Zhang
Yingfa Chen
Shengding Hu
Xu Han
Zihang Xu
Yuanwei Xu
Weilin Zhao
Maosong Sun
Zhiyuan Liu
210
20
0
22 Jun 2024
A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student
  Feedback to Make Mnemonic Learning Stick
A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick
Nishant Balepur
Matthew Shu
Alexander Hoyle
Alison Robey
Shi Feng
Seraphina Goldfarb-Tarrant
Jordan Boyd-Graber
218
8
0
21 Jun 2024
Hybrid Alignment Training for Large Language Models
Hybrid Alignment Training for Large Language Models
Chenglong Wang
Hang Zhou
Kaiyan Chang
Bei Li
Yongyu Mu
Tong Xiao
Tongran Liu
Jingbo Zhu
253
10
0
21 Jun 2024
GraphReader: Building Graph-based Agent to Enhance Long-Context
  Abilities of Large Language Models
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
Shilong Li
Yancheng He
Hangyu Guo
Xingyuan Bu
Ge Bai
...
Xingwei Qu
Yangguang Li
Wanli Ouyang
Yuchi Xu
Bo Zheng
RALMLLMAG
238
35
0
20 Jun 2024
FoRAG: Factuality-optimized Retrieval Augmented Generation for
  Web-enhanced Long-form Question Answering
FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering
Tianchi Cai
Zhiwen Tan
Xierui Song
Tao Sun
Jiyan Jiang
Yunqi Xu
Yinger Zhang
Jinjie Gu
297
18
0
19 Jun 2024
Model Internals-based Answer Attribution for Trustworthy
  Retrieval-Augmented Generation
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jirui Qi
Gabriele Sarti
Raquel Fernández
Arianna Bisazza
RALM
287
14
0
19 Jun 2024
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for
  LLM Agents
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM AgentsNeural Information Processing Systems (NeurIPS), 2024
Edoardo Debenedetti
Jie Zhang
Mislav Balunović
Luca Beurer-Kellner
Marc Fischer
Florian Tramèr
LLMAGAAML
425
84
1
19 Jun 2024
APPL: A Prompt Programming Language for Harmonious Integration of
  Programs and Large Language Model Prompts
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model PromptsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Honghua Dong
Qidong Su
Yubo Gao
Zhaoyu Li
Yangjun Ruan
Gennady Pekhimenko
Chris J. Maddison
Xujie Si
LLMAG
166
3
0
19 Jun 2024
Learning to Generate Answers with Citations via Factual Consistency
  Models
Learning to Generate Answers with Citations via Factual Consistency ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Rami Aly
Zhiqiang Tang
Samson Tan
George Karypis
HILM
272
11
0
19 Jun 2024
Think-then-Act: A Dual-Angle Evaluated Retrieval-Augmented Generation
Think-then-Act: A Dual-Angle Evaluated Retrieval-Augmented Generation
Yige Shen
Hao Jiang
Hua Qu
Jihong Zhao
RALMLRM
228
1
0
18 Jun 2024
LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document
  Summarization
LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document Summarization
Masafumi Enomoto
Kunihiro Takeoka
Kosuke Akimoto
Kiril Gashteovski
Masafumi Oyamada
RALM
170
1
0
18 Jun 2024
WebCanvas: Benchmarking Web Agents in Online Environments
WebCanvas: Benchmarking Web Agents in Online Environments
Yichen Pan
Dehan Kong
Sida Zhou
Cheng Cui
Yifei Leng
...
Hangyu Liu
Yanyi Shang
Shuyan Zhou
Tongshuang Wu
Zhengyang Wu
404
72
0
18 Jun 2024
On the Exponential Convergence for Offline RLHF with Pairwise Comparisons
On the Exponential Convergence for Offline RLHF with Pairwise Comparisons
Zhirui Chen
Vincent Y. F. Tan
OffRL
244
1
0
18 Jun 2024
Satyrn: A Platform for Analytics Augmented Generation
Satyrn: A Platform for Analytics Augmented Generation
Marko Sterbentz
Cameron Barrie
Shubham Shahi
Abhratanu Dutta
Donna Hooshmand
Harper Pack
Kristian J. Hammond
182
1
0
17 Jun 2024
Dialogue Action Tokens: Steering Language Models in Goal-Directed
  Dialogue with a Multi-Turn Planner
Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
Kenneth Li
Yiming Wang
Fernanda Viégas
Martin Wattenberg
279
10
0
17 Jun 2024
KAOS: Large Model Multi-Agent Operating System
KAOS: Large Model Multi-Agent Operating System
Zhao Zhuo
Rongzhen Li
Kai Liu
Huhai Zou
KaiMao Li
Jie Yu
Tianhao Sun
Qingbo Wu
VLMLLMAG
298
4
0
17 Jun 2024
Small Agent Can Also Rock! Empowering Small Language Models as
  Hallucination Detector
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Xiaoxue Cheng
Junyi Li
Wayne Xin Zhao
Hongzhi Zhang
Fuzheng Zhang
Di Zhang
Kun Gai
Ji-Rong Wen
HILMLLMAG
228
21
0
17 Jun 2024
A Survey on Human Preference Learning for Large Language Models
A Survey on Human Preference Learning for Large Language Models
Ruili Jiang
Kehai Chen
Xuefeng Bai
Zhixuan He
Juntao Li
Muyun Yang
Tiejun Zhao
Liqiang Nie
Min Zhang
285
16
0
17 Jun 2024
Previous
123...91011...212223
Next
Page 10 of 23
Pageof 23