ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09332
  4. Cited By
WebGPT: Browser-assisted question-answering with human feedback
v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
    ALMRALM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 1,126 papers shown
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
Ori Yoran
S. Amouyal
Chaitanya Malaviya
Ben Bogin
Ofir Press
Jonathan Berant
LLMAG
380
87
0
22 Jul 2024
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by
  Direct Preference Optimization
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization
Md Sultan al Nahian
R. Kavuluru
MedImAI4CE
185
0
0
19 Jul 2024
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Peng Xu
Ming-Yu Liu
Xianchao Wu
Zihan Liu
Mohammad Shoeybi
Mohammad Shoeybi
Bryan Catanzaro
RALM
553
39
0
19 Jul 2024
Learning Goal-Conditioned Representations for Language Reward Models
Learning Goal-Conditioned Representations for Language Reward Models
Vaskar Nath
Dylan Slack
Jeff Da
Yuntao Ma
Hugh Zhang
Spencer Whitehead
Sean Hendryx
194
1
0
18 Jul 2024
Agent-E: From Autonomous Web Navigation to Foundational Design
  Principles in Agentic Systems
Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems
Tamer Abuelsaad
Deepak Akkil
Prasenjit Dey
Ashish Jagmohan
Aditya Vempaty
Ravi Kokku
350
60
0
17 Jul 2024
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
To Eun Kim
Alireza Salemi
Andrew Drozdov
Fernando Diaz
Hamed Zamani
414
11
0
17 Jul 2024
How Are LLMs Mitigating Stereotyping Harms? Learning from Search Engine
  Studies
How Are LLMs Mitigating Stereotyping Harms? Learning from Search Engine Studies
Alina Leidinger
Richard Rogers
401
20
0
16 Jul 2024
Localizing and Mitigating Errors in Long-form Question Answering
Localizing and Mitigating Errors in Long-form Question Answering
Rachneet Sachdeva
Yixiao Song
Mohit Iyyer
Iryna Gurevych
HILM
362
0
0
16 Jul 2024
Sibyl: Simple yet Effective Agent Framework for Complex Real-world
  Reasoning
Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning
Yulong Wang
Tianhao Shen
Lifeng Liu
Jian Xie
LLMAGLRM
308
17
0
15 Jul 2024
Fine-grained Analysis of In-context Linear Estimation: Data,
  Architecture, and Beyond
Fine-grained Analysis of In-context Linear Estimation: Data, Architecture, and Beyond
Yingcong Li
A. S. Rawat
Samet Oymak
253
19
0
13 Jul 2024
A Survey on Symbolic Knowledge Distillation of Large Language Models
A Survey on Symbolic Knowledge Distillation of Large Language Models
Kamal Acharya
Alvaro Velasquez
Haoze Song
SyDa
326
31
0
12 Jul 2024
Large Language Models as Biomedical Hypothesis Generators: A
  Comprehensive Evaluation
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Biqing Qi
Kaiyan Zhang
Kai Tian
Haoxiang Li
Zhang-Ren Chen
Sihang Zeng
Ermo Hua
Hu Jinfang
Bowen Zhou
LM&MA
406
35
0
12 Jul 2024
Internet of Agents: Weaving a Web of Heterogeneous Agents for
  Collaborative Intelligence
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Weize Chen
Ziming You
Ran Li
Yitong Guan
Chen Qian
Chenyang Zhao
Cheng Yang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
LLMAG
352
73
0
09 Jul 2024
It Cannot Be Right If It Was Written by AI: On Lawyers' Preferences of
  Documents Perceived as Authored by an LLM vs a Human
It Cannot Be Right If It Was Written by AI: On Lawyers' Preferences of Documents Perceived as Authored by an LLM vs a Human
Jakub Harasta
Tereza Novotná
Jaromír Šavelka
ELM
319
18
0
09 Jul 2024
Variational Best-of-N Alignment
Variational Best-of-N Alignment
Afra Amini
Tim Vieira
Robert Bamler
Ryan Cotterell
BDL
520
41
0
08 Jul 2024
Orchestrating LLMs with Different Personalizations
Orchestrating LLMs with Different Personalizations
Jin Peng Zhou
Katie Z Luo
Jingwen Gu
Jason Yuan
Kilian Q. Weinberger
Wen Sun
145
5
0
04 Jul 2024
RLHF Can Speak Many Languages: Unlocking Multilingual Preference
  Optimization for LLMs
RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs
John Dang
Arash Ahmadian
Kelly Marchisio
Julia Kreutzer
Ahmet Üstün
Sara Hooker
312
43
0
02 Jul 2024
Concise and Precise Context Compression for Tool-Using Language Models
Concise and Precise Context Compression for Tool-Using Language Models
Yang Xu
Yunlong Feng
Honglin Mu
Yutai Hou
Yitong Li
...
Zhongyang Li
Dandan Tu
Qingfu Zhu
Hao Fei
Wanxiang Che
LLMAG
215
9
0
02 Jul 2024
LogEval: A Comprehensive Benchmark Suite for Large Language Models In
  Log Analysis
LogEval: A Comprehensive Benchmark Suite for Large Language Models In Log Analysis
Tianyu Cui
Shiyu Ma
Ziang Chen
Tong Xiao
Shimin Tao
...
Changchang Liu
Yuzhe Cai
Weibin Meng
Yongqian Sun
Dan Pei
ELM
199
17
0
02 Jul 2024
DogeRM: Equipping Reward Models with Domain Knowledge through Model
  Merging
DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging
Tzu-Han Lin
Chen-An Li
Hung-yi Lee
Yun-Nung Chen
VLMALM
165
6
0
01 Jul 2024
$\text{Memory}^3$: Language Modeling with Explicit Memory
Memory3\text{Memory}^3Memory3: Language Modeling with Explicit Memory
Hongkang Yang
Peng Liu
Wenjin Wang
Huayi Lai
Zhiyu Li
...
Yu Yu
Kai Chen
Feiyu Xiong
Linpeng Tang
Weinan E
263
36
0
01 Jul 2024
ProductAgent: Benchmarking Conversational Product Search Agent with
  Asking Clarification Questions
ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions
Jingheng Ye
Yong Jiang
Xiaobin Wang
Hai-Tao Zheng
Yangning Li
Hai-Tao Zheng
Pengjun Xie
Fei Huang
249
6
0
01 Jul 2024
Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation
Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation
Sirui Xia
Xintao Wang
Jiaqing Liang
Yifei Zhang
Weikang Zhou
Jiaji Deng
Fei Yu
Yanghua Xiao
RALM
494
15
0
01 Jul 2024
Advancing Process Verification for Large Language Models via Tree-Based
  Preference Learning
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning
Mingqian He
Yongliang Shen
Wenqi Zhang
Zeqi Tan
Weiming Lu
LRM
235
13
0
29 Jun 2024
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Sujan Dutta
Sayantan Mahinder
R. Anantha
Bortik Bandyopadhyay
ALM
260
13
0
28 Jun 2024
Scalable and Domain-General Abstractive Proposition Segmentation
Scalable and Domain-General Abstractive Proposition Segmentation
Mohammad Javad Hosseini
Yang Gao
Tim Baumgärtner
Alex Fabrikant
Reinald Kim Amplayo
202
2
0
28 Jun 2024
Lifelong Robot Library Learning: Bootstrapping Composable and
  Generalizable Skills for Embodied Control with Language Models
Lifelong Robot Library Learning: Bootstrapping Composable and Generalizable Skills for Embodied Control with Language Models
Georgios Tziafas
Hamidreza Kasaei
KELMLM&Ro
356
14
0
26 Jun 2024
Not All Preference Pairs Are Created Equal: A Recipe for
  Annotation-Efficient Iterative Preference Learning
Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning
Sen Yang
Leyang Cui
Deng Cai
Xinting Huang
Shuming Shi
Wai Lam
224
10
0
25 Jun 2024
Reinforcement Learning via Auxiliary Task Distillation
Reinforcement Learning via Auxiliary Task Distillation
Abhinav Harish
Larry Heck
Josiah P. Hanna
Z. Kira
Andrew Szot
289
1
0
24 Jun 2024
Towards Comprehensive Preference Data Collection for Reward Modeling
Towards Comprehensive Preference Data Collection for Reward Modeling
Yulan Hu
Qingyang Li
Sheng Ouyang
Ge Chen
Kaihui Chen
Lijun Mei
Xucheng Ye
Fuzheng Zhang
Yong Liu
SyDa
821
4
0
24 Jun 2024
Cascade Reward Sampling for Efficient Decoding-Time Alignment
Cascade Reward Sampling for Efficient Decoding-Time Alignment
Bolian Li
Yifan Wang
Anamika Lochab
A. Grama
Ruqi Zhang
AI4TS
639
31
0
24 Jun 2024
LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations
LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations
Shashank Kirtania
Priyanshu Gupta
Arjun Radhakirshna
LRM
350
18
0
22 Jun 2024
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
Xinrong Zhang
Yingfa Chen
Shengding Hu
Xu Han
Zihang Xu
Yuanwei Xu
Weilin Zhao
Maosong Sun
Zhiyuan Liu
227
20
0
22 Jun 2024
A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student
  Feedback to Make Mnemonic Learning Stick
A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick
Nishant Balepur
Matthew Shu
Alexander Hoyle
Alison Robey
Shi Feng
Seraphina Goldfarb-Tarrant
Jordan Boyd-Graber
222
9
0
21 Jun 2024
Hybrid Alignment Training for Large Language Models
Hybrid Alignment Training for Large Language Models
Chenglong Wang
Hang Zhou
Kaiyan Chang
Bei Li
Yongyu Mu
Tong Xiao
Tongran Liu
Jingbo Zhu
298
10
0
21 Jun 2024
GraphReader: Building Graph-based Agent to Enhance Long-Context
  Abilities of Large Language Models
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
Shilong Li
Yancheng He
Hangyu Guo
Xingyuan Bu
Ge Bai
...
Xingwei Qu
Yangguang Li
Wanli Ouyang
Yuchi Xu
Bo Zheng
RALMLLMAG
267
41
0
20 Jun 2024
FoRAG: Factuality-optimized Retrieval Augmented Generation for
  Web-enhanced Long-form Question Answering
FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering
Tianchi Cai
Zhiwen Tan
Xierui Song
Tao Sun
Jiyan Jiang
Yunqi Xu
Yinger Zhang
Jinjie Gu
321
20
0
19 Jun 2024
Model Internals-based Answer Attribution for Trustworthy
  Retrieval-Augmented Generation
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jirui Qi
Gabriele Sarti
Raquel Fernández
Arianna Bisazza
RALM
292
16
0
19 Jun 2024
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for
  LLM Agents
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM AgentsNeural Information Processing Systems (NeurIPS), 2024
Edoardo Debenedetti
Jie Zhang
Mislav Balunović
Luca Beurer-Kellner
Marc Fischer
Florian Tramèr
LLMAGAAML
457
94
1
19 Jun 2024
APPL: A Prompt Programming Language for Harmonious Integration of
  Programs and Large Language Model Prompts
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model PromptsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Honghua Dong
Qidong Su
Yubo Gao
Zhaoyu Li
Yangjun Ruan
Gennady Pekhimenko
Chris J. Maddison
Xujie Si
LLMAG
174
3
0
19 Jun 2024
Learning to Generate Answers with Citations via Factual Consistency
  Models
Learning to Generate Answers with Citations via Factual Consistency ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Rami Aly
Zhiqiang Tang
Samson Tan
George Karypis
HILM
280
12
0
19 Jun 2024
Think-then-Act: A Dual-Angle Evaluated Retrieval-Augmented Generation
Think-then-Act: A Dual-Angle Evaluated Retrieval-Augmented Generation
Yige Shen
Hao Jiang
Hua Qu
Jihong Zhao
RALMLRM
241
1
0
18 Jun 2024
LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document
  Summarization
LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document Summarization
Masafumi Enomoto
Kunihiro Takeoka
Kosuke Akimoto
Kiril Gashteovski
Masafumi Oyamada
RALM
183
1
0
18 Jun 2024
WebCanvas: Benchmarking Web Agents in Online Environments
WebCanvas: Benchmarking Web Agents in Online Environments
Yichen Pan
Dehan Kong
Sida Zhou
Cheng Cui
Yifei Leng
...
Hangyu Liu
Yanyi Shang
Shuyan Zhou
Tongshuang Wu
Zhengyang Wu
416
83
0
18 Jun 2024
On the Exponential Convergence for Offline RLHF with Pairwise Comparisons
On the Exponential Convergence for Offline RLHF with Pairwise Comparisons
Zhirui Chen
Vincent Y. F. Tan
OffRL
257
1
0
18 Jun 2024
Satyrn: A Platform for Analytics Augmented Generation
Satyrn: A Platform for Analytics Augmented Generation
Marko Sterbentz
Cameron Barrie
Shubham Shahi
Abhratanu Dutta
Donna Hooshmand
Harper Pack
Kristian J. Hammond
255
1
0
17 Jun 2024
Dialogue Action Tokens: Steering Language Models in Goal-Directed
  Dialogue with a Multi-Turn Planner
Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
Kenneth Li
Yiming Wang
Fernanda Viégas
Martin Wattenberg
349
10
0
17 Jun 2024
KAOS: Large Model Multi-Agent Operating System
KAOS: Large Model Multi-Agent Operating System
Zhao Zhuo
Rongzhen Li
Kai Liu
Huhai Zou
KaiMao Li
Jie Yu
Tianhao Sun
Qingbo Wu
VLMLLMAG
359
4
0
17 Jun 2024
Small Agent Can Also Rock! Empowering Small Language Models as
  Hallucination Detector
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Xiaoxue Cheng
Junyi Li
Wayne Xin Zhao
Hongzhi Zhang
Fuzheng Zhang
Di Zhang
Kun Gai
Ji-Rong Wen
HILMLLMAG
236
25
0
17 Jun 2024
A Survey on Human Preference Learning for Large Language Models
A Survey on Human Preference Learning for Large Language Models
Ruili Jiang
Kehai Chen
Xuefeng Bai
Zhixuan He
Juntao Li
Muyun Yang
Tiejun Zhao
Liqiang Nie
Min Zhang
332
19
0
17 Jun 2024
Previous
123...91011...212223
Next
Page 10 of 23
Pageof 23