ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09332
  4. Cited By
WebGPT: Browser-assisted question-answering with human feedback
v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
    ALMRALM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

23 / 1,123 papers shown
ASQA: Factoid Questions Meet Long-Form Answers
ASQA: Factoid Questions Meet Long-Form AnswersConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ivan Stelmakh
Yi Luan
Bhuwan Dhingra
Ming-Wei Chang
359
237
0
12 Apr 2022
Training a Helpful and Harmless Assistant with Reinforcement Learning
  from Human Feedback
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Yuntao Bai
Andy Jones
Kamal Ndousse
Amanda Askell
Anna Chen
...
Jack Clark
Sam McCandlish
C. Olah
Benjamin Mann
Jared Kaplan
960
3,520
0
12 Apr 2022
Single-Turn Debate Does Not Help Humans Answer Hard
  Reading-Comprehension Questions
Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions
Alicia Parrish
H. Trivedi
Ethan Perez
Angelica Chen
Nikita Nangia
Jason Phang
Sam Bowman
212
19
0
11 Apr 2022
Language Models that Seek for Knowledge: Modular Search & Generation for
  Dialogue and Prompt Completion
Language Models that Seek for Knowledge: Modular Search & Generation for Dialogue and Prompt CompletionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Kurt Shuster
M. Komeili
Leonard Adolphs
Stephen Roller
Arthur Szlam
Jason Weston
KELM
232
143
0
24 Mar 2022
Teaching language models to support answers with verified quotes
Teaching language models to support answers with verified quotes
Jacob Menick
Maja Trebacz
Vladimir Mikulik
John Aslanides
Francis Song
...
Mia Glaese
Susannah Young
Lucy Campbell-Gillingham
G. Irving
Nat McAleese
ELMRALM
530
308
0
21 Mar 2022
How Do We Answer Complex Questions: Discourse Structure of Long-form
  Answers
How Do We Answer Complex Questions: Discourse Structure of Long-form AnswersAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Fangyuan Xu
Junyi Jessy Li
Eunsol Choi
189
22
0
21 Mar 2022
Internet-augmented language models through few-shot prompting for
  open-domain question answering
Internet-augmented language models through few-shot prompting for open-domain question answering
Angeliki Lazaridou
E. Gribovskaya
Wojciech Stokowiec
N. Grigorev
KELMLRM
244
159
0
10 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLMALM
2.1K
17,754
0
04 Mar 2022
Read before Generate! Faithful Long Form Question Answering with Machine
  Reading
Read before Generate! Faithful Long Form Question Answering with Machine ReadingFindings (Findings), 2022
Jane Polak Scowcroft
Xiaoguang Li
Jindi Zhang
Lifeng Shang
Xin Jiang
Qun Liu
Pascale Fung
HILM
216
72
0
01 Mar 2022
From Natural Language to Simulations: Applying GPT-3 Codex to Automate
  Simulation Modeling of Logistics Systems
From Natural Language to Simulations: Applying GPT-3 Codex to Automate Simulation Modeling of Logistics SystemsSocial Science Research Network (SSRN), 2022
I. Jackson
M. J. Sáenz
182
10
0
24 Feb 2022
Do Transformers know symbolic rules, and would we know if they did?
Do Transformers know symbolic rules, and would we know if they did?
Tommi Gröndahl
Yu-Wen Guo
Nirmal Asokan
420
0
0
19 Feb 2022
A data-driven approach for learning to control computers
A data-driven approach for learning to control computersInternational Conference on Machine Learning (ICML), 2022
Peter C. Humphreys
David Raposo
Tobias Pohlen
Gregory Thornton
Rachita Chhaparia
...
Josh Abramson
Petko Georgiev
Alex Goldin
Adam Santoro
Timothy Lillicrap
334
115
0
16 Feb 2022
Transformer Memory as a Differentiable Search Index
Transformer Memory as a Differentiable Search IndexNeural Information Processing Systems (NeurIPS), 2022
Yi Tay
Vinh Q. Tran
Mostafa Dehghani
Jianmo Ni
Dara Bahri
...
Zhe Zhao
Jai Gupta
Tal Schuster
William W. Cohen
Donald Metzler
434
368
0
14 Feb 2022
Survey of Hallucination in Natural Language Generation
Survey of Hallucination in Natural Language GenerationACM Computing Surveys (ACM CSUR), 2022
Ziwei Ji
Nayeon Lee
Rita Frieske
Tiezheng Yu
D. Su
...
Delong Chen
Wenliang Dai
Ho Shu Chan
Andrea Madotto
Pascale Fung
HILMLRM
915
3,544
0
08 Feb 2022
Describing Differences between Text Distributions with Natural Language
Describing Differences between Text Distributions with Natural LanguageInternational Conference on Machine Learning (ICML), 2022
Ruiqi Zhong
Charles Burton Snell
Dan Klein
Jacob Steinhardt
VLM
300
55
0
28 Jan 2022
LaMDA: Language Models for Dialog Applications
LaMDA: Language Models for Dialog Applications
R. Thoppilan
Daniel De Freitas
Jamie Hall
Noam M. Shazeer
Apoorv Kulshreshtha
...
Blaise Aguera-Arcas
Claire Cui
M. Croak
Ed H. Chi
Quoc Le
ALM
406
1,799
0
20 Jan 2022
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A SurveyACM Computing Surveys (ACM CSUR), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
525
268
0
14 Jan 2022
Open Domain Question Answering with A Unified Knowledge Interface
Open Domain Question Answering with A Unified Knowledge Interface
Kaixin Ma
Hao Cheng
Xiaodong Liu
Eric Nyberg
Jianfeng Gao
RALM
308
45
0
16 Oct 2021
TruthfulQA: Measuring How Models Mimic Human Falsehoods
TruthfulQA: Measuring How Models Mimic Human FalsehoodsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Stephanie C. Lin
Jacob Hilton
Owain Evans
HILM
1.6K
2,728
0
08 Sep 2021
Boosting Search Engines with Interactive Agents
Boosting Search Engines with Interactive Agents
Leonard Adolphs
Benjamin Boerschinger
Christian Buck
Michelle Chen Huebscher
Massimiliano Ciaramita
...
Thomas Hofmann
Yannic Kilcher
Sascha Rothe
Pier Giuseppe Sessa
Lierni Sestorain Saralegui
LLMAG
332
24
0
01 Sep 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Offline Meta-Reinforcement Learning with Online Self-SupervisionInternational Conference on Machine Learning (ICML), 2021
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
374
77
0
08 Jul 2021
Uni-Encoder: A Fast and Accurate Response Selection Paradigm for
  Generation-Based Dialogue Systems
Uni-Encoder: A Fast and Accurate Response Selection Paradigm for Generation-Based Dialogue SystemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Chiyu Song
Hongliang He
Haofei Yu
Pengfei Fang
Leyang Cui
Zhenzhong Lan
245
9
0
02 Jun 2021
Text as Environment: A Deep Reinforcement Learning Text Readability
  Assessment Model
Text as Environment: A Deep Reinforcement Learning Text Readability Assessment Model
Hamid Reza Mohammadi
S. H. Khasteh
Tahereh Firoozi
Taha Samavati
271
21
0
12 Dec 2019
Previous
123...212223
Page 23 of 23
Pageof 23