ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09332
  4. Cited By
WebGPT: Browser-assisted question-answering with human feedback
v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
    ALMRALM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 1,123 papers shown
General Intelligence Requires Rethinking Exploration
General Intelligence Requires Rethinking ExplorationRoyal Society Open Science (RSOS), 2022
Minqi Jiang
Tim Rocktaschel
Edward Grefenstette
LRM
227
26
0
15 Nov 2022
Metaphors We Learn By
Metaphors We Learn By
Roland Memisevic
201
0
0
11 Nov 2022
The CRINGE Loss: Learning what language not to model
The CRINGE Loss: Learning what language not to modelAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Leonard Adolphs
Tianyu Gao
Jing Xu
Kurt Shuster
Sainbayar Sukhbaatar
Jason Weston
MU
238
40
0
10 Nov 2022
Active Example Selection for In-Context Learning
Active Example Selection for In-Context LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yiming Zhang
Shi Feng
Chenhao Tan
SILMLRM
329
251
0
08 Nov 2022
PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation
PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation
Siqi Bao
H. He
Jun Xu
Hua Lu
Fan Wang
Hua Wu
Han Zhou
Wenquan Wu
Zheng-Yu Niu
Haifeng Wang
126
4
0
02 Nov 2022
Learning to Navigate Wikipedia by Taking Random Walks
Learning to Navigate Wikipedia by Taking Random WalksNeural Information Processing Systems (NeurIPS), 2022
Manzil Zaheer
Kenneth Marino
Will Grathwohl
John Schultz
Wendy Shang
Sheila Babayan
Arun Ahuja
Ishita Dasgupta
Christine Kaeser-Chen
Rob Fergus
97
8
0
31 Oct 2022
When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad
  Responses into Good Labels
When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad Responses into Good LabelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Weiyan Shi
Emily Dinan
Kurt Shuster
Jason Weston
Jing Xu
200
22
0
28 Oct 2022
Decoding a Neural Retriever's Latent Space for Query Suggestion
Decoding a Neural Retriever's Latent Space for Query SuggestionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Leonard Adolphs
Michelle Chen Huebscher
Christian Buck
Sertan Girgin
Olivier Bachem
Massimiliano Ciaramita
Thomas Hofmann
RALM
194
9
0
21 Oct 2022
Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension
  Questions
Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions
Alicia Parrish
H. Trivedi
Nikita Nangia
Vishakh Padmakumar
Jason Phang
Amanpreet Singh Saimbhi
Sam Bowman
165
15
0
19 Oct 2022
Scaling Laws for Reward Model Overoptimization
Scaling Laws for Reward Model OveroptimizationInternational Conference on Machine Learning (ICML), 2022
Leo Gao
John Schulman
Jacob Hilton
ALM
383
776
0
19 Oct 2022
N-Best Hypotheses Reranking for Text-To-SQL Systems
N-Best Hypotheses Reranking for Text-To-SQL SystemsSpoken Language Technology Workshop (SLT), 2022
Lu Zeng
S. Parthasarathi
Dilek Z. Hakkani-Tür
204
27
0
19 Oct 2022
RARR: Researching and Revising What Language Models Say, Using Language
  Models
RARR: Researching and Revising What Language Models Say, Using Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Luyu Gao
Zhuyun Dai
Panupong Pasupat
Anthony Chen
Arun Tejasvi Chaganty
...
Vincent Zhao
Ni Lao
Hongrae Lee
Da-Cheng Juan
Kelvin Guu
HILMKELM
709
282
0
17 Oct 2022
Understanding HTML with Large Language Models
Understanding HTML with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Izzeddin Gur
Ofir Nachum
Yingjie Miao
Mustafa Safdari
Austin Huang
Aakanksha Chowdhery
Sharan Narang
Noah Fiedel
Aleksandra Faust
AI4CE
501
83
0
08 Oct 2022
LLMEffiChecker: Understanding and Testing Efficiency Degradation of
  Large Language Models
LLMEffiChecker: Understanding and Testing Efficiency Degradation of Large Language ModelsACM Transactions on Software Engineering and Methodology (TOSEM), 2022
Simin Chen
Cong Liu
Mirazul Haque
Wei Yang
254
32
0
07 Oct 2022
Measuring and Narrowing the Compositionality Gap in Language Models
Measuring and Narrowing the Compositionality Gap in Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ofir Press
Muru Zhang
Sewon Min
Ludwig Schmidt
Noah A. Smith
M. Lewis
ReLMKELMLRM
719
947
0
07 Oct 2022
Rainier: Reinforced Knowledge Introspector for Commonsense Question
  Answering
Rainier: Reinforced Knowledge Introspector for Commonsense Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hamish Ivison
Skyler Hallinan
Ximing Lu
Pengfei He
Sean Welleck
Hannaneh Hajishirzi
Yejin Choi
RALM
247
63
0
06 Oct 2022
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAGReLMLRM
2.6K
5,358
0
06 Oct 2022
Ask Me Anything: A simple strategy for prompting language models
Ask Me Anything: A simple strategy for prompting language modelsInternational Conference on Learning Representations (ICLR), 2022
Simran Arora
A. Narayan
Mayee F. Chen
Laurel J. Orr
Neel Guha
Kush S. Bhatia
Ines Chami
Frederic Sala
Christopher Ré
ReLMLRM
640
254
0
05 Oct 2022
Is Reinforcement Learning (Not) for Natural Language Processing:
  Benchmarks, Baselines, and Building Blocks for Natural Language Policy
  Optimization
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy
Prithviraj Ammanabrolu
Kianté Brantley
Jack Hessel
R. Sifa
Christian Bauckhage
Hannaneh Hajishirzi
Yejin Choi
OffRL
568
279
0
03 Oct 2022
Zero-Shot Retrieval with Search Agents and Hybrid Environments
Zero-Shot Retrieval with Search Agents and Hybrid Environments
Michelle Chen Huebscher
Christian Buck
Massimiliano Ciaramita
S. Rothe
335
9
0
30 Sep 2022
Improving alignment of dialogue agents via targeted human judgements
Improving alignment of dialogue agents via targeted human judgements
Amelia Glaese
Nat McAleese
Maja Trkebacz
John Aslanides
Vlad Firoiu
...
John F. J. Mellor
Demis Hassabis
Koray Kavukcuoglu
Lisa Anne Hendricks
G. Irving
ALMAAML
535
636
0
28 Sep 2022
FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation
FiD-Light: Efficient and Effective Retrieval-Augmented Text GenerationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022
Sebastian Hofstatter
Jiecao Chen
K. Raman
Hamed Zamani
RALM
587
105
0
28 Sep 2022
Defining and Characterizing Reward Hacking
Defining and Characterizing Reward Hacking
Joar Skalse
Nikolaus H. R. Howe
Dmitrii Krasheninnikov
David M. Krueger
390
92
0
27 Sep 2022
Exploring Effective Information Utilization in Multi-Turn Topic-Driven
  Conversations
Exploring Effective Information Utilization in Multi-Turn Topic-Driven Conversations
Jiatong Li
Bin He
Fei Mi
206
4
0
01 Sep 2022
Faithful Reasoning Using Large Language Models
Faithful Reasoning Using Large Language Models
Antonia Creswell
Murray Shanahan
ReLMLRM
193
141
0
30 Aug 2022
Towards Boosting the Open-Domain Chatbot with Human Feedback
Towards Boosting the Open-Domain Chatbot with Human FeedbackAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Hua Lu
Siqi Bao
H. He
Fan Wang
Hua Wu
Haifeng Wang
ALM
167
20
0
30 Aug 2022
PEER: A Collaborative Language Model
PEER: A Collaborative Language ModelInternational Conference on Learning Representations (ICLR), 2022
Timo Schick
Jane Dwivedi-Yu
Zhengbao Jiang
Fabio Petroni
Patrick Lewis
Gautier Izacard
Qingfei You
Christoforos Nalmpantis
Edouard Grave
Sebastian Riedel
ALM
265
104
0
24 Aug 2022
Learning New Skills after Deployment: Improving open-domain
  internet-driven dialogue with human feedback
Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedbackAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jing Xu
Megan Ung
M. Komeili
Kushal Arora
Y-Lan Boureau
Jason Weston
208
43
0
05 Aug 2022
BlenderBot 3: a deployed conversational agent that continually learns to
  responsibly engage
BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Kurt Shuster
Jing Xu
M. Komeili
Da Ju
Eric Michael Smith
...
Naman Goyal
Arthur Szlam
Y-Lan Boureau
Melanie Kambadur
Jason Weston
LM&RoKELM
452
275
0
05 Aug 2022
Discrete Key-Value Bottleneck
Discrete Key-Value BottleneckInternational Conference on Machine Learning (ICML), 2022
Frederik Trauble
Anirudh Goyal
Nasim Rahaman
Michael C. Mozer
Kenji Kawaguchi
Yoshua Bengio
Bernhard Schölkopf
CLL
307
23
0
22 Jul 2022
Language Model Cascades
Language Model Cascades
David Dohan
Winnie Xu
Aitor Lewkowycz
Jacob Austin
David Bieber
...
Henryk Michalewski
Rif A. Saurous
Jascha Narain Sohl-Dickstein
Kevin Patrick Murphy
Charles Sutton
ReLMLRM
297
109
0
21 Jul 2022
Language Models (Mostly) Know What They Know
Language Models (Mostly) Know What They Know
Saurav Kadavath
Tom Conerly
Amanda Askell
T. Henighan
Dawn Drain
...
Nicholas Joseph
Benjamin Mann
Sam McCandlish
C. Olah
Jared Kaplan
ELM
638
1,139
0
11 Jul 2022
WebShop: Towards Scalable Real-World Web Interaction with Grounded
  Language Agents
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language AgentsNeural Information Processing Systems (NeurIPS), 2022
Shunyu Yao
Howard Chen
John Yang
Karthik Narasimhan
LLMAGLM&Ro
781
754
0
04 Jul 2022
INSCIT: Information-Seeking Conversations with Mixed-Initiative
  Interactions
INSCIT: Information-Seeking Conversations with Mixed-Initiative InteractionsTransactions of the Association for Computational Linguistics (TACL), 2022
Zeqiu Wu
Ryu Parish
Hao Cheng
Sewon Min
Prithviraj Ammanabrolu
Mari Ostendorf
Hannaneh Hajishirzi
165
22
0
02 Jul 2022
Forecasting Future World Events with Neural Networks
Forecasting Future World Events with Neural NetworksNeural Information Processing Systems (NeurIPS), 2022
Andy Zou
Tristan Xiao
Ryan Jia
Joe Kwon
Mantas Mazeika
Richard Li
Dawn Song
Jacob Steinhardt
Owain Evans
Dan Hendrycks
356
38
0
30 Jun 2022
Actionable Guidance for High-Consequence AI Risk Management: Towards
  Standards Addressing AI Catastrophic Risks
Actionable Guidance for High-Consequence AI Risk Management: Towards Standards Addressing AI Catastrophic Risks
Anthony M. Barrett
Dan Hendrycks
Jessica Newman
Brandie Nonnecke
SILM
149
19
0
17 Jun 2022
DIRECTOR: Generator-Classifiers For Supervised Language Modeling
DIRECTOR: Generator-Classifiers For Supervised Language Modeling
Kushal Arora
Kurt Shuster
Sainbayar Sukhbaatar
Jason Weston
VLM
253
44
0
15 Jun 2022
Quark: Controllable Text Generation with Reinforced Unlearning
Quark: Controllable Text Generation with Reinforced UnlearningNeural Information Processing Systems (NeurIPS), 2022
Ximing Lu
Sean Welleck
Jack Hessel
Liwei Jiang
Lianhui Qin
Peter West
Prithviraj Ammanabrolu
Yejin Choi
MU
457
254
0
26 May 2022
NaturalProver: Grounded Mathematical Proof Generation with Language
  Models
NaturalProver: Grounded Mathematical Proof Generation with Language ModelsNeural Information Processing Systems (NeurIPS), 2022
Sean Welleck
Hamish Ivison
Ximing Lu
Hannaneh Hajishirzi
Yejin Choi
AIMatLRM
282
90
0
25 May 2022
RankGen: Improving Text Generation with Large Ranking Models
RankGen: Improving Text Generation with Large Ranking ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Kalpesh Krishna
Yapei Chang
John Wieting
Mohit Iyyer
AIMat
335
79
0
19 May 2022
Selection-Inference: Exploiting Large Language Models for Interpretable
  Logical Reasoning
Selection-Inference: Exploiting Large Language Models for Interpretable Logical ReasoningInternational Conference on Learning Representations (ICLR), 2022
Antonia Creswell
Murray Shanahan
I. Higgins
ReLMLRM
309
432
0
19 May 2022
Modeling Exemplification in Long-form Question Answering via Retrieval
Modeling Exemplification in Long-form Question Answering via RetrievalNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Shufan Wang
Fangyuan Xu
Laure Thompson
Eunsol Choi
Mohit Iyyer
130
11
0
19 May 2022
Dialog Inpainting: Turning Documents into Dialogs
Dialog Inpainting: Turning Documents into DialogsInternational Conference on Machine Learning (ICML), 2022
Zhuyun Dai
Arun Tejasvi Chaganty
Vincent Zhao
Aida Amini
Q. Rashid
Mike Green
Kelvin Guu
272
76
0
18 May 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&RoLLMAGAI4CE
450
977
0
12 May 2022
Asking for Knowledge: Training RL Agents to Query External Knowledge
  Using Language
Asking for Knowledge: Training RL Agents to Query External Knowledge Using LanguageInternational Conference on Machine Learning (ICML), 2022
Iou-Jen Liu
Xingdi Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
Alex Schwing
RALM
248
13
0
12 May 2022
ISA-bEL: Intelligent Search Algorithm based on Entity Linking
ISA-bEL: Intelligent Search Algorithm based on Entity Linking
Rubén González Sendino
Mónica Ortega
Carlos Carrasco
81
1
0
09 May 2022
OPT: Open Pre-trained Transformer Language Models
OPT: Open Pre-trained Transformer Language Models
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
...
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLMOSLMAI4CE
868
4,400
0
02 May 2022
Training Language Models with Language Feedback
Training Language Models with Language Feedback
Jérémy Scheurer
Jon Ander Campos
Jun Shern Chan
Angelica Chen
Dong Wang
Ethan Perez
ALM
507
55
0
29 Apr 2022
Which Discriminator for Cooperative Text Generation?
Which Discriminator for Cooperative Text Generation?Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022
Antoine Chaffin
Thomas Scialom
Sylvain Lamprier
Jacopo Staiano
Benjamin Piwowarski
Ewa Kijak
Vincent Claveau
172
4
0
25 Apr 2022
Autoregressive Search Engines: Generating Substrings as Document
  Identifiers
Autoregressive Search Engines: Generating Substrings as Document IdentifiersNeural Information Processing Systems (NeurIPS), 2022
Michele Bevilacqua
G. Ottaviano
Patrick Lewis
Anuj Kumar
Sebastian Riedel
Fabio Petroni
KELMRALM
343
193
0
22 Apr 2022
Previous
123...212223
Next