ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09332
  4. Cited By
WebGPT: Browser-assisted question-answering with human feedback

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
    ALM
    RALM
ArXivPDFHTML

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 905 papers shown
Title
WeaverBird: Empowering Financial Decision-Making with Large Language
  Model, Knowledge Base, and Search Engine
WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search Engine
Siqiao Xue
Fan Zhou
Y. Xu
Ming Jin
Qingsong Wen
...
Jun Zhou
Shuo Xie
D. Xiu
James Y. Zhang
Hongyuan Mei
RALM
AIFin
31
15
0
10 Aug 2023
TPTU: Large Language Model-based AI Agents for Task Planning and Tool
  Usage
TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage
Jingqing Ruan
Yihong Chen
Bin Zhang
Zhiwei Xu
Tianpeng Bao
...
Shiwei Shi
Hangyu Mao
Ziyue Li
Xingyu Zeng
Rui Zhao
LLMAG
LM&Ro
39
32
0
07 Aug 2023
Retroformer: Retrospective Large Language Agents with Policy Gradient
  Optimization
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Weiran Yao
Shelby Heinecke
Juan Carlos Niebles
Zhiwei Liu
Yihao Feng
...
Ran Xu
P. Mùi
Haiquan Wang
Caiming Xiong
Silvio Savarese
LLMAG
LM&Ro
31
70
0
04 Aug 2023
Curricular Transfer Learning for Sentence Encoded Tasks
Curricular Transfer Learning for Sentence Encoded Tasks
Jader Martins Camboim de Sá
Matheus Ferraroni Sanches
R. R. Souza
Júlio Cesar dos Reis
Leandro A. Villas
21
0
0
03 Aug 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language
  Models
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Cheng-Yu Hsieh
Sibei Chen
Chun-Liang Li
Yasuhisa Fujii
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
LLMAG
SyDa
38
41
0
01 Aug 2023
HAGRID: A Human-LLM Collaborative Dataset for Generative
  Information-Seeking with Attribution
HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution
Ehsan Kamalloo
A. Jafari
Xinyu Crystina Zhang
Nandan Thakur
Jimmy J. Lin
24
41
0
31 Jul 2023
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world
  APIs
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
Yujia Qin
Shi Liang
Yining Ye
Kunlun Zhu
Lan Yan
...
Jie Zhou
Mark B. Gerstein
Dahai Li
Zhiyuan Liu
Maosong Sun
CLL
ALM
LLMAG
ELM
LM&MA
55
614
0
31 Jul 2023
Deception Abilities Emerged in Large Language Models
Deception Abilities Emerged in Large Language Models
Thilo Hagendorff
LLMAG
35
75
0
31 Jul 2023
When Large Language Models Meet Personalization: Perspectives of
  Challenges and Opportunities
When Large Language Models Meet Personalization: Perspectives of Challenges and Opportunities
Jin Chen
Zheng Liu
Xunpeng Huang
Chenwang Wu
Qi Liu
...
Yuxuan Lei
Xiaolong Chen
Xingmei Wang
Defu Lian
Enhong Chen
ALM
24
110
0
31 Jul 2023
WebArena: A Realistic Web Environment for Building Autonomous Agents
WebArena: A Realistic Web Environment for Building Autonomous Agents
Shuyan Zhou
Frank F. Xu
Hao Zhu
Xuhui Zhou
Robert Lo
...
Tianyue Ou
Yonatan Bisk
Daniel Fried
Uri Alon
Graham Neubig
LLMAG
36
382
0
25 Jul 2023
LLM Censorship: A Machine Learning Challenge or a Computer Security
  Problem?
LLM Censorship: A Machine Learning Challenge or a Computer Security Problem?
David Glukhov
Ilia Shumailov
Y. Gal
Nicolas Papernot
V. Papyan
AAML
ELM
26
56
0
20 Jul 2023
Information Retrieval Meets Large Language Models: A Strategic Report
  from Chinese IR Community
Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community
Qingyao Ai
Ting Bai
Zhao Cao
Yi-Ju Chang
Jiawei Chen
...
Peng-Zhen Zhang
Fan Zhang
Wei-na Zhang
M. Zhang
Xiaofei Zhu
52
58
0
19 Jul 2023
Thrust: Adaptively Propels Large Language Models with External Knowledge
Thrust: Adaptively Propels Large Language Models with External Knowledge
Xinran Zhao
Hongming Zhang
Xiaoman Pan
Wenlin Yao
Dong Yu
Jianshu Chen
KELM
48
4
0
19 Jul 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
90
10,977
0
18 Jul 2023
REX: Rapid Exploration and eXploitation for AI Agents
REX: Rapid Exploration and eXploitation for AI Agents
Rithesh Murthy
Shelby Heinecke
Juan Carlos Niebles
Zhiwei Liu
Le Xue
...
Ran Xu
P. Mùi
Haiquan Wang
Caiming Xiong
Silvio Savarese
OffRL
26
8
0
18 Jul 2023
Question Decomposition Improves the Faithfulness of Model-Generated
  Reasoning
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning
Ansh Radhakrishnan
Karina Nguyen
Anna Chen
Carol Chen
Carson E. Denison
...
Zac Hatfield-Dodds
Jared Kaplan
J. Brauner
Sam Bowman
Ethan Perez
ReLM
LRM
HILM
27
84
0
17 Jul 2023
EasyTPP: Towards Open Benchmarking Temporal Point Processes
EasyTPP: Towards Open Benchmarking Temporal Point Processes
Siqiao Xue
X. Shi
Zhixuan Chu
Yan Wang
Hongyan Hao
...
Chenyuan Pan
James Y. Zhang
Qingsong Wen
Junqing Zhou
Hongyuan Mei
AI4TS
32
29
0
16 Jul 2023
A Survey of Techniques for Optimizing Transformer Inference
A Survey of Techniques for Optimizing Transformer Inference
Krishna Teja Chitty-Venkata
Sparsh Mittal
M. Emani
V. Vishwanath
Arun Somani
35
62
0
16 Jul 2023
GeoGPT: Understanding and Processing Geospatial Tasks through An
  Autonomous GPT
GeoGPT: Understanding and Processing Geospatial Tasks through An Autonomous GPT
Yifan Zhang
Cheng Wei
Shangyou Wu
Zhengting He
Wenhao Yu
29
27
0
16 Jul 2023
Learning to Retrieve In-Context Examples for Large Language Models
Learning to Retrieve In-Context Examples for Large Language Models
Liang Wang
Nan Yang
Furu Wei
RALM
30
38
0
14 Jul 2023
A Comprehensive Overview of Large Language Models
A Comprehensive Overview of Large Language Models
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Ajmal Saeed Mian
OffRL
57
523
0
12 Jul 2023
What Should Data Science Education Do with Large Language Models?
What Should Data Science Education Do with Large Language Models?
Xinming Tu
James Y. Zou
Weijie J. Su
Linjun Zhang
AI4Ed
37
32
0
06 Jul 2023
SCITUNE: Aligning Large Language Models with Scientific Multimodal
  Instructions
SCITUNE: Aligning Large Language Models with Scientific Multimodal Instructions
Sameera Horawalavithana
Sai Munikoti
Ian Stewart
Henry Kvinge
MLLM
19
20
0
03 Jul 2023
Preference Ranking Optimization for Human Alignment
Preference Ranking Optimization for Human Alignment
Feifan Song
Yu Bowen
Minghao Li
Haiyang Yu
Fei Huang
Yongbin Li
Houfeng Wang
ALM
21
235
0
30 Jun 2023
Query Understanding in the Age of Large Language Models
Query Understanding in the Age of Large Language Models
Avishek Anand
Venktesh V
Abhijit Anand
Vinay Setty
LRM
43
4
0
28 Jun 2023
A Survey on Multimodal Large Language Models
A Survey on Multimodal Large Language Models
Shukang Yin
Chaoyou Fu
Sirui Zhao
Ke Li
Xing Sun
Tong Bill Xu
Enhong Chen
MLLM
LRM
48
553
0
23 Jun 2023
ToolQA: A Dataset for LLM Question Answering with External Tools
ToolQA: A Dataset for LLM Question Answering with External Tools
Yuchen Zhuang
Yue Yu
Kuan-Chieh Jackson Wang
Haotian Sun
Chao Zhang
ELM
LLMAG
20
212
0
23 Jun 2023
DiversiGATE: A Comprehensive Framework for Reliable Large Language
  Models
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models
Shima Imani
Ali Beyram
H. Shrivastava
13
1
0
22 Jun 2023
Learning to Generate Better Than Your LLM
Learning to Generate Better Than Your LLM
Jonathan D. Chang
Kianté Brantley
Rajkumar Ramamurthy
Dipendra Kumar Misra
Wen Sun
19
40
0
20 Jun 2023
Aligning Synthetic Medical Images with Clinical Knowledge using Human
  Feedback
Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback
Shenghuan Sun
Gregory M. Goldgof
A. Butte
Ahmed Alaa
MedIm
19
12
0
16 Jun 2023
ClinicalGPT: Large Language Models Finetuned with Diverse Medical Data
  and Comprehensive Evaluation
ClinicalGPT: Large Language Models Finetuned with Diverse Medical Data and Comprehensive Evaluation
Guangyu Wang
Guoxing Yang
Zongxin Du
Longjun Fan
Xiaohu Li
LM&MA
ELM
AI4MH
14
79
0
16 Jun 2023
Explaining Legal Concepts with Augmented Large Language Models (GPT-4)
Explaining Legal Concepts with Augmented Large Language Models (GPT-4)
Jaromír Šavelka
Kevin D. Ashley
Morgan A. Gray
Hannes Westermann
Huihui Xu
ELM
AILaw
38
42
0
15 Jun 2023
Explore, Establish, Exploit: Red Teaming Language Models from Scratch
Explore, Establish, Exploit: Red Teaming Language Models from Scratch
Stephen Casper
Jason Lin
Joe Kwon
Gatlen Culp
Dylan Hadfield-Menell
AAML
8
83
0
15 Jun 2023
Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large
  Language Models
Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models
Myles Foley
Ambrish Rawat
Taesung Lee
Yufang Hou
Gabriele Picco
Giulio Zizzo
DeLMO
30
5
0
15 Jun 2023
Propagating Knowledge Updates to LMs Through Distillation
Propagating Knowledge Updates to LMs Through Distillation
Shankar Padmanabhan
Yasumasa Onoe
Michael J.Q. Zhang
Greg Durrett
Eunsol Choi
KELM
10
18
0
15 Jun 2023
AssistGPT: A General Multi-modal Assistant that can Plan, Execute,
  Inspect, and Learn
AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn
Difei Gao
Lei Ji
Luowei Zhou
Kevin Lin
Joya Chen
Zihan Fan
Mike Zheng Shou
MLLM
27
71
0
14 Jun 2023
AVIS: Autonomous Visual Information Seeking with Large Language Model
  Agent
AVIS: Autonomous Visual Information Seeking with Large Language Model Agent
Ziniu Hu
Ahmet Iscen
Chen Sun
Kai-Wei Chang
Yizhou Sun
David A. Ross
Cordelia Schmid
Alireza Fathi
31
11
0
13 Jun 2023
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with
  Human Preferences
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
Xiao Liu
Hanyu Lai
Hao Yu
Yifan Xu
Aohan Zeng
Zhengxiao Du
Peng-Zhen Zhang
Yuxiao Dong
Jie Tang
17
94
0
13 Jun 2023
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer
  Control
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control
Longtao Zheng
R. Wang
Xinrun Wang
Bo An
LLMAG
22
57
0
13 Jun 2023
Boosting Language Models Reasoning with Chain-of-Knowledge Prompting
Boosting Language Models Reasoning with Chain-of-Knowledge Prompting
J. Wang
Qiushi Sun
Xiang Li
Ming Gao
ReLM
LRM
19
64
0
10 Jun 2023
Improving Open Language Models by Learning from Organic Interactions
Improving Open Language Models by Learning from Organic Interactions
Jing Xu
Da Ju
Joshua Lane
M. Komeili
Eric Michael Smith
...
Rashel Moritz
Sainbayar Sukhbaatar
Y-Lan Boureau
Jason Weston
Kurt Shuster
25
8
0
07 Jun 2023
Natural Language Commanding via Program Synthesis
Natural Language Commanding via Program Synthesis
Apurva Gandhi
Thong Q. Nguyen
Huitian Jiao
R. Steen
Ameya Bhatawdekar
19
7
0
06 Jun 2023
Inference-Time Intervention: Eliciting Truthful Answers from a Language
  Model
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
Kenneth Li
Oam Patel
Fernanda Viégas
Hanspeter Pfister
Martin Wattenberg
KELM
HILM
26
472
0
06 Jun 2023
Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For
  Scoring and Providing Actionable Insights on Classroom Instruction
Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom Instruction
Rose E. Wang
Dorottya Demszky
19
58
0
05 Jun 2023
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Subhabrata Mukherjee
Arindam Mitra
Ganesh Jawahar
Sahaj Agarwal
Hamid Palangi
Ahmed Hassan Awadallah
ELM
ALM
LRM
33
262
0
05 Jun 2023
Fine-Tuning Language Models with Advantage-Induced Policy Alignment
Fine-Tuning Language Models with Advantage-Induced Policy Alignment
Banghua Zhu
Hiteshi Sharma
Felipe Vieira Frujeri
Shi Dong
Chenguang Zhu
Michael I. Jordan
Jiantao Jiao
OSLM
23
39
0
04 Jun 2023
Question-Context Alignment and Answer-Context Dependencies for Effective
  Answer Sentence Selection
Question-Context Alignment and Answer-Context Dependencies for Effective Answer Sentence Selection
Minh Le Nguyen
K. Kishan
Toan Q. Nguyen
Thien Huu Nguyen
Ankit Chadha
Thuy Vu
18
0
0
03 Jun 2023
Fine-Grained Human Feedback Gives Better Rewards for Language Model
  Training
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Zeqiu Wu
Yushi Hu
Weijia Shi
Nouha Dziri
Alane Suhr
Prithviraj Ammanabrolu
Noah A. Smith
Mari Ostendorf
Hannaneh Hajishirzi
ALM
30
303
0
02 Jun 2023
TorchRL: A data-driven decision-making library for PyTorch
TorchRL: A data-driven decision-making library for PyTorch
Albert Bou
Matteo Bettini
Sebastian Dittert
Vikash Kumar
Shagun Sodhani
Xiaomeng Yang
Gianni de Fabritiis
Vincent Moens
OffRL
AI4CE
16
37
0
01 Jun 2023
Challenges and Remedies to Privacy and Security in AIGC: Exploring the
  Potential of Privacy Computing, Blockchain, and Beyond
Challenges and Remedies to Privacy and Security in AIGC: Exploring the Potential of Privacy Computing, Blockchain, and Beyond
Chuan Chen
Zhenpeng Wu
Yan-Hao Lai
Wen-chao Ou
Tianchi Liao
Zibin Zheng
22
32
0
01 Jun 2023
Previous
123...131415...171819
Next