ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09332
  4. Cited By
WebGPT: Browser-assisted question-answering with human feedback
v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
    ALMRALM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 1,125 papers shown
Analyzing Probabilistic Methods for Evaluating Agent Capabilities
Analyzing Probabilistic Methods for Evaluating Agent Capabilities
Axel Højmark
Govind Pimpale
Arjun Panickssery
Marius Hobbhahn
Jérémy Scheurer
312
4
0
24 Sep 2024
LLM With Tools: A Survey
LLM With Tools: A Survey
Zhuocheng Shen
250
40
0
24 Sep 2024
LINKAGE: Listwise Ranking among Varied-Quality References for
  Non-Factoid QA Evaluation via LLMs
LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Sihui Yang
Keping Bi
Wanqing Cui
Jiafeng Guo
Xueqi Cheng
276
5
0
23 Sep 2024
PROMPTFUZZ: Harnessing Fuzzing Techniques for Robust Testing of Prompt Injection in LLMs
PROMPTFUZZ: Harnessing Fuzzing Techniques for Robust Testing of Prompt Injection in LLMs
Jiahao Yu
Yangguang Shao
Hanwen Miao
Junzheng Shi
SILMAAML
473
20
0
23 Sep 2024
Backtracking Improves Generation Safety
Backtracking Improves Generation Safety
Yiming Zhang
Jianfeng Chi
Hailey Nguyen
Kartikeya Upasani
Daniel M. Bikel
Jason Weston
Eric Michael Smith
SILM
310
24
0
22 Sep 2024
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal
  Reasoning with Large Language Models
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal Reasoning with Large Language Models
Shengsheng Qian
Zuyi Zhou
Dizhan Xue
Bing Wang
Changsheng Xu
LRM
428
5
0
19 Sep 2024
Textualized Agent-Style Reasoning for Complex Tasks by Multiple Round
  LLM Generation
Textualized Agent-Style Reasoning for Complex Tasks by Multiple Round LLM Generation
Chen Liang
Zhifan Feng
Zihe Liu
Wenbin Jiang
Jinan Xu
Yufeng Chen
Yong Wang
LLMAGLRM
165
3
0
19 Sep 2024
From Lists to Emojis: How Format Bias Affects Model Alignment
From Lists to Emojis: How Format Bias Affects Model AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Xuanchang Zhang
Wei Xiong
Lichang Chen
Wanrong Zhu
Heng Huang
Tong Zhang
ALM
445
22
0
18 Sep 2024
CoCA: Regaining Safety-awareness of Multimodal Large Language Models
  with Constitutional Calibration
CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration
Lei Li
Renjie Pi
Tianyang Han
Han Wu
Lanqing Hong
Lingpeng Kong
Xin Jiang
Zhenguo Li
313
19
0
17 Sep 2024
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to RefuseInternational Conference on Learning Representations (ICLR), 2024
Maojia Song
Shang Hong Sim
Rishabh Bhardwaj
Hai Leong Chieu
Navonil Majumder
Soujanya Poria
578
29
0
17 Sep 2024
Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation
  with LLMs
Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs
Yifan Wang
David Stevens
Pranay Shah
Wenwen Jiang
Miao Liu
...
Boying Gong
Daniel Lee
Jiabo Hu
Ning Zhang
Bob Kamma
247
5
0
16 Sep 2024
StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge
  Editing for Large Language Models
StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models
Baolong Bi
Shenghua Liu
Yiwei Wang
Lingrui Mei
Hongcheng Gao
Junfeng Fang
Xueqi Cheng
KELM
221
10
0
16 Sep 2024
Trustworthiness in Retrieval-Augmented Generation Systems: A Survey
Trustworthiness in Retrieval-Augmented Generation Systems: A Survey
Yujia Zhou
Yan Liu
Xiaoxi Li
Jiajie Jin
Hongjin Qian
Zheng Liu
Chaozhuo Li
Zhicheng Dou
Tsung-Yi Ho
Philip S. Yu
3DVRALM
282
83
0
16 Sep 2024
Towards Data-Centric RLHF: Simple Metrics for Preference Dataset
  Comparison
Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison
Judy Hanwen Shen
Archit Sharma
Jun Qin
190
13
0
15 Sep 2024
Policy Filtration for RLHF to Mitigate Noise in Reward Models
Policy Filtration for RLHF to Mitigate Noise in Reward Models
Chuheng Zhang
Wei Shen
Li Zhao
Xuyun Zhang
Xiaolong Xu
Wanchun Dou
Jiang Biang
OffRL
415
6
0
11 Sep 2024
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric KnowledgeNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Han Wang
Archiki Prasad
Elias Stengel-Eskin
Joey Tianyi Zhou
485
19
0
11 Sep 2024
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Buhua Liu
Shitong Shao
Bao Li
Lichen Bai
Zhiqiang Xu
Haoyi Xiong
James Kwok
Sumi Helal
Bo Han
467
23
0
11 Sep 2024
AGR: Age Group fairness Reward for Bias Mitigation in LLMs
AGR: Age Group fairness Reward for Bias Mitigation in LLMsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Shuirong Cao
Ruoxi Cheng
Zhiqiang Wang
185
12
0
06 Sep 2024
On the Limited Generalization Capability of the Implicit Reward Model
  Induced by Direct Preference Optimization
On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference OptimizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yong Lin
Skyler Seto
Maartje ter Hoeve
Katherine Metcalf
B. Theobald
Xuan Wang
Yizhe Zhang
Chen Huang
Tong Zhang
360
23
0
05 Sep 2024
Towards a Unified View of Preference Learning for Large Language Models:
  A Survey
Towards a Unified View of Preference Learning for Large Language Models: A Survey
Bofei Gao
Feifan Song
Yibo Miao
Zefan Cai
Zhiyong Yang
...
Houfeng Wang
Zhifang Sui
Peiyi Wang
Baobao Chang
Baobao Chang
473
17
0
04 Sep 2024
From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
Segev Shlomov
Ben Wiesel
Aviad Sela
Ido Levy
Liane Galanti
Roy Abitbol
LLMAG
340
8
0
03 Sep 2024
ContextCite: Attributing Model Generation to Context
ContextCite: Attributing Model Generation to ContextNeural Information Processing Systems (NeurIPS), 2024
Benjamin Cohen-Wang
Harshay Shah
Kristian Georgiev
Aleksander Madry
LRM
361
62
0
01 Sep 2024
Leveraging Open Knowledge for Advancing Task Expertise in Large Language
  Models
Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Yuncheng Yang
Yulei Qin
Tong Wu
Zihan Xu
Gang Li
...
Yuchen Shi
Ke Li
Xing Sun
Jie Yang
Yun Gu
ALMOffRLMoE
356
1
0
28 Aug 2024
ConsistencyTrack: A Robust Multi-Object Tracker with a Generation
  Strategy of Consistency Model
ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model
Lifan Jiang
Zhihui Wang
Siqi Yin
Guangxiao Ma
Peng Zhang
Boxi Wu
DiffM
350
19
0
28 Aug 2024
Enhancing and Accelerating Large Language Models via Instruction-Aware
  Contextual Compression
Enhancing and Accelerating Large Language Models via Instruction-Aware Contextual Compression
Haowen Hou
Fei Ma
Binwen Bai
Xinxin Zhu
Fei Yu
200
4
0
28 Aug 2024
How will advanced AI systems impact democracy?
How will advanced AI systems impact democracy?
Christopher Summerfield
Lisa Argyle
Michiel Bakker
Teddy Collins
Esin Durmus
...
Elizabeth Seger
Divya Siddarth
Henrik Skaug Sætra
MH Tessler
M. Botvinick
304
10
0
27 Aug 2024
AgentMonitor: A Plug-and-Play Framework for Predictive and Secure
  Multi-Agent Systems
AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems
Chi-Min Chan
Jianxuan Yu
Weize Chen
Chunyang Jiang
Xinyu Liu
Weijie Shi
Zhiyuan Liu
Wei Xue
Yike Guo
LLMAG
288
5
0
27 Aug 2024
Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for
  Ancient Indian Philosophy
Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for Ancient Indian Philosophy
Priyanka Mandikal
RALMVLM
217
1
0
21 Aug 2024
DreamFactory: Pioneering Multi-Scene Long Video Generation with a
  Multi-Agent Framework
DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework
Zhifei Xie
Daniel Tang
Dingwei Tan
Jacques Klein
Tegawend F. Bissyand
Saad Ezzini
VGen
232
22
0
21 Aug 2024
Athena: Safe Autonomous Agents with Verbal Contrastive Learning
Athena: Safe Autonomous Agents with Verbal Contrastive LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Tanmana Sadhu
Ali Pesaranghader
Yanan Chen
Dong Hoon Yi
ELMLLMAGAAML
67
5
0
20 Aug 2024
SysBench: Can Large Language Models Follow System Messages?
SysBench: Can Large Language Models Follow System Messages?
Yanzhao Qin
Tao Zhang
Tao Zhang
Yanjun Shen
Wenjing Luo
...
Yujing Qiao
Weipeng Chen
Guosheng Dong
Wentao Zhang
Bin Cui
ALM
397
16
0
20 Aug 2024
Minor DPO reject penalty to increase training robustness
Minor DPO reject penalty to increase training robustness
Shiming Xie
Hong Chen
Fred Yu
Zeye Sun
Xiuyu Wu
Yingfan Hu
215
5
0
19 Aug 2024
HySem: A context length optimized LLM pipeline for unstructured tabular
  extraction
HySem: A context length optimized LLM pipeline for unstructured tabular extraction
Narayanan PP
A. P. N. Iyer
252
2
0
18 Aug 2024
SEAL: Systematic Error Analysis for Value ALignment
SEAL: Systematic Error Analysis for Value ALignmentAAAI Conference on Artificial Intelligence (AAAI), 2024
Manon Revel
Matteo Cargnelutti
Tyna Eloundou
Greg Leppert
287
6
0
16 Aug 2024
The Future of Open Human Feedback
The Future of Open Human FeedbackNature Machine Intelligence (Nat. Mach. Intell.), 2024
Shachar Don-Yehiya
Ben Burtenshaw
Ramon Fernandez Astudillo
Cailean Osborne
Mimansa Jaiswal
...
Omri Abend
Jennifer Ding
Sara Hooker
Hannah Rose Kirk
Leshem Choshen
VLMALM
298
9
0
15 Aug 2024
Derivative-Free Guidance in Continuous and Discrete Diffusion Models
  with Soft Value-Based Decoding
Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding
Xiner Li
Yulai Zhao
Chenyu Wang
Gabriele Scalia
Gökçen Eraslan
Surag Nair
Tommaso Biancalani
Aviv Regev
Sergey Levine
Masatoshi Uehara
422
81
0
15 Aug 2024
The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community
The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community
Shachar Don-Yehiya
Leshem Choshen
Omri Abend
280
5
0
15 Aug 2024
Automated Design of Agentic Systems
Automated Design of Agentic SystemsInternational Conference on Learning Representations (ICLR), 2024
Shengran Hu
Cong Lu
Jeff Clune
AI4CE
471
120
0
15 Aug 2024
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
Pranav Putta
Edmund Mills
Naman Garg
S. Motwani
Chelsea Finn
Divyansh Garg
Rafael Rafailov
LLMAGLRM
291
147
0
13 Aug 2024
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation
  Agents
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Xiao-Yang Liu
Tianjie Zhang
Yu Gu
Iat Long Iong
Yifan Xu
...
Zhengxiao Du
Chan Hee Song
Yu Su
Yuxiao Dong
Jie Tang
VLMLLMAG
253
68
0
12 Aug 2024
A Hybrid RAG System with Comprehensive Enhancement on Complex Reasoning
A Hybrid RAG System with Comprehensive Enhancement on Complex Reasoning
Ye Yuan
Chengwu Liu
Jingyang Yuan
Gongbo Sun
Siqi Li
Ming Zhang
LRM
432
17
0
09 Aug 2024
Learning Fine-Grained Grounded Citations for Attributed Large Language
  Models
Learning Fine-Grained Grounded Citations for Attributed Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Lei Huang
Xiaocheng Feng
Weitao Ma
Yuxuan Gu
Weihong Zhong
...
Weijiang Yu
Weihua Peng
Duyu Tang
Dandan Tu
Bing Qin
HILM
281
7
0
08 Aug 2024
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph
  Retrieval-Augmented Generation
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation
Junde Wu
Jiayuan Zhu
Yunli Qi
274
109
0
08 Aug 2024
MaxMind: A Memory Loop Network to Enhance Software Productivity based on
  Large Language Models
MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models
Yuchen Dong
Xiaoxiang Fang
Yuchen Hu
Renshuang Jiang
Zhe Jiang
242
0
0
07 Aug 2024
Making Long-Context Language Models Better Multi-Hop Reasoners
Making Long-Context Language Models Better Multi-Hop ReasonersAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Yanyang Li
Shuo Liang
Michael R. Lyu
Liwei Wang
LLMAGLRM
312
29
0
06 Aug 2024
Can DPO Learn Diverse Human Values? A Theoretical Scaling Law
Can DPO Learn Diverse Human Values? A Theoretical Scaling Law
Shawn Im
Yixuan Li
619
3
0
06 Aug 2024
A Framework for Fine-Tuning LLMs using Heterogeneous Feedback
A Framework for Fine-Tuning LLMs using Heterogeneous Feedback
Ryan Aponte
Ryan Rossi
Shunan Guo
Franck Dernoncourt
Tong Yu
Xiang Chen
Subrata Mitra
Nedim Lipka
OffRL
144
1
0
05 Aug 2024
DECIDER: Leveraging Foundation Model Priors for Improved Model Failure
  Detection and Explanation
DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and ExplanationEuropean Conference on Computer Vision (ECCV), 2024
Rakshith Subramanyam
Kowshik Thopalli
V. Narayanaswamy
Jayaraman J.Thiagarajan
268
4
0
01 Aug 2024
Improving Retrieval Augmented Language Model with Self-Reasoning
Improving Retrieval Augmented Language Model with Self-ReasoningAAAI Conference on Artificial Intelligence (AAAI), 2024
Yuan Xia
Jingbo Zhou
Zhenhui Shi
Jun Chen
Hai-ting Huang
AIFinLRMReLMKELM
258
34
0
29 Jul 2024
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
MindSearch: Mimicking Human Minds Elicits Deep AI SearcherInternational Conference on Learning Representations (ICLR), 2024
Zehui Chen
Kuikun Liu
Qiuchen Wang
Jiangning Liu
Wenwei Zhang
Kai Chen
Feng Zhao
LLMAG
389
44
0
29 Jul 2024
Previous
123...8910...212223
Next
Page 9 of 23
Pageof 23