v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021

Tyna Eloundou

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 1,125 papers shown

Analyzing Probabilistic Methods for Evaluating Agent Capabilities

312

24 Sep 2024

LLM With Tools: A Survey

Zhuocheng Shen

250

24 Sep 2024

LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Jiafeng Guo

276

23 Sep 2024

PROMPTFUZZ: Harnessing Fuzzing Techniques for Robust Testing of Prompt Injection in LLMs

Jiahao Yu

473

23 Sep 2024

Backtracking Improves Generation Safety

310

22 Sep 2024

From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal Reasoning with Large Language Models

428

19 Sep 2024

Textualized Agent-Style Reasoning for Complex Tasks by Multiple Round LLM Generation

Jinan Xu

Yufeng Chen

Yong Wang

LLMAG LRM

165

19 Sep 2024

From Lists to Emojis: How Format Bias Affects Model AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

445

18 Sep 2024

CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration

Lanqing Hong

Xin Jiang

Zhenguo Li

313

17 Sep 2024

Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to RefuseInternational Conference on Learning Representations (ICLR), 2024

Soujanya Poria

578

17 Sep 2024

Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs

Yifan Wang

...

Jiabo Hu

Ning Zhang

Bob Kamma

247

16 Sep 2024

StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models

Shenghua Liu

Lingrui Mei

221

16 Sep 2024

Trustworthiness in Retrieval-Augmented Generation Systems: A Survey

Yan Liu

Zheng Liu

Tsung-Yi Ho

282

16 Sep 2024

Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison

Judy Hanwen Shen

Archit Sharma

Jun Qin

190

15 Sep 2024

Policy Filtration for RLHF to Mitigate Noise in Reward Models

415

11 Sep 2024

AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric KnowledgeNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

485

11 Sep 2024

Alignment of Diffusion Models: Fundamentals, Challenges, and Future

467

11 Sep 2024

AGR: Age Group fairness Reward for Bias Mitigation in LLMsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Shuirong Cao

Ruoxi Cheng

Zhiqiang Wang

185

06 Sep 2024

On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference OptimizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Tong Zhang

360

05 Sep 2024

Towards a Unified View of Preference Learning for Large Language Models: A Survey

...

Houfeng Wang

Zhifang Sui

Peiyi Wang

Baobao Chang

473

04 Sep 2024

From Grounding to Planning: Benchmarking Bottlenecks in Web Agents

340

03 Sep 2024

ContextCite: Attributing Model Generation to ContextNeural Information Processing Systems (NeurIPS), 2024

Aleksander Madry

361

01 Sep 2024

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Yuncheng Yang

Tong Wu

...

Ke Li

Xing Sun

Jie Yang

Yun Gu

ALM OffRL MoE

356

28 Aug 2024

ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model

Lifan Jiang

Zhihui Wang

Siqi Yin

Guangxiao Ma

Peng Zhang

Boxi Wu

DiffM

350

28 Aug 2024

Enhancing and Accelerating Large Language Models via Instruction-Aware Contextual Compression

Haowen Hou

Fei Ma

Binwen Bai

Xinxin Zhu

Fei Yu

200

28 Aug 2024

How will advanced AI systems impact democracy?

Christopher Summerfield

Michiel Bakker

...

304

27 Aug 2024

AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems

Chi-Min Chan

Zhiyuan Liu

Wei Xue

Yike Guo

LLMAG

288

27 Aug 2024

Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for Ancient Indian Philosophy

Priyanka Mandikal

RALM VLM

217

21 Aug 2024

DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework

232

21 Aug 2024

Athena: Safe Autonomous Agents with Verbal Contrastive LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

20 Aug 2024

SysBench: Can Large Language Models Follow System Messages?

Yanzhao Qin

Tao Zhang

Yanjun Shen

Wenjing Luo

...

Yujing Qiao

Weipeng Chen

Guosheng Dong

Wentao Zhang

Bin Cui

ALM

397

20 Aug 2024

Minor DPO reject penalty to increase training robustness

215

19 Aug 2024

HySem: A context length optimized LLM pipeline for unstructured tabular extraction

Narayanan PP

A. P. N. Iyer

252

18 Aug 2024

SEAL: Systematic Error Analysis for Value ALignmentAAAI Conference on Artificial Intelligence (AAAI), 2024

287

16 Aug 2024

The Future of Open Human FeedbackNature Machine Intelligence (Nat. Mach. Intell.), 2024

Shachar Don-Yehiya

Ben Burtenshaw

Ramon Fernandez Astudillo

...

298

15 Aug 2024

Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding

422

15 Aug 2024

The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community

Shachar Don-Yehiya

Leshem Choshen

Omri Abend

280

15 Aug 2024

Automated Design of Agentic SystemsInternational Conference on Learning Representations (ICLR), 2024

Shengran Hu

Cong Lu

Jeff Clune

AI4CE

471

120

15 Aug 2024

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Chelsea Finn

291

147

13 Aug 2024

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Yifan Xu

...

Zhengxiao Du

Chan Hee Song

Yu Su

Yuxiao Dong

Jie Tang

VLM LLMAG

253

12 Aug 2024

A Hybrid RAG System with Comprehensive Enhancement on Complex Reasoning

Ming Zhang

432

09 Aug 2024

Learning Fine-Grained Grounded Citations for Attributed Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

...

Dandan Tu

Bing Qin

HILM

281

08 Aug 2024

Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation

Junde Wu

Jiayuan Zhu

Yunli Qi

274

109

08 Aug 2024

MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models

242

07 Aug 2024

Making Long-Context Language Models Better Multi-Hop ReasonersAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

312

06 Aug 2024

Can DPO Learn Diverse Human Values? A Theoretical Scaling Law

Shawn Im

Yixuan Li

619

06 Aug 2024

A Framework for Fine-Tuning LLMs using Heterogeneous Feedback

Ryan Aponte

Ryan Rossi

Shunan Guo

Franck Dernoncourt

Tong Yu

Xiang Chen

Subrata Mitra

Nedim Lipka

OffRL

144

05 Aug 2024

DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and ExplanationEuropean Conference on Computer Vision (ECCV), 2024

Rakshith Subramanyam

Kowshik Thopalli

V. Narayanaswamy

Jayaraman J.Thiagarajan

268

01 Aug 2024

Improving Retrieval Augmented Language Model with Self-ReasoningAAAI Conference on Artificial Intelligence (AAAI), 2024

258

29 Jul 2024

MindSearch: Mimicking Human Minds Elicits Deep AI SearcherInternational Conference on Learning Representations (ICLR), 2024

389

29 Jul 2024