v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021

Tyna Eloundou

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 1,126 papers shown

Strong hallucinations from negation and how to fix them

Nicholas Asher

Swarnadeep Bhar

ReLM LRM

193

16 Feb 2024

A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents

Zeyi Liao

Huan Sun

296

15 Feb 2024

RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language Models

278

15 Feb 2024

A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Hiroki Furuta

262

15 Feb 2024

InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling

Liang Ding

391

14 Feb 2024

Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models

442

14 Feb 2024

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents

...

Jie Zhou

Yankai Lin

Zhiyuan Liu

Maosong Sun

282

14 Feb 2024

Discovering Sensorimotor Agency in Cellular Automata using Diversity Search

272

14 Feb 2024

AgentLens: Visual Analysis for Agent Behaviors in LLM-based Autonomous Systems

202

14 Feb 2024

Measuring and Controlling Instruction (In)Stability in Language Model Dialogs

375

13 Feb 2024

PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models

335

13 Feb 2024

ODIN: Disentangled Reward Mitigates Hacking in RLHFInternational Conference on Machine Learning (ICML), 2024

324

111

11 Feb 2024

Online Iterative Reinforcement Learning from Human Feedback with General Preference ModelNeural Information Processing Systems (NeurIPS), 2024

Wei Xiong

Tong Zhang

328

11 Feb 2024

How do Large Language Models Navigate Conflicts between Honesty and Helpfulness?International Conference on Machine Learning (ICML), 2024

155

11 Feb 2024

ScreenAgent: A Vision Language Model-driven Computer Control Agent

324

09 Feb 2024

Large Language Models: A Survey

915

834

09 Feb 2024

WebLINX: Real-World Website Navigation with Multi-Turn Dialogue

Xing Han Lù

Zdeněk Kasner

Siva Reddy

315

125

08 Feb 2024

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

...

Xuanjing Huang

270

08 Feb 2024

Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations

Cheng-Han Chiang

Hung-yi Lee

HILM

367

08 Feb 2024

JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs

386

08 Feb 2024

Pedagogical Alignment of Large Language Models

Richard G. Baraniuk

197

07 Feb 2024

InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory

Chaojun Xiao

Pengle Zhang

Xu Han

Guangxuan Xiao

Yankai Lin

Zhengyan Zhang

Zhiyuan Liu

Maosong Sun

LLMAG

373

122

07 Feb 2024

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

901

101

07 Feb 2024

Training Language Models to Generate Text with Citations via Fine-grained Rewards

Chengyu Huang

279

06 Feb 2024

Personalized Language Modeling from Personalized Human Feedback

439

115

06 Feb 2024

V-IRL: Grounding Virtual Intelligence in Real LifeEuropean Conference on Computer Vision (ECCV), 2024

Xiaojuan Qi

324

05 Feb 2024

Factuality of Large Language Models in the Year 2024

Yuxia Wang

Minghan Wang

Muhammad Arslan Manzoor

223

04 Feb 2024

Enhance Reasoning for Large Language Models in the Game Werewolf

322

04 Feb 2024

Affordable Generative Agents

327

03 Feb 2024

How well do LLMs cite relevant medical references? An evaluation framework and analyses

Patricia Shi Riantawan

Mark A. Lemley

James Zou

LM&MA ELM AI4MH

287

03 Feb 2024

TravelPlanner: A Benchmark for Real-World Planning with Language Agents

Yanghua Xiao

340

330

02 Feb 2024

Building Guardrails for Large Language Models

439

02 Feb 2024

AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback

210

02 Feb 2024

Rethinking the Role of Proxy Rewards in Language Model Alignment

Sungdong Kim

Minjoon Seo

SyDa ALM

305

02 Feb 2024

LLM-based NLG Evaluation: Current Status and Challenges

640

101

02 Feb 2024

Plan-Grounded Large Language Models for Dual Goal Conversational Settings

190

01 Feb 2024

Executable Code Actions Elicit Better LLM Agents

Heng Ji

951

366

01 Feb 2024

Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration

Shangbin Feng

Weijia Shi

Yike Wang

Wenxuan Ding

Vidhisha Balachandran

Yulia Tsvetkov

359

174

01 Feb 2024

Efficient Non-Parametric Uncertainty Quantification for Black-Box Large Language Models and Decision Planning

260

01 Feb 2024

Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF

Banghua Zhu

Michael I. Jordan

Jiantao Jiao

316

29 Jan 2024

PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

...

Lifeng Shang

Qun Liu

Linqi Song

310

26 Jan 2024

Can LLMs Evaluate Complex Attribution in QA? Automatic Benchmarking using Knowledge GraphsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

434

26 Jan 2024

WebVoyager: Building an End-to-End Web Agent with Large Multimodal ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Hongliang He

Wenlin Yao

Kaixin Ma

Wenhao Yu

Dong Yu

525

264

25 Jan 2024

UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems

Zezhong Wang

325

24 Jan 2024

AgentBoard: An Analytical Evaluation Board of Multi-turn LLM AgentsNeural Information Processing Systems (NeurIPS), 2024

Yujiu Yang

Lingpeng Kong

250

147

24 Jan 2024

ARGS: Alignment as Reward-Guided SearchInternational Conference on Learning Representations (ICLR), 2024

Maxim Khanov

Jirayu Burapacheep

Yixuan Li

466

23 Jan 2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and FeedbackInternational Conference on Machine Learning (ICML), 2024

Wei Shen

...

Yicheng Zou

Zhi Chen

Hang Yan

Tao Gui

Dahua Lin

254

21 Jan 2024

Reinforcement learning for question answering in programming domain using public community scoring as a human feedback

Alexey Gorbatovski

Sergey Kovalchuk

19 Jan 2024

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents

...

Rui Wang

431

157

18 Jan 2024

QAnswer: Towards Question Answering Search over WebsitesThe Web Conference (WWW), 2022

194

17 Jan 2024