v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021

Tyna Eloundou

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 1,123 papers shown

Inference-Time Reward Hacking in Large Language Models

237

24 Jun 2025

Deep Research Agents: A Systematic Examination And Roadmap

...

Youssef Attia El Hili

Jun Wang

LLMAG

286

22 Jun 2025

Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models

407

21 Jun 2025

Relic: Enhancing Reward Model Generalization for Low-Resource Indic Languages with Few-Shot Examples

215

19 Jun 2025

Reranking-based Generation for Unbiased Perspective SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

171

19 Jun 2025

Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective

230

19 Jun 2025

MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents

Bryan Kian Hsiang Low

Paul Liang

LLMAG OffRL LRM

394

18 Jun 2025

Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs

Jing Yang Lee

Kong-Aik Lee

Woon-Seng Gan

243

18 Jun 2025

OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents

Maksym Andriushchenko

LLMAG ELM

296

17 Jun 2025

Min-p, Max Exaggeration: A Critical Analysis of Min-p Sampling in Language Models

Rylan Schaeffer

Joshua Kazdan

Yegor Denisov-Blanch

315

16 Jun 2025

Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives

...

438

11 Jun 2025

Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization

347

08 Jun 2025

Human-assisted Robotic Policy Refinement via Action Preference Optimization

369

08 Jun 2025

C-SEO Bench: Does Conversational SEO Work?

789

06 Jun 2025

Truly Self-Improving Agents Require Intrinsic Metacognitive Learning

Tennison Liu

M. Schaar

AIFin LRM

390

05 Jun 2025

When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration

337

05 Jun 2025

Micro-Act: Mitigating Knowledge Conflict in LLM-based RAG via Actionable Self-Reasoning

346

05 Jun 2025

Kinetics: Rethinking Test-Time Scaling Laws

457

05 Jun 2025

TracLLM: A Generic Framework for Attributing Long Context LLMs

512

04 Jun 2025

AI Agents for Conversational Patient Triage: Preliminary Simulation-Based Evaluation with Real-World EHR Data

203

04 Jun 2025

Does Thinking More always Help? Mirage of Test-Time Scaling in Reasoning Models

387

04 Jun 2025

Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework

...

337

03 Jun 2025

DeepShop: A Benchmark for Deep Research Shopping Agents

337

03 Jun 2025

Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs

183

03 Jun 2025

Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights

...

196

03 Jun 2025

Psi-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models

364

02 Jun 2025

Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences

412

02 Jun 2025

HADA: Human-AI Agent Decision Alignment Architecture

Tapio Pitkäranta

Leena Pitkäranta

193

01 Jun 2025

MCP-Zero: Active Tool Discovery for Autonomous LLM Agents

505

01 Jun 2025

Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents

276

30 May 2025

When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways

222

30 May 2025

Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data

315

29 May 2025

Proximalized Preference Optimization for Diverse Feedback Types: A Decomposed Perspective on DPO

Kaiyang Guo

Yinchuan Li

Zhitang Chen

352

29 May 2025

AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models

214

29 May 2025

Text2Grad: Reinforcement Learning from Natural Language Feedback

225

28 May 2025

RISE: Reasoning Enhancement via Iterative Self-Exploration in Multi-hop Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

250

28 May 2025

Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation

230

27 May 2025

CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

440

27 May 2025

Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

264

26 May 2025

SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety

257

26 May 2025

Token-level Accept or Reject: A Micro Alignment Approach for Large Language ModelsInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

...

483

26 May 2025

Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO

309

26 May 2025

ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment

Bryan Kian Hsiang Low

717

25 May 2025

ChartLens: Fine-grained Visual Attribution in ChartsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

212

25 May 2025

Retrieval-Augmented Generation for Service Discovery: Chunking Strategies and Benchmarking

229

25 May 2025

Dynamic Risk Assessments for Offensive Cybersecurity Agents

575

23 May 2025

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

...

409

21 May 2025

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

...

405

19 May 2025

Web Intellectual Property at Risk: Preventing Unauthorized Real-Time Retrieval by Large Language Models

279

19 May 2025

Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and ChallengesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

315

19 May 2025