v1v2v3 (latest)

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021

Tyna Eloundou

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 1,126 papers shown

A Study on Training and Developing Large Language Models for Behavior Tree Generation

304

16 Jan 2024

DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)International Conference on Machine Learning (ICML), 2024

Guikun Chen

553

16 Jan 2024

The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey

Saurav Pawar

S.M. Towhidul Islam Tonmoy

S. M. M. Zaman

Vinija Jain

Vasu Sharma

Amitava Das

239

15 Jan 2024

Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation

360

14 Jan 2024

Small LLMs Are Weak Tool Learners: A Multi-LLM AgentConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Ming Yan

Ji Zhang

Fei Huang

LLMAG

456

107

14 Jan 2024

EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health RecordsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Ran Xu

236

13 Jan 2024

INTERS: Unlocking the Power of Large Language Models in Search with Instruction TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Peitian Zhang

Zheng Liu

283

12 Jan 2024

Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems

...

Qi Li

372

106

11 Jan 2024

MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance

Tianyang Han

Tong Zhang

423

114

05 Jan 2024

From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models

Wei Zou

294

05 Jan 2024

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding

Yuu Jinnai

Kaito Ariu

456

05 Jan 2024

Understanding LLMs: A Comprehensive Overview from Training to Inference

...

Tuo Zhang

Tianming Liu

483

136

04 Jan 2024

Theoretical guarantees on the best-of-n alignment policy

578

03 Jan 2024

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents

...

502

124

01 Jan 2024

Enhancing Open-Domain Task-Solving Capability of LLMs via Autonomous Tool Integration from GitHub

...

303

28 Dec 2023

ShennongAlpha: an AI-driven sharing and collaboration platform for intelligent curation, acquisition, and translation of natural medicinal material knowledge

Yue Zhang

27 Dec 2023

LARP: Language-Agent Role Play for Open-World Games

395

24 Dec 2023

MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA

Jie Zhou

300

19 Dec 2023

Agent-based Learning of Materials Datasets from Scientific Literature

Mehrad Ansari

S. M. Moosavi

AI4CE

182

18 Dec 2023

Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint

Wei Xiong

Tong Zhang

477

325

18 Dec 2023

Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations

Jun Chen

Boshi Tang

Zhiyong Wu

208

18 Dec 2023

Retrieval-Augmented Generation for Large Language Models: A Survey

1.3K

3,027

18 Dec 2023

Let AI Entertain You: Increasing User Engagement with Generative AI and Rejection Sampling

166

16 Dec 2023

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

...

Sanjiv Kumar

301

15 Dec 2023

Towards Verifiable Text Generation with Evolving Memory and Self-ReflectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

379

14 Dec 2023

TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yuan Sui

238

14 Dec 2023

LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic
Memory Enhancement

LDM

^2

: A Large Decision Model Imitating Human Cognition with Dynamic Memory EnhancementConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Xingjin Wang

Linjing Li

D. Zeng

177

13 Dec 2023

AI capabilities can be significantly improved without expensive retraining

Tom Davidson

Jean-Stanislas Denain

Pablo Villalobos

Guillem Bas

OffRL VLM

264

12 Dec 2023

On Diversified Preferences of Large Language Model AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

431

12 Dec 2023

Exploring Large Language Models to Facilitate Variable Autonomy for Human-Robot Teaming

Younes Lakhnati

Max Pascher

Jens Gerken

LLMAG LM&Ro

334

12 Dec 2023

Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation

Yu Qiao

542

12 Dec 2023

Alignment for HonestyNeural Information Processing Systems (NeurIPS), 2023

Yuqing Yang

Ethan Chern

Xipeng Qiu

Graham Neubig

Pengfei Liu

293

12 Dec 2023

"I Want It That Way": Enabling Interactive Decision Support Using Large Language Models and Constraint Programming

357

12 Dec 2023

"What's important here?": Opportunities and Challenges of Using LLMs in Retrieving Information from Web Interfaces

Faria Huq

Jeffrey P. Bigham

Nikolas Martelaro

280

11 Dec 2023

KwaiAgents: Generalized Information-seeking Agent System with Large Language Models

291

08 Dec 2023

Learning to Break: Knowledge-Enhanced Reasoning in Multi-Agent Debate System

411

08 Dec 2023

LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent Ecosystem

Juntao Tan

285

06 Dec 2023

Speculative Exploration on the Concept of Artificial Agents Conducting Autonomous Research

Shiro Takagi

320

06 Dec 2023

Rethinking E-Commerce Search

Haixun Wang

Taesik Na

197

06 Dec 2023

ULMA: Unified Language Model Alignment with Human Demonstration and Point-wise Preference

242

05 Dec 2023

Explore, Select, Derive, and Recall: Augmenting LLM with Human-like Memory for Mobile Task AutomationACM/IEEE International Conference on Mobile Computing and Networking (MobiCom), 2023

439

04 Dec 2023

D-Bot: Database Diagnosis System using Large Language ModelsProceedings of the VLDB Endowment (PVLDB), 2023

Zhiyuan Liu

249

03 Dec 2023

Nash Learning from Human FeedbackInternational Conference on Machine Learning (ICML), 2023

Daniele Calandriello

...

Nikola Momchev

Olivier Bachem

D. Mankowitz

Doina Precup

Bilal Piot

690

201

01 Dec 2023

Griffon: Spelling out All Object Locations at Any Granularity with Large Language ModelsEuropean Conference on Computer Vision (ECCV), 2023

253

24 Nov 2023

PrivateLoRA For Efficient Privacy Preserving LLM

367

23 Nov 2023

DaG LLM ver 1.0: Pioneering Instruction-Tuned Language Modeling for Korean NLP

...

260

23 Nov 2023

GAIA: a benchmark for General AI Assistants

Grégoire Mialon

533

591

21 Nov 2023

Unifying Corroborative and Contributive Attributions in Large Language Models

355

20 Nov 2023

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

...

Rui Wang

395

102

20 Nov 2023

Towards Robust Text Retrieval with Progressive Learning

Yulei Qin

182

20 Nov 2023