APRIL: Active Preference-learning based Reinforcement Learning

5 August 2012

Michèle Sebag

Papers citing "APRIL: Active Preference-learning based Reinforcement Learning"

50 / 61 papers shown

Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review

536

12 May 2025

AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models

371

24 Mar 2025

Advances in Preference-based Reinforcement Learning: A ReviewIEEE International Conference on Systems, Man and Cybernetics (SMC), 2022

304

21 Aug 2024

Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization

Md Sultan al Nahian

R. Kavuluru

MedIm AI4CE

234

19 Jul 2024

Learning Human-Robot Handshaking Preferences for Quadruped Robots

Alessandra Chappuis

Guillaume Bellegarda

A. Ijspeert

322

28 Jun 2024

A Survey on Human Preference Learning for Large Language Models

Ruili Jiang

Kehai Chen

Xuefeng Bai

Zhixuan He

Juntao Li

Muyun Yang

Tiejun Zhao

Liqiang Nie

Min Zhang

364

17 Jun 2024

Reinforcement learning in large, structured action spaces: A simulation study of decision support for spinal cord injury rehabilitationIntelligent Medicine (IM), 2023

199

23 Oct 2023

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion ModelInternational Conference on Learning Representations (ICLR), 2023

Zibin Dong

Jianye Hao

Yan Zheng

Changjie Fan

338

03 Oct 2023

Reinforcement Learning with Human Feedback for Realistic Traffic SimulationIEEE International Conference on Robotics and Automation (ICRA), 2023

Yulong Cao

Boris Ivanovic

Chaowei Xiao

Marco Pavone

188

01 Sep 2023

Active Inverse Learning in Stackelberg Trajectory GamesAmerican Control Conference (ACC), 2023

Ufuk Topcu

173

15 Aug 2023

Toward Grounded Commonsense ReasoningIEEE International Conference on Robotics and Automation (ICRA), 2023

Dorsa Sadigh

314

14 Jun 2023

PAGAR: Taming Reward Misalignment in Inverse Reinforcement Learning-Based Imitation Learning with Protagonist Antagonist Guided Adversarial Reward

Weichao Zhou

Wenchao Li

360

02 Jun 2023

Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models

219

19 May 2023

Learning a Universal Human Prior for Dexterous Manipulation from Human Preference

Allen Z. Ren

269

10 Apr 2023

Vision-Language Models as Success Detectors

444

133

13 Mar 2023

Eliciting User Preferences for Personalized Multi-Objective Decision Making through Comparative FeedbackNeural Information Processing Systems (NeurIPS), 2023

362

07 Feb 2023

Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Arun Ahuja

...

Rui Zhu

262

21 Nov 2022

Efficient Meta Reinforcement Learning for Preference-based Fast AdaptationNeural Information Processing Systems (NeurIPS), 2022

242

20 Nov 2022

Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning

Katherine Metcalf

Miguel Sarabia

B. Theobald

OffRL

206

12 Nov 2022

Argumentative Reward Learning: Reasoning About Human Preferences

Francis Rhys Ward

Francesco Belardinelli

Francesca Toni

HAI

312

28 Sep 2022

Learning Latent Traits for Simulated Cooperative Driving Tasks

257

20 Jul 2022

Personalized Algorithmic Recourse with Preference Elicitation

607

27 May 2022

Invariance in Policy Optimisation and Partial Identifiability in Reward LearningInternational Conference on Machine Learning (ICML), 2022

Joar Skalse

Matthew Farrugia-Roberts

Stuart J. Russell

Alessandro Abate

Adam Gleave

337

14 Mar 2022

Uncertainty Estimation for Language Reward Models

Adam Gleave

G. Irving

UQLM

203

14 Mar 2022

Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive InterventionsInternational Statistical Review (ISR), 2022

314

04 Mar 2022

Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward FunctionsAdaptive Agents and Multi-Agent Systems (AAMAS), 2021

Tom Bewley

Freddy Lecue

OffRL

307

20 Dec 2021

Scientific Discovery and the Cost of Measurement -- Balancing Information and Cost in Reinforcement Learning

275

14 Dec 2021

Dueling RL: Reinforcement Learning with Trajectory Preferences

Aldo Pacchiano

Aadirupa Saha

Jonathan Lee

420

109

08 Nov 2021

Learning Multimodal Rewards from RankingsConference on Robot Learning (CoRL), 2021

Dorsa Sadigh

301

27 Sep 2021

PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-trainingInternational Conference on Machine Learning (ICML), 2021

Kimin Lee

Laura M. Smith

Pieter Abbeel

OffRL

536

377

09 Jun 2021

Information Directed Reward Learning for Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021

David Lindner

M. Turchetta

Sebastian Tschiatschek

K. Ciosek

Andreas Krause

OffRL

278

24 Feb 2021

Open Problems in Cooperative AI

485

254

15 Dec 2020

Human-guided Robot Behavior Learning: A GAN-assisted Preference-based Reinforcement Learning Approach

Huixin Zhan

Feng Tao

Yongcan Cao

256

15 Oct 2020

Reward Machines: Exploiting Reward Function Structure in Reinforcement LearningJournal of Artificial Intelligence Research (JAIR), 2020

541

299

06 Oct 2020

Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences

Dorsa Sadigh

412

140

24 Jun 2020

Active Measure Reinforcement Learning for Observation Cost Minimization

184

26 May 2020

Active Preference-Based Gaussian Process Regression for Reward Learning

Erdem Biyik

Nicolas Huynh

Mykel J. Kochenderfer

Dorsa Sadigh

359

131

06 May 2020

Reducing Non-Normative Text Generation from Language Models

257

23 Jan 2020

Learning Norms from Stories: A Prior for Value Aligned AgentsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2019

198

07 Dec 2019

Reinforcing an Image Caption Generator Using Off-Line Human FeedbackAAAI Conference on Artificial Intelligence (AAAI), 2019

260

21 Nov 2019

Context-aware Active Multi-Step Reinforcement Learning

Gang Chen

Dingcheng Li

Ran Xu

167

11 Nov 2019

Asking Easy Questions: A User-Friendly Approach to Active Reward LearningConference on Robot Learning (CoRL), 2019

Dorsa Sadigh

213

137

10 Oct 2019

Scaling data-driven robotics with reward sketching and batch reinforcement learning

Serkan Cabi

Sergio Gomez Colmenarejo

...

366

26 Sep 2019

Reinforcement Learning in Healthcare: A SurveyACM Computing Surveys (ACM CSUR), 2019

801

733

22 Aug 2019

Dueling Posterior Sampling for Preference-Based Reinforcement LearningConference on Uncertainty in Artificial Intelligence (UAI), 2019

546

04 Aug 2019

Improving User Specifications for Robot Behavior through Active Preference Learning: Framework and Evaluation

Nils Wilde

Alex Blidaru

Stephen L. Smith

Dana Kulić

205

24 Jul 2019

Learning Reward Functions by Integrating Human Demonstrations and Preferences

Malayandi Palan

Nicholas C. Landolfi

Gleb Shevchuk

Dorsa Sadigh

172

146

21 Jun 2019

Batch Active Learning Using Determinantal Point Processes

Erdem Biyik

Kenneth Wang

Nima Anari

Dorsa Sadigh

320

19 Jun 2019

The Green Choice: Learning and Influencing Human Decisions on Shared Roads

Erdem Biyik

Daniel A. Lazar

Dorsa Sadigh

Ramtin Pedarsani

184

03 Apr 2019

Parenting: Safe Reinforcement Learning from Human Input

Christopher Frye

Ilya Feige

182

18 Feb 2019