Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation

29 May 2024

Bo Xu

Papers citing "Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation"

9 / 9 papers shown

STAIR: Addressing Stage Misalignment through Temporal-Aligned Preference Reinforcement Learning

28 Sep 2025

DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation

...

28 Sep 2025

Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance

181

30 Jul 2025

$β$-DQN: Improving Deep Q-Learning By Evolving the Behavior

β

-DQN: Improving Deep Q-Learning By Evolving the BehaviorAdaptive Agents and Multi-Agent Systems (AAMAS), 2025

374

01 Jan 2025

RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted BehaviorsAAAI Conference on Artificial Intelligence (AAAI), 2024

334

14 Dec 2024

Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction TuningAAAI Conference on Artificial Intelligence (AAAI), 2024

Runchuan Zhu

Zhipeng Ma

Jiang Wu

Junyuan Gao

Jiaqi Wang

Dahua Lin

Conghui He

180

09 Oct 2024

Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving

Junchi Yan

418

137

06 Jun 2024

PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic ManipulationInternational Conference on Machine Learning (ICML), 2023

350

06 Jun 2023

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Yuke Zhu

J. Wong

Ajay Mandlekar

Roberto Martín-Martín

Abhishek Joshi

Soroush Nasiriany

Yifeng Zhu

Soroush Nasiriany

Yifeng Zhu

525

562

25 Sep 2020