Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1908.08342
Cited By
v1
v2 (latest)
A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation
Neural Information Processing Systems (NeurIPS), 2019
21 August 2019
Runzhe Yang
Xingyuan Sun
Karthik Narasimhan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation"
50 / 138 papers shown
Limitations of Scalarisation in MORL: A Comparative Study in Discrete Environments
Muhammad Saóod Shah
Asad Jeewa
170
0
0
20 Nov 2025
Parametric Pareto Set Learning for Expensive Multi-Objective Optimization
Ji Cheng
Bo Xue
Qingfu Zhang
127
1
0
08 Nov 2025
Iterative Foundation Model Fine-Tuning on Multiple Rewards
Pouya M. Ghari
Simone Sciabola
Ye Wang
OffRL
156
0
0
31 Oct 2025
Multi-Objective Reinforcement Learning with Max-Min Criterion: A Game-Theoretic Approach
Woohyeon Byeon
Giseung Park
Jongseong Chae
Amir Leshem
Y. Sung
223
1
0
23 Oct 2025
Game-Theoretic Understandings of Multi-Agent Systems with Multiple Objectives
Yue Wang
187
0
0
27 Sep 2025
Goals and the Structure of Experience
Nadav Amir
Stas Tiomkin
Angela Langdon
187
0
0
20 Aug 2025
Pareto Multi-Objective Alignment for Language Models
Qiang He
S. Maghsudi
163
5
0
11 Aug 2025
Multi-Policy Pareto Front Tracking Based Online and Offline Multi-Objective Reinforcement Learning
Zeyu Zhao
Yueling Che
Kaichen Liu
Jian Li
Junmei Yao
OffRL
238
0
0
04 Aug 2025
Reinforcement Learning for Multi-Objective Multi-Echelon Supply Chain Optimisation
Rifny Rachman
Josh Tingey
Richard Allmendinger
Pradyumn Shukla
Wei Pan
143
2
0
26 Jul 2025
BEAVER: Building Environments with Assessable Variation for Evaluating Multi-Objective Reinforcement Learning
Ruohong Liu
Jack Umenberger
Yize Chen
291
0
0
10 Jul 2025
Dual-Objective Reinforcement Learning with Novel Hamilton-Jacobi-Bellman Formulations
William Sharpless
Dylan Hirsch
S. Tonkens
Nikhil Shinde
Sylvia Herbert
223
4
0
19 Jun 2025
Dynamic Preference Multi-Objective Reinforcement Learning for Internet Network Management
DongNyeong Heo
Daniela N. Rim
Heeyoul Choi
155
1
0
16 Jun 2025
Interpretability by Design for Efficient Multi-Objective Reinforcement Learning
Qiyue Xia
J. Michael Herrmann
J. Michael Herrmann
283
1
0
04 Jun 2025
AMOR: Adaptive Character Control through Multi-Objective Reinforcement Learning
Lucas N. Alegre
Agon Serifi
Ruben Grandia
David Müller
Espen Knoop
Moritz Bächer
294
3
0
29 May 2025
Diffusion Blend: Inference-Time Multi-Preference Alignment for Diffusion Models
Min Cheng
Fatemeh Doudi
D. Kalathil
Mohammad Ghavamzadeh
P. R. Kumar
357
2
0
24 May 2025
DSADF: Thinking Fast and Slow for Decision Making
Alex Zhihao Dou
Dongfei Cui
Jun Yan
Wei Wang
Benteng Chen
Haoming Wang
Zeke Xie
Shufei Zhang
OffRL
619
4
0
13 May 2025
Constructing an Optimal Behavior Basis for the Option Keyboard
L. N. Alegre
A. Bazzan
André Barreto
Bruno C. da Silva
306
2
0
01 May 2025
FAST-Q: Fast-track Exploration with Adversarially Balanced State Representations for Counterfactual Action Estimation in Offline Reinforcement Learning
The Web Conference (WWW), 2025
Pulkit Agrawal
Rukma Talwadker
Aditya Pareek
Tridib Mukherjee
OffRL
282
0
0
30 Apr 2025
HypRL: Reinforcement Learning of Control Policies for Hyperproperties
Tzu-Han Hsu
Arshia Rafieioskouei
Borzoo Bonakdarpour
579
2
0
07 Apr 2025
Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs
International Conference on Learning Representations (ICLR), 2025
Wei-Ting Hung
Shao-Hua Sun
Ping-Chun Hsieh
283
5
0
17 Mar 2025
SNPL: Simultaneous Policy Learning and Evaluation for Safe Multi-Objective Policy Improvement
Brian M Cho
Ana-Roxana Pop
Ariel Evince
Nathan Kallus
OffRL
282
0
0
17 Mar 2025
Incentivizing Multi-Tenant Split Federated Learning for Foundation Models at the Network Edge
Songyuan Li
Jia Hu
Geyong Min
Haojun Huang
FedML
932
1
0
06 Mar 2025
On Generalization Across Environments In Multi-Objective Reinforcement Learning
International Conference on Learning Representations (ICLR), 2025
Jayden Teoh
Pradeep Varakantham
Peter Vamplew
OffRL
341
6
0
02 Mar 2025
Reward Dimension Reduction for Scalable Multi-Objective Reinforcement Learning
International Conference on Learning Representations (ICLR), 2025
Giseung Park
Y. Sung
OffRL
248
0
0
28 Feb 2025
Multi-Objective Reinforcement Learning for Critical Scenario Generation of Autonomous Vehicles
Jiahui Wu
Chengjie Lu
Aitor Arrieta
Shaukat Ali
170
3
0
18 Feb 2025
Navigating the Social Welfare Frontier: Portfolios for Multi-objective Reinforcement Learning
Cheol Woo Kim
Jai Moondra
Shresth Verma
Madeleine Pollack
Lingkai Kong
Milind Tambe
Swati Gupta
353
1
0
13 Feb 2025
Mol-MoE: Training Preference-Guided Routers for Molecule Generation
Diego Calanzone
P. DÓro
Pierre-Luc Bacon
335
3
0
08 Feb 2025
Pareto Set Learning for Multi-Objective Reinforcement Learning
AAAI Conference on Artificial Intelligence (AAAI), 2025
Erlong Liu
Yu-Chang Wu
Xiaobin Huang
Chengrui Gao
Ren-Jian Wang
Ke Xue
Chao Qian
OffRL
682
22
0
12 Jan 2025
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Helia Hashemi
J. Eisner
Corby Rosset
Benjamin Van Durme
Chris Kedzie
561
57
0
03 Jan 2025
Preference-Conditioned Gradient Variations for Multi-Objective Quality-Diversity
Hannah Janmohamed
Maxence Faldor
Thomas Pierrot
Antoine Cully
415
1
0
19 Nov 2024
Policy Aggregation
Neural Information Processing Systems (NeurIPS), 2024
Parand A. Alamdari
Soroush Ebadian
Ariel D. Procaccia
OffRL
287
9
0
06 Nov 2024
Unlocking the Potential of Global Human Expertise
Neural Information Processing Systems (NeurIPS), 2024
Elliot Meyerson
Olivier Francon
Darren Sargent
Babak Hodjat
Risto Miikkulainen
316
2
0
31 Oct 2024
How to Find the Exact Pareto Front for Multi-Objective MDPs?
International Conference on Learning Representations (ICLR), 2024
Yining Li
Peizhong Ju
Ness B. Shroff
994
3
0
21 Oct 2024
MFC-EQ: Mean-Field Control with Envelope Q-Learning for Moving Decentralized Agents in Formation
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Qiushi Lin
Hang Ma
238
1
0
15 Oct 2024
Domains as Objectives: Domain-Uncertainty-Aware Policy Optimization through Explicit Multi-Domain Convex Coverage Set Learning
Wendyam Eric Lionel Ilboudo
Taisuke Kobayashi
Takamitsu Matsubara
264
0
0
07 Oct 2024
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
306
5
0
03 Oct 2024
Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learning
Junlin Lu
Patrick Mannion
Karl Mason
257
2
0
30 Sep 2024
Stage-Wise Reward Shaping for Acrobatic Robots: A Constrained Multi-Objective Reinforcement Learning Approach
IEEE International Conference on Robotics and Automation (ICRA), 2024
Dohyeong Kim
Hyeokjin Kwon
Junseok Kim
Gunmin Lee
Songhwai Oh
234
13
0
24 Sep 2024
MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning
Yifu Yuan
Zhenrui Zheng
Zibin Dong
Jianye Hao
OffRL
415
6
0
28 Aug 2024
Thresholded Lexicographic Ordered Multiobjective Reinforcement Learning
European Conference on Artificial Intelligence (ECAI), 2024
Alperen Tercan
Vinayak S. Prabhu
209
9
0
24 Aug 2024
Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Woo Kyung Kim
Minjong Yoo
Honguk Woo
OffRL
228
0
0
22 Aug 2024
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
Shresth Verma
Niclas Boehmer
Lingkai Kong
Milind Tambe
531
8
0
22 Aug 2024
Preference-Optimized Pareto Set Learning for Blackbox Optimization
Zhang Haishan
Chen Liang
Koji Tsuda
286
1
0
19 Aug 2024
Learning in Multi-Objective Public Goods Games with Non-Linear Utilities
European Conference on Artificial Intelligence (ECAI), 2024
Nicole Orzan
Erman Acar
Davide Grossi
Patrick Mannion
Roxana Ruadulescu
176
0
0
01 Aug 2024
A Meta-Learning Approach for Multi-Objective Reinforcement Learning in Sustainable Home Environments
Junlin Lu
Patrick Mannion
Karl Mason
231
2
0
16 Jul 2024
Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees
Alexia Jolicoeur-Martineau
A. Baratin
Kisoo Kwon
Boris Knyazev
Yan Zhang
379
4
0
12 Jul 2024
Learning Pareto Set for Multi-Objective Continuous Robot Control
Tianye Shu
Ke Shang
Cheng Gong
Yang Nan
H. Ishibuchi
241
10
0
27 Jun 2024
OCCAM: Online Continuous Controller Adaptation with Meta-Learned Models
Hersh Sanghvi
Spencer Folk
Camillo J Taylor
264
5
0
25 Jun 2024
Training Greedy Policy for Proposal Batch Selection in Expensive Multi-Objective Combinatorial Optimization
Deokjae Lee
Hyun Oh Song
Kyunghyun Cho
OffRL
263
0
0
21 Jun 2024
The Max-Min Formulation of Multi-Objective Reinforcement Learning: From Theory to a Model-Free Algorithm
Giseung Park
Woohyeon Byeon
Seongmin Kim
Elad Havakuk
Amir Leshem
Youngchul Sung
264
6
0
12 Jun 2024
1
2
3
Next
Page 1 of 3