Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2306.07124
Cited By
v1
v2 (latest)
Diverse Projection Ensembles for Distributional Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
12 June 2023
Moritz A. Zanger
Wendelin Bohmer
M. Spaan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Diverse Projection Ensembles for Distributional Reinforcement Learning"
4 / 4 papers shown
DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training
Dingwei Zhu
Zhiheng Xi
Shihan Dou
Yuhui Wang
Sixian Li
...
Caishuang Huang
Yunke Zhang
Demei Yan
Yuran Wang
Tao Gui
OffRL
153
0
0
03 Dec 2025
Universal Value-Function Uncertainties
Moritz A. Zanger
Max Weltevrede
Yaniv Oren
Pascal R. van der Vaart
Caroline Horsch
Wendelin Bohmer
M. Spaan
OffRL
291
1
0
27 May 2025
Epistemic Artificial Intelligence is Essential for Machine Learning Models to Truly 'Know When They Do Not Know'
Shireen Kudukkil Manchingal
Andrew Bradley
Julian F. P. Kooij
Keivan K1 Shariatmadar
Neil Yorke-Smith
Fabio Cuzzolin
587
1
0
08 May 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCV
BDL
1.1K
5
0
14 Mar 2025
1
Page 1 of 1