ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.07124
  4. Cited By
Diverse Projection Ensembles for Distributional Reinforcement Learning
v1v2 (latest)

Diverse Projection Ensembles for Distributional Reinforcement Learning

International Conference on Learning Representations (ICLR), 2023
12 June 2023
Moritz A. Zanger
Wendelin Bohmer
M. Spaan
ArXiv (abs)PDFHTML

Papers citing "Diverse Projection Ensembles for Distributional Reinforcement Learning"

4 / 4 papers shown
DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training
DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training
Dingwei Zhu
Zhiheng Xi
Shihan Dou
Yuhui Wang
Sixian Li
...
Caishuang Huang
Yunke Zhang
Demei Yan
Yuran Wang
Tao Gui
OffRL
153
0
0
03 Dec 2025
Universal Value-Function Uncertainties
Universal Value-Function Uncertainties
Moritz A. Zanger
Max Weltevrede
Yaniv Oren
Pascal R. van der Vaart
Caroline Horsch
Wendelin Bohmer
M. Spaan
OffRL
291
1
0
27 May 2025
Epistemic Artificial Intelligence is Essential for Machine Learning Models to Truly 'Know When They Do Not Know'
Epistemic Artificial Intelligence is Essential for Machine Learning Models to Truly 'Know When They Do Not Know'
Shireen Kudukkil Manchingal
Andrew Bradley
Julian F. P. Kooij
Keivan K1 Shariatmadar
Neil Yorke-Smith
Fabio Cuzzolin
587
1
0
08 May 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCVBDL
1.1K
5
0
14 Mar 2025
1
Page 1 of 1