ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.09510
  4. Cited By
Near-optimal Policy Identification in Active Reinforcement Learning

Near-optimal Policy Identification in Active Reinforcement Learning

19 December 2022
Xiang Li
Viraj Mehta
Johannes Kirschner
I. Char
W. Neiswanger
J. Schneider
Andreas Krause
Ilija Bogunovic
    OffRL
ArXivPDFHTML

Papers citing "Near-optimal Policy Identification in Active Reinforcement Learning"

7 / 7 papers shown
Title
Active Preference Optimization for Sample Efficient RLHF
Active Preference Optimization for Sample Efficient RLHF
Nirjhar Das
Souradip Chakraborty
Aldo Pacchiano
Sayak Ray Chowdhury
27
13
0
16 Feb 2024
Sample Efficient Preference Alignment in LLMs via Active Exploration
Sample Efficient Preference Alignment in LLMs via Active Exploration
Viraj Mehta
Vikramjeet Das
Ojash Neopane
Yijia Dai
Ilija Bogunovic
Ilija Bogunovic
W. Neiswanger
Stefano Ermon
Jeff Schneider
Willie Neiswanger
OffRL
25
12
0
01 Dec 2023
Distributionally Robust Model-based Reinforcement Learning with Large
  State Spaces
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Shyam Sundhar Ramesh
Pier Giuseppe Sessa
Yifan Hu
Andreas Krause
Ilija Bogunovic
OOD
26
10
0
05 Sep 2023
Multitask Learning with No Regret: from Improved Confidence Bounds to
  Active Learning
Multitask Learning with No Regret: from Improved Confidence Bounds to Active Learning
Pier Giuseppe Sessa
Pierre Laforgue
Nicolò Cesa-Bianchi
Andreas Krause
11
2
0
03 Aug 2023
Kernelized Offline Contextual Dueling Bandits
Kernelized Offline Contextual Dueling Bandits
Viraj Mehta
Ojash Neopane
Vikramjeet Das
Sen Lin
J. Schneider
W. Neiswanger
OffRL
12
3
0
21 Jul 2023
Exploration via Planning for Information about the Optimal Trajectory
Exploration via Planning for Information about the Optimal Trajectory
Viraj Mehta
I. Char
J. Abbate
R. Conlin
M. Boyer
Stefano Ermon
J. Schneider
W. Neiswanger
OffRL
8
6
0
06 Oct 2022
Misspecified Gaussian Process Bandit Optimization
Misspecified Gaussian Process Bandit Optimization
Ilija Bogunovic
Andreas Krause
49
41
0
09 Nov 2021
1