ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.00980
  4. Cited By
Zeroth Order Non-convex optimization with Dueling-Choice Bandits

Zeroth Order Non-convex optimization with Dueling-Choice Bandits

3 November 2019
Yichong Xu
Aparna R. Joshi
Aarti Singh
A. Dubrawski
ArXiv (abs)PDFHTML

Papers citing "Zeroth Order Non-convex optimization with Dueling-Choice Bandits"

5 / 5 papers shown
Title
Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds
Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds
Aya Kayal
Sattar Vakili
Laura Toni
Da-shan Shiu
A. Bernacchia
26
0
0
29 May 2025
Human-in-the-loop: Provably Efficient Preference-based Reinforcement
  Learning with General Function Approximation
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
Xiaoyu Chen
Han Zhong
Zhuoran Yang
Zhaoran Wang
Liwei Wang
180
70
0
23 May 2022
Procrastinated Tree Search: Black-box Optimization with Delayed, Noisy,
  and Multi-Fidelity Feedback
Procrastinated Tree Search: Black-box Optimization with Delayed, Noisy, and Multi-Fidelity Feedback
Junxiong Wang
D. Basu
Immanuel Trummer
127
3
0
14 Oct 2021
Bias-Robust Bayesian Optimization via Dueling Bandits
Bias-Robust Bayesian Optimization via Dueling Bandits
Johannes Kirschner
Andreas Krause
48
11
0
25 May 2021
Preference-based Reinforcement Learning with Finite-Time Guarantees
Preference-based Reinforcement Learning with Finite-Time Guarantees
Yichong Xu
Ruosong Wang
Lin F. Yang
Aarti Singh
A. Dubrawski
106
60
0
16 Jun 2020
1