ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.05527
  4. Cited By
DOPL: Direct Online Preference Learning for Restless Bandits with
  Preference Feedback

DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback

7 October 2024
Guojun Xiong
Ujwal Dinesha
Debajoy Mukherjee
Jian Li
Srinivas Shakkottai
ArXivPDFHTML

Papers citing "DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback"

1 / 1 papers shown
Title
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal
  Reinforcement for Enhanced Financial Decision Making
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
Yangyang Yu
Zhiyuan Yao
Haohang Li
Zhiyang Deng
Yupeng Cao
...
Guojun Xiong
Yueru He
Jimin Huang
Dong Li
Qianqian Xie
AIFin
LLMAG
39
13
0
09 Jul 2024
1