ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.08457
  4. Cited By
Online Semi-Supervised Learning in Contextual Bandits with Episodic
  Reward
v1v2 (latest)

Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward

17 September 2020
Baihan Lin
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward"

3 / 3 papers shown
Title
Reinforcement Learning and Bandits for Speech and Language Processing:
  Tutorial, Review and Outlook
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook
Baihan Lin
OffRLAI4TS
133
27
0
24 Oct 2022
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy
  Treatment Strategies with Deep Reinforcement Learning
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
86
12
0
27 Aug 2022
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits
  and RL
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
Jenna M. Reinen
Irina Rish
OffRL
60
24
0
10 May 2020
1