v1v2 (latest)

Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward

17 September 2020

Papers citing "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward"

3 / 3 papers shown

Title
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook Baihan Lin OffRL AI4TS 133 27 0 24 Oct 2022
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning Baihan Lin Guillermo Cecchi Djallel Bouneffouf OffRL 86 12 0 27 Aug 2022
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL Baihan Lin Guillermo Cecchi Djallel Bouneffouf Jenna M. Reinen Irina Rish OffRL 60 24 0 10 May 2020