v1v2v3v4v5 (latest)

Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

10 May 2020

Papers citing "Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL"

21 / 21 papers shown

Title
Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics Baihan Lin Guillermo Cecchi Djallel Bouneffouf OffRL AI4TS AI4MH 107 11 0 16 Mar 2023
TherapyView: Visualizing Therapy Sessions with Temporal Topic Modeling and AI-Generated Arts Baihan Lin Stefan Zecevic Djallel Bouneffouf Guillermo Cecchi DiffM 85 5 0 21 Feb 2023
Working Alliance Transformer for Psychotherapy Dialogue Classification Baihan Lin Guillermo Cecchi Djallel Bouneffouf 70 14 0 27 Oct 2022
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook Baihan Lin OffRL AI4TS 129 27 0 24 Oct 2022
Computational Inference in Cognitive Science: Operational, Societal and Ethical Considerations Baihan Lin AI4CE 77 8 0 24 Oct 2022
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning Baihan Lin Guillermo Cecchi Djallel Bouneffouf OffRL 86 12 0 27 Aug 2022
Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling Baihan Lin 65 4 0 26 Apr 2022
Neural Topic Modeling of Psychotherapy Sessions Baihan Lin Djallel Bouneffouf Guillermo Cecchi Ravi Tejwani BDL 124 16 0 13 Apr 2022
Deep Annotation of Therapeutic Working Alliance in Psychotherapy Baihan Lin Guillermo Cecchi Djallel Bouneffouf 98 16 0 12 Apr 2022
Optimal Epidemic Control as a Contextual Combinatorial Bandit with Budget Baihan Lin Djallel Bouneffouf 71 8 0 30 Jun 2021
Etat de lárt sur lápplication des bandits multi-bras Djallel Bouneffouf 61 0 0 04 Jan 2021
Online Semi-Supervised Learning with Bandit Feedback Sohini Upadhyay Mikhail Yurochkin Mayank Agarwal Y. Khazaeni Djallel Bouneffouf 73 7 0 23 Oct 2020
Predicting human decision making in psychological tasks with recurrent neural networks Baihan Lin Djallel Bouneffouf Guillermo Cecchi 62 21 0 22 Oct 2020
Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward Baihan Lin OffRL 66 14 0 17 Sep 2020
Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling Djallel Bouneffouf 47 1 0 21 Jul 2020
Computing the Dirichlet-Multinomial Log-Likelihood Function Djallel Bouneffouf 36 2 0 17 Jul 2020
Contextual Bandit with Missing Rewards Djallel Bouneffouf Sohini Upadhyay Y. Khazaeni OffRL 56 9 0 13 Jul 2020
Online learning with Corrupted context: Corrupted Contextual Bandits Djallel Bouneffouf 32 11 0 26 Jun 2020
Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior Baihan Lin Djallel Bouneffouf Guillermo Cecchi 80 23 0 09 Jun 2020
Speaker Diarization as a Fully Online Learning Problem in MiniVox Baihan Lin Xinxin Zhang 98 16 0 08 Jun 2020
A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry Baihan Lin Guillermo Cecchi Djallel Bouneffouf Jenna M. Reinen Irina Rish OffRL 91 36 0 21 Jun 2019