Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.04544
Cited By
v1
v2
v3
v4
v5 (latest)
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL
10 May 2020
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
Jenna M. Reinen
Irina Rish
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL"
21 / 21 papers shown
Title
Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
AI4TS
AI4MH
107
11
0
16 Mar 2023
TherapyView: Visualizing Therapy Sessions with Temporal Topic Modeling and AI-Generated Arts
Baihan Lin
Stefan Zecevic
Djallel Bouneffouf
Guillermo Cecchi
DiffM
85
5
0
21 Feb 2023
Working Alliance Transformer for Psychotherapy Dialogue Classification
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
70
14
0
27 Oct 2022
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook
Baihan Lin
OffRL
AI4TS
129
27
0
24 Oct 2022
Computational Inference in Cognitive Science: Operational, Societal and Ethical Considerations
Baihan Lin
AI4CE
77
8
0
24 Oct 2022
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
86
12
0
27 Aug 2022
Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling
Baihan Lin
65
4
0
26 Apr 2022
Neural Topic Modeling of Psychotherapy Sessions
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
Ravi Tejwani
BDL
124
16
0
13 Apr 2022
Deep Annotation of Therapeutic Working Alliance in Psychotherapy
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
98
16
0
12 Apr 2022
Optimal Epidemic Control as a Contextual Combinatorial Bandit with Budget
Baihan Lin
Djallel Bouneffouf
71
8
0
30 Jun 2021
Etat de lárt sur lápplication des bandits multi-bras
Djallel Bouneffouf
61
0
0
04 Jan 2021
Online Semi-Supervised Learning with Bandit Feedback
Sohini Upadhyay
Mikhail Yurochkin
Mayank Agarwal
Y. Khazaeni
Djallel Bouneffouf
73
7
0
23 Oct 2020
Predicting human decision making in psychological tasks with recurrent neural networks
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
62
21
0
22 Oct 2020
Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward
Baihan Lin
OffRL
66
14
0
17 Sep 2020
Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling
Djallel Bouneffouf
47
1
0
21 Jul 2020
Computing the Dirichlet-Multinomial Log-Likelihood Function
Djallel Bouneffouf
36
2
0
17 Jul 2020
Contextual Bandit with Missing Rewards
Djallel Bouneffouf
Sohini Upadhyay
Y. Khazaeni
OffRL
56
9
0
13 Jul 2020
Online learning with Corrupted context: Corrupted Contextual Bandits
Djallel Bouneffouf
32
11
0
26 Jun 2020
Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
80
23
0
09 Jun 2020
Speaker Diarization as a Fully Online Learning Problem in MiniVox
Baihan Lin
Xinxin Zhang
98
16
0
08 Jun 2020
A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
Jenna M. Reinen
Irina Rish
OffRL
91
36
0
21 Jun 2019
1