ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.04544
  4. Cited By
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits
  and RL
v1v2v3v4v5 (latest)

Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

10 May 2020
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
Jenna M. Reinen
Irina Rish
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL"

21 / 21 papers shown
Title
Psychotherapy AI Companion with Reinforcement Learning Recommendations
  and Interpretable Policy Dynamics
Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRLAI4TSAI4MH
107
11
0
16 Mar 2023
TherapyView: Visualizing Therapy Sessions with Temporal Topic Modeling
  and AI-Generated Arts
TherapyView: Visualizing Therapy Sessions with Temporal Topic Modeling and AI-Generated Arts
Baihan Lin
Stefan Zecevic
Djallel Bouneffouf
Guillermo Cecchi
DiffM
85
5
0
21 Feb 2023
Working Alliance Transformer for Psychotherapy Dialogue Classification
Working Alliance Transformer for Psychotherapy Dialogue Classification
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
70
14
0
27 Oct 2022
Reinforcement Learning and Bandits for Speech and Language Processing:
  Tutorial, Review and Outlook
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook
Baihan Lin
OffRLAI4TS
129
27
0
24 Oct 2022
Computational Inference in Cognitive Science: Operational, Societal and
  Ethical Considerations
Computational Inference in Cognitive Science: Operational, Societal and Ethical Considerations
Baihan Lin
AI4CE
77
8
0
24 Oct 2022
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy
  Treatment Strategies with Deep Reinforcement Learning
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
86
12
0
27 Aug 2022
Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling
Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling
Baihan Lin
65
4
0
26 Apr 2022
Neural Topic Modeling of Psychotherapy Sessions
Neural Topic Modeling of Psychotherapy Sessions
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
Ravi Tejwani
BDL
124
16
0
13 Apr 2022
Deep Annotation of Therapeutic Working Alliance in Psychotherapy
Deep Annotation of Therapeutic Working Alliance in Psychotherapy
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
98
16
0
12 Apr 2022
Optimal Epidemic Control as a Contextual Combinatorial Bandit with
  Budget
Optimal Epidemic Control as a Contextual Combinatorial Bandit with Budget
Baihan Lin
Djallel Bouneffouf
71
8
0
30 Jun 2021
Etat de lárt sur lápplication des bandits multi-bras
Etat de lárt sur lápplication des bandits multi-bras
Djallel Bouneffouf
61
0
0
04 Jan 2021
Online Semi-Supervised Learning with Bandit Feedback
Online Semi-Supervised Learning with Bandit Feedback
Sohini Upadhyay
Mikhail Yurochkin
Mayank Agarwal
Y. Khazaeni
Djallel Bouneffouf
73
7
0
23 Oct 2020
Predicting human decision making in psychological tasks with recurrent
  neural networks
Predicting human decision making in psychological tasks with recurrent neural networks
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
62
21
0
22 Oct 2020
Online Semi-Supervised Learning in Contextual Bandits with Episodic
  Reward
Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward
Baihan Lin
OffRL
66
14
0
17 Sep 2020
Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling
Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling
Djallel Bouneffouf
47
1
0
21 Jul 2020
Computing the Dirichlet-Multinomial Log-Likelihood Function
Computing the Dirichlet-Multinomial Log-Likelihood Function
Djallel Bouneffouf
36
2
0
17 Jul 2020
Contextual Bandit with Missing Rewards
Contextual Bandit with Missing Rewards
Djallel Bouneffouf
Sohini Upadhyay
Y. Khazaeni
OffRL
56
9
0
13 Jul 2020
Online learning with Corrupted context: Corrupted Contextual Bandits
Online learning with Corrupted context: Corrupted Contextual Bandits
Djallel Bouneffouf
32
11
0
26 Jun 2020
Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior
Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
80
23
0
09 Jun 2020
Speaker Diarization as a Fully Online Learning Problem in MiniVox
Speaker Diarization as a Fully Online Learning Problem in MiniVox
Baihan Lin
Xinxin Zhang
98
16
0
08 Jun 2020
A Story of Two Streams: Reinforcement Learning Models from Human
  Behavior and Neuropsychiatry
A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
Jenna M. Reinen
Irina Rish
OffRL
91
36
0
21 Jun 2019
1