Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.06580
Cited By
v1
v2
v3 (latest)
Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior
9 June 2020
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior"
5 / 5 papers shown
Title
An analytical model of active inference in the Iterated Prisoner's Dilemma
Daphne Demekas
Conor Heins
Brennan Klein
59
1
0
27 Jun 2023
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook
Baihan Lin
OffRL
AI4TS
133
27
0
24 Oct 2022
Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward
Baihan Lin
OffRL
68
14
0
17 Sep 2020
Contextual Bandit with Missing Rewards
Djallel Bouneffouf
Sohini Upadhyay
Y. Khazaeni
OffRL
56
9
0
13 Jul 2020
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
Jenna M. Reinen
Irina Rish
OffRL
60
24
0
10 May 2020
1