ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.06580
  4. Cited By
Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior
v1v2v3 (latest)

Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior

9 June 2020
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
ArXiv (abs)PDFHTML

Papers citing "Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior"

14 / 14 papers shown
Survey: Multi-Armed Bandits Meet Large Language Models
Survey: Multi-Armed Bandits Meet Large Language Models
Djallel Bouneffouf
Raphael Feraud
398
4
0
19 May 2025
An analytical model of active inference in the Iterated Prisoner's
  Dilemma
An analytical model of active inference in the Iterated Prisoner's Dilemma
Daphne Demekas
Conor Heins
Brennan Klein
231
3
0
27 Jun 2023
A Reinforcement Learning Framework for Online Speaker Diarization
A Reinforcement Learning Framework for Online Speaker Diarization
Baihan Lin
Xinxin Zhang
OffRL
388
2
0
21 Feb 2023
Reinforcement Learning and Bandits for Speech and Language Processing:
  Tutorial, Review and Outlook
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and OutlookExpert systems with applications (ESWA), 2022
Baihan Lin
OffRLAI4TS
479
30
0
24 Oct 2022
Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling
Evolutionary Multi-Armed Bandits with Genetic Thompson SamplingIEEE Congress on Evolutionary Computation (CEC), 2022
Baihan Lin
190
6
0
26 Apr 2022
Optimal Epidemic Control as a Contextual Combinatorial Bandit with
  Budget
Optimal Epidemic Control as a Contextual Combinatorial Bandit with BudgetIEEE International Conference on Fuzzy Systems (FUZZ-IEEE), 2021
Baihan Lin
Djallel Bouneffouf
325
8
0
30 Jun 2021
Etat de lárt sur lápplication des bandits multi-bras
Etat de lárt sur lápplication des bandits multi-bras
Djallel Bouneffouf
347
0
0
04 Jan 2021
Online Semi-Supervised Learning in Contextual Bandits with Episodic
  Reward
Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward
Baihan Lin
OffRL
299
14
0
17 Sep 2020
Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling
Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling
Djallel Bouneffouf
252
1
0
21 Jul 2020
Computing the Dirichlet-Multinomial Log-Likelihood Function
Computing the Dirichlet-Multinomial Log-Likelihood Function
Djallel Bouneffouf
280
2
0
17 Jul 2020
Contextual Bandit with Missing Rewards
Contextual Bandit with Missing Rewards
Djallel Bouneffouf
Sohini Upadhyay
Y. Khazaeni
OffRL
280
10
0
13 Jul 2020
Online learning with Corrupted context: Corrupted Contextual Bandits
Online learning with Corrupted context: Corrupted Contextual Bandits
Djallel Bouneffouf
251
13
0
26 Jun 2020
Speaker Diarization as a Fully Online Learning Problem in MiniVox
Speaker Diarization as a Fully Online Learning Problem in MiniVox
Baihan Lin
Xinxin Zhang
444
16
0
08 Jun 2020
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits
  and RL
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
Jenna M. Reinen
Irina Rish
OffRL
493
25
0
10 May 2020
1
Page 1 of 1