Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2006.06580
Cited By

Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior

v1v2v3 (latest)

Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior

9 June 2020

Djallel Bouneffouf

Guillermo Cecchi

ArXiv (abs)PDF HTML

Papers citing "Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior"

14 / 14 papers shown

Survey: Multi-Armed Bandits Meet Large Language Models

Survey: Multi-Armed Bandits Meet Large Language Models

Djallel Bouneffouf

398

4

0

19 May 2025

An analytical model of active inference in the Iterated Prisoner's
Dilemma

An analytical model of active inference in the Iterated Prisoner's Dilemma

231

3

0

27 Jun 2023

A Reinforcement Learning Framework for Online Speaker Diarization

A Reinforcement Learning Framework for Online Speaker Diarization

388

2

0

21 Feb 2023

Reinforcement Learning and Bandits for Speech and Language Processing:
Tutorial, Review and Outlook

Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and OutlookExpert systems with applications (ESWA), 2022

479

30

0

24 Oct 2022

Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling

Evolutionary Multi-Armed Bandits with Genetic Thompson SamplingIEEE Congress on Evolutionary Computation (CEC), 2022

190

6

0

26 Apr 2022

Optimal Epidemic Control as a Contextual Combinatorial Bandit with
Budget

Optimal Epidemic Control as a Contextual Combinatorial Bandit with BudgetIEEE International Conference on Fuzzy Systems (FUZZ-IEEE), 2021

Djallel Bouneffouf

325

8

0

30 Jun 2021

Etat de lárt sur lápplication des bandits multi-bras

Etat de lárt sur lápplication des bandits multi-bras

Djallel Bouneffouf

347

0

0

04 Jan 2021

Online Semi-Supervised Learning in Contextual Bandits with Episodic
Reward

Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward

299

14

0

17 Sep 2020

Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling

Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling

Djallel Bouneffouf

252

1

0

21 Jul 2020

Computing the Dirichlet-Multinomial Log-Likelihood Function

Computing the Dirichlet-Multinomial Log-Likelihood Function

Djallel Bouneffouf

280

2

0

17 Jul 2020

Contextual Bandit with Missing Rewards

Contextual Bandit with Missing Rewards

Djallel Bouneffouf

Sohini Upadhyay

280

10

0

13 Jul 2020

Online learning with Corrupted context: Corrupted Contextual Bandits

Online learning with Corrupted context: Corrupted Contextual Bandits

Djallel Bouneffouf

251

13

0

26 Jun 2020

Speaker Diarization as a Fully Online Learning Problem in MiniVox

Speaker Diarization as a Fully Online Learning Problem in MiniVox

444

16

0

08 Jun 2020

Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits
and RL

Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

Guillermo Cecchi

Djallel Bouneffouf

Jenna M. Reinen

493

25

0

10 May 2020

Page 1 of 1