ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1410.4604
  4. Cited By
Domain-Independent Optimistic Initialization for Reinforcement Learning

Domain-Independent Optimistic Initialization for Reinforcement Learning

16 October 2014
Marlos C. Machado
S. Srinivasan
Michael Bowling
    LRMAI4CE
ArXiv (abs)PDFHTML

Papers citing "Domain-Independent Optimistic Initialization for Reinforcement Learning"

12 / 12 papers shown
Advancing Weight and Channel Sparsification with Enhanced Saliency
Advancing Weight and Channel Sparsification with Enhanced SaliencyIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Xinglong Sun
Maying Shen
Hongxu Yin
Lei Mao
Pavlo Molchanov
Jose M. Alvarez
268
1
0
05 Feb 2025
Cyclophobic Reinforcement Learning
Cyclophobic Reinforcement Learning
Stefan Sylvius Wagner
P. Arndt
Jan Robine
Stefan Harmeling
176
1
0
30 Aug 2023
An Empirical Study of Implicit Regularization in Deep Offline RL
An Empirical Study of Implicit Regularization in Deep Offline RL
Çağlar Gülçehre
Srivatsan Srinivasan
Jakub Sygnowski
Georg Ostrovski
Mehrdad Farajtabar
Matt Hoffman
Razvan Pascanu
Arnaud Doucet
OffRL
365
22
0
05 Jul 2022
Optimistic Temporal Difference Learning for 2048
Optimistic Temporal Difference Learning for 2048IEEE Transactions on Games (IEEE Trans. Games), 2021
Hung Guei
Lung-Pin Chen
I-Chen Wu
116
9
0
22 Nov 2021
Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence
  Optimization
Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence OptimizationNeural Networks (NN), 2021
Taisuke Kobayashi
241
18
0
27 May 2021
Multi-Armed Bandits for Minesweeper: Profiting from
  Exploration-Exploitation Synergy
Multi-Armed Bandits for Minesweeper: Profiting from Exploration-Exploitation SynergyIEEE Transactions on Games (IEEE Trans. Games), 2020
Igor Q. Lordeiro
D. B. Haddad
Douglas O. Cardoso
113
2
0
25 Jul 2020
A Comparison of Self-Play Algorithms Under a Generalized Framework
A Comparison of Self-Play Algorithms Under a Generalized Framework
Daniel Hernández
Kevin Denamganai
Sam Devlin
Spyridon Samothrakis
James Alfred Walker
206
13
0
08 Jun 2020
Optimistic Exploration even with a Pessimistic Initialisation
Optimistic Exploration even with a Pessimistic InitialisationInternational Conference on Learning Representations (ICLR), 2020
Tabish Rashid
Bei Peng
Wendelin Bohmer
Shimon Whiteson
OffRLOnRL
178
50
0
26 Feb 2020
Revisiting the Arcade Learning Environment: Evaluation Protocols and
  Open Problems for General Agents
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
446
601
0
18 Sep 2017
Agent-Agnostic Human-in-the-Loop Reinforcement Learning
Agent-Agnostic Human-in-the-Loop Reinforcement Learning
David Abel
J. Salvatier
Andreas Stuhlmuller
Owain Evans
197
65
0
15 Jan 2017
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic MotivationNeural Information Processing Systems (NeurIPS), 2016
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
611
1,634
0
06 Jun 2016
State of the Art Control of Atari Games Using Shallow Reinforcement
  Learning
State of the Art Control of Atari Games Using Shallow Reinforcement Learning
Yitao Liang
Marlos C. Machado
Erik Talvitie
Michael Bowling
318
114
0
04 Dec 2015
1
Page 1 of 1