ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.04736
  4. Cited By
Reverb: A Framework For Experience Replay

Reverb: A Framework For Experience Replay

9 February 2021
Albin Cassirer
Gabriel Barth-Maron
E. Brevdo
Sabela Ramos
Toby Boyd
Thibault Sottiaux
M. Kroiss
    VLMOffRL
ArXiv (abs)PDFHTML

Papers citing "Reverb: A Framework For Experience Replay"

23 / 23 papers shown
Laminar: A Scalable Asynchronous RL Post-Training Framework
Laminar: A Scalable Asynchronous RL Post-Training Framework
Guangming Sheng
Yuxuan Tong
Borui Wan
W. Zhang
Chaobo Jia
...
Chi Zhang
Yanghua Peng
H. Lin
Xin Liu
Chuan Wu
174
13
0
14 Oct 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment DependenciesAAAI Conference on Artificial Intelligence (AAAI), 2025
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
448
0
0
27 Feb 2025
Bad Students Make Great Teachers: Active Learning Accelerates
  Large-Scale Visual Understanding
Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual UnderstandingEuropean Conference on Computer Vision (ECCV), 2023
Talfan Evans
Shreya Pathak
Hamza Merzic
Jonathan Schwarz
Ryutaro Tanno
Olivier J. Hénaff
338
28
0
08 Dec 2023
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement
  Learning Models
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning ModelsInternational Conference on Machine Learning (ICML), 2023
Hanjing Wang
Man-Kit Sit
Cong He
Ying Wen
Weinan Zhang
Jun Wang
Yaodong Yang
Kai Zou
OffRLVLM
268
5
0
08 Oct 2023
Curious Replay for Model-based Adaptation
Curious Replay for Model-based AdaptationInternational Conference on Machine Learning (ICML), 2023
Isaac Kauvar
Christopher Doyle
Linqi Zhou
Nick Haber
196
18
0
28 Jun 2023
marl-jax: Multi-Agent Reinforcement Leaning Framework
marl-jax: Multi-Agent Reinforcement Leaning Framework
K. Mehta
Anuj Mahajan
Kiran Ravish
284
3
0
24 Mar 2023
Hierarchical Reinforcement Learning in Complex 3D Environments
Hierarchical Reinforcement Learning in Complex 3D Environments
Bernardo Avila-Pires
Feryal M. P. Behbahani
Hubert Soyer
Kyriacos Nikiforou
Thomas Keck
Satinder Singh
OffRL
199
0
0
28 Feb 2023
Zero-Shot Transfer of Haptics-Based Object Insertion Policies
Zero-Shot Transfer of Haptics-Based Object Insertion PoliciesIEEE International Conference on Robotics and Automation (ICRA), 2023
Samarth Brahmbhatt
A. Deka
Andrew Spielberg
M. Muller
403
7
0
29 Jan 2023
Understanding Self-Predictive Learning for Reinforcement Learning
Understanding Self-Predictive Learning for Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Yunhao Tang
Z. Guo
Pierre Harvey Richemond
Bernardo Avila-Pires
Yash Chandak
...
S. Thakoor
Will Dabney
Bilal Piot
Daniele Calandriello
Michal Valko
SSL
342
45
0
06 Dec 2022
Event Tables for Efficient Experience Replay
Event Tables for Efficient Experience Replay
Varun Kompella
Thomas J. Walsh
Samuel Barrett
Peter R. Wurman
Peter Stone
OffRL
274
6
0
01 Nov 2022
A Mixture of Surprises for Unsupervised Reinforcement Learning
A Mixture of Surprises for Unsupervised Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Andrew Zhao
Matthieu Lin
Yangguang Li
Wenshu Fan
Gao Huang
261
14
0
13 Oct 2022
Phantom -- A RL-driven multi-agent framework to model complex systems
Phantom -- A RL-driven multi-agent framework to model complex systemsAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Leo Ardon
Jared Vann
Deepeka Garg
Thomas Spooner
Sumitra Ganesh
256
9
0
12 Oct 2022
Human-level Atari 200x faster
Human-level Atari 200x fasterInternational Conference on Learning Representations (ICLR), 2022
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
263
41
0
15 Sep 2022
Beyond Supervised Continual Learning: a Review
Beyond Supervised Continual Learning: a Review
Benedikt Bagus
A. Gepperth
Timothée Lesort
BDLCLL
318
14
0
30 Aug 2022
Open Source Vizier: Distributed Infrastructure and API for Reliable and
  Flexible Blackbox Optimization
Open Source Vizier: Distributed Infrastructure and API for Reliable and Flexible Blackbox Optimization
Xingyou Song
Sagi Perel
Chansoo Lee
Greg Kochanski
Daniel Golovin
392
28
0
27 Jul 2022
MAD for Robust Reinforcement Learning in Machine Translation
MAD for Robust Reinforcement Learning in Machine Translation
Domenic Donato
Lei Yu
Wang Ling
Chris Dyer
MoE
277
8
0
18 Jul 2022
Value Function Decomposition for Iterative Design of Reinforcement
  Learning Agents
Value Function Decomposition for Iterative Design of Reinforcement Learning AgentsNeural Information Processing Systems (NeurIPS), 2022
J. MacGlashan
Evan Archer
A. Devlic
Takuma Seno
Craig Sherstan
Peter R. Wurman
AI PeterStoneSony
239
10
0
24 Jun 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
377
12
0
01 Mar 2022
Distillation of RL Policies with Formal Guarantees via Variational
  Abstraction of Markov Decision Processes (Technical Report)
Distillation of RL Policies with Formal Guarantees via Variational Abstraction of Markov Decision Processes (Technical Report)AAAI Conference on Artificial Intelligence (AAAI), 2021
Florent Delgrange
Ann Nowé
Guillermo A. Pérez
OffRL
322
13
0
17 Dec 2021
Evaluating model-based planning and planner amortization for continuous
  control
Evaluating model-based planning and planner amortization for continuous control
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
M. Berk Mirza
Alessandro Davide Ialongo
...
Jost Tobias Springenberg
A. Abdolmaleki
N. Heess
J. Merel
Martin Riedmiller
216
18
0
07 Oct 2021
Cogment: Open Source Framework For Distributed Multi-actor Training,
  Deployment & Operations
Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment & Operations
AI Redefined
S. Gottipati
Sagar Kurandwad
Clodéric Mars
Gregory Szriftgiser
Franccois Chabot
240
9
0
21 Jun 2021
Launchpad: A Programming Model for Distributed Machine Learning Research
Launchpad: A Programming Model for Distributed Machine Learning Research
Fan Yang
Gabriel Barth-Maron
Piotr Stańczyk
Matthew Hoffman
Siqi Liu
M. Kroiss
Aedan Pope
Alban Rrustemi
162
25
0
07 Jun 2021
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
358
242
0
01 Jun 2020
1
Page 1 of 1