ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.04516
  4. Cited By
Launchpad: A Programming Model for Distributed Machine Learning Research

Launchpad: A Programming Model for Distributed Machine Learning Research

7 June 2021
Fan Yang
Gabriel Barth-Maron
Piotr Stańczyk
Matthew Hoffman
Siqi Liu
M. Kroiss
Aedan Pope
Alban Rrustemi
ArXiv (abs)PDFHTMLGithub

Papers citing "Launchpad: A Programming Model for Distributed Machine Learning Research"

15 / 15 papers shown
Sample-Efficient Alignment for LLMs
Sample-Efficient Alignment for LLMs
Zichen Liu
Changyu Chen
Chao Du
Wee Sun Lee
Min Lin
315
14
0
03 Nov 2024
Bad Students Make Great Teachers: Active Learning Accelerates
  Large-Scale Visual Understanding
Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual UnderstandingEuropean Conference on Computer Vision (ECCV), 2023
Talfan Evans
Shreya Pathak
Hamza Merzic
Jonathan Schwarz
Ryutaro Tanno
Olivier J. Hénaff
339
29
0
08 Dec 2023
Reinforced Self-Training (ReST) for Language Modeling
Reinforced Self-Training (ReST) for Language Modeling
Çağlar Gülçehre
T. Paine
S. Srinivasan
Ksenia Konyushkova
L. Weerts
...
Chenjie Gu
Wolfgang Macherey
Arnaud Doucet
Orhan Firat
Nando de Freitas
OffRL
516
423
0
17 Aug 2023
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand
  Cores
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand CoresInternational Conference on Learning Representations (ICLR), 2023
Zhiyu Mei
Wei Fu
Jiaxuan Gao
Guang Wang
Huanchen Zhang
Yi Wu
OffRLLRM
458
10
0
29 Jun 2023
Co-Learning Empirical Games and World Models
Co-Learning Empirical Games and World Models
Max O. Smith
Michael P. Wellman
341
4
0
23 May 2023
marl-jax: Multi-Agent Reinforcement Leaning Framework
marl-jax: Multi-Agent Reinforcement Leaning Framework
K. Mehta
Anuj Mahajan
Kiran Ravish
284
3
0
24 Mar 2023
VeLO: Training Versatile Learned Optimizers by Scaling Up
VeLO: Training Versatile Learned Optimizers by Scaling Up
Luke Metz
James Harrison
C. Freeman
Amil Merchant
Lucas Beyer
...
Naman Agrawal
Ben Poole
Igor Mordatch
Adam Roberts
Jascha Narain Sohl-Dickstein
373
78
0
17 Nov 2022
Phantom -- A RL-driven multi-agent framework to model complex systems
Phantom -- A RL-driven multi-agent framework to model complex systemsAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Leo Ardon
Jared Vann
Deepeka Garg
Thomas Spooner
Sumitra Ganesh
258
9
0
12 Oct 2022
MAD for Robust Reinforcement Learning in Machine Translation
MAD for Robust Reinforcement Learning in Machine Translation
Domenic Donato
Lei Yu
Wang Ling
Chris Dyer
MoE
287
8
0
18 Jul 2022
The Frost Hollow Experiments: Pavlovian Signalling as a Path to
  Coordination and Communication Between Agents
The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents
P. Pilarski
Andrew Butcher
Elnaz Davoodi
Michael Bradley Johanson
Dylan J. A. Brenneis
Adam S. R. Parker
Leslie Acker
M. Botvinick
Joseph Modayil
Adam White
AI4CE
227
5
0
17 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
378
12
0
01 Mar 2022
Pavlovian Signalling with General Value Functions in Agent-Agent
  Temporal Decision Making
Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making
Andrew Butcher
Michael Bradley Johanson
Elnaz Davoodi
Dylan J. A. Brenneis
Leslie Acker
Adam S. R. Parker
Adam White
Joseph Modayil
P. Pilarski
AI4CE
361
3
0
11 Jan 2022
Temporally Abstract Partial Models
Temporally Abstract Partial ModelsNeural Information Processing Systems (NeurIPS), 2021
Khimya Khetarpal
Zafarali Ahmed
Gheorghe Comanici
Doina Precup
228
17
0
06 Aug 2021
Integrating Distributed Architectures in Highly Modular RL Libraries
Integrating Distributed Architectures in Highly Modular RL Libraries
Albert Bou
Sebastian Dittert
Gianni De Fabritiis
373
0
0
06 Jul 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
361
243
0
01 Jun 2020
1
Page 1 of 1