Launchpad: A Programming Model for Distributed Machine Learning Research

7 June 2021

Fan Yang

Matthew Hoffman

ArXiv (abs)PDF HTML Github

Papers citing "Launchpad: A Programming Model for Distributed Machine Learning Research"

15 / 15 papers shown

Sample-Efficient Alignment for LLMs

315

03 Nov 2024

Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual UnderstandingEuropean Conference on Computer Vision (ECCV), 2023

339

08 Dec 2023

Reinforced Self-Training (ReST) for Language Modeling

...

516

423

17 Aug 2023

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand CoresInternational Conference on Learning Representations (ICLR), 2023

458

29 Jun 2023

Co-Learning Empirical Games and World Models

Max O. Smith

Michael P. Wellman

341

23 May 2023

marl-jax: Multi-Agent Reinforcement Leaning Framework

K. Mehta

Anuj Mahajan

Kiran Ravish

284

24 Mar 2023

VeLO: Training Versatile Learned Optimizers by Scaling Up

...

Jascha Narain Sohl-Dickstein

373

17 Nov 2022

Phantom -- A RL-driven multi-agent framework to model complex systemsAdaptive Agents and Multi-Agent Systems (AAMAS), 2022

258

12 Oct 2022

MAD for Robust Reinforcement Learning in Machine Translation

287

18 Jul 2022

The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents

P. Pilarski

Andrew Butcher

Elnaz Davoodi

Michael Bradley Johanson

227

17 Mar 2022

Learning Robust Real-Time Cultural Transmission without Human Data

Cultural General Intelligence Team

...

Lei M. Zhang

378

01 Mar 2022

Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making

Andrew Butcher

Michael Bradley Johanson

361

11 Jan 2022

Temporally Abstract Partial ModelsNeural Information Processing Systems (NeurIPS), 2021

228

06 Aug 2021

Integrating Distributed Architectures in Highly Modular RL Libraries

Albert Bou

Sebastian Dittert

Gianni De Fabritiis

373

06 Jul 2020

Acme: A Research Framework for Distributed Reinforcement Learning

Nikola Momchev

...

Bilal Piot

361

243

01 Jun 2020