v1v2v3 (latest)

Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts

Neural Information Processing Systems (NeurIPS), 2020

10 February 2020

Papers citing "Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts"

22 / 22 papers shown

All is Not Lost: LLM Recovery without Checkpoints

Nikolay Blagoev

Oğuzhan Ersoy

Lydia Yiyu Chen

262

18 Jun 2025

TAH-QUANT: Effective Activation Quantization in Pipeline Parallelism over Slow Network

230

02 Jun 2025

Protocol Models: Scaling Decentralized Training with Communication-Efficient Model Parallelism

Sameera Ramasinghe

Thalaiyasingam Ajanthan

462

02 Jun 2025

Achieving Peak Performance for Large Language Models: A Systematic ReviewIEEE Access (IEEE Access), 2024

Z. R. K. Rostam

Sándor Szénási

Gábor Kertész

375

07 Sep 2024

Lower Bounds and Optimal Algorithms for Non-Smooth Convex Decentralized Optimization over Time-Varying Networks

D. Kovalev

Ekaterina Borodich

Alexander Gasnikov

Dmitrii Feoktistov

252

28 May 2024

Video Relationship Detection Using Mixture of Experts

A. Shaabana

Zahra Gharaee

Paul Fieguth

237

06 Mar 2024

Social Interpretable Reinforcement Learning

Leonardo Lucio Custode

Giovanni Iacca

OffRL

496

27 Jan 2024

Direct Neural Machine Translation with Task-level Mixture of Experts models

Isidora Chara Tourni

Subhajit Naskar

MoE

281

18 Oct 2023

Towards Open Federated Learning Platforms: Survey and Vision from Technical and Legal Perspectives

502

05 Jul 2023

A Language Model of Java Methods with Train/Test Deduplication

307

15 May 2023

SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-EfficientInternational Conference on Machine Learning (ICML), 2023

Tim Dettmers

415

27 Jan 2023

Model Ratatouille: Recycling Diverse Models for Out-of-Distribution GeneralizationInternational Conference on Machine Learning (ICML), 2022

582

107

20 Dec 2022

Petals: Collaborative Inference and Fine-tuning of Large ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Tim Dettmers

268

107

02 Sep 2022

Training Transformers TogetherNeural Information Processing Systems (NeurIPS), 2022

Tim Dettmers

178

07 Jul 2022

Fine-tuning Language Models over Slow Networks using Activation Compression with GuaranteesNeural Information Processing Systems (NeurIPS), 2022

510

02 Jun 2022

Decentralized Training of Foundation Models in Heterogeneous EnvironmentsNeural Information Processing Systems (NeurIPS), 2022

500

131

02 Jun 2022

Machines & Influence: An Information Systems Lens

Shashank Yadav

365

26 Nov 2021

Mixed SIGNals: Sign Language Production via a Mixture of Motion PrimitivesIEEE International Conference on Computer Vision (ICCV), 2021

292

23 Jul 2021

Secure Distributed Training at ScaleInternational Conference on Machine Learning (ICML), 2021

452

21 Jun 2021

Distributed Deep Learning in Open CollaborationsNeural Information Processing Systems (NeurIPS), 2021

...

315

18 Jun 2021

Distributed Deep Learning Using Volunteer Computing-Like ParadigmIEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2021

Medha Atre

B. Jha

Ashwini Rao

416

16 Mar 2021

Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable DevicesNeural Information Processing Systems (NeurIPS), 2021

425

04 Mar 2021