Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.12533
Cited By
Pathways: Asynchronous Distributed Dataflow for ML
23 March 2022
P. Barham
Aakanksha Chowdhery
J. Dean
Sanjay Ghemawat
Steven Hand
Dan Hurt
Michael Isard
Hyeontaek Lim
Ruoming Pang
Sudip Roy
Brennan Saeta
Parker Schuh
Ryan Sepassi
Laurent El Shafey
C. A. Thekkath
Yonghui Wu
GNN
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pathways: Asynchronous Distributed Dataflow for ML"
50 / 77 papers shown
Title
eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference
Suraiya Tairin
Shohaib Mahmud
Haiying Shen
Anand Iyer
MoE
88
0
0
10 Mar 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Erik Cambria
LM&MA
AILaw
93
151
0
28 Jan 2025
Revisiting Reliability in Large-Scale Machine Learning Research Clusters
Apostolos Kokolis
Michael Kuchnik
John Hoffman
Adithya Kumar
Parth Malani
Faye Ma
Zachary DeVito
S.
Kalyan Saladi
Carole-Jean Wu
89
7
0
29 Oct 2024
Improving Parallel Program Performance with LLM Optimizers via Agent-System Interface
Anjiang Wei
Allen Nie
Thiago S. F. X. Teixeira
Rohan Yadav
Wonchan Lee
Ke Wang
Alex Aiken
21
0
0
21 Oct 2024
Scalable Multi-Domain Adaptation of Language Models using Modular Experts
Peter Schafhalter
Shun Liao
Yanqi Zhou
Chih-Kuan Yeh
Arun Kandoor
James Laudon
MoE
24
1
0
14 Oct 2024
HybridFlow: A Flexible and Efficient RLHF Framework
Guangming Sheng
Chi Zhang
Zilingfeng Ye
Xibin Wu
Wang Zhang
Ru Zhang
Yanghua Peng
Haibin Lin
Chuan Wu
AI4CE
26
66
0
28 Sep 2024
Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping
Guanhua Wang
Chengming Zhang
Zheyu Shen
Ang Li
Olatunji Ruwase
18
3
0
23 Sep 2024
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey
Jiangfei Duan
Shuo Zhang
Zerui Wang
Lijuan Jiang
Wenwen Qu
...
Dahua Lin
Yonggang Wen
Xin Jin
Tianwei Zhang
Peng Sun
69
8
0
29 Jul 2024
Automatic Tracing in Task-Based Runtime Systems
Rohan Yadav
Michael Bauer
David Broman
Michael Garland
Alex Aiken
Fredrik Kjolstad
21
1
0
26 Jun 2024
Composing Distributed Computations Through Task and Kernel Fusion
Rohan Yadav
S. Sundram
Wonchan Lee
Michael Garland
Michael Bauer
Alex Aiken
Fredrik Kjolstad
26
1
0
26 Jun 2024
AI-coupled HPC Workflow Applications, Middleware and Performance
Wes Brewer
Ana Gainaru
Frédéric Suter
Feiyi Wang
M. Emani
S. Jha
30
10
0
20 Jun 2024
Tx-LLM: A Large Language Model for Therapeutics
Juan Manuel Zambrano Chaves
Eric Wang
Tao Tu
E. D. Vaishnav
Byron Lee
S. S. Mahdavi
Christopher Semturs
David Fleet
Vivek Natarajan
Shekoofeh Azizi
LM&MA
22
12
0
10 Jun 2024
Glauber Generative Model: Discrete Diffusion Models via Binary Classification
Harshit Varma
Dheeraj M. Nagaraj
Karthikeyan Shanmugam
VLM
62
2
0
27 May 2024
Mixture of Experts Soften the Curse of Dimensionality in Operator Learning
Anastasis Kratsios
Takashi Furuya
Jose Antonio Lara Benitez
Matti Lassas
Maarten V. de Hoop
34
13
0
13 Apr 2024
Large Language Models Are State-of-the-Art Evaluator for Grammatical Error Correction
Masamune Kobayashi
Masato Mita
Mamoru Komachi
ELM
40
3
0
26 Mar 2024
Design and Implementation of an Analysis Pipeline for Heterogeneous Data
A. Sarker
Aymen Alsaadi
Niranda Perera
Mills Staylor
G. V. Laszewski
...
Ozgur O. Kilic
Mikhail Titov
André Merzky
S. Jha
Geoffrey C. Fox
11
1
0
23 Mar 2024
Model Parallelism on Distributed Infrastructure: A Literature Review from Theory to LLM Case-Studies
Felix Brakel
Uraz Odyurt
A. Varbanescu
GNN
31
11
0
06 Mar 2024
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
Xupeng Miao
Gabriele Oliaro
Xinhao Cheng
Vineeth Kada
Ruohan Gao
...
April Yang
Yingcheng Wang
Mengdi Wu
Colin Unger
Zhihao Jia
MoE
88
9
0
29 Feb 2024
Addressing cognitive bias in medical language models
Samuel Schmidgall
Carl Harris
Ime Essien
Daniel Olshvang
Tawsifur Rahman
Ji Woong Kim
Rojin Ziaei
Jason Eshraghian
Peter M Abadir
Rama Chellappa
ELM
30
22
0
12 Feb 2024
Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts
Anastasis Kratsios
Haitz Sáez de Ocáriz Borde
Takashi Furuya
Marc T. Law
MoE
28
1
0
05 Feb 2024
HAP: SPMD DNN Training on Heterogeneous GPU Clusters with Automated Program Synthesis
Shiwei Zhang
Lansong Diao
Chuan Wu
Zongyan Cao
Siyu Wang
Wei Lin
25
12
0
11 Jan 2024
Elastic Multi-Gradient Descent for Parallel Continual Learning
Fan Lyu
Wei Feng
Yuepan Li
Qing Sun
Fanhua Shang
Liang Wan
Liang Wang
15
2
0
02 Jan 2024
DynaLay: An Introspective Approach to Dynamic Layer Selection for Deep Networks
Mrinal Mathur
Sergey Plis
AI4CE
8
1
0
20 Dec 2023
Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor Collections
Marcel Wagenlander
Guo Li
Bo Zhao
Luo Mai
Peter R. Pietzuch
20
6
0
08 Dec 2023
QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models
Elias Frantar
Dan Alistarh
MQ
MoE
19
24
0
25 Oct 2023
Exponential Quantum Communication Advantage in Distributed Inference and Learning
H. Michaeli
D. Gilboa
Daniel Soudry
Jarrod R. McClean
FedML
16
0
0
11 Oct 2023
Modularity in Deep Learning: A Survey
Haozhe Sun
Isabelle Guyon
MoMe
23
2
0
02 Oct 2023
Language models are susceptible to incorrect patient self-diagnosis in medical applications
Rojin Ziaei
Samuel Schmidgall
ELM
LM&MA
23
8
0
17 Sep 2023
The Grand Illusion: The Myth of Software Portability and Implications for ML Progress
Fraser Mince
Dzung Dinh
Jonas Kgomo
Neil Thompson
Sara Hooker
12
6
0
12 Sep 2023
Saturn: An Optimized Data System for Large Model Deep Learning Workloads
Kabir Nagrecha
Arun Kumar
11
6
0
03 Sep 2023
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
Jiajia Li
Mingle Xu
Lirong Xiang
Dong Chen
Weichao Zhuang
Xunyuan Yin
Zhao Li
25
3
0
13 Aug 2023
Towards Generalist Biomedical AI
Tao Tu
Shekoofeh Azizi
Danny Driess
M. Schaekermann
Mohamed Amin
...
Yossi Matias
K. Singhal
Peter R. Florence
Alan Karthikesalingam
Vivek Natarajan
LM&MA
MedIm
AI4MH
33
241
0
26 Jul 2023
Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability
Jianing Zhu
Hengzhuang Li
Jiangchao Yao
Tongliang Liu
Jianliang Xu
Bo Han
OODD
22
12
0
06 Jun 2023
Auto-Differentiation of Relational Computations for Very Large Scale Machine Learning
Yu-Shuen Tang
Zhimin Ding
Dimitrije Jankov
Binhang Yuan
Daniel Bourgeois
C. Jermaine
BDL
24
6
0
31 May 2023
Large Language Models for User Interest Journeys
Konstantina Christakopoulou
Alberto Lalama
Cj Adams
Iris Qu
Yifat Amir
...
Dina Bseiso
Sarah Scodel
Lucas Dixon
Ed H. Chi
Minmin Chen
16
25
0
24 May 2023
SPARSEFIT: Few-shot Prompting with Sparse Fine-tuning for Jointly Generating Predictions and Natural Language Explanations
Jesus Solano
Oana-Maria Camburu
Pasquale Minervini
8
1
0
22 May 2023
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
58
1,138
0
17 May 2023
SAFE: Machine Unlearning With Shard Graphs
Yonatan Dukler
Benjamin Bowman
Alessandro Achille
Aditya Golatkar
A. Swaminathan
Stefano Soatto
MU
13
20
0
25 Apr 2023
Causal fault localisation in dataflow systems
Andrei Paleyes
Neil D. Lawrence
11
3
0
24 Apr 2023
Byzantine-Resilient Learning Beyond Gradients: Distributing Evolutionary Search
Andrei Kucharavy
M. Monti
R. Guerraoui
Ljiljana Dolamic
17
1
0
20 Apr 2023
Fundamentals of Generative Large Language Models and Perspectives in Cyber-Defense
Andrei Kucharavy
Z. Schillaci
Loic Maréchal
Maxime Wursch
Ljiljana Dolamic
Remi Sabonnadiere
Dimitri Percia David
Alain Mermoud
Vincent Lenders
ELM
AI4CE
22
31
0
21 Mar 2023
Universal Instance Perception as Object Discovery and Retrieval
B. Yan
Yi-Xin Jiang
Jiannan Wu
D. Wang
Ping Luo
Zehuan Yuan
Huchuan Lu
VOS
VLM
LRM
27
161
0
12 Mar 2023
OCCL: a Deadlock-free Library for GPU Collective Communication
Lichen Pan
Juncheng Liu
Jinhui Yuan
Rongkai Zhang
Pengze Li
Zhen Xiao
20
1
0
11 Mar 2023
Provable Pathways: Learning Multiple Tasks over Multiple Paths
Yingcong Li
Samet Oymak
MoE
11
4
0
08 Mar 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
W. Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
27
2
0
01 Mar 2023
Hulk: Graph Neural Networks for Optimizing Regionally Distributed Computing Systems
Zheng Yuan
HU Xue
Chaoyun Zhang
Yongming Liu
GNN
AI4CE
19
1
0
27 Feb 2023
Multipath agents for modular multitask ML systems
Andrea Gesmundo
10
1
0
06 Feb 2023
Ten Lessons We Have Learned in the New "Sparseland": A Short Handbook for Sparse Neural Network Researchers
Shiwei Liu
Zhangyang Wang
17
30
0
06 Feb 2023
SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction
Zhiqi Lin
Youshan Miao
Guodong Liu
Xiaoxiang Shi
Quanlu Zhang
...
Xu Cao
Cheng-Wu Li
Mao Yang
Lintao Zhang
Lidong Zhou
13
6
0
21 Jan 2023
Large Language Models Encode Clinical Knowledge
K. Singhal
Shekoofeh Azizi
T. Tu
S. S. Mahdavi
Jason W. Wei
...
A. Rajkomar
Joelle Barral
Christopher Semturs
Alan Karthikesalingam
Vivek Natarajan
LM&MA
ELM
AI4MH
19
2,154
0
26 Dec 2022
1
2
Next