ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.15762
  4. Cited By
Hanayo: Harnessing Wave-like Pipeline Parallelism for Enhanced Large
  Model Training Efficiency

Hanayo: Harnessing Wave-like Pipeline Parallelism for Enhanced Large Model Training Efficiency

International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2023
30 August 2023
Ziming Liu
Shenggan Cheng
Hao Zhou
Yang You
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Hanayo: Harnessing Wave-like Pipeline Parallelism for Enhanced Large Model Training Efficiency"

17 / 17 papers shown
AdaPtis: Reducing Pipeline Bubbles with Adaptive Pipeline Parallelism on Heterogeneous Models
AdaPtis: Reducing Pipeline Bubbles with Adaptive Pipeline Parallelism on Heterogeneous Models
Jihu Guo
Tenghui Ma
Wei Gao
Peng Sun
Jiaxing Li
Xun Chen
Yuyang Jin
Dahua Lin
102
0
0
28 Sep 2025
Data-Centric Elastic Pipeline Parallelism for Efficient Long-Context LLM Training
Data-Centric Elastic Pipeline Parallelism for Efficient Long-Context LLM Training
Shiju Wang
Yujie Wang
Ao Sun
Fangcheng Fu
Z. Zhu
Huang Leng
Xu Han
Kaisheng Ma
157
0
0
25 Sep 2025
Kimi K2: Open Agentic Intelligence
Kimi K2: Open Agentic Intelligence
Kimi Team
Yifan Bai
Yiping Bao
Guanduo Chen
Jiahao Chen
...
Qifeng Teng
Chensi Wang
Dinglu Wang
Feng Wang
Haiming Wang
MoEVLMLRM
182
84
0
28 Jul 2025
Rethinking Dynamic Networks and Heterogeneous Computing with Automatic Parallelization
Rethinking Dynamic Networks and Heterogeneous Computing with Automatic ParallelizationAsia-Pacific Workshop on Networking (AN), 2025
Ruilong Wu
Xinjiao Li
Yisu Wang
Xinyu Chen
Dirk Kutscher
178
0
0
03 Jun 2025
Ferret: An Efficient Online Continual Learning Framework under Varying Memory Constraints
Ferret: An Efficient Online Continual Learning Framework under Varying Memory ConstraintsComputer Vision and Pattern Recognition (CVPR), 2025
Yuhao Zhou
Yuxin Tian
Jindi Lv
Mingjia Shi
Yuanxi Li
Qing Ye
Shuhao Zhang
Jiancheng Lv
CLL
283
1
0
15 Mar 2025
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization
Xinyi Wan
Penghui Qi
Guangxing Huang
Jialin Li
Jialin Li
221
3
0
03 Mar 2025
FreeRide: Harvesting Bubbles in Pipeline Parallelism
FreeRide: Harvesting Bubbles in Pipeline Parallelism
Jiashu Zhang
Zihan Pan
Molly
Xu
Khuzaima S. Daudjee
336
0
0
11 Sep 2024
Efficient Training of Large Language Models on Distributed
  Infrastructures: A Survey
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey
Jiangfei Duan
Shuo Zhang
Zerui Wang
Lijuan Jiang
Wenwen Qu
...
Dahua Lin
Yonggang Wen
Xin Jin
Tianwei Zhang
Yang Liu
369
32
0
29 Jul 2024
WallFacer: Guiding Transformer Model Training Out of the Long-Context
  Dark Forest with N-body Problem
WallFacer: Guiding Transformer Model Training Out of the Long-Context Dark Forest with N-body Problem
Ziming Liu
Shaoyu Wang
Shenggan Cheng
Zhongkai Zhao
Xuanlei Zhao
James Demmel
Yang You
219
1
0
30 Jun 2024
GraphPipe: Improving Performance and Scalability of DNN Training with
  Graph Pipeline Parallelism
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism
Byungsoo Jeon
Yingcheng Wang
Shiyi Cao
Sunghyun Kim
Sunghyun Park
...
Xupeng Miao
Mohammad Alizadeh
G. R. Ganger
Tianqi Chen
Zhihao Jia
GNNAI4CE
216
17
0
24 Jun 2024
Resource Allocation and Workload Scheduling for Large-Scale Distributed
  Deep Learning: A Survey
Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey
Feng Liang
Zhen Zhang
Haifeng Lu
Chengming Li
Victor C. M. Leung
Yanyi Guo
Xiping Hu
339
9
0
12 Jun 2024
2BP: 2-Stage Backpropagation
2BP: 2-Stage Backpropagation
Christopher Rae
Joseph K. L. Lee
James Richings
MoEMQ
123
0
0
28 May 2024
Pipeline Parallelism with Controllable Memory
Pipeline Parallelism with Controllable Memory
Penghui Qi
Xinyi Wan
Nyamdavaa Amar
Jialin Li
243
10
0
24 May 2024
SlipStream: Adapting Pipelines for Distributed Training of Large DNNs
  Amid Failures
SlipStream: Adapting Pipelines for Distributed Training of Large DNNs Amid FailuresSymposium on Operating Systems Principles (SOSP), 2024
Swapnil Gandhi
Mark Zhao
Athinagoras Skiadopoulos
Christos Kozyrakis
AI4CEGNN
210
1
0
22 May 2024
Checkpoint Merging via Bayesian Optimization in LLM Pretraining
Checkpoint Merging via Bayesian Optimization in LLM Pretraining
Deyuan Liu
Zecheng Wang
Bingning Wang
Weipeng Chen
Chunshan Li
Zhiying Tu
Dianhui Chu
Bo Li
Dianbo Sui
MoMe
300
26
0
28 Mar 2024
Training and Serving System of Foundation Models: A Comprehensive Survey
Training and Serving System of Foundation Models: A Comprehensive Survey
Jiahang Zhou
Yanyu Chen
Zicong Hong
Wuhui Chen
Yue Yu
Tao Zhang
Hui Wang
Chuan-fu Zhang
Zibin Zheng
ALM
227
14
0
05 Jan 2024
PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction
PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight PredictionIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Lei Guan
Dongsheng Li
Jiye Liang
Wenjian Wang
Wenjian Wang
Xicheng Lu
311
3
0
01 Dec 2023
1
Page 1 of 1