Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.14158
Cited By
Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning
26 August 2024
Wei An
Xiao Bi
Guanting Chen
Shanhuang Chen
Chengqi Deng
Honghui Ding
Kai Dong
Qiushi Du
Wenjun Gao
Kang Guan
Jianzhong Guo
Yongqiang Guo
Zhe Fu
Ying He
Panpan Huang
Jiashi Li
Wenfeng Liang
Xiaodong Liu
Xin Liu
Yiyuan Liu
Yuxuan Liu
Shanghao Lu
Xuan Lu
Xiaotao Nie
Tian Pei
Junjie Qiu
Hui Qu
Z. Z. Ren
Zhangli Sha
Xuecheng Su
Xiaowen Sun
Yixuan Tan
Minghui Tang
Shiyu Wang
Yaohui Wang
Yongji Wang
Ziwei Xie
Yiliang Xiong
Yanhong Xu
Shengfeng Ye
Shuiping Yu
Yukun Zha
Liyue Zhang
Haowei Zhang
Mingchuan Zhang
Wentao Zhang
Yichao Zhang
Chenggang Zhao
Yao Zhao
Shangyan Zhou
Shunfeng Zhou
Yuheng Zou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning"
4 / 4 papers shown
Title
Inference-Time Scaling for Generalist Reward Modeling
Zijun Liu
P. Wang
R. Xu
Shirong Ma
Chong Ruan
Peng Li
Yang Janet Liu
Y. Wu
OffRL
LRM
46
9
0
03 Apr 2025
Tutel: Adaptive Mixture-of-Experts at Scale
Changho Hwang
Wei Cui
Yifan Xiong
Ziyue Yang
Ze Liu
...
Joe Chau
Peng Cheng
Fan Yang
Mao Yang
Y. Xiong
MoE
92
108
0
07 Jun 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,412
0
11 Nov 2021
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,815
0
17 Sep 2019
1