Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.01635
Cited By
RTP: Rethinking Tensor Parallelism with Memory Deduplication
2 November 2023
Cheng Luo
Tianle Zhong
Geoffrey C. Fox
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RTP: Rethinking Tensor Parallelism with Memory Deduplication"
3 / 3 papers shown
Title
iServe: An Intent-based Serving System for LLMs
Dimitrios Liakopoulos
Tianrui Hu
Prasoon Sinha
N. Yadwadkar
VLM
95
0
0
08 Jan 2025
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey
Jiangfei Duan
Shuo Zhang
Zerui Wang
Lijuan Jiang
Wenwen Qu
...
Dahua Lin
Yonggang Wen
Xin Jin
Tianwei Zhang
Peng Sun
69
8
0
29 Jul 2024
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,815
0
17 Sep 2019
1