Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.10898
Cited By
Impact of RoCE Congestion Control Policies on Distributed Training of DNNs
22 July 2022
Tarannum Khan
Saeed Rashidi
Srinivas Sridharan
Pallavi Shurpali
Aditya Akella
T. Krishna
OOD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Impact of RoCE Congestion Control Policies on Distributed Training of DNNs"
2 / 2 papers shown
Title
Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning
Jinsun Yoo
ChonLam Lao
Lianjie Cao
Bob Lantz
Minlan Yu
Tushar Krishna
Puneet Sharma
52
0
0
29 Apr 2025
LIBRA: Enabling Workload-aware Multi-dimensional Network Topology Optimization for Distributed Training of Large AI Models
William Won
Saeed Rashidi
S. Srinivasan
T. Krishna
AI4CE
12
7
0
24 Sep 2021
1