Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.01433
Cited By
TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
4 April 2023
N. Jouppi
George Kurian
Sheng R. Li
Peter C. Ma
R. Nagarajan
Lifeng Nai
Nishant Patil
Suvinay Subramanian
Andy Swing
Brian Towles
C. Young
Xiaoping Zhou
Zongwei Zhou
David A. Patterson
BDL
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings"
22 / 122 papers shown
Title
Retrospective: A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing
Junwhan Ahn
Sungpack Hong
Sungjoo Yoo
Onur Mutlu
Kiyoung Choi
GNN
11
1
0
27 Jun 2023
DGEMM on Integer Matrix Multiplication Unit
Hiroyuki Ootomo
K. Ozaki
Rio Yokota
4
12
0
21 Jun 2023
ArchGym: An Open-Source Gymnasium for Machine Learning Assisted Architecture Design
Srivatsan Krishnan
Amir Yazdanbaksh
Shvetank Prakash
Jason J. Jabbour
Ikechukwu Uchendu
...
Behzad Boroujerdian
Daniel Richins
Devashree Tripathy
Aleksandra Faust
Vijay Janapa Reddi
43
11
0
15 Jun 2023
DORSal: Diffusion for Object-centric Representations of Scenes et al
Allan Jabri
Sjoerd van Steenkiste
Emiel Hoogeboom
Mehdi S. M. Sajjadi
Thomas Kipf
19
17
0
13 Jun 2023
Augmenting Hessians with Inter-Layer Dependencies for Mixed-Precision Post-Training Quantization
Clemens J. S. Schaefer
Navid Lambert-Shirzad
Xiaofan Zhang
Chia-Wei Chou
T. Jablin
Jian Li
Elfie Guo
Caitlin Stanton
S. Joshi
Yu Emma Wang
MQ
13
2
0
08 Jun 2023
M3ICRO: Machine Learning-Enabled Compact Photonic Tensor Core based on PRogrammable Multi-Operand Multimode Interference
Jiaqi Gu
Hanqing Zhu
Chenghao Feng
Zixuan Jiang
Ray T. Chen
D. Pan
8
6
0
31 May 2023
NicePIM: Design Space Exploration for Processing-In-Memory DNN Accelerators with 3D-Stacked-DRAM
Junpeng Wang
Mengke Ge
Bo Ding
Qi Xu
Song Chen
Yi Kang
23
5
0
30 May 2023
Translatotron 3: Speech to Speech Translation with Monolingual Data
Eliya Nachmani
Alon Levkovitch
Yi-Yang Ding
Chulayutsh Asawaroengchai
Heiga Zen
Michelle Tadmor Ramanovich
13
14
0
27 May 2023
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
Eliya Nachmani
Alon Levkovitch
Roy Hirsch
Julián Salazar
Chulayutsh Asawaroengchai
Soroosh Mariooryad
Ehud Rivlin
RJ Skerry-Ryan
Michelle Tadmor Ramanovich
AuLLM
19
30
0
24 May 2023
Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems
Benjamin Coleman
Wang-Cheng Kang
Matthew Fahrbach
Ruoxi Wang
Lichan Hong
Ed H. Chi
D. Cheng
17
10
0
20 May 2023
SoundStorm: Efficient Parallel Audio Generation
Zalan Borsos
Matthew Sharifi
Damien Vincent
Eugene Kharitonov
Neil Zeghidour
Marco Tagliasacchi
15
97
0
16 May 2023
Symbol tuning improves in-context learning in language models
Jerry W. Wei
Le Hou
Andrew Kyle Lampinen
Xiangning Chen
Da Huang
...
Xinyun Chen
Yifeng Lu
Denny Zhou
Tengyu Ma
Quoc V. Le
LRM
28
72
0
15 May 2023
TACOS: Topology-Aware Collective Algorithm Synthesizer for Distributed Machine Learning
William Won
Midhilesh Elavazhagan
S. Srinivasan
A. Durg
Samvit Kaul
Swati Gupta
Tushar Krishna
17
6
0
11 Apr 2023
FengWu: Pushing the Skillful Global Medium-range Weather Forecast beyond 10 Days Lead
Kan Chen
Tao Han
Junchao Gong
Lei Bai
Fenghua Ling
...
Rui Su
Yuanzheng Ci
Bin Li
Xiaokang Yang
Wanli Ouyang
AI4Cl
AI4CE
15
165
0
06 Apr 2023
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
Eugene Kharitonov
Damien Vincent
Zalan Borsos
Raphaël Marinier
Sertan Girgin
Olivier Pietquin
Matthew Sharifi
Marco Tagliasacchi
Neil Zeghidour
13
189
0
07 Feb 2023
Tricking AI chips into Simulating the Human Brain: A Detailed Performance Analysis
Lennart P L Landsmeer
Max C. W. Engelen
Rene Miedema
Christos Strydis
6
4
0
31 Jan 2023
GPU-based Private Information Retrieval for On-Device Machine Learning Inference
Maximilian Lam
Jeff Johnson
Wenjie Xiong
Kiwan Maeng
Udit Gupta
...
Hsien-Hsin S. Lee
Vijay Janapa Reddi
Gu-Yeon Wei
David Brooks
Edward Suh
19
9
0
26 Jan 2023
Tensor Networks Meet Neural Networks: A Survey and Future Perspectives
Maolin Wang
Y. Pan
Zenglin Xu
Xiangli Yang
Guangxi Li
A. Cichocki
Andrzej Cichocki
43
19
0
22 Jan 2023
COMET: A Comprehensive Cluster Design Methodology for Distributed Deep Learning Training
D. Kadiyala
Saeed Rashidi
Taekyung Heo
A. Bambhaniya
T. Krishna
Alexandros Daglis
VLM
13
9
0
30 Nov 2022
tf.data service: A Case for Disaggregating ML Input Data Processing
Andrew Audibert
Yangrui Chen
D. Graur
Ana Klimovic
Jiří Šimša
C. A. Thekkath
37
16
0
26 Oct 2022
Efficient Direct-Connect Topologies for Collective Communications
Liangyu Zhao
Siddharth Pal
Tapan Chugh
Weiyang Wang
Jason Fantl
P. Basu
J. Khoury
Arvind Krishnamurthy
12
6
0
07 Feb 2022
LIBRA: Enabling Workload-aware Multi-dimensional Network Topology Optimization for Distributed Training of Large AI Models
William Won
Saeed Rashidi
S. Srinivasan
T. Krishna
AI4CE
8
7
0
24 Sep 2021
Previous
1
2
3