Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.10299
Cited By
HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models
16 May 2024
R. Sukthanker
Arber Zela
B. Staffler
Aaron Klein
Lennart Purucker
Jorg K. H. Franke
Frank Hutter
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models"
10 / 10 papers shown
Title
NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance
Raphael T. Husistein
Markus Reiher
Marco Eckhoff
73
1
0
20 Feb 2025
Structural Pruning of Pre-trained Language Models via Neural Architecture Search
Aaron Klein
Jacek Golebiowski
Xingchen Ma
Valerio Perrone
Cédric Archambeau
19
1
0
03 May 2024
Unsupervised Graph Neural Architecture Search with Disentangled Self-supervision
Zeyang Zhang
Xin Eric Wang
Ziwei Zhang
Guangyao Shen
Shiqi Shen
Wenwu Zhu
35
12
0
08 Mar 2024
Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining Methods
Bo-Kyeong Kim
Geonmin Kim
Tae-Ho Kim
Thibault Castells
Shinkook Choi
Junho Shin
Hyoung-Kyu Song
49
28
0
05 Feb 2024
Towards Efficient Post-training Quantization of Pre-trained Language Models
Haoli Bai
Lu Hou
Lifeng Shang
Xin Jiang
Irwin King
M. Lyu
MQ
44
47
0
30 Sep 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
124
665
0
24 Jan 2021
AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data
Nick Erickson
Jonas W. Mueller
Alexander Shirkov
Hang Zhang
Pedro Larroy
Mu Li
Alex Smola
LMTD
84
576
0
13 Mar 2020
NAS-Bench-1Shot1: Benchmarking and Dissecting One-shot Neural Architecture Search
Arber Zela
Julien N. Siems
Frank Hutter
72
146
0
28 Jan 2020
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,791
0
17 Sep 2019
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
264
5,290
0
05 Nov 2016
1