v1v2 (latest)

AdaBERT: Task-Adaptive BERT Compression with Differentiable Neural Architecture Search

International Joint Conference on Artificial Intelligence (IJCAI), 2020

13 January 2020

Jingren Zhou

Papers citing "AdaBERT: Task-Adaptive BERT Compression with Differentiable Neural Architecture Search"

50 / 62 papers shown

Elastic Architecture Search for Efficient Language ModelsIEEE International Conference on Multimedia and Expo (ICME), 2025

Shang Wang

KELM

172

30 Oct 2025

EvoPress: Accurate Dynamic Model Compression via Evolutionary Search

561

18 Oct 2024

Automatic Pruning of Fine-tuning Datasets for Transformer-based Language Models

Mohammadreza Tayaranian

S. H. Mozafari

Brett H. Meyer

J. Clark

Warren J. Gross

178

11 Jul 2024

DIR-BHRNet: A Lightweight Network for Real-time Vision-based Multi-person Pose Estimation on Smartphones

Gongjin Lan

Yu Wu

Qi Hao

3DH

283

01 Jul 2024

Structural Pruning of Pre-trained Language Models via Neural Architecture Search

233

03 May 2024

Model Compression and Efficient Inference for Large Language Models: A Survey

378

15 Feb 2024

A Comprehensive Survey of Compression Algorithms for Language Models

390

27 Jan 2024

Vesper: A Compact and Effective Pretrained Model for Speech Emotion RecognitionIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023

345

20 Jul 2023

A Survey of Techniques for Optimizing Transformer InferenceJournal of systems architecture (JSA), 2023

Krishna Teja Chitty-Venkata

399

137

16 Jul 2023

DDNAS: Discretized Differentiable Neural Architecture Search for Text ClassificationACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023

Kuan-Yu Chen

Cheng Li

Kuo-Jung Lee

257

12 Jul 2023

AutoML in the Age of Large Language Models: Current Challenges, Future Opportunities and Risks

...

Daphne Theodorakopoulos

Tanja Tornede

Henning Wachsmuth

Marius Lindauer

383

13 Jun 2023

SqueezeLLM: Dense-and-Sparse QuantizationInternational Conference on Machine Learning (ICML), 2023

Sehoon Kim

Coleman Hooper

Zhen Dong

564

297

13 Jun 2023

ALT: An Automatic System for Long Tail Scenario ModelingIEEE International Conference on Data Engineering (ICDE), 2023

Yue Zhang

178

19 May 2023

Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile TelepresenceComputer Vision and Pattern Recognition (CVPR), 2023

415

24 Apr 2023

Efficient Automation of Neural Network Design: A Survey on Differentiable Neural Architecture SearchACM Computing Surveys (ACM Comput. Surv.), 2023

371

11 Apr 2023

EdgeTran: Co-designing Transformers for Efficient Inference on Mobile Edge Platforms

Shikhar Tuli

N. Jha

364

24 Mar 2023

Neural Architecture Search for Effective Teacher-Student Knowledge Transfer in Language Models

Bishwaranjan Bhattacharjee

343

16 Mar 2023

Gradient-Free Structured Pruning with Unlabeled DataInternational Conference on Machine Learning (ICML), 2023

367

07 Mar 2023

Speculative Decoding with Big Little DecoderNeural Information Processing Systems (NeurIPS), 2023

Sehoon Kim

Suhong Moon

594

176

15 Feb 2023

Efficient Non-Parametric Optimizer Search for Diverse TasksNeural Information Processing Systems (NeurIPS), 2022

398

27 Sep 2022

Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for Natural Language Understanding

Connor Holmes

Minjia Zhang

Yuxiong He

Bo Wu

189

30 Jun 2022

FlexiBERT: Are Current Transformer Architectures too Homogeneous and Rigid?Journal of Artificial Intelligence Research (JAIR), 2022

343

23 May 2022

Prompting to Distill: Boosting Data-Free Knowledge Distillation via Reinforced PromptInternational Joint Conference on Artificial Intelligence (IJCAI), 2022

174

16 May 2022

Meta Learning for Natural Language Processing: A SurveyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Hung-yi Lee

Shang-Wen Li

Ngoc Thang Vu

443

03 May 2022

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Zhijian Liu

Song Han

288

136

25 Apr 2022

A Fast Post-Training Pruning Framework for TransformersNeural Information Processing Systems (NeurIPS), 2022

Sehoon Kim

282

212

29 Mar 2022

Fast Monte-Carlo Approximation of the Attention MechanismAAAI Conference on Artificial Intelligence (AAAI), 2022

Hyunjun Kim

Jeonggil Ko

335

30 Jan 2022

AutoMC: Automated Model Compression based on Domain Knowledge and Progressive search strategyIEEE International Conference on Data Engineering (ICDE), 2022

Chunnan Wang

Hongzhi Wang

Xiangyu Shi

174

24 Jan 2022

AutoDistill: an End-to-End Framework to Explore and Distill Hardware-Efficient Language Models

Xiaofan Zhang

Zongwei Zhou

Deming Chen

Yu Emma Wang

207

21 Jan 2022

Which Student is Best? A Comprehensive Knowledge Distillation Exam for Task-Specific BERT Models

Made Nindyatama Nityasya

Haryo Akbarianto Wibowo

Rendi Chevi

Radityo Eko Prasojo

Alham Fikri Aji

260

03 Jan 2022

RT-RCG: Neural Network and Accelerator Search Towards Effective and Real-time ECG Reconstruction from Intracardiac ElectrogramsACM Journal on Emerging Technologies in Computing Systems (JETC), 2021

209

04 Nov 2021

Differentiable NAS Framework and Application to Ads CTR Prediction

207

25 Oct 2021

SuperShaper: Task-Agnostic Super Pre-training of BERT Models with Variable Hidden Dimensions

Vinod Ganesan

Gowtham Ramesh

Pratyush Kumar

165

10 Oct 2021

Towards Efficient Post-training Quantization of Pre-trained Language Models

Haoli Bai

Lu Hou

Lifeng Shang

Xin Jiang

Irwin King

Michael R. Lyu

250

30 Sep 2021

Distiller: A Systematic Study of Model Distillation Methods in Natural Language Processing

Zha Sheng

George Karypis

177

23 Sep 2021

Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction NetworkAutomatic Speech Recognition & Understanding (ASRU), 2021

Takaaki Saeki

Shinnosuke Takamichi

Hiroshi Saruwatari

269

22 Sep 2021

RankNAS: Efficient Neural Architecture Search by Pairwise Ranking

Jingbo Zhu

262

15 Sep 2021

EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation

Hang Xu

Xiaodan Liang

214

15 Sep 2021

AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Yichun Yin

Cheng Chen

Lifeng Shang

Xin Jiang

Xiao Chen

Qun Liu

VLM

206

29 Jul 2021

AutoBERT-Zero: Evolving BERT Backbone from ScratchAAAI Conference on Artificial Intelligence (AAAI), 2021

Jiahui Gao

Hang Xu

Han Shi

Xiaozhe Ren

Philip L. H. Yu

Xiaodan Liang

Xin Jiang

Zhenguo Li

232

15 Jul 2021

Scene-adaptive Knowledge Distillation for Sequential Recommendation via Differentiable Architecture Search

Min Yang

233

15 Jul 2021

LV-BERT: Exploiting Layer Variety for BERTFindings (Findings), 2021

Weihao Yu

181

22 Jun 2021

RoSearch: Search for Robust Student Architectures When Distilling Pre-trained Language Models

Xin Guo

174

07 Jun 2021

You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient

Mengdi Wang

Shen Li

Jun Yang

Rongrong Ji

256

04 Jun 2021

NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture SearchKnowledge Discovery and Data Mining (KDD), 2021

Xu Tan

188

30 May 2021

Improved Customer Transaction Classification using Semi-Supervised Knowledge Distillation

Rohan Sukumaran

186

15 Feb 2021

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture SearchIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Xu Tan

Enhong Chen

161

08 Feb 2021

Model Compression for Domain Adaptation through Causal Effect EstimationTransactions of the Association for Computational Linguistics (TACL), 2021

283

18 Jan 2021

Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search

341

24 Dec 2020

Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across DomainsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Chengyu Wang

Yichang Zhang

261

02 Dec 2020