v1v2 (latest)

Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains

Annual Meeting of the Association for Computational Linguistics (ACL), 2020

2 December 2020

Chengyu Wang

Yichang Zhang

Papers citing "Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains"

27 / 27 papers shown

Experts are all you need: A Composable Framework for Large Language Model Inference

227

28 Nov 2025

PrunedLoRA: Robust Gradient-Based structured pruning for Low-rank Adaptation in Fine-tuning

307

30 Sep 2025

SPADE: Structured Pruning and Adaptive Distillation for Efficient LLM-TTS

196

25 Sep 2025

DistilQwen2.5: Industrial Practices of Training Distilled Open Lightweight Language Models

359

21 Apr 2025

EvoP: Robust LLM Inference via Evolutionary Pruning

701

19 Feb 2025

A Hybrid Cross-Stage Coordination Pre-ranking Model for Online Recommendation SystemsThe Web Conference (WWW), 2025

297

17 Feb 2025

MoDeGPT: Modular Decomposition for Large Language Model CompressionInternational Conference on Learning Representations (ICLR), 2024

894

19 Aug 2024

MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLUNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

498

15 Aug 2024

Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers

Diana-Nicoleta Grigore

Mariana-Iuliana Georgescu

J. A. Justo

T. Johansen

Andreea-Iuliana Ionescu

Radu Tudor Ionescu

381

14 Apr 2024

CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models

337

12 Apr 2024

Hierarchical Skip Decoding for Efficient Autoregressive Text Generation

400

22 Mar 2024

CLLMs: Consistency Large Language Models

512

28 Feb 2024

Model Compression and Efficient Inference for Large Language Models: A Survey

362

15 Feb 2024

One-Shot Sensitivity-Aware Mixed Sparsity Pruning for Large Language Models

319

14 Oct 2023

Position: Key Claims in LLM Research Have a Long Tail of FootnotesInternational Conference on Machine Learning (ICML), 2023

Anna Rogers

A. Luccioni

547

14 Aug 2023

Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation MethodAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

182

11 Jun 2023

Domain Private Transformers for Multi-Domain Dialog SystemsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Anmol Kabra

Ethan R. Elenberg

266

23 May 2023

LLM-Pruner: On the Structural Pruning of Large Language ModelsNeural Information Processing Systems (NeurIPS), 2023

Xinyin Ma

Gongfan Fang

Xinchao Wang

880

766

19 May 2023

Few-Shot Learning of Compact Models via Task-Specific Meta DistillationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

317

18 Oct 2022

Meta Learning for Natural Language Processing: A SurveyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Hung-yi Lee

Shang-Wen Li

Ngoc Thang Vu

441

03 May 2022

EasyNLP: A Comprehensive and Easy-to-use Toolkit for Natural Language ProcessingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Chengyu Wang

Lei Li

273

30 Apr 2022

DistilCSE: Effective Knowledge Distillation For Contrastive Sentence Embeddings

Peng Wang

204

10 Dec 2021

Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AIIEEE Transactions on Knowledge and Data Engineering (TKDE), 2021

Jiangchao Yao

Shengyu Zhang

Yang Yao

Feng Wang

Jianxin Ma

...

Kun Kuang

Chao-Xiang Wu

Leilei Gan

Jingren Zhou

Hongxia Yang

441

155

11 Nov 2021

HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression

371

16 Oct 2021

Learning to Teach with Student Feedback

Yitao Liu

Tianxiang Sun

Xipeng Qiu

Xuanjing Huang

VLM

192

10 Sep 2021

BERT Learns to Teach: Knowledge Distillation with Meta LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Wangchunshu Zhou

Canwen Xu

Julian McAuley

378

110

08 Jun 2021

EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP ApplicationsInternational Conference on Information and Knowledge Management (CIKM), 2020

Minghui Qiu

Peng Li

Chengyu Wang

...

Yaliang Li

414

18 Nov 2020