Self-Knowledge Distillation in Natural Language Processing

Recent Advances in Natural Language Processing (RANLP), 2019

2 August 2019

Sangchul Hahn

Heeyoul Choi

ArXiv (abs)PDF HTML

Papers citing "Self-Knowledge Distillation in Natural Language Processing"

50 / 65 papers shown

Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations

189

24 Oct 2025

MMCD: Multi-Modal Collaborative Decision-Making for Connected Autonomy with Knowledge Distillation

174

19 Sep 2025

A Novel Compression Framework for YOLOv8: Achieving Real-Time Aerial Object Detection on Edge Devices via Structured Pruning and Channel-Wise Distillation

Melika Sabaghian

Mohammad Ali Keyvanrad

Seyyedeh Mahila Moghadami

243

16 Sep 2025

Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models

...

385

18 Apr 2025

Not All LoRA Parameters Are Essential: Insights on Inference Necessity

327

30 Mar 2025

CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems

358

25 Feb 2025

The Effect of Optimal Self-Distillation in Noisy Gaussian Mixture Model

Kaito Takanami

Takashi Takahashi

Ayaka Sakata

548

27 Jan 2025

Metric Learning with Progressive Self-Distillation for Audio-Visual Embedding LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

Donghuo Zeng

Kazushi Ikeda

SSL

247

17 Jan 2025

Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models

413

25 Nov 2024

SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Kumar Shridhar

221

24 Oct 2024

Collaborative Knowledge Distillation via a Learning-by-Education Node Community

Anestis Kaimakamidis

Ioannis Mademlis

Ioannis Pitas

395

30 Sep 2024

Mitigating the Negative Impact of Over-association for Conversational Query ProductionInformation Processing & Management (IPM), 2024

348

29 Sep 2024

Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic ModelsNeural Information Processing Systems (NeurIPS), 2024

466

19 Aug 2024

Tackling Noisy Clients in Federated Learning with End-to-end Label CorrectionInternational Conference on Information and Knowledge Management (CIKM), 2024

Xuefeng Jiang

367

08 Aug 2024

Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins

245

31 Jul 2024

Instance Temperature Knowledge Distillation

Jun Liu

502

27 Jun 2024

Decoupled Alignment for Robust Plug-and-Play Adaptation

Jerry Yao-Chieh Hu

420

03 Jun 2024

Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on Perceptual Similarity

Lei Wang

Desen Yuan

253

30 Apr 2024

CTSM: Combining Trait and State Emotions for Empathetic Response ModelInternational Conference on Language Resources and Evaluation (LREC), 2024

222

22 Mar 2024

Non-Exchangeable Conformal Language Generation with Nearest Neighbors

Dennis Ulmer

Chrysoula Zerva

André F. T. Martins

401

01 Feb 2024

Learning with Noisy Low-Cost MOS for Image Quality Assessment via Dual-Bias CalibrationIEEE transactions on multimedia (IEEE TMM), 2023

Qingbo Wu

Fanman Meng

Linfeng Xu

181

27 Nov 2023

ViPE: Visualise Pretty-much EverythingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

188

16 Oct 2023

Data Upcycling Knowledge Distillation for Image Super-Resolution

373

25 Sep 2023

Teacher-Student Architecture for Knowledge Distillation: A Survey

Xue Liu

365

08 Aug 2023

Incorporating Graph Information in Transformer-based AMR ParsingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Mustafa Hajij

Pere-Lluís Huguet Cabot

Abelardo Carlos Martínez Lorenzo

Roberto Navigli

252

23 Jun 2023

UADB: Unsupervised Anomaly Detection BoosterIEEE International Conference on Data Engineering (ICDE), 2023

Jiang Bian

269

03 Jun 2023

Distilling Robustness into Natural Language Inference Models with Domain-Targeted AugmentationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Joe Stacey

Marek Rei

293

22 May 2023

Pseudo-Label Training and Model Inertia in Neural Machine TranslationInternational Conference on Learning Representations (ICLR), 2023

255

19 May 2023

Heterogeneous-Branch Collaborative Learning for Dialogue GenerationAAAI Conference on Artificial Intelligence (AAAI), 2023

Yiwei Li

Shaoxiong Feng

Bin Sun

Kan Li

162

21 Mar 2023

Improving Video Retrieval by Adaptive MarginAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021

309

09 Mar 2023

Topics in Contextualised Attention EmbeddingsEuropean Conference on Information Retrieval (ECIR), 2023

Mozhgan Talebpour

A. G. S. D. Herrera

Shoaib Jameel

235

11 Jan 2023

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-TrainingComputer Vision and Pattern Recognition (CVPR), 2023

370

108

05 Jan 2023

Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness PredictionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

245

07 Nov 2022

Teacher-Student Architecture for Knowledge Learning: A Survey

Xue Liu

287

28 Oct 2022

A Novel Self-Knowledge Distillation Approach with Siamese Representation Learning for Action RecognitionVisual Communications and Image Processing (VCIP), 2021

Duc-Quang Vu

T. Phung

Jia-Ching Wang

165

03 Sep 2022

Towards Federated Learning against Noisy Labels via Local Self-RegularizationInternational Conference on Information and Knowledge Management (CIKM), 2022

217

25 Aug 2022

PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model AdaptationIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022

Liang Ding

Bo Du

234

22 Aug 2022

Label Semantic Knowledge Distillation for Unbiased Scene Graph Generation

Yi Yang

267

07 Aug 2022

Multi-Faceted Distillation of Base-Novel Commonality for Few-shot Object DetectionEuropean Conference on Computer Vision (ECCV), 2022

181

22 Jul 2022

End-to-end Spoken Conversational Question Answering: Task, Dataset and Model

211

29 Apr 2022

Robust Cross-Modal Representation Learning with Progressive Self-DistillationComputer Vision and Pattern Recognition (CVPR), 2022

287

10 Apr 2022

Adaptive Mixing of Auxiliary Losses in Supervised LearningAAAI Conference on Artificial Intelligence (AAAI), 2022

Ganesh Ramakrishnan

433

07 Feb 2022

Adaptive Image Inpainting

Maitreya Suin

Kuldeep Purohit

A. N. Rajagopalan

160

01 Jan 2022

Conditional Generative Data-free Knowledge DistillationImage and Vision Computing (IVC), 2021

434

31 Dec 2021

Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis

Liang Ding

Bo Du

295

26 Oct 2021

Language Modelling via Learning to Rank

A. Frydenlund

Gagandeep Singh

Frank Rudzicz

192

13 Oct 2021

Improving Question Answering Performance Using Knowledge Distillation and Active LearningEngineering applications of artificial intelligence (EAAI), 2021

Yasaman Boreshban

Seyed Morteza Mirbostani

Gholamreza Ghassem-Sani

Seyed Abolghasem Mirroshandel

Shahin Amiriparian

209

26 Sep 2021

Adversarial Training with Contrastive Learning in NLP

188

19 Sep 2021

Cross-Lingual Text Classification of Transliterated Hindi and Malayalam

Jitin Krishnan

Antonios Anastasopoulos

Hemant Purohit

Huzefa Rangwala

222

31 Aug 2021

Learning from Matured Dumb Teacher for Fine Generalization

197

12 Aug 2021