ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.01851
  4. Cited By
Self-Knowledge Distillation in Natural Language Processing

Self-Knowledge Distillation in Natural Language Processing

Recent Advances in Natural Language Processing (RANLP), 2019
2 August 2019
Sangchul Hahn
Heeyoul Choi
ArXiv (abs)PDFHTML

Papers citing "Self-Knowledge Distillation in Natural Language Processing"

50 / 65 papers shown
Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations
Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations
Faisal Hamman
Pasan Dissanayake
Yanjun Fu
Sanghamitra Dutta
189
1
0
24 Oct 2025
MMCD: Multi-Modal Collaborative Decision-Making for Connected Autonomy with Knowledge Distillation
MMCD: Multi-Modal Collaborative Decision-Making for Connected Autonomy with Knowledge Distillation
Rui Liu
Zikang Wang
Peng Gao
Yu Shen
Pratap Tokekar
Ming-Chyuan Lin
174
4
0
19 Sep 2025
A Novel Compression Framework for YOLOv8: Achieving Real-Time Aerial Object Detection on Edge Devices via Structured Pruning and Channel-Wise Distillation
A Novel Compression Framework for YOLOv8: Achieving Real-Time Aerial Object Detection on Edge Devices via Structured Pruning and Channel-Wise Distillation
Melika Sabaghian
Mohammad Ali Keyvanrad
Seyyedeh Mahila Moghadami
243
0
0
16 Sep 2025
Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models
Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models
Junjie Yang
Junhao Song
Xudong Han
Ziqian Bi
Pohsun Feng
...
Yujiao Shi
Qian Niu
Cheng Fei
Keyu Chen
Ming Liu
VLM
385
4
0
18 Apr 2025
Not All LoRA Parameters Are Essential: Insights on Inference Necessity
Not All LoRA Parameters Are Essential: Insights on Inference Necessity
Guanhua Chen
Yutong Yao
Ci-Jun Gao
Lidia S. Chao
Feng Wan
Yang Li
327
1
0
30 Mar 2025
CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems
CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems
Rui Liu
Yu-cui Shen
Peng Gao
Erfaun Noorani
Ming C. Lin
358
4
0
25 Feb 2025
The Effect of Optimal Self-Distillation in Noisy Gaussian Mixture Model
The Effect of Optimal Self-Distillation in Noisy Gaussian Mixture Model
Kaito Takanami
Takashi Takahashi
Ayaka Sakata
548
4
0
27 Jan 2025
Metric Learning with Progressive Self-Distillation for Audio-Visual Embedding Learning
Metric Learning with Progressive Self-Distillation for Audio-Visual Embedding LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Donghuo Zeng
Kazushi Ikeda
SSL
247
2
0
17 Jan 2025
Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning
  Small Language Models
Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models
Y. Fu
Yin Yu
Xiaotian Han
Runchao Li
Xianxuan Long
Haotian Yu
Pan Li
SyDa
413
0
0
25 Nov 2024
SIKeD: Self-guided Iterative Knowledge Distillation for mathematical
  reasoning
SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Shivam Adarsh
Kumar Shridhar
Caglar Gulcehre
Nicholas Monath
Mrinmaya Sachan
LRM
221
5
0
24 Oct 2024
Collaborative Knowledge Distillation via a Learning-by-Education Node
  Community
Collaborative Knowledge Distillation via a Learning-by-Education Node Community
Anestis Kaimakamidis
Ioannis Mademlis
Ioannis Pitas
395
1
0
30 Sep 2024
Mitigating the Negative Impact of Over-association for Conversational
  Query Production
Mitigating the Negative Impact of Over-association for Conversational Query ProductionInformation Processing & Management (IPM), 2024
Ante Wang
Linfeng Song
Zijun Min
Ge Xu
Xiaoli Wang
Junfeng Yao
Jinsong Su
348
4
0
29 Sep 2024
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic ModelsNeural Information Processing Systems (NeurIPS), 2024
Aviv Bick
Kevin Y. Li
Eric P. Xing
J. Zico Kolter
Albert Gu
Mamba
466
54
0
19 Aug 2024
Tackling Noisy Clients in Federated Learning with End-to-end Label
  Correction
Tackling Noisy Clients in Federated Learning with End-to-end Label CorrectionInternational Conference on Information and Knowledge Management (CIKM), 2024
Xuefeng Jiang
Sheng Sun
Jia Li
Jingjing Xue
Runhan Li
Zhiyuan Wu
Gang Xu
Yuwei Wang
Min Liu
FedML
367
25
0
08 Aug 2024
Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins
Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins
Lukas Gienapp
Niklas Deckers
Martin Potthast
Harrisen Scells
245
2
0
31 Jul 2024
Instance Temperature Knowledge Distillation
Instance Temperature Knowledge Distillation
Zitao Gao
Yuxi Zhou
Jia Gong
Jun Liu
Zhigang Tu
502
5
0
27 Jun 2024
Decoupled Alignment for Robust Plug-and-Play Adaptation
Decoupled Alignment for Robust Plug-and-Play Adaptation
Haozheng Luo
Jiahao Yu
Wenxin Zhang
Jialong Li
Jerry Yao-Chieh Hu
Xingyu Xing
Han Liu
420
12
0
03 Jun 2024
Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on
  Perceptual Similarity
Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on Perceptual Similarity
Lei Wang
Desen Yuan
253
2
0
30 Apr 2024
CTSM: Combining Trait and State Emotions for Empathetic Response Model
CTSM: Combining Trait and State Emotions for Empathetic Response ModelInternational Conference on Language Resources and Evaluation (LREC), 2024
Yufeng Wang
Chao Chen
Zhou Yang
Shuhui Wang
Xiangwen Liao
222
8
0
22 Mar 2024
Non-Exchangeable Conformal Language Generation with Nearest Neighbors
Non-Exchangeable Conformal Language Generation with Nearest Neighbors
Dennis Ulmer
Chrysoula Zerva
André F. T. Martins
401
17
0
01 Feb 2024
Learning with Noisy Low-Cost MOS for Image Quality Assessment via
  Dual-Bias Calibration
Learning with Noisy Low-Cost MOS for Image Quality Assessment via Dual-Bias CalibrationIEEE transactions on multimedia (IEEE TMM), 2023
Lei Wang
Qingbo Wu
Desen Yuan
K. Ngan
Hongliang Li
Fanman Meng
Linfeng Xu
181
6
0
27 Nov 2023
ViPE: Visualise Pretty-much Everything
ViPE: Visualise Pretty-much EverythingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hassan Shahmohammadi
Adhiraj Ghosh
Hendrik P. A. Lensch
DiffM
188
3
0
16 Oct 2023
Data Upcycling Knowledge Distillation for Image Super-Resolution
Data Upcycling Knowledge Distillation for Image Super-Resolution
Yun-feng Zhang
Wei Li
Simiao Li
Hanting Chen
Zhaopeng Tu
Wenjun Wang
Bingyi Jing
Hai-lin Wang
Jie Hu
373
8
0
25 Sep 2023
Teacher-Student Architecture for Knowledge Distillation: A Survey
Teacher-Student Architecture for Knowledge Distillation: A Survey
Chengming Hu
Xuan Li
Danyang Liu
Haolun Wu
Xi Chen
Ju Wang
Xue Liu
365
44
0
08 Aug 2023
Incorporating Graph Information in Transformer-based AMR Parsing
Incorporating Graph Information in Transformer-based AMR ParsingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Mustafa Hajij
Pere-Lluís Huguet Cabot
Abelardo Carlos Martínez Lorenzo
Roberto Navigli
252
20
0
23 Jun 2023
UADB: Unsupervised Anomaly Detection Booster
UADB: Unsupervised Anomaly Detection BoosterIEEE International Conference on Data Engineering (ICDE), 2023
Hangting Ye
Zhining Liu
Xinyi Shen
Wei Cao
Shun Zheng
Xiaofan Gui
Huishuai Zhang
Yi Chang
Jiang Bian
269
9
0
03 Jun 2023
Distilling Robustness into Natural Language Inference Models with
  Domain-Targeted Augmentation
Distilling Robustness into Natural Language Inference Models with Domain-Targeted AugmentationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Joe Stacey
Marek Rei
293
5
0
22 May 2023
Pseudo-Label Training and Model Inertia in Neural Machine Translation
Pseudo-Label Training and Model Inertia in Neural Machine TranslationInternational Conference on Learning Representations (ICLR), 2023
B. Hsu
Anna Currey
Xing Niu
Maria Nuadejde
Georgiana Dinu
ODL
255
3
0
19 May 2023
Heterogeneous-Branch Collaborative Learning for Dialogue Generation
Heterogeneous-Branch Collaborative Learning for Dialogue GenerationAAAI Conference on Artificial Intelligence (AAAI), 2023
Yiwei Li
Shaoxiong Feng
Bin Sun
Kan Li
162
4
0
21 Mar 2023
Improving Video Retrieval by Adaptive Margin
Improving Video Retrieval by Adaptive MarginAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021
Feng He
Qi Wang
Zhifan Feng
Wenbin Jiang
Yajuan Lü
Yong Zhu
Xiao Tan
309
25
0
09 Mar 2023
Topics in Contextualised Attention Embeddings
Topics in Contextualised Attention EmbeddingsEuropean Conference on Information Retrieval (ECIR), 2023
Mozhgan Talebpour
A. G. S. D. Herrera
Shoaib Jameel
235
3
0
11 Jan 2023
Filtering, Distillation, and Hard Negatives for Vision-Language
  Pre-Training
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-TrainingComputer Vision and Pattern Recognition (CVPR), 2023
Filip Radenovic
Abhimanyu Dubey
Abhishek Kadian
Todor Mihaylov
Simon Vandenhende
Yash J. Patel
Y. Wen
Vignesh Ramanathan
D. Mahajan
VLM
370
108
0
05 Jan 2023
Adaptive Contrastive Learning on Multimodal Transformer for Review
  Helpfulness Predictions
Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness PredictionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Thong Nguyen
Xiaobao Wu
Anh Tuan Luu
Cong-Duy Nguyen
Zhen Hai
Lidong Bing
245
17
0
07 Nov 2022
Teacher-Student Architecture for Knowledge Learning: A Survey
Teacher-Student Architecture for Knowledge Learning: A Survey
Chengming Hu
Xuan Li
Dan Liu
Xi Chen
Ju Wang
Xue Liu
287
44
0
28 Oct 2022
A Novel Self-Knowledge Distillation Approach with Siamese Representation
  Learning for Action Recognition
A Novel Self-Knowledge Distillation Approach with Siamese Representation Learning for Action RecognitionVisual Communications and Image Processing (VCIP), 2021
Duc-Quang Vu
T. Phung
Jia-Ching Wang
165
10
0
03 Sep 2022
Towards Federated Learning against Noisy Labels via Local
  Self-Regularization
Towards Federated Learning against Noisy Labels via Local Self-RegularizationInternational Conference on Information and Knowledge Management (CIKM), 2022
Xue Jiang
Sheng Sun
Yuwei Wang
Min Liu
217
52
0
25 Aug 2022
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model
  Adaptation
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model AdaptationIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
VLMCLL
234
50
0
22 Aug 2022
Label Semantic Knowledge Distillation for Unbiased Scene Graph
  Generation
Label Semantic Knowledge Distillation for Unbiased Scene Graph Generation
Lin Li
Long Chen
Hanrong Shi
Wenxiao Wang
Jian Shao
Yi Yang
Jun Xiao
VLM
267
30
0
07 Aug 2022
Multi-Faceted Distillation of Base-Novel Commonality for Few-shot Object
  Detection
Multi-Faceted Distillation of Base-Novel Commonality for Few-shot Object DetectionEuropean Conference on Computer Vision (ECCV), 2022
Shuang Wu
Wenjie Pei
Dianwen Mei
Fanglin Chen
Jiandong Tian
Guangming Lu
VLMObjD
181
42
0
22 Jul 2022
End-to-end Spoken Conversational Question Answering: Task, Dataset and
  Model
End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Chenyu You
Polydoros Giannouris
Fenglin Liu
Shen Ge
Xian Wu
Yuexian Zou
AuLLM
211
57
0
29 Apr 2022
Robust Cross-Modal Representation Learning with Progressive
  Self-Distillation
Robust Cross-Modal Representation Learning with Progressive Self-DistillationComputer Vision and Pattern Recognition (CVPR), 2022
A. Andonian
Shixing Chen
Raffay Hamid
VLM
287
68
0
10 Apr 2022
Adaptive Mixing of Auxiliary Losses in Supervised Learning
Adaptive Mixing of Auxiliary Losses in Supervised LearningAAAI Conference on Artificial Intelligence (AAAI), 2022
D. Sivasubramanian
Ayush Maheshwari
Pradeep Shenoy
A. Prathosh
Ganesh Ramakrishnan
433
7
0
07 Feb 2022
Adaptive Image Inpainting
Adaptive Image Inpainting
Maitreya Suin
Kuldeep Purohit
A. N. Rajagopalan
160
0
0
01 Jan 2022
Conditional Generative Data-free Knowledge Distillation
Conditional Generative Data-free Knowledge DistillationImage and Vision Computing (IVC), 2021
Xinyi Yu
Ling Yan
Yang Yang
Libo Zhou
Linlin Ou
434
8
0
31 Dec 2021
Unified Instance and Knowledge Alignment Pretraining for Aspect-based
  Sentiment Analysis
Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis
Juhua Liu
Qihuang Zhong
Liang Ding
Hua Jin
Bo Du
Dacheng Tao
295
32
0
26 Oct 2021
Language Modelling via Learning to Rank
Language Modelling via Learning to Rank
A. Frydenlund
Gagandeep Singh
Frank Rudzicz
192
9
0
13 Oct 2021
Improving Question Answering Performance Using Knowledge Distillation
  and Active Learning
Improving Question Answering Performance Using Knowledge Distillation and Active LearningEngineering applications of artificial intelligence (EAAI), 2021
Yasaman Boreshban
Seyed Morteza Mirbostani
Gholamreza Ghassem-Sani
Seyed Abolghasem Mirroshandel
Shahin Amiriparian
209
18
0
26 Sep 2021
Adversarial Training with Contrastive Learning in NLP
Adversarial Training with Contrastive Learning in NLP
Daniela N. Rim
DongNyeong Heo
Heeyoul Choi
AAML
188
13
0
19 Sep 2021
Cross-Lingual Text Classification of Transliterated Hindi and Malayalam
Cross-Lingual Text Classification of Transliterated Hindi and Malayalam
Jitin Krishnan
Antonios Anastasopoulos
Hemant Purohit
Huzefa Rangwala
222
16
0
31 Aug 2021
Learning from Matured Dumb Teacher for Fine Generalization
Learning from Matured Dumb Teacher for Fine Generalization
Heeseung Jung
Kangil Kim
Hoyong Kim
Jong-Hun Shin
197
2
0
12 Aug 2021
12
Next
Page 1 of 2