v1v2 (latest)

Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation

Annual Meeting of the Association for Computational Linguistics (ACL), 2023

14 May 2023

ArXiv (abs)PDF HTML Github (736★)

Papers citing "Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation"

6 / 6 papers shown

Multi-Hypothesis Distillation of Multilingual Neural Translation Models for Low-Resource Languages

Aarón Galiano-Jiménez

Juan Antonio Pérez-Ortiz

F. Sánchez-Martínez

Víctor M. Sánchez-Cartagena

304

29 Jul 2025

Distilled Circuits: A Mechanistic Study of Internal Restructuring in Knowledge Distillation

Reilly Haskins

Benjamin Adams

348

16 May 2025

A Dual-Space Framework for General Knowledge Distillation of Large Language Models

413

15 Apr 2025

Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models

Jun Rao

Xuebo Liu

Zepeng Lin

Liang Ding

Jing Li

Dacheng Tao

Min Zhang

403

19 Sep 2024

Towards Lifelong Learning of Large Language Models: A Survey

Qianli Ma

384

10 Jun 2024

D$^2$TV: Dual Knowledge Distillation and Target-oriented Vision Modeling
for Many-to-Many Multimodal Summarization

^2

TV: Dual Knowledge Distillation and Target-oriented Vision Modeling for Many-to-Many Multimodal SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Jie Zhou

370

22 May 2023