LLMR: Knowledge Distillation with a Large Language Model-Induced RewardInternational Conference on Language Resources and Evaluation (LREC), 2024 |
MiniLLM: Knowledge Distillation of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023 |
Token Imbalance Adaptation for Radiology Report GenerationACM Conference on Health, Inference, and Learning (CHIL), 2023 |
Inverse Reinforcement Learning for Text SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |