Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model EnsemblingInternational Conference on Learning Representations (ICLR), 2024 |
Co-training and Co-distillation for Quality Improvement and Compression
of Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |