STORE: Semantic Tokenization, Orthogonal Rotation and Efficient Attention for Scaling Up Ranking ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2025 |
InfoPO: On Mutual Information Maximization for Large Language Model AlignmentNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025 |
Syntactic and Semantic Control of Large Language Models via Sequential Monte CarloInternational Conference on Learning Representations (ICLR), 2025 |
SimPER: A Minimalist Approach to Preference Alignment without HyperparametersInternational Conference on Learning Representations (ICLR), 2025 |
SR: Teaching LLMs to Self-verify and Self-correct via Reinforcement LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |