HSCR: Hierarchical Self-Contrastive Rewarding for Aligning Medical Vision Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Online Iterative Self-Alignment for Radiology Report GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Persona-judge: Personalized Alignment of Large Language Models via Token-level Self-judgmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Natural Language GenerationTheoretical Issues In Natural Language Processing (TINLP), 2018 |
Robust Multi-Objective Preference Alignment with Online DPOAAAI Conference on Artificial Intelligence (AAAI), 2025 |
The Rise of Darkness: Safety-Utility Trade-Offs in Role-Playing Dialogue AgentsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Rethinking Diverse Human Preference Learning through Principal Component AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Orbit: A Framework for Designing and Evaluating Multi-objective RankersInternational Conference on Intelligent User Interfaces (IUI), 2024 |
Comparison-based Active Preference Learning for Multi-dimensional PersonalizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
L3Ms -- Lagrange Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024 |
COS-DPO: Conditioned One-Shot Multi-Objective Fine-Tuning FrameworkConference on Uncertainty in Artificial Intelligence (UAI), 2024 |
Inference-Time Language Model Alignment via Integrated Value GuidanceConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |