
Title |
|---|
![]() Anyprefer: An Agentic Framework for Preference Data SynthesisInternational Conference on Learning Representations (ICLR), 2025 |
![]() What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token PatternsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Instruction-Tuning Data Synthesis from Scratch via Web ReconstructionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() REALM: A Dataset of Real-World LLM Use CasesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() ChatBench: From Static Benchmarks to Human-AI EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Navigating Rifts in Human-LLM Grounding: Study and BenchmarkAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() LLMs syntactically adapt their language use to their conversational partnerAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Synthesizing Post-Training Data for LLMs through Multi-Agent SimulationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
![]() SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast AsiaNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025 |
![]() Style Outweighs Substance: Failure Modes of LLM Judges in Alignment BenchmarkingInternational Conference on Learning Representations (ICLR), 2024 |