Knowing Before Saying: LLM Representations Encode Information About Chain-of-Thought Success Before CompletionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Can Large Language Models Predict Parallel Code Performance?IEEE International Symposium on High-Performance Parallel Distributed Computing (HPDC), 2025 |
StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
LEMMA: Learning from Errors for MatheMatical Advancement in LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |