Large Language Models for Predictive Analysis: How Far Are They?Annual Meeting of the Association for Computational Linguistics (ACL), 2025 |
NorEval: A Norwegian Language Understanding and Generation Evaluation BenchmarkAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Verification of Autonomous Neural Car Control with KeYmaera XInternational Conference on Abstract State Machines, Alloy, B, TLA, VDM, and Z (ABZ), 2025 |
Rubrik's Cube: Testing a New Rubric for Evaluating Explanations on the CUBE datasetAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Enhancing Product Search Interfaces with Sketch-Guided Diffusion and Language AgentsThe Web Conference (WWW), 2025 |