Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Is Large Language Model Performance on Reasoning Tasks Impacted by Different Ways Questions Are Asked?Annual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Logical forms complement probability in understanding language model (and human) performanceAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Large Language Models Meet Symbolic Provers for Logical Reasoning EvaluationInternational Conference on Learning Representations (ICLR), 2025 |
Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical ReasoningInternational Conference on Neural Information Processing (ICONIP), 2023 |