Structured, flexible, and robust: benchmarking and improving large
language models towards more human-like behavior in out-of-distribution
reasoning tasks

Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks

11 May 2022

Papers citing "Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks"

10 / 10 papers shown

Title
Distinguishing AI-Generated and Human-Written Text Through Psycholinguistic Analysis Chidimma Opara DeLMO 32 0 0 03 May 2025
A Survey of AI Agent Protocols Y. Yang Huacan Chai Y. Song S. Qi Muning Wen ... Gaowei Chang W. Liu Ying Wen Yong Yu W. Zhang LLMAG 56 1 0 23 Apr 2025
Testing the limits of fine-tuning to improve reasoning in vision language models Luca M. Schulze Buschoff Konstantinos Voudouris Elif Akata Matthias Bethge Joshua B. Tenenbaum Eric Schulz LRM VLM Presented at ResearchTrend Connect \| VLM on 14 Mar 2025 113 0 1 24 Feb 2025
Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges Qian Niu Junyu Liu Ziqian Bi Pohsun Feng Benji Peng ... Ming Li Lawrence KQ Yan Yichao Zhang Caitlyn Heqi Yin Cheng Fei 25 13 0 04 Sep 2024
People use fast, goal-directed simulation to reason about novel games Cedegao E. Zhang Katherine M. Collins L. Wong Adrian Weller Adrian Weller Joshua B. Tenenbaum LRM 24 0 0 19 Jul 2024
PDDLEGO: Iterative Planning in Textual Environments Li Zhang Peter Alexander Jansen Tianyi Zhang Peter Clark Chris Callison-Burch Niket Tandon LM&Ro 13 4 0 30 May 2024
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models Wei He Shichun Liu Jun Zhao Yiwen Ding Yi Lu Zhiheng Xi Tao Gui Qi Zhang Xuanjing Huang 29 1 0 01 Apr 2024
Distilling Script Knowledge from Large Language Models for Constrained Language Planning Siyu Yuan Jiangjie Chen Ziquan Fu Xuyang Ge Soham Shah C. R. Jankowski Yanghua Xiao Deqing Yang 25 46 0 09 May 2023
Dissociating language and thought in large language models Kyle Mahowald Anna A. Ivanova I. Blank Nancy Kanwisher J. Tenenbaum Evelina Fedorenko ELM ReLM 13 205 0 16 Jan 2023
Skill Induction and Planning with Latent Language Pratyusha Sharma Antonio Torralba Jacob Andreas LM&Ro 178 108 0 04 Oct 2021