Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

23 February 2023

Elliot Murphy

Papers citing "Testing AI on language comprehension tasks reveals insensitivity to underlying meaning"

22 / 22 papers shown

Title
Detecting Spelling and Grammatical Anomalies in Russian Poetry Texts Ilya Koziev 24 0 0 07 May 2025
Language Models at the Syntax-Semantics Interface: A Case Study of the Long-Distance Binding of Chinese Reflexive ziji Xiulin Yang 32 0 0 02 Apr 2025
Order Matters: Investigate the Position Bias in Multi-constraint Instruction Following Jie Zeng Qianyu He Qingyu Ren Jiaqing Liang Yanghua Xiao Weikang Zhou Zeye Sun Fei Yu 84 0 0 24 Feb 2025
The Philosophical Foundations of Growing AI Like A Child Dezhi Luo Yijiang Li Hokin Deng ReLM LRM 39 1 0 15 Feb 2025
Human-like conceptual representations emerge from language prediction Ningyu Xu Qi Zhang Chao Du Qiang Luo Xipeng Qiu Xuanjing Huang Menghan Zhang 60 0 0 21 Jan 2025
Representation in large language models Cameron C. Yetman 41 1 0 03 Jan 2025
Digestion Algorithm in Hierarchical Symbolic Forests: A Fast Text Normalization Algorithm and Semantic Parsing Framework for Specific Scenarios and Lightweight Deployment Kevin You 64 0 0 18 Dec 2024
Non-native speakers of English or ChatGPT: Who thinks better? Mohammed Q. Shormani LRM 70 0 0 30 Nov 2024
To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese Yukiko Ishizuki Tatsuki Kuribayashi Yuichiroh Matsubayashi Ryohei Sasano Kentaro Inui 13 2 0 17 Apr 2024
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance Vishaal Udandarao Ameya Prabhu Adhiraj Ghosh Yash Sharma Philip H. S. Torr Adel Bibi Samuel Albanie Matthias Bethge VLM 112 43 0 04 Apr 2024
A Comparative Investigation of Compositional Syntax and Semantics in DALL-E 2 Elliot Murphy Jill de Villiers Sofia Morales CoGe 25 3 0 18 Mar 2024
What is a word? Elliot Murphy 25 0 0 19 Feb 2024
Language models align with human judgments on key grammatical constructions Jennifer Hu Kyle Mahowald G. Lupyan Anna A. Ivanova Roger Levy 30 10 0 19 Jan 2024
A Sentence is Worth a Thousand Pictures: Can Large Language Models Understand Human Language? G. Marcus Evelina Leivada Elliot Murphy ELM VLM ALM 14 3 0 26 Jul 2023
Glamour muscles: why having a body is not what it means to be embodied Shawn L. E. Beaulieu Sam Kriegman AI4CE 19 0 0 17 Jul 2023
Prompting is not a substitute for probability measurements in large language models Jennifer Hu R. Levy 20 36 0 22 May 2023
Large Linguistic Models: Investigating LLMs' metalinguistic abilities G. Beguš M. Dąbkowski Ryan Rhodes LRM 32 18 0 01 May 2023
AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays Steffen Herbold Annette Hautli-Janisz Ute Heuer Zlata Kikteva Alexander Trautsch DeLMO 69 22 0 24 Apr 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4 Sébastien Bubeck Varun Chandrasekaran Ronen Eldan J. Gehrke Eric Horvitz ... Scott M. Lundberg Harsha Nori Hamid Palangi Marco Tulio Ribeiro Yi Zhang ELM AI4MH AI4CE ALM 203 2,232 0 22 Mar 2023
DALL-E 2 Fails to Reliably Capture Common Syntactic Processes Evelina Leivada Elliot Murphy G. Marcus 130 37 0 23 Oct 2022
The Debate Over Understanding in AI's Large Language Models Melanie Mitchell D. Krakauer ELM 70 196 0 14 Oct 2022
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 301 11,730 0 04 Mar 2022