Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.12313
Cited By
Testing AI on language comprehension tasks reveals insensitivity to underlying meaning
23 February 2023
Vittoria Dentella
Fritz Guenther
Elliot Murphy
G. Marcus
Evelina Leivada
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Testing AI on language comprehension tasks reveals insensitivity to underlying meaning"
22 / 22 papers shown
Title
Detecting Spelling and Grammatical Anomalies in Russian Poetry Texts
Ilya Koziev
24
0
0
07 May 2025
Language Models at the Syntax-Semantics Interface: A Case Study of the Long-Distance Binding of Chinese Reflexive ziji
Xiulin Yang
32
0
0
02 Apr 2025
Order Matters: Investigate the Position Bias in Multi-constraint Instruction Following
Jie Zeng
Qianyu He
Qingyu Ren
Jiaqing Liang
Yanghua Xiao
Weikang Zhou
Zeye Sun
Fei Yu
84
0
0
24 Feb 2025
The Philosophical Foundations of Growing AI Like A Child
Dezhi Luo
Yijiang Li
Hokin Deng
ReLM
LRM
39
1
0
15 Feb 2025
Human-like conceptual representations emerge from language prediction
Ningyu Xu
Qi Zhang
Chao Du
Qiang Luo
Xipeng Qiu
Xuanjing Huang
Menghan Zhang
60
0
0
21 Jan 2025
Representation in large language models
Cameron C. Yetman
41
1
0
03 Jan 2025
Digestion Algorithm in Hierarchical Symbolic Forests: A Fast Text Normalization Algorithm and Semantic Parsing Framework for Specific Scenarios and Lightweight Deployment
Kevin You
64
0
0
18 Dec 2024
Non-native speakers of English or ChatGPT: Who thinks better?
Mohammed Q. Shormani
LRM
70
0
0
30 Nov 2024
To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese
Yukiko Ishizuki
Tatsuki Kuribayashi
Yuichiroh Matsubayashi
Ryohei Sasano
Kentaro Inui
13
2
0
17 Apr 2024
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
Vishaal Udandarao
Ameya Prabhu
Adhiraj Ghosh
Yash Sharma
Philip H. S. Torr
Adel Bibi
Samuel Albanie
Matthias Bethge
VLM
112
43
0
04 Apr 2024
A Comparative Investigation of Compositional Syntax and Semantics in DALL-E 2
Elliot Murphy
Jill de Villiers
Sofia Morales
CoGe
25
3
0
18 Mar 2024
What is a word?
Elliot Murphy
25
0
0
19 Feb 2024
Language models align with human judgments on key grammatical constructions
Jennifer Hu
Kyle Mahowald
G. Lupyan
Anna A. Ivanova
Roger Levy
30
10
0
19 Jan 2024
A Sentence is Worth a Thousand Pictures: Can Large Language Models Understand Human Language?
G. Marcus
Evelina Leivada
Elliot Murphy
ELM
VLM
ALM
14
3
0
26 Jul 2023
Glamour muscles: why having a body is not what it means to be embodied
Shawn L. E. Beaulieu
Sam Kriegman
AI4CE
19
0
0
17 Jul 2023
Prompting is not a substitute for probability measurements in large language models
Jennifer Hu
R. Levy
20
36
0
22 May 2023
Large Linguistic Models: Investigating LLMs' metalinguistic abilities
G. Beguš
M. Dąbkowski
Ryan Rhodes
LRM
32
18
0
01 May 2023
AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays
Steffen Herbold
Annette Hautli-Janisz
Ute Heuer
Zlata Kikteva
Alexander Trautsch
DeLMO
69
22
0
24 Apr 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
203
2,232
0
22 Mar 2023
DALL-E 2 Fails to Reliably Capture Common Syntactic Processes
Evelina Leivada
Elliot Murphy
G. Marcus
130
37
0
23 Oct 2022
The Debate Over Understanding in AI's Large Language Models
Melanie Mitchell
D. Krakauer
ELM
70
196
0
14 Oct 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
1