Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2005.00683
Cited By
v1
v2 (latest)
Birds have four legs?! NumerSense: Probing Numerical Commonsense Knowledge of Pre-trained Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
2 May 2020
Bill Yuchen Lin
Seyeon Lee
Rahul Khanna
Xiang Ren
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Birds have four legs?! NumerSense: Probing Numerical Commonsense Knowledge of Pre-trained Language Models"
50 / 109 papers shown
Beyond Plain Demos: A Demo-centric Anchoring Paradigm for In-Context Learning in Alzheimer's Disease Detection
Puzhen Su
Haoran Yin
Yongzhu Miao
Jintao Tang
Shasha Li
Ting Wang
133
0
0
10 Nov 2025
Retrieval-Constrained Decoding Reveals Underestimated Parametric Knowledge in Language Models
Rajaa El Hamdani
Samy Haffoudhi
Nils Holzenberger
Fabian M. Suchanek
Thomas Bonald
Fragkiskos D. Malliaros
KELM
188
0
0
27 Sep 2025
Intermediate Languages Matter: Formal Languages and LLMs affect Neurosymbolic Reasoning
International Conference on Semantic Systems (i-Semantics), 2025
Alexander Beiser
David Penz
Nysret Musliu
LRM
211
0
0
04 Sep 2025
Is Large Language Model Performance on Reasoning Tasks Impacted by Different Ways Questions Are Asked?
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Seok Hwan Song
Mohna Chakraborty
Qi Li
Wallapak Tavanapong
ELM
LRM
257
1
0
21 Jul 2025
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
I. Gevers
Victor De Marez
Luna De Bruyne
Walter Daelemans
392
2
0
31 Mar 2025
Commonsense Reasoning in Arab Culture
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Abdelrahman Boda Sadallah
Junior Cedric Tonga
Khalid Almubarak
Saeed Almheiri
Farah Atif
Chatrine Qwaider
Karima Kadaoui
Sara Shatnawi
Yaser Alesh
Fajri Koto
LRM
486
17
0
18 Feb 2025
Number Cookbook: Number Understanding of Language Models and How to Improve It
International Conference on Learning Representations (ICLR), 2024
Haotong Yang
Yi Hu
Shijia Kang
Zhouchen Lin
Muhan Zhang
LRM
608
42
0
06 Nov 2024
The Factuality of Large Language Models in the Legal Domain
International Conference on Information and Knowledge Management (CIKM), 2024
Rajaa El Hamdani
Thomas Bonald
Fragkiskos D. Malliaros
Nils Holzenberger
Fabian M. Suchanek
AILaw
HILM
381
13
0
18 Sep 2024
Towards a Generative Approach for Emotion Detection and Reasoning
Ankita Bhaumik
T. Strzalkowski
ReLM
LRM
259
5
0
09 Aug 2024
Development of Cognitive Intelligence in Pre-trained Language Models
Raj Sanjay Shah
Khushi Bhardwaj
Sashank Varma
481
3
0
01 Jul 2024
Paraphrase Types Elicit Prompt Engineering Capabilities
Jan Philip Wahle
Terry Ruas
Yang Xu
Bela Gipp
582
19
0
28 Jun 2024
RUPBench: Benchmarking Reasoning Under Perturbations for Robustness Evaluation in Large Language Models
Yuqing Wang
Yun Zhao
LRM
AAML
ELM
311
9
0
16 Jun 2024
Are LLMs classical or nonmonotonic reasoners? Lessons from generics
Alina Leidinger
R. Rooij
Ekaterina Shutova
LRM
362
9
0
05 Jun 2024
NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models
Ancheng Xu
Minghuan Tan
Lei Wang
Min Yang
Ruifeng Xu
LRM
215
1
0
05 Jun 2024
Can Large Language Models put 2 and 2 together? Probing for Entailed Arithmetical Relationships
D. Panas
S. Seth
V. Belle
ReLM
LRM
269
6
0
30 Apr 2024
Exploring Internal Numeracy in Language Models: A Case Study on ALBERT
Ulme Wennberg
G. Henter
MILM
361
2
0
25 Apr 2024
IndoCulture: Exploring Geographically-Influenced Cultural Commonsense Reasoning Across Eleven Indonesian Provinces
Transactions of the Association for Computational Linguistics (TACL), 2024
Fajri Koto
Rahmad Mahendra
Nurul Aisyah
Timothy Baldwin
LRM
407
43
0
02 Apr 2024
Rule or Story, Which is a Better Commonsense Expression for Talking with Large Language Models?
Ning Bian
Xianpei Han
Hongyu Lin
Yaojie Lu
Xianpei Han
Le Sun
326
3
0
22 Feb 2024
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries
Jing Han Sun
Ali Emami
396
6
0
20 Feb 2024
Beyond Lines and Circles: Unveiling the Geometric Reasoning Gap in Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Spyridon Mouselinos
Henryk Michalewski
Mateusz Malinowski
LRM
251
14
0
06 Feb 2024
Temporal Blind Spots in Large Language Models
Web Search and Data Mining (WSDM), 2024
Jonas Wallat
Adam Jatowt
Avishek Anand
447
9
0
22 Jan 2024
In-context Learning with Retrieved Demonstrations for Language Models: A Survey
an Luo
Xin Xu
Yue Liu
Panupong Pasupat
Mehran Kazemi
RALM
860
83
0
21 Jan 2024
Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models
Yuqing Wang
Yun Zhao
VLM
ReLM
LRM
378
27
0
29 Dec 2023
Enhancing Quantitative Reasoning Skills of Large Language Models through Dimension Perception
IEEE International Conference on Data Engineering (ICDE), 2023
Yuncheng Huang
Qi He
Jiaqing Liang
Sihang Jiang
Yanghua Xiao
Yunwen Chen
LRM
209
5
0
29 Dec 2023
CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark Tasks for Chinese Large Language Models
Dan Shi
Chaobin You
Jian-Tao Huang
Taihao Li
Deyi Xiong
LRM
250
2
0
20 Dec 2023
Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Mubashara Akhtar
Abhilash Shankarampeta
Vivek Gupta
Arpit Patil
O. Cocarascu
Elena Simperl
LRM
ReLM
LMTD
ELM
287
45
0
03 Nov 2023
ROME: Evaluating Pre-trained Vision-Language Models on Reasoning beyond Visual Common Sense
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kankan Zhou
Eason Lai
Wei Bin Au Yeong
K. Mouratidis
Jing Jiang
ReLM
LRM
VLM
254
25
0
30 Oct 2023
CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Mete Ismayilzada
Debjit Paul
Syrielle Montariol
Mor Geva
Antoine Bosselut
LRM
342
8
0
23 Oct 2023
GeoLLM: Extracting Geospatial Knowledge from Large Language Models
International Conference on Learning Representations (ICLR), 2023
Rohin Manvi
Samar Khanna
Gengchen Mai
Marshall Burke
David B. Lobell
Stefano Ermon
489
103
0
10 Oct 2023
Crystal: Introspective Reasoners Reinforced with Self-Feedback
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hamish Ivison
Ramakanth Pasunuru
Hannaneh Hajishirzi
Yejin Choi
Asli Celikyilmaz
LRM
ReLM
297
33
0
07 Oct 2023
Can NLP Models Ídentify', 'Distinguish', and 'Justify' Questions that Don't have a Definitive Answer?
Ayushi Agarwal
Nisarg Patel
Neeraj Varshney
Mihir Parmar
Pavan Mallina
Aryan Bhavin Shah
Srihari Sangaraju
Tirth Patel
Nihar Thakkar
Chitta Baral
ELM
256
4
0
08 Sep 2023
TaskLAMA: Probing the Complex Task Understanding of Language Models
AAAI Conference on Artificial Intelligence (AAAI), 2023
Quan Yuan
Mehran Kazemi
Xinyuan Xu
Isaac Noble
Vaiva Imbrasaite
Deepak Ramachandran
LRM
258
22
0
29 Aug 2023
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
International Conference on Learning Representations (ICLR), 2023
Jian Xie
Kai Zhang
Jiangjie Chen
Renze Lou
Yu-Chuan Su
RALM
819
285
0
22 May 2023
The Web Can Be Your Oyster for Improving Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Jingyuan Wang
Jian-Yun Nie
Ji-Rong Wen
RALM
KELM
460
8
0
18 May 2023
Human Behavioral Benchmarking: Numeric Magnitude Comparison Effects in Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Raj Sanjay Shah
Vijay Marupudi
Reba Koenen
Khushi Bhardwaj
Sashank Varma
414
11
0
18 May 2023
Completeness, Recall, and Negation in Open-World Knowledge Bases: A Survey
ACM Computing Surveys (ACM Comput. Surv.), 2023
Simon Razniewski
Hiba Arnaout
Tuan-Phong Nguyen
Fabian M. Suchanek
270
15
0
09 May 2023
Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hamish Ivison
Wenya Wang
Dianzhuo Wang
Noah A. Smith
Yejin Choi
Hannaneh Hajishirzi
VLM
341
66
0
05 May 2023
KitchenScale: Learning to predict ingredient quantities from recipe contexts
Expert systems with applications (ESWA), 2023
Donghee Choi
Mogan Gim
Samy Badreddine
Hajung Kim
Donghyeon Park
Jaewoo Kang
193
11
0
21 Apr 2023
ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models
International Conference on Language Resources and Evaluation (LREC), 2023
Ning Bian
Xianpei Han
Le Sun
Hongyu Lin
Yaojie Lu
Xianpei Han
Shanshan Jiang
Bin Dong
KELM
ELM
AI4MH
LRM
372
99
0
29 Mar 2023
Language Model Behavior: A Comprehensive Survey
International Conference on Computational Logic (ICCL), 2023
Tyler A. Chang
Benjamin Bergen
VLM
LRM
LM&MA
536
157
0
20 Mar 2023
Can neural networks do arithmetic? A survey on the elementary numerical skills of state-of-the-art deep learning models
Applied Sciences (Appl. Sci.), 2023
Alberto Testolin
AIMat
278
30
0
14 Mar 2023
The Life Cycle of Knowledge in Big Language Models: A Survey
Machine Intelligence Research (MIR), 2023
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
KELM
306
31
0
14 Mar 2023
Class Cardinality Comparison as a Fermi Problem
The Web Conference (WWW), 2023
Tuan-Phong Nguyen
Simon Razniewski
Gerhard Weikum
164
2
0
08 Mar 2023
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
844
17
0
17 Feb 2023
Learning to Initialize: Can Meta Learning Improve Cross-task Generalization in Prompt Tuning?
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Chengwei Qin
Cunliang Kong
Ruochen Zhao
Shafiq Joty
VLM
LRM
437
17
0
16 Feb 2023
Commonsense Reasoning for Conversational AI: A Survey of the State of the Art
Christopher Richardson
Larry Heck
LRM
304
10
0
15 Feb 2023
Benchmarks for Automated Commonsense Reasoning: A Survey
ACM Computing Surveys (ACM Comput. Surv.), 2023
E. Davis
ELM
LRM
459
83
0
09 Feb 2023
Understanding Finetuning for Factual Knowledge Extraction from Language Models
Mehran Kazemi
Sid Mittal
Deepak Ramachandran
KELM
287
14
0
26 Jan 2023
A Survey of Deep Learning for Mathematical Reasoning
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Pan Lu
Liang Qiu
Wenhao Yu
Sean Welleck
Kai-Wei Chang
ReLM
LRM
401
193
0
20 Dec 2022
Analogical Math Word Problems Solving with Enhanced Problem-Solution Association
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhenwen Liang
Jipeng Zhang
Xiangliang Zhang
AIMat
230
19
0
01 Dec 2022
1
2
3
Next
Page 1 of 3