Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.16421
Cited By
v1
v2
v3 (latest)
ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models
International Conference on Language Resources and Evaluation (LREC), 2023
29 March 2023
Ning Bian
Xianpei Han
Le Sun
Hongyu Lin
Yaojie Lu
Xianpei Han
Shanshan Jiang
Bin Dong
KELM
ELM
AI4MH
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models"
29 / 29 papers shown
It Depends: Resolving Referential Ambiguity in Minimal Contexts with Commonsense Knowledge
Lukas Ellinger
Georg Groh
146
0
0
19 Sep 2025
BF-Max: an Efficient Bit Flipping Decoder with Predictable Decoding Failure Rate
International Symposium on Information Theory (ISIT), 2025
Alessio Baldelli
Marco Baldi
F. Chiaraluce
Paolo Santini
426
2
0
11 Jun 2025
Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks
Advances in Artificial Intelligence and Machine Learning (AAIML), 2025
Haru-Tada Sato
Fuka Matsuzaki
Jun-ichiro Takahashi
UQCV
285
0
0
24 Apr 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Information Fusion (Inf. Fusion), 2023
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Xiaoshi Zhong
LM&MA
AILaw
882
293
0
28 Jan 2025
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Neural Information Processing Systems (NeurIPS), 2024
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
326
8
0
28 Oct 2024
On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
Weiqi Wang
Tianqing Fang
Haochen Shi
Baixuan Xu
Wenxuan Ding
...
Wei Fan
Jiaxin Bai
Haoran Li
Xin Liu
Yangqiu Song
LRM
523
4
0
16 Jun 2024
MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset
Weiqi Wang
Yangqiu Song
LRM
462
14
0
04 Jun 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
471
181
0
30 Mar 2024
Case-Based or Rule-Based: How Do Transformers Do the Math?
Yi Hu
Xiaojuan Tang
Haotong Yang
Muhan Zhang
LRM
510
33
0
27 Feb 2024
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models
Yougang Lyu
Lingyong Yan
Shuaiqiang Wang
Haibo Shi
D. Yin
Sudipta Singha Roy
Zhumin Chen
Maarten de Rijke
Zhaochun Ren
280
11
0
17 Feb 2024
Democratizing Fine-grained Visual Recognition with Large Language Models
International Conference on Learning Representations (ICLR), 2024
Mingxuan Liu
Subhankar Roy
Wenjing Li
Zhun Zhong
Andrii Zadaianchuk
Elisa Ricci
VLM
444
24
0
24 Jan 2024
Exploring the Capabilities of ChatGPT in Ancient Chinese Translation and Person Name Recognition
Shijing Si
Siqing Zhou
Le Tang
Xiaoqing Cheng
Yugui Zhang
288
3
0
23 Dec 2023
Receive, Reason, and React: Drive as You Say with Large Language Models in Autonomous Vehicles
IEEE Intelligent Transportation Systems Magazine (ITS), 2023
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Ziran Wang
280
135
0
12 Oct 2023
GeoLLM: Extracting Geospatial Knowledge from Large Language Models
International Conference on Learning Representations (ICLR), 2023
Rohin Manvi
Samar Khanna
Gengchen Mai
Marshall Burke
David B. Lobell
Stefano Ermon
459
102
0
10 Oct 2023
A New Dialogue Response Generation Agent for Large Language Models by Asking Questions to Detect User's Intentions
Siwei Wu
Xiangqing Shen
Rui Xia
197
6
0
05 Oct 2023
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
808
235
0
04 Oct 2023
An In-depth Survey of Large Language Model-based Artificial Intelligence Agents
Pengyu Zhao
Zijian Jin
Ning Cheng
LLMAG
260
44
0
23 Sep 2023
LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles
International Conference on Language Resources and Evaluation (LREC), 2023
Shulin Huang
Shirong Ma
Hai-Tao Zheng
Mengzuo Huang
Wuhe Zou
Weidong Zhang
Haitao Zheng
LLMAG
LRM
354
46
0
21 Aug 2023
How susceptible are LLMs to Logical Fallacies?
International Conference on Language Resources and Evaluation (LREC), 2023
Amirreza Payandeh
Dan Pluth
Jordan Hosier
Xuesu Xiao
V. Gurbani
LLMAG
LRM
ELM
207
26
0
18 Aug 2023
From Military to Healthcare: Adopting and Expanding Ethical Principles for Generative Artificial Intelligence
David Oniani
Jordan Hilsman
Yifan Peng
COL
C. R. K. Poropatich
C. J. C. Pamplin
L. G. L. Legault
Yanshan Wang
AI4TS
221
13
0
04 Aug 2023
ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on Class-level Code Generation
Xueying Du
Wentai Deng
Kaixin Wang
Juntao Li
Junwei Liu
Yixuan Chen
Jiayi Feng
Chaofeng Sha
Xin Peng
Xin Peng
ELM
ALM
254
216
0
03 Aug 2023
Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?
SIGKDD Explorations (SIGKDD Explor.), 2023
Amrita Bhattacharjee
Huang Liu
DeLMO
338
90
0
02 Aug 2023
An Overview Of Temporal Commonsense Reasoning and Acquisition
Georg Wenzel
Adam Jatowt
ReLM
LRM
492
13
0
28 Jul 2023
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Lin F. Yang
Hongyang Chen
Zhao Li
Xiao Ding
Xindong Wu
KELM
386
166
0
20 Jun 2023
The Two Word Test: A Semantic Benchmark for Large Language Models
Nicholas Riccardi
Rutvik H. Desai
ELM
181
6
0
07 Jun 2023
Enhancing In-Context Learning with Answer Feedback for Multi-Span Question Answering
Natural Language Processing and Chinese Computing (NLPCC), 2023
Zixian Huang
Jiaying Zhou
Gengyang Xiao
Gong Cheng
KELM
222
14
0
07 Jun 2023
Faith and Fate: Limits of Transformers on Compositionality
Neural Information Processing Systems (NeurIPS), 2023
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
704
552
0
29 May 2023
BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Jie He
U. SimonChiLok
Víctor Gutiérrez-Basulto
Jeff Z. Pan
682
12
0
25 May 2023
Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness
Bo Li
Gexiang Fang
Yang Yang
Quansen Wang
Wei Ye
Wen Zhao
Shikun Zhang
ELM
AI4MH
647
210
0
23 Apr 2023
1
Page 1 of 1