ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.16421
  4. Cited By
ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of
  Commonsense Problem in Large Language Models
v1v2v3 (latest)

ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models

International Conference on Language Resources and Evaluation (LREC), 2023
29 March 2023
Ning Bian
Xianpei Han
Le Sun
Hongyu Lin
Yaojie Lu
Xianpei Han
Shanshan Jiang
Bin Dong
    KELMELMAI4MHLRM
ArXiv (abs)PDFHTML

Papers citing "ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models"

29 / 29 papers shown
It Depends: Resolving Referential Ambiguity in Minimal Contexts with Commonsense Knowledge
It Depends: Resolving Referential Ambiguity in Minimal Contexts with Commonsense Knowledge
Lukas Ellinger
Georg Groh
146
0
0
19 Sep 2025
BF-Max: an Efficient Bit Flipping Decoder with Predictable Decoding Failure Rate
BF-Max: an Efficient Bit Flipping Decoder with Predictable Decoding Failure RateInternational Symposium on Information Theory (ISIT), 2025
Alessio Baldelli
Marco Baldi
F. Chiaraluce
Paolo Santini
426
2
0
11 Jun 2025
Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks
Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching TasksAdvances in Artificial Intelligence and Machine Learning (AAIML), 2025
Haru-Tada Sato
Fuka Matsuzaki
Jun-ichiro Takahashi
UQCV
285
0
0
24 Apr 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and EthicsInformation Fusion (Inf. Fusion), 2023
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Xiaoshi Zhong
LM&MAAILaw
882
293
0
28 Jan 2025
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with
  Annual Updates
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual UpdatesNeural Information Processing Systems (NeurIPS), 2024
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
326
8
0
28 Oct 2024
On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
Weiqi Wang
Tianqing Fang
Haochen Shi
Baixuan Xu
Wenxuan Ding
...
Wei Fan
Jiaxin Bai
Haoran Li
Xin Liu
Yangqiu Song
LRM
523
4
0
16 Jun 2024
MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset
MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset
Weiqi Wang
Yangqiu Song
LRM
462
14
0
04 Jun 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept,
  Taxonomy, and Methods
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAGKELMOffRLLM&Ro
471
181
0
30 Mar 2024
Case-Based or Rule-Based: How Do Transformers Do the Math?
Case-Based or Rule-Based: How Do Transformers Do the Math?
Yi Hu
Xiaojuan Tang
Haotong Yang
Muhan Zhang
LRM
510
33
0
27 Feb 2024
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models
Yougang Lyu
Lingyong Yan
Shuaiqiang Wang
Haibo Shi
D. Yin
Sudipta Singha Roy
Zhumin Chen
Maarten de Rijke
Zhaochun Ren
280
11
0
17 Feb 2024
Democratizing Fine-grained Visual Recognition with Large Language Models
Democratizing Fine-grained Visual Recognition with Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Mingxuan Liu
Subhankar Roy
Wenjing Li
Zhun Zhong
Andrii Zadaianchuk
Elisa Ricci
VLM
444
24
0
24 Jan 2024
Exploring the Capabilities of ChatGPT in Ancient Chinese Translation and
  Person Name Recognition
Exploring the Capabilities of ChatGPT in Ancient Chinese Translation and Person Name Recognition
Shijing Si
Siqing Zhou
Le Tang
Xiaoqing Cheng
Yugui Zhang
288
3
0
23 Dec 2023
Receive, Reason, and React: Drive as You Say with Large Language Models
  in Autonomous Vehicles
Receive, Reason, and React: Drive as You Say with Large Language Models in Autonomous VehiclesIEEE Intelligent Transportation Systems Magazine (ITS), 2023
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Ziran Wang
280
135
0
12 Oct 2023
GeoLLM: Extracting Geospatial Knowledge from Large Language Models
GeoLLM: Extracting Geospatial Knowledge from Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Rohin Manvi
Samar Khanna
Gengchen Mai
Marshall Burke
David B. Lobell
Stefano Ermon
459
102
0
10 Oct 2023
A New Dialogue Response Generation Agent for Large Language Models by
  Asking Questions to Detect User's Intentions
A New Dialogue Response Generation Agent for Large Language Models by Asking Questions to Detect User's Intentions
Siwei Wu
Xiangqing Shen
Rui Xia
197
6
0
05 Oct 2023
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
808
235
0
04 Oct 2023
An In-depth Survey of Large Language Model-based Artificial Intelligence
  Agents
An In-depth Survey of Large Language Model-based Artificial Intelligence Agents
Pengyu Zhao
Zijian Jin
Ning Cheng
LLMAG
260
44
0
23 Sep 2023
LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete
  Information from Lateral Thinking Puzzles
LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking PuzzlesInternational Conference on Language Resources and Evaluation (LREC), 2023
Shulin Huang
Shirong Ma
Hai-Tao Zheng
Mengzuo Huang
Wuhe Zou
Weidong Zhang
Haitao Zheng
LLMAGLRM
354
46
0
21 Aug 2023
How susceptible are LLMs to Logical Fallacies?
How susceptible are LLMs to Logical Fallacies?International Conference on Language Resources and Evaluation (LREC), 2023
Amirreza Payandeh
Dan Pluth
Jordan Hosier
Xuesu Xiao
V. Gurbani
LLMAGLRMELM
207
26
0
18 Aug 2023
From Military to Healthcare: Adopting and Expanding Ethical Principles
  for Generative Artificial Intelligence
From Military to Healthcare: Adopting and Expanding Ethical Principles for Generative Artificial Intelligence
David Oniani
Jordan Hilsman
Yifan Peng
COL
C. R. K. Poropatich
C. J. C. Pamplin
L. G. L. Legault
Yanshan Wang
AI4TS
221
13
0
04 Aug 2023
ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on
  Class-level Code Generation
ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on Class-level Code Generation
Xueying Du
Wentai Deng
Kaixin Wang
Juntao Li
Junwei Liu
Yixuan Chen
Jiayi Feng
Chaofeng Sha
Xin Peng
Xin Peng
ELMALM
254
216
0
03 Aug 2023
Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?
Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?SIGKDD Explorations (SIGKDD Explor.), 2023
Amrita Bhattacharjee
Huang Liu
DeLMO
338
90
0
02 Aug 2023
An Overview Of Temporal Commonsense Reasoning and Acquisition
An Overview Of Temporal Commonsense Reasoning and Acquisition
Georg Wenzel
Adam Jatowt
ReLMLRM
492
13
0
28 Jul 2023
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs
  for Fact-aware Language Modeling
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language ModelingIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Lin F. Yang
Hongyang Chen
Zhao Li
Xiao Ding
Xindong Wu
KELM
386
166
0
20 Jun 2023
The Two Word Test: A Semantic Benchmark for Large Language Models
The Two Word Test: A Semantic Benchmark for Large Language Models
Nicholas Riccardi
Rutvik H. Desai
ELM
181
6
0
07 Jun 2023
Enhancing In-Context Learning with Answer Feedback for Multi-Span
  Question Answering
Enhancing In-Context Learning with Answer Feedback for Multi-Span Question AnsweringNatural Language Processing and Chinese Computing (NLPCC), 2023
Zixian Huang
Jiaying Zhou
Gengyang Xiao
Gong Cheng
KELM
222
14
0
07 Jun 2023
Faith and Fate: Limits of Transformers on Compositionality
Faith and Fate: Limits of Transformers on CompositionalityNeural Information Processing Systems (NeurIPS), 2023
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLMLRM
704
552
0
29 May 2023
BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering
BUCA: A Binary Classification Approach to Unsupervised Commonsense Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Jie He
U. SimonChiLok
Víctor Gutiérrez-Basulto
Jeff Z. Pan
682
12
0
25 May 2023
Evaluating ChatGPT's Information Extraction Capabilities: An Assessment
  of Performance, Explainability, Calibration, and Faithfulness
Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness
Bo Li
Gexiang Fang
Yang Yang
Quansen Wang
Wei Ye
Wen Zhao
Shikun Zhang
ELMAI4MH
647
210
0
23 Apr 2023
1
Page 1 of 1