v1v2v3 (latest)

ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models

International Conference on Language Resources and Evaluation (LREC), 2023

29 March 2023

Xianpei Han

Yaojie Lu

Papers citing "ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models"

29 / 29 papers shown

It Depends: Resolving Referential Ambiguity in Minimal Contexts with Commonsense Knowledge

Lukas Ellinger

Georg Groh

146

19 Sep 2025

BF-Max: an Efficient Bit Flipping Decoder with Predictable Decoding Failure RateInternational Symposium on Information Theory (ISIT), 2025

426

11 Jun 2025

Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching TasksAdvances in Artificial Intelligence and Machine Learning (AAIML), 2025

285

24 Apr 2025

A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and EthicsInformation Fusion (Inf. Fusion), 2023

882

293

28 Jan 2025

NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual UpdatesNeural Information Processing Systems (NeurIPS), 2024

326

28 Oct 2024

On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions

...

523

16 Jun 2024

MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset

Weiqi Wang

Yangqiu Song

LRM

462

04 Jun 2024

Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods

LLMAG KELM OffRL LM&Ro

471

181

30 Mar 2024

Case-Based or Rule-Based: How Do Transformers Do the Math?

510

27 Feb 2024

KnowTuning: Knowledge-aware Fine-tuning for Large Language Models

Maarten de Rijke

280

17 Feb 2024

Democratizing Fine-grained Visual Recognition with Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

444

24 Jan 2024

Exploring the Capabilities of ChatGPT in Ancient Chinese Translation and Person Name Recognition

288

23 Dec 2023

Receive, Reason, and React: Drive as You Say with Large Language Models in Autonomous VehiclesIEEE Intelligent Transportation Systems Magazine (ITS), 2023

Wenqian Ye

280

135

12 Oct 2023

GeoLLM: Extracting Geospatial Knowledge from Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

459

102

10 Oct 2023

A New Dialogue Response Generation Agent for Large Language Models by Asking Questions to Detect User's Intentions

Siwei Wu

Xiangqing Shen

Rui Xia

197

05 Oct 2023

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving

Li Chen

Ping Luo

Shengbo Eben Li

Masayoshi Tomizuka

Wei Zhan

Mingyu Ding

808

235

04 Oct 2023

An In-depth Survey of Large Language Model-based Artificial Intelligence Agents

260

23 Sep 2023

LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking PuzzlesInternational Conference on Language Resources and Evaluation (LREC), 2023

354

21 Aug 2023

How susceptible are LLMs to Logical Fallacies?International Conference on Language Resources and Evaluation (LREC), 2023

Xuesu Xiao

207

18 Aug 2023

From Military to Healthcare: Adopting and Expanding Ethical Principles for Generative Artificial Intelligence

221

04 Aug 2023

ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on Class-level Code Generation

Xin Peng

254

216

03 Aug 2023

Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?SIGKDD Explorations (SIGKDD Explor.), 2023

Amrita Bhattacharjee

Huang Liu

DeLMO

338

02 Aug 2023

An Overview Of Temporal Commonsense Reasoning and Acquisition

Georg Wenzel

Adam Jatowt

ReLM LRM

492

28 Jul 2023

Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language ModelingIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023

386

166

20 Jun 2023

The Two Word Test: A Semantic Benchmark for Large Language Models

Nicholas Riccardi

Rutvik H. Desai

ELM

181

07 Jun 2023

Enhancing In-Context Learning with Answer Feedback for Multi-Span Question AnsweringNatural Language Processing and Chinese Computing (NLPCC), 2023

222

07 Jun 2023

Faith and Fate: Limits of Transformers on CompositionalityNeural Information Processing Systems (NeurIPS), 2023

Xiang Lorraine Li

...

Xiang Ren

Yejin Choi

704

552

29 May 2023

BUCA: A Binary Classification Approach to Unsupervised Commonsense Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Jie He

U. SimonChiLok

Víctor Gutiérrez-Basulto

Jeff Z. Pan

682

25 May 2023

Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

647

210

23 Apr 2023