v1v2v3v4v5v6v7v8 (latest)

A Categorical Archive of ChatGPT Failures

6 February 2023

Ali Borji

ELM

ArXiv (abs)PDF HTML

Papers citing "A Categorical Archive of ChatGPT Failures"

50 / 160 papers shown

Bridging Symbolic Control and Neural Reasoning in LLM Agents: Structured Cognitive Loop with a Governance Layer

Myung Ho Kim

188

21 Nov 2025

Vibe Learning: Education in the age of AI

Marcos Florencio

Francielle Prieto

151

03 Nov 2025

From Superficial Outputs to Superficial Learning: Risks of Large Language Models in Education

488

26 Sep 2025

A perishable ability? The future of writing in the face of generative artificial intelligence

Evandro L. T. P. Cunha

DeLMO

199

26 Aug 2025

Sword and Shield: Uses and Strategies of LLMs in Navigating Disinformation

Gionnieve Lim

Bryan Chen Zhengyu Tan

290

08 Jun 2025

Analysis of LLM Bias (Chinese Propaganda & Anti-US Sentiment) in DeepSeek-R1 vs. ChatGPT o3-mini-high

262

02 Jun 2025

Calibrating LLMs for Text-to-SQL Parsing by Leveraging Sub-clause Frequencies

Terrance Liu

Shuyi Wang

Daniel Preotiuc-Pietro

Yash Chandarana

Chirag Gupta

367

27 May 2025

UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking

Muhammad Ahsan Riaz Khan

Arham Riaz

Muhammad Arslan Manzoor

Yuxia Wang

Preslav Nakov

HILM ELM

695

21 May 2025

Reliable Collaborative Conversational Agent System Based on LLMs and Answer Set Programming

Yankai Zeng

Gopal Gupta

192

09 May 2025

Secure Coding with AI -- From Detection to Repair

Vladislav Belozerov

Peter J. Barclay

Ashkan Sami

241

29 Apr 2025

Information Retrieval in the Age of Generative AI: The RGB ModelAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

M. Garetto

Alessandro Cornacchia

904

29 Apr 2025

Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification

Jaime E. Cuellar

Oscar Moreno-Martinez

Paula Sofia Torres-Rodriguez

Jaime Andres Pavlich-Mariscal

Andres Felipe Mican-Castiblanco

Juan Guillermo Torres-Hurtado

256

16 Apr 2025

Increasing the Robustness of the Fine-tuned Multilingual Machine-Generated Text Detectors

422

19 Mar 2025

BAMBI: Developing Baby Language Models for Italian

317

12 Mar 2025

Can AI-Generated Text be Reliably Detected?

Vinu Sankar Sadasivan

1.1K

532

20 Jan 2025

Evaluation of LLM Vulnerabilities to Being Misused for Personalized Disinformation GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Katarina Marcincinova

Matus Mesarcik

467

18 Dec 2024

Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory TasksInternational Conference on Intelligent User Interfaces (IUI), 2024

Jue Zhang

Qi Zhang

197

31 Oct 2024

Stars, Stripes, and Silicon: Unravelling the ChatGPT's All-American, Monochrome, Cis-centric Bias

Federico Torrielli

309

02 Oct 2024

Constructive Apraxia: An Unexpected Limit of Instructible Vision-Language Models and Analog for Human Cognitive Disorders

David Noever

S. M. Noever

219

17 Sep 2024

Promises and challenges of generative artificial intelligence for human learningNature Human Behaviour (Nat Hum Behav), 2024

Lixiang Yan

Samuel Greiff

Ziwen Teuber

Dragan Gašević

505

22 Aug 2024

See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses

Yulong Chen

Yang Liu

Jianhao Yan

X. Bai

Ming Zhong

Yinghao Yang

Ziyi Yang

Chenguang Zhu

Yue Zhang

ALM ELM

233

16 Aug 2024

A Study on Bias Detection and Classification in Natural Language Processing

Ana Sofia Evans

Helena Moniz

Luísa Coheur

225

14 Aug 2024

Interactive embodied evolution for socially adept Artificial General Creatures

164

31 Jul 2024

Are LLMs Good Annotators for Discourse-level Event Relation Extraction?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Kangda Wei

Aayush Gautam

Ruihong Huang

571

28 Jul 2024

Unipa-GPT: Large Language Models for university-oriented QA in Italian

Irene Siragusa

Roberto Pirrone

298

19 Jul 2024

Auditing of AI: Legal, Ethical and Technical Approaches

Jakob Mokander

337

07 Jul 2024

Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models

Yanghua Xiao

260

131

04 Jul 2024

Satyrn: A Platform for Analytics Augmented Generation

289

17 Jun 2024

A Complete Survey on LLM-based AI Chatbots

Sumit Kumar Dam

Choong Seon Hong

Yu Qiao

Chaoning Zhang

319

146

17 Jun 2024

GPT-ology, Computational Models, Silicon Sampling: How should we think about LLMs in Cognitive Science?

Desmond C. Ong

330

13 Jun 2024

A Reality check of the benefits of LLM in business

Ming Cheung

353

09 Jun 2024

Reinterpreting 'the Company a Word Keeps': Towards Explainable and Ontologically Grounded Language Models

Walid S. Saba

140

06 Jun 2024

On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots

438

01 Jun 2024

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

Jing Gao

434

31 May 2024

Unlearning Climate Misinformation in Large Language Models

Antonios Anastasopoulos

Dimitrios Stamoulis

353

29 May 2024

Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities

249

20 May 2024

OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs

523

09 May 2024

Attributions toward Artificial Agents in a modified Moral Turing TestScientific Reports (Sci Rep), 2024

313

03 Apr 2024

HILL: A Hallucination Identifier for Large Language ModelsInternational Conference on Human Factors in Computing Systems (CHI), 2024

260

11 Mar 2024

Enhancing Instructional Quality: Leveraging Computer-Assisted Textual Analysis to Generate In-Depth Insights from Educational Artifacts

232

06 Mar 2024

Should We Fear Large Language Models? A Structural Analysis of the Human Reasoning System for Elucidating LLM Capabilities and Risks Through the Lens of Heidegger's Philosophy

Jianqiiu Zhang

ELM

229

05 Mar 2024

RAGged Edges: The Double-Edged Sword of Retrieval-Augmented Chatbots

308

02 Mar 2024

What Generative Artificial Intelligence Means for Terminological Definitions

Antonio San Martín

339

25 Feb 2024

Exploring ChatGPT and its Impact on Society

Md. Asraful Haque

Shuai Li

SILM

350

21 Feb 2024

Mapping the Ethics of Generative AI: A Comprehensive Scoping Review

Thilo Hagendorff

284

13 Feb 2024

Why and When LLM-Based Assistants Can Go Wrong: Investigating the Effectiveness of Prompt-Based Interactions for Software Help-Seeking

Anjali Khurana

Hariharan Subramonyam

Parmit K. Chilana

229

12 Feb 2024

Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024

527

298

06 Feb 2024

APT-Pipe: A Prompt-Tuning Tool for Social Data Annotation using ChatGPTThe Web Conference (WWW), 2024

Lik-Hang Lee

444

24 Jan 2024

ChatGPT in the classroom. Exploring its potential and limitations in a Functional Programming courseInternational journal of human computer interactions (IJHCI), 2023

Dan-Matei Popovici

184

20 Jan 2024

Stability Analysis of ChatGPT-based Sentiment Analysis in AI Quality Assurance

Tinghui Ouyang

AprilPyone Maungmaung

Koichi Konishi

Yoshiki Seo

Isao Echizen

AI4MH

222

15 Jan 2024