ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.03494
  4. Cited By
A Categorical Archive of ChatGPT Failures
v1v2v3v4v5v6v7v8 (latest)

A Categorical Archive of ChatGPT Failures

6 February 2023
Ali Borji
    ELM
ArXiv (abs)PDFHTML

Papers citing "A Categorical Archive of ChatGPT Failures"

50 / 160 papers shown
Bridging Symbolic Control and Neural Reasoning in LLM Agents: Structured Cognitive Loop with a Governance Layer
Bridging Symbolic Control and Neural Reasoning in LLM Agents: Structured Cognitive Loop with a Governance Layer
Myung Ho Kim
188
0
0
21 Nov 2025
Vibe Learning: Education in the age of AI
Vibe Learning: Education in the age of AI
Marcos Florencio
Francielle Prieto
151
0
0
03 Nov 2025
From Superficial Outputs to Superficial Learning: Risks of Large Language Models in Education
From Superficial Outputs to Superficial Learning: Risks of Large Language Models in Education
Iris Delikoura
Yi.R
Fung
AI4Ed
488
5
0
26 Sep 2025
A perishable ability? The future of writing in the face of generative artificial intelligence
A perishable ability? The future of writing in the face of generative artificial intelligence
Evandro L. T. P. Cunha
DeLMO
199
0
0
26 Aug 2025
Sword and Shield: Uses and Strategies of LLMs in Navigating Disinformation
Sword and Shield: Uses and Strategies of LLMs in Navigating Disinformation
Gionnieve Lim
Bryan Chen Zhengyu Tan
Kellie Yu Hui Sim
Weiyan Shi
Ming Hui Chew
Ming Shan Hee
Roy Ka-wei Lee
S. Perrault
K. T. W. Choo
290
2
0
08 Jun 2025
Analysis of LLM Bias (Chinese Propaganda & Anti-US Sentiment) in DeepSeek-R1 vs. ChatGPT o3-mini-high
Analysis of LLM Bias (Chinese Propaganda & Anti-US Sentiment) in DeepSeek-R1 vs. ChatGPT o3-mini-high
PeiHsuan Huang
ZihWei Lin
Simon Imbot
WenCheng Fu
Ethan Tu
262
3
0
02 Jun 2025
Calibrating LLMs for Text-to-SQL Parsing by Leveraging Sub-clause Frequencies
Calibrating LLMs for Text-to-SQL Parsing by Leveraging Sub-clause Frequencies
Terrance Liu
Shuyi Wang
Daniel Preotiuc-Pietro
Yash Chandarana
Chirag Gupta
367
3
0
27 May 2025
UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking
UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking
Sarfraz Ahmad
Hasan Iqbal
Momina Ahsan
Numaan Naeem
Muhammad Ahsan Riaz Khan
Arham Riaz
Muhammad Arslan Manzoor
Yuxia Wang
Preslav Nakov
HILMELM
695
2
0
21 May 2025
Reliable Collaborative Conversational Agent System Based on LLMs and Answer Set Programming
Reliable Collaborative Conversational Agent System Based on LLMs and Answer Set Programming
Yankai Zeng
Gopal Gupta
192
0
0
09 May 2025
Secure Coding with AI -- From Detection to Repair
Secure Coding with AI -- From Detection to Repair
Vladislav Belozerov
Peter J. Barclay
Ashkan Sami
241
1
0
29 Apr 2025
Information Retrieval in the Age of Generative AI: The RGB Model
Information Retrieval in the Age of Generative AI: The RGB ModelAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
M. Garetto
Alessandro Cornacchia
Franco Galante
Emilio Leonardi
A. Nordio
A. Tarable
904
3
0
29 Apr 2025
Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification
Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification
Jaime E. Cuellar
Oscar Moreno-Martinez
Paula Sofia Torres-Rodriguez
Jaime Andres Pavlich-Mariscal
Andres Felipe Mican-Castiblanco
Juan Guillermo Torres-Hurtado
256
0
0
16 Apr 2025
Increasing the Robustness of the Fine-tuned Multilingual Machine-Generated Text Detectors
Increasing the Robustness of the Fine-tuned Multilingual Machine-Generated Text Detectors
Dominik Macko
Robert Moro
Ivan Srba
DeLMO
422
8
0
19 Mar 2025
BAMBI: Developing Baby Language Models for Italian
BAMBI: Developing Baby Language Models for Italian
Alice Suozzi
Luca Capone
Gianluca E. Lebani
Alessandro Lenci
317
4
0
12 Mar 2025
Can AI-Generated Text be Reliably Detected?
Can AI-Generated Text be Reliably Detected?
Vinu Sankar Sadasivan
Aounon Kumar
S. Balasubramanian
Wenxiao Wang
Soheil Feizi
DeLMO
1.1K
532
0
20 Jan 2025
Evaluation of LLM Vulnerabilities to Being Misused for Personalized Disinformation Generation
Evaluation of LLM Vulnerabilities to Being Misused for Personalized Disinformation GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Aneta Zugecova
Dominik Macko
Ivan Srba
Robert Moro
Jakub Kopal
Katarina Marcincinova
Matus Mesarcik
467
18
0
18 Dec 2024
Navigating the Unknown: A Chat-Based Collaborative Interface for
  Personalized Exploratory Tasks
Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory TasksInternational Conference on Intelligent User Interfaces (IUI), 2024
Yingzhe Peng
Xiaoting Qin
Zhiyang Zhang
Jue Zhang
Qingwei Lin
Xu Yang
Dongmei Zhang
Saravan Rajmohan
Qi Zhang
197
11
0
31 Oct 2024
Stars, Stripes, and Silicon: Unravelling the ChatGPT's All-American,
  Monochrome, Cis-centric Bias
Stars, Stripes, and Silicon: Unravelling the ChatGPT's All-American, Monochrome, Cis-centric Bias
Federico Torrielli
309
3
0
02 Oct 2024
Constructive Apraxia: An Unexpected Limit of Instructible
  Vision-Language Models and Analog for Human Cognitive Disorders
Constructive Apraxia: An Unexpected Limit of Instructible Vision-Language Models and Analog for Human Cognitive Disorders
David Noever
S. M. Noever
219
0
0
17 Sep 2024
Promises and challenges of generative artificial intelligence for human
  learning
Promises and challenges of generative artificial intelligence for human learningNature Human Behaviour (Nat Hum Behav), 2024
Lixiang Yan
Samuel Greiff
Ziwen Teuber
Dragan Gašević
505
0
0
22 Aug 2024
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering
  LLM Weaknesses
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Yulong Chen
Yang Liu
Jianhao Yan
X. Bai
Ming Zhong
Yinghao Yang
Ziyi Yang
Chenguang Zhu
Yue Zhang
ALMELM
233
20
0
16 Aug 2024
A Study on Bias Detection and Classification in Natural Language
  Processing
A Study on Bias Detection and Classification in Natural Language Processing
Ana Sofia Evans
Helena Moniz
Luísa Coheur
225
1
0
14 Aug 2024
Interactive embodied evolution for socially adept Artificial General
  Creatures
Interactive embodied evolution for socially adept Artificial General Creatures
Kevin Godin-Dubois
Olivier Weissl
Karine Miras
Anna V. Kononova
164
0
0
31 Jul 2024
Are LLMs Good Annotators for Discourse-level Event Relation Extraction?
Are LLMs Good Annotators for Discourse-level Event Relation Extraction?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Kangda Wei
Aayush Gautam
Ruihong Huang
571
11
0
28 Jul 2024
Unipa-GPT: Large Language Models for university-oriented QA in Italian
Unipa-GPT: Large Language Models for university-oriented QA in Italian
Irene Siragusa
Roberto Pirrone
298
1
0
19 Jul 2024
Auditing of AI: Legal, Ethical and Technical Approaches
Auditing of AI: Legal, Ethical and Technical Approaches
Jakob Mokander
337
84
0
07 Jul 2024
Hallucination Detection: Robustly Discerning Reliable Answers in Large
  Language Models
Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models
Yuyan Chen
Qiang Fu
Yichen Yuan
Zhihao Wen
Ge Fan
Dayiheng Liu
Dongmei Zhang
Zhixu Li
Yanghua Xiao
HILM
260
131
0
04 Jul 2024
Satyrn: A Platform for Analytics Augmented Generation
Satyrn: A Platform for Analytics Augmented Generation
Marko Sterbentz
Cameron Barrie
Shubham Shahi
Abhratanu Dutta
Donna Hooshmand
Harper Pack
Kristian J. Hammond
289
2
0
17 Jun 2024
A Complete Survey on LLM-based AI Chatbots
A Complete Survey on LLM-based AI Chatbots
Sumit Kumar Dam
Choong Seon Hong
Yu Qiao
Chaoning Zhang
319
146
0
17 Jun 2024
GPT-ology, Computational Models, Silicon Sampling: How should we think
  about LLMs in Cognitive Science?
GPT-ology, Computational Models, Silicon Sampling: How should we think about LLMs in Cognitive Science?
Desmond C. Ong
330
6
0
13 Jun 2024
A Reality check of the benefits of LLM in business
A Reality check of the benefits of LLM in business
Ming Cheung
353
11
0
09 Jun 2024
Reinterpreting 'the Company a Word Keeps': Towards Explainable and
  Ontologically Grounded Language Models
Reinterpreting 'the Company a Word Keeps': Towards Explainable and Ontologically Grounded Language Models
Walid S. Saba
140
1
0
06 Jun 2024
On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots
On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots
Christine Herlihy
Jennifer Neville
Tobias Schnabel
Adith Swaminathan
438
12
0
01 Jun 2024
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective
  Rationales
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Tianyang Xu
Shujin Wu
Shizhe Diao
Xiaoze Liu
Xingyao Wang
Yangyi Chen
Jing Gao
LRM
434
92
0
31 May 2024
Unlearning Climate Misinformation in Large Language Models
Unlearning Climate Misinformation in Large Language Models
Michael Fore
Simranjit Singh
Chaehong Lee
Amritanshu Pandey
Antonios Anastasopoulos
Dimitrios Stamoulis
MU
353
7
0
29 May 2024
Evaluating and Modeling Social Intelligence: A Comparative Study of
  Human and AI Capabilities
Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities
Junqi Wang
Chunhui Zhang
Jiapeng Li
Yuxi Ma
Lixing Niu
Jiaheng Han
Yujia Peng
Yixin Zhu
Lifeng Fan
ELMALM
249
10
0
20 May 2024
OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs
OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs
Yuxia Wang
Minghan Wang
Hasan Iqbal
Georgi Georgiev
Fauzan Farooqui
Preslav Nakov
HILM
523
39
0
09 May 2024
Attributions toward Artificial Agents in a modified Moral Turing Test
Attributions toward Artificial Agents in a modified Moral Turing TestScientific Reports (Sci Rep), 2024
Eyal Aharoni
Sharlene Fernandes
Daniel J Brady
Caelan Alexander
Michael Criner
Kara Queen
Javier Rando
Eddy Nahmias
Victor Crespo
ELM
313
39
0
03 Apr 2024
HILL: A Hallucination Identifier for Large Language Models
HILL: A Hallucination Identifier for Large Language ModelsInternational Conference on Human Factors in Computing Systems (CHI), 2024
Florian Leiser
S. Eckhardt
Valentin Leuthe
Merlin Knaeble
Alexander Maedche
Gerhard Schwabe
Ali Sunyaev
HILM
260
45
0
11 Mar 2024
Enhancing Instructional Quality: Leveraging Computer-Assisted Textual
  Analysis to Generate In-Depth Insights from Educational Artifacts
Enhancing Instructional Quality: Leveraging Computer-Assisted Textual Analysis to Generate In-Depth Insights from Educational Artifacts
Zewei Tian
Min Sun
Alex Liu
Shawon Sarkar
Jing Liu
232
8
0
06 Mar 2024
Should We Fear Large Language Models? A Structural Analysis of the Human
  Reasoning System for Elucidating LLM Capabilities and Risks Through the Lens
  of Heidegger's Philosophy
Should We Fear Large Language Models? A Structural Analysis of the Human Reasoning System for Elucidating LLM Capabilities and Risks Through the Lens of Heidegger's Philosophy
Jianqiiu Zhang
ELM
229
4
0
05 Mar 2024
RAGged Edges: The Double-Edged Sword of Retrieval-Augmented Chatbots
RAGged Edges: The Double-Edged Sword of Retrieval-Augmented Chatbots
Philip G. Feldman
James R. Foulds
Shimei Pan
SILM
308
18
0
02 Mar 2024
What Generative Artificial Intelligence Means for Terminological
  Definitions
What Generative Artificial Intelligence Means for Terminological Definitions
Antonio San Martín
339
5
0
25 Feb 2024
Exploring ChatGPT and its Impact on Society
Exploring ChatGPT and its Impact on Society
Md. Asraful Haque
Shuai Li
SILM
350
57
0
21 Feb 2024
Mapping the Ethics of Generative AI: A Comprehensive Scoping Review
Mapping the Ethics of Generative AI: A Comprehensive Scoping Review
Thilo Hagendorff
284
97
0
13 Feb 2024
Why and When LLM-Based Assistants Can Go Wrong: Investigating the
  Effectiveness of Prompt-Based Interactions for Software Help-Seeking
Why and When LLM-Based Assistants Can Go Wrong: Investigating the Effectiveness of Prompt-Based Interactions for Software Help-Seeking
Anjali Khurana
Hariharan Subramonyam
Parmit K. Chilana
229
69
0
12 Feb 2024
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in
  Closed-Source LLMs
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
Simone Balloccu
Patrícia Schmidtová
Mateusz Lango
Ondrej Dusek
SILMELMPILM
527
298
0
06 Feb 2024
APT-Pipe: A Prompt-Tuning Tool for Social Data Annotation using ChatGPT
APT-Pipe: A Prompt-Tuning Tool for Social Data Annotation using ChatGPTThe Web Conference (WWW), 2024
Yiming Zhu
Zhizhuo Yin
Gareth Tyson
Ehsan-ul Haq
Lik-Hang Lee
Pan Hui
ALM
444
16
0
24 Jan 2024
ChatGPT in the classroom. Exploring its potential and limitations in a
  Functional Programming course
ChatGPT in the classroom. Exploring its potential and limitations in a Functional Programming courseInternational journal of human computer interactions (IJHCI), 2023
Dan-Matei Popovici
184
60
0
20 Jan 2024
Stability Analysis of ChatGPT-based Sentiment Analysis in AI Quality
  Assurance
Stability Analysis of ChatGPT-based Sentiment Analysis in AI Quality Assurance
Tinghui Ouyang
AprilPyone Maungmaung
Koichi Konishi
Yoshiki Seo
Isao Echizen
AI4MH
222
16
0
15 Jan 2024
1234
Next
Page 1 of 4