Beyond the Safeguards: Exploring the Security Risks of ChatGPT

13 May 2023

Papers citing "Beyond the Safeguards: Exploring the Security Risks of ChatGPT"

37 / 37 papers shown

LLM in the Middle: A Systematic Review of Threats and Mitigations to Real-World LLM-based Systems

Vitor Hugo Galhardo Moia

Igor Jochem Sanz

Gabriel Antonio Fontes Rebello

Rodrigo Duarte de Meneses

Briland Hitaj

Ulf Lindqvist

238

12 Sep 2025

Two Birds with One Stone: Multi-Task Detection and Attribution of LLM-Generated Text

176

19 Aug 2025

Securing Educational LLMs: A Generalised Taxonomy of Attacks on LLMs and DREAD Risk Assessment

174

12 Aug 2025

AI Ethics and Social Norms: Exploring ChatGPT's Capabilities From What to HowProceedings of the ACM on Human-Computer Interaction (PACMHCI), 2025

393

25 Apr 2025

SOK: Exploring Hallucinations and Security Risks in AI-Assisted Software Development with Insights for LLM Deployment

304

31 Jan 2025

AI Safety in Generative AI Large Language Models: A Survey

Lina Yao

349

06 Jul 2024

The Art of Saying No: Contextual Noncompliance in Language Models

Faeze Brahman

Sachin Kumar

Vidhisha Balachandran

...

Yejin Choi

Hannaneh Hajishirzi

288

02 Jul 2024

A Complete Survey on LLM-based AI Chatbots

Sumit Kumar Dam

Choong Seon Hong

Yu Qiao

Chaoning Zhang

279

124

17 Jun 2024

Is On-Device AI Broken and Exploitable? Assessing the Trust and Ethics in Small Language Models

Kalyan Nakka

Jimmy Dani

Nitesh Saxena

423

08 Jun 2024

Measure-Observe-Remeasure: An Interactive Paradigm for Differentially-Private Exploratory Analysis

236

04 Jun 2024

Towards Trustworthy AI: A Review of Ethical and Robust Large Language Models

398

01 Jun 2024

FreezeAsGuard: Mitigating Illegal Adaptation of Diffusion Models via Selective Tensor Freezing

Kai Huang

Wei Gao

230

24 May 2024

Tagengo: A Multilingual Chat Dataset

P. Devine

143

21 May 2024

Risks of Practicing Large Language Models in Smart Grid: Threat Modeling and Validation

Jiangnan Li

Yingyuan Yang

Jinyuan Stella Sun

351

10 May 2024

Large Language Models for Cyber Security: A Systematic Literature Review

587

106

08 May 2024

SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile

Wei Niu

Md. Musfiqur Rahman Sanim

190

21 Apr 2024

Risk and Response in Large Language Models: Evaluating Key Threat Categories

Bahareh Harandizadeh

A. Salinas

Fred Morstatter

222

22 Mar 2024

On Protecting the Data Privacy of Large Language Models (LLMs): A SurveyInternational Conference on Mathematics and Computing (ICMC), 2024

408

158

08 Mar 2024

Exploring the Potential of Large Language Models for Improving Digital Forensic Investigation Efficiency

Akila Wickramasekara

Frank Breitinger

Mark Scanlon

507

29 Feb 2024

Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction

Yinpeng Dong

247

105

28 Feb 2024

Farsight: Fostering Responsible AI Awareness During AI Application Prototyping

317

23 Feb 2024

Mapping the Ethics of Generative AI: A Comprehensive Scoping Review

Thilo Hagendorff

257

13 Feb 2024

Whispers in the Machine: Confidentiality in Agentic Systems

335

10 Feb 2024

Improving Dialog Safety using Socially Aware Contrastive Learning

Souvik Das

Rohini Srihari

219

01 Feb 2024

The Ethics of Interaction: Mitigating Security Threats in LLMs

281

22 Jan 2024

A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the UglyHigh-Confidence Computing (HC), 2023

591

920

04 Dec 2023

From Chatbots to PhishBots? -- Preventing Phishing scams created using ChatGPT, Google Bard and Claude

Sayak Saha Roy

Poojitha Thota

Krishna Vamsi Naragam

Shirin Nilizadeh

SILM

331

29 Oct 2023

Ask Again, Then Fail: Large Language Models' Vacillations in JudgmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

650

03 Oct 2023

Can LLM-Generated Misinformation Be Detected?International Conference on Learning Representations (ICLR), 2023

Canyu Chen

Kai Shu

DeLMO

782

241

25 Sep 2023

Efficient Avoidance of Vulnerabilities in Auto-completed Smart Contract Code Using Vulnerability-constrained DecodingIEEE International Symposium on Software Reliability Engineering (ISSRE), 2023

176

18 Sep 2023

Distilled GPT for Source Code SummarizationInternational Conference on Automated Software Engineering (ASE), 2023

Chia-Yi Su

Collin McMillan

259

28 Aug 2023

GPTEval: A Survey on Assessments of ChatGPT and GPT-4International Conference on Language Resources and Evaluation (LREC), 2023

185

147

24 Aug 2023

Using Large Language Models for Cybersecurity Capture-The-Flag Challenges and Certification Questions

209

21 Aug 2023

RatGPT: Turning online LLMs into Proxies for Malware Attacks

139

17 Aug 2023

Learning to Prompt in the Classroom to Understand AI Limits: A pilot studyInternational Conference of the Italian Association for Artificial Intelligence (AIxIA), 2023

...

Davinia Hernández Leo

220

04 Jul 2023

On the Detectability of ChatGPT Content: Benchmarking, Methodology, and Evaluation through the Lens of Academic WritingConference on Computer and Communications Security (CCS), 2023

222

07 Jun 2023

From Text to MITRE Techniques: Exploring the Malicious Use of Large Language Models for Generating Cyber Attack Payloads

180

24 May 2023