v1v2 (latest)

InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents

5 March 2024

ArXiv (abs)PDF HTML Github (63★)

Papers citing "InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents"

50 / 135 papers shown

LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems

120

02 Dec 2025

Bias Injection Attacks on RAG Databases and Sanitization Defenses

Hao Wu

Prateek Saxena

AAML SILM

320

30 Nov 2025

BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents

634

25 Nov 2025

Z-Space: A Multi-Agent Tool Orchestration Framework for Enterprise-Grade LLM Automation

194

23 Nov 2025

ASTRA: Agentic Steerability and Risk Assessment Framework

22 Nov 2025

MURMUR: Using cross-user chatter to break collaborative language agents in groups

21 Nov 2025

Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Frameworks

313

19 Nov 2025

TAMAS: Benchmarking Adversarial Risks in Multi-Agent LLM Systems

Ishan Kavathekar

Hemang Jain

Ameya Rathod

Ponnurangam Kumaraguru

Tanuja Ganu

LLMAG AAML

340

07 Nov 2025

ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations

120

07 Nov 2025

DRIP: Defending Prompt Injection via Token-wise Representation Editing and Residual Instruction Fusion

354

01 Nov 2025

SIRAJ: Diverse and Efficient Red-Teaming for LLM Agents via Distilled Structured Reasoning

30 Oct 2025

Who Grants the Agent Power? Defending Against Instruction Injection via Task-Centric Access Control

30 Oct 2025

Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges

276

27 Oct 2025

QueryIPI: Query-agnostic Indirect Prompt Injection on Coding Agents

100

27 Oct 2025

Breaking Agent Backbones: Evaluating the Security of Backbone LLMs in AI Agents

184

26 Oct 2025

Soft Instruction De-escalation Defense

128

24 Oct 2025

GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?

196

23 Oct 2025

Defending Against Prompt Injection with DataFilter

241

22 Oct 2025

Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming

21 Oct 2025

Breaking and Fixing Defenses Against Control-Flow Hijacking in Multi-Agent Systems

133

20 Oct 2025

Investigating the Impact of Dark Patterns on LLM-Based Web Agents

127

20 Oct 2025

Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents

...

154

20 Oct 2025

Prompt injections as a tool for preserving identity in GAI image descriptions

Kate Glazko

Jennifer Mankoff

17 Oct 2025

SoK: Taxonomy and Evaluation of Prompt Security in Large Language Models

229

17 Oct 2025

Metacognitive Self-Correction for Multi-Agent System via Prototype-Guided Next-Execution Reconstruction

...

Hossein Nourkhiz Mahjoub

Ehsan Moradi-Pari

Kwonjoon Lee

Tianlong Chen

185

16 Oct 2025

Breaking Guardrails, Facing Walls: Insights on Adversarial AI for Defenders & Researchers

Giacomo Bertollo

Naz Bodemir

Jonah Burgess

14 Oct 2025

MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

174

14 Oct 2025

CommandSans: Securing AI Agents with Surgical Precision Prompt Sanitization

153

09 Oct 2025

A Survey on Agentic Security: Applications, Threats and Defenses

141

07 Oct 2025

RL Is a Hammer and LLMs Are Nails: A Simple Reinforcement Learning Recipe for Strong Prompt Injection

175

06 Oct 2025

ECLipsE-Gen-Local: Efficient Compositional Local Lipschitz Estimates for Deep Neural Networks

Yuezhu Xu

S. Sivaranjani

06 Oct 2025

AgentTypo: Adaptive Typographic Prompt Injection Attacks against Black-box Multimodal Agents

128

05 Oct 2025

Backdoor-Powered Prompt Injection Attacks Nullify Defense Methods

207

04 Oct 2025

Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks

168

01 Oct 2025

STAC: When Innocent Tools Form Dangerous Chains to Jailbreak LLM Agents

112

30 Sep 2025

SecInfer: Preventing Prompt Injection via Inference-time Scaling

429

29 Sep 2025

Takedown: How It's Done in Modern Coding Agent Exploits

173

29 Sep 2025

SafeSearch: Automated Red-Teaming for the Safety of LLM-Based Search Agents

321

28 Sep 2025

ChatInject: Abusing Chat Templates for Prompt Injection in LLM Agents

134

26 Sep 2025

AI Kill Switch for malicious web-based LLM agent

Sechan Lee

Sangdon Park

LLMAG AAML

26 Sep 2025

Generalizability of Large Language Model-Based Agents: A Comprehensive Survey

178

19 Sep 2025

When Your Reviewer is an LLM: Biases, Divergence, and Prompt Injection Risks in Peer Review

123

12 Sep 2025

SafeToolBench: Pioneering a Prospective Benchmark to Evaluating Tool Utilization Safety in LLMs

09 Sep 2025

Transferable Direct Prompt Injection via Activation-Guided MCMC Sampling

136

09 Sep 2025

Network-Level Prompt and Trait Leakage in Local Research Agents

Hyejun Jeong

Mohammadreza Teymoorianfard

A. Kumar

Amir Houmansadr

Eugene Bagdasarian

145

27 Aug 2025

IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents

116

21 Aug 2025

MCPTox: A Benchmark for Tool Poisoning Attack on Real-World MCP Servers

272

19 Aug 2025

LM Agents May Fail to Act on Their Own Risk Knowledge

1.7K

19 Aug 2025

BlindGuard: Safeguarding LLM-based Multi-Agent Systems under Unknown Attacks

190

11 Aug 2025

Provably Secure Retrieval-Augmented Generation

170

01 Aug 2025

All Papers

InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents

Papers citing "InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents"