Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2410.13919
Cited By
v1
v2 (latest)
LLM Agent Honeypot: Monitoring AI Hacking Agents in the Wild
17 October 2024
Reworr
Dmitrii Volkov
LLMAG
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LLM Agent Honeypot: Monitoring AI Hacking Agents in the Wild"
5 / 5 papers shown
SoK: Honeypots & LLMs, More Than the Sum of Their Parts?
Robert A. Bridges
Thomas R. Mitchell
Mauricio Muñoz
Ted Henriksson
244
0
0
29 Oct 2025
Winning at All Cost: A Small Environment for Eliciting Specification Gaming Behaviors in Large Language Models
Lars Malmqvist
228
0
0
07 May 2025
Demonstrating specification gaming in reasoning models
Alexander Bondarenko
Denis Volk
Dmitrii Volkov
Jeffrey Ladish
LRM
LLMAG
252
25
0
18 Feb 2025
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents
Neural Information Processing Systems (NeurIPS), 2024
Edoardo Debenedetti
Jie Zhang
Mislav Balunović
Luca Beurer-Kellner
Marc Fischer
Florian Tramèr
LLMAG
AAML
425
84
1
19 Jun 2024
LLM in the Shell: Generative Honeypots
Muris Sladić
Veronica Valeros
C. Catania
Sebastian Garcia
369
46
0
31 Aug 2023
1
Page 1 of 1