Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.06913
Cited By
RedTeamLLM: an Agentic AI framework for offensive security
11 May 2025
Brian Challita
Pierre Parrend
LLMAG
Re-assign community
ArXiv (abs)
PDF
HTML
Github (14★)
Papers citing
"RedTeamLLM: an Agentic AI framework for offensive security"
25 / 25 papers shown
Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming
Strahinja Janjusevic
Anna Baron Garcia
Sohrob Kazerounian
280
1
0
20 Nov 2025
Neuro-Symbolic AI for Cybersecurity: State of the Art, Challenges, and Opportunities
Safayat Bin Hakim
M. Adil
Alvaro Velasquez
Shouhuai Xu
Houbing Herbert Song
AAML
NAI
182
4
0
08 Sep 2025
Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements
User Modeling, Adaptation, and Personalization (UMAP), 2024
I. Isozaki
Manil Shrestha
Rick Console
Edward Kim
ELM
531
29
0
24 Feb 2025
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing
Lajos Muzsai
David Imolai
András Lukács
LLMAG
318
44
0
02 Dec 2024
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations
Neural Information Processing Systems (NeurIPS), 2024
Jia Li
Ge Li
Xuanming Zhang
Yunfei Zhao
Yihong Dong
Zhi Jin
Binhua Li
Fei Huang
Yongbin Li
ALM
ELM
303
47
0
30 Oct 2024
Countering Autonomous Cyber Threats
Kade M. Heckel
Adrian Weller
AAML
198
3
0
23 Oct 2024
Security Threats in Agentic AI System
Raihan Khan
Sayak Sarkar
Sainik Kumar Mahata
Edwin Jose
374
23
0
16 Oct 2024
CYBERSECEVAL 3: Advancing the Evaluation of Cybersecurity Risks and Capabilities in Large Language Models
Shengye Wan
Cyrus Nikolaidis
Daniel Song
David Molnar
James Crnkovich
...
Spencer Whitman
Stephanie Ding
Vlad Ionescu
Yue Li
Joshua Saxe
ELM
382
44
0
02 Aug 2024
LLM Agents can Autonomously Exploit One-day Vulnerabilities
Richard Fang
R. Bindu
Akul Gupta
Daniel Kang
SILM
LLMAG
510
137
0
11 Apr 2024
Breaking Down the Defenses: A Comparative Survey of Attacks on Large Language Models
Arijit Ghosh Chowdhury
Md. Mofijul Islam
Vaibhav Kumar
F. H. Shezan
Vaibhav Kumar
Vinija Jain
Vasu Sharma
AAML
PILM
337
50
0
03 Mar 2024
AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks
Jiacen Xu
Jack W. Stokes
Geoff McDonald
Xuesong Bai
David Marshall
Siyue Wang
Adith Swaminathan
Zhou Li
376
130
0
02 Mar 2024
TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation
Yaoxiang Wang
Zhiyong Wu
Junfeng Yao
Jinsong Su
LLMAG
558
45
0
15 Feb 2024
LLM Agents can Autonomously Hack Websites
Richard Fang
R. Bindu
Akul Gupta
Qiusi Zhan
Daniel Kang
LLMAG
329
108
0
06 Feb 2024
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Chengshu Li
Jacky Liang
Andy Zeng
Xinyun Chen
Karol Hausman
Dorsa Sadigh
Sergey Levine
Fei-Fei Li
Fei Xia
Brian Ichter
LLMAG
LRM
380
154
0
07 Dec 2023
A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly
High-Confidence Computing (HC), 2023
Yifan Yao
Jinhao Duan
Kaidi Xu
Yuanfang Cai
Eric Sun
Yue Zhang
PILM
ELM
675
1,127
0
04 Dec 2023
ADaPT: As-Needed Decomposition and Planning with Language Models
Archiki Prasad
Alexander Koller
Mareike Hartmann
Peter Clark
Ashish Sabharwal
Mohit Bansal
Tushar Khot
LM&Ro
401
166
0
08 Nov 2023
Decoding the Threat Landscape : ChatGPT, FraudGPT, and WormGPT in Social Engineering Attacks
International Journal of Scientific Research in Computer Science Engineering and Information Technology (JCSEIT), 2023
Polra Victor Falade
AAML
198
54
0
09 Oct 2023
PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Simeng Sun
Yongxu Liu
Shuohang Wang
Chenguang Zhu
Mohit Iyyer
RALM
LRM
ReLM
248
78
0
23 May 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Neural Information Processing Systems (NeurIPS), 2023
Shunyu Yao
Dian Yu
Jeffrey Zhao
Izhak Shafran
Thomas Griffiths
Yuan Cao
Karthik Narasimhan
LM&Ro
LRM
AI4CE
766
3,713
0
17 May 2023
DarkBERT: A Language Model for the Dark Side of the Internet
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Youngjin Jin
Eugene Jang
Jian Cui
Jin-Woo Chung
Yongjae Lee
Seung-Eui Shin
221
50
0
15 May 2023
Structured Chain-of-Thought Prompting for Code Generation
ACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Jia Li
Ge Li
Yongming Li
Zhi Jin
LRM
518
307
0
11 May 2023
Automatic Chain of Thought Prompting in Large Language Models
International Conference on Learning Representations (ICLR), 2022
Zhuosheng Zhang
Aston Zhang
Mu Li
Alexander J. Smola
ReLM
LRM
660
932
0
07 Oct 2022
ReAct: Synergizing Reasoning and Acting in Language Models
International Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
3.4K
7,139
0
06 Oct 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
International Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
3.7K
6,409
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Neural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
2.8K
17,183
0
28 Jan 2022
1
Page 1 of 1