ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.06913
  4. Cited By
RedTeamLLM: an Agentic AI framework for offensive security

RedTeamLLM: an Agentic AI framework for offensive security

11 May 2025
Brian Challita
Pierre Parrend
    LLMAG
ArXiv (abs)PDFHTMLGithub (14★)

Papers citing "RedTeamLLM: an Agentic AI framework for offensive security"

25 / 25 papers shown
Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming
Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming
Strahinja Janjusevic
Anna Baron Garcia
Sohrob Kazerounian
280
1
0
20 Nov 2025
Neuro-Symbolic AI for Cybersecurity: State of the Art, Challenges, and Opportunities
Neuro-Symbolic AI for Cybersecurity: State of the Art, Challenges, and Opportunities
Safayat Bin Hakim
M. Adil
Alvaro Velasquez
Shouhuai Xu
Houbing Herbert Song
AAMLNAI
182
4
0
08 Sep 2025
Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements
Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and ImprovementsUser Modeling, Adaptation, and Personalization (UMAP), 2024
I. Isozaki
Manil Shrestha
Rick Console
Edward Kim
ELM
531
29
0
24 Feb 2025
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration
  Testing
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing
Lajos Muzsai
David Imolai
András Lukács
LLMAG
318
44
0
02 Dec 2024
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific
  Evaluations
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific EvaluationsNeural Information Processing Systems (NeurIPS), 2024
Jia Li
Ge Li
Xuanming Zhang
Yunfei Zhao
Yihong Dong
Zhi Jin
Binhua Li
Fei Huang
Yongbin Li
ALMELM
303
47
0
30 Oct 2024
Countering Autonomous Cyber Threats
Countering Autonomous Cyber Threats
Kade M. Heckel
Adrian Weller
AAML
198
3
0
23 Oct 2024
Security Threats in Agentic AI System
Security Threats in Agentic AI System
Raihan Khan
Sayak Sarkar
Sainik Kumar Mahata
Edwin Jose
374
23
0
16 Oct 2024
CYBERSECEVAL 3: Advancing the Evaluation of Cybersecurity Risks and
  Capabilities in Large Language Models
CYBERSECEVAL 3: Advancing the Evaluation of Cybersecurity Risks and Capabilities in Large Language Models
Shengye Wan
Cyrus Nikolaidis
Daniel Song
David Molnar
James Crnkovich
...
Spencer Whitman
Stephanie Ding
Vlad Ionescu
Yue Li
Joshua Saxe
ELM
382
44
0
02 Aug 2024
LLM Agents can Autonomously Exploit One-day Vulnerabilities
LLM Agents can Autonomously Exploit One-day Vulnerabilities
Richard Fang
R. Bindu
Akul Gupta
Daniel Kang
SILMLLMAG
510
137
0
11 Apr 2024
Breaking Down the Defenses: A Comparative Survey of Attacks on Large
  Language Models
Breaking Down the Defenses: A Comparative Survey of Attacks on Large Language Models
Arijit Ghosh Chowdhury
Md. Mofijul Islam
Vaibhav Kumar
F. H. Shezan
Vaibhav Kumar
Vinija Jain
Vasu Sharma
AAMLPILM
337
50
0
03 Mar 2024
AutoAttacker: A Large Language Model Guided System to Implement
  Automatic Cyber-attacks
AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks
Jiacen Xu
Jack W. Stokes
Geoff McDonald
Xuesong Bai
David Marshall
Siyue Wang
Adith Swaminathan
Zhou Li
376
130
0
02 Mar 2024
TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation
TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation
Yaoxiang Wang
Zhiyong Wu
Junfeng Yao
Jinsong Su
LLMAG
558
45
0
15 Feb 2024
LLM Agents can Autonomously Hack Websites
LLM Agents can Autonomously Hack Websites
Richard Fang
R. Bindu
Akul Gupta
Qiusi Zhan
Daniel Kang
LLMAG
329
108
0
06 Feb 2024
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Chengshu Li
Jacky Liang
Andy Zeng
Xinyun Chen
Karol Hausman
Dorsa Sadigh
Sergey Levine
Fei-Fei Li
Fei Xia
Brian Ichter
LLMAGLRM
380
154
0
07 Dec 2023
A Survey on Large Language Model (LLM) Security and Privacy: The Good,
  the Bad, and the Ugly
A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the UglyHigh-Confidence Computing (HC), 2023
Yifan Yao
Jinhao Duan
Kaidi Xu
Yuanfang Cai
Eric Sun
Yue Zhang
PILMELM
675
1,127
0
04 Dec 2023
ADaPT: As-Needed Decomposition and Planning with Language Models
ADaPT: As-Needed Decomposition and Planning with Language Models
Archiki Prasad
Alexander Koller
Mareike Hartmann
Peter Clark
Ashish Sabharwal
Mohit Bansal
Tushar Khot
LM&Ro
401
166
0
08 Nov 2023
Decoding the Threat Landscape : ChatGPT, FraudGPT, and WormGPT in Social
  Engineering Attacks
Decoding the Threat Landscape : ChatGPT, FraudGPT, and WormGPT in Social Engineering AttacksInternational Journal of Scientific Research in Computer Science Engineering and Information Technology (JCSEIT), 2023
Polra Victor Falade
AAML
198
54
0
09 Oct 2023
PEARL: Prompting Large Language Models to Plan and Execute Actions Over
  Long Documents
PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long DocumentsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Simeng Sun
Yongxu Liu
Shuohang Wang
Chenguang Zhu
Mohit Iyyer
RALMLRMReLM
248
78
0
23 May 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Tree of Thoughts: Deliberate Problem Solving with Large Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Shunyu Yao
Dian Yu
Jeffrey Zhao
Izhak Shafran
Thomas Griffiths
Yuan Cao
Karthik Narasimhan
LM&RoLRMAI4CE
766
3,713
0
17 May 2023
DarkBERT: A Language Model for the Dark Side of the Internet
DarkBERT: A Language Model for the Dark Side of the InternetAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Youngjin Jin
Eugene Jang
Jian Cui
Jin-Woo Chung
Yongjae Lee
Seung-Eui Shin
221
50
0
15 May 2023
Structured Chain-of-Thought Prompting for Code Generation
Structured Chain-of-Thought Prompting for Code GenerationACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Jia Li
Ge Li
Yongming Li
Zhi Jin
LRM
518
307
0
11 May 2023
Automatic Chain of Thought Prompting in Large Language Models
Automatic Chain of Thought Prompting in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Zhuosheng Zhang
Aston Zhang
Mu Li
Alexander J. Smola
ReLMLRM
660
932
0
07 Oct 2022
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAGReLMLRM
3.4K
7,139
0
06 Oct 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLMBDLLRMAI4CE
3.7K
6,409
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&RoLRMAI4CEReLM
2.8K
17,183
0
28 Jan 2022
1
Page 1 of 1