ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.07238
  4. Cited By
Lessons From Red Teaming 100 Generative AI Products

Lessons From Red Teaming 100 Generative AI Products

13 January 2025
Blake Bullwinkel
Amanda Minnich
Shiven Chawla
Gary Lopez
Martin Pouliot
Whitney Maxwell
Joris de Gruyter
Katherine Pratt
Saphir Qi
Nina Chikanov
Roman Lutz
Raja Sekhar Rao Dheekonda
Bolor-Erdene Jagdagdorj
Eugenia Kim
Justin Song
Keegan Hines
Daniel Jones
Giorgio Severi
Richard Lundeen
Sam Vaughan
Victoria Westerhoff
Pete Bryan
Ram Shankar Siva Kumar
Yonatan Zunger
Chang Kawaguchi
Mark Russinovich
    AAMLVLM
ArXiv (abs)PDFHTML

Papers citing "Lessons From Red Teaming 100 Generative AI Products"

14 / 14 papers shown
Title
BlackIce: A Containerized Red Teaming Toolkit for AI Security Testing
BlackIce: A Containerized Red Teaming Toolkit for AI Security Testing
Caelin Kaplan
Alexander Warnecke
Neil Archibald
VLM
100
0
0
13 Oct 2025
A Framework for Rapidly Developing and Deploying Protection Against Large Language Model Attacks
A Framework for Rapidly Developing and Deploying Protection Against Large Language Model Attacks
Adam Swanda
Amy Chang
Alexander Chen
Fraser Burch
Paul Kassianik
Konstantin Berlin
89
0
0
25 Sep 2025
Responsible AI Technical Report
Responsible AI Technical Report
Soonmin Bae
Wanjin Park
Jeongyeop Kim
Yunjin Park
Jungwon Yoon
...
Sujin Kim
Youngchol Kim
Somin Lee
Wonyoung Lee
Minsung Noh
147
0
0
24 Sep 2025
From Firewalls to Frontiers: AI Red-Teaming is a Domain-Specific Evolution of Cyber Red-Teaming
From Firewalls to Frontiers: AI Red-Teaming is a Domain-Specific Evolution of Cyber Red-Teaming
Anusha Sinha
Keltin Grimes
James Lucassen
Michael Feffer
Nathan M. VanHoudnos
Zhiwei Steven Wu
Hoda Heidari
AAML
148
1
0
14 Sep 2025
LLM in the Middle: A Systematic Review of Threats and Mitigations to Real-World LLM-based Systems
LLM in the Middle: A Systematic Review of Threats and Mitigations to Real-World LLM-based Systems
Vitor Hugo Galhardo Moia
Igor Jochem Sanz
Gabriel Antonio Fontes Rebello
Rodrigo Duarte de Meneses
Briland Hitaj
Ulf Lindqvist
225
0
0
12 Sep 2025
The AI Model Risk Catalog: What Developers and Researchers Miss About Real-World AI Harms
The AI Model Risk Catalog: What Developers and Researchers Miss About Real-World AI Harms
Pooja S. B. Rao
S. Šćepanović
Dinesh Babu Jayagopi
Mauro Cherubini
Daniele Quercia
117
1
0
21 Aug 2025
Red Teaming AI Red Teaming
Red Teaming AI Red Teaming
Subhabrata Majumdar
Brian Pendleton
Abhishek Gupta
133
2
0
07 Jul 2025
Jailbreak Distillation: Renewable Safety Benchmarking
Jailbreak Distillation: Renewable Safety Benchmarking
Jingyu Zhang
Ahmed Elgohary
Xiawei Wang
A S M Iftekhar
Ahmed Magooda
Benjamin Van Durme
Daniel Khashabi
Kyle Jackson
AAMLALM
223
0
0
28 May 2025
Disentangling Reasoning and Knowledge in Medical Large Language Models
Disentangling Reasoning and Knowledge in Medical Large Language Models
Rahul Thapa
Qingyang Wu
Kevin Wu
Harrison Zhang
Angela Zhang
...
Joseph Boen
Shriya Reddy
Ben Athiwaratkun
Shuaiwen Leon Song
James Zou
ELMAI4MHLM&MALRM
393
7
0
16 May 2025
Red Teaming Large Language Models for Healthcare
Red Teaming Large Language Models for Healthcare
Vahid Balazadeh
Michael Cooper
David Pellow
Atousa Assadi
Jennifer Bell
...
Babak Taati
Balagopal Unnikrishnan
Iñigo Urteaga
Stephanie Williams
Fahad Razak
LM&MA
273
2
0
01 May 2025
Real-World Gaps in AI Governance Research
Real-World Gaps in AI Governance Research
Ilan Strauss
Isobel Moure
Tim O'Reilly
Sruly Rosenblat
571
3
0
30 Apr 2025
Understanding and Mitigating Risks of Generative AI in Financial Services
Understanding and Mitigating Risks of Generative AI in Financial ServicesConference on Fairness, Accountability and Transparency (FAccT), 2025
Sebastian Gehrmann
Claire Huang
Xian Teng
Sergei Yurovski
Iyanuoluwa Shode
...
Naveen Thomas
John Doucette
David S. Rosenberg
Mark Dredze
David Rabinowitz
SILM
146
4
0
25 Apr 2025
PolyGuard: A Multilingual Safety Moderation Tool for 17 Languages
PolyGuard: A Multilingual Safety Moderation Tool for 17 Languages
Priyanshu Kumar
Devansh Jain
Akhila Yerukola
Liwei Jiang
Himanshu Beniwal
Thomas Hartvigsen
Maarten Sap
324
12
0
06 Apr 2025
Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack
Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack
M. Russinovich
Ahmed Salem
Ronen Eldan
530
190
0
02 Apr 2024
1