Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2506.09600
Cited By
v1
v2
v3 (latest)
Effective Red-Teaming of Policy-Adherent Agents
11 June 2025
Itay Nakash
George Kour
Koren Lazar
Matan Vetzler
Guy Uziel
Ateret Anaby-Tavor
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (38 upvotes)
Papers citing
"Effective Red-Teaming of Policy-Adherent Agents"
3 / 3 papers shown
Title
ASTRA: Agentic Steerability and Risk Assessment Framework
Itay Hazan
Yael Mathov
Guy Shtar
Ron Bitton
Itsik Mantin
84
0
0
22 Nov 2025
Don't Pass@k: A Bayesian Framework for Large Language Model Evaluation
Mohsen Hariri
Amirhossein Samandar
Michael Hinczewski
Vipin Chaudhary
ALM
321
0
0
05 Oct 2025
Towards Enforcing Company Policy Adherence in Agentic Workflows
Naama Zwerdling
David Boaz
Ella Rabinovich
Guy Uziel
David Amid
Ateret Anaby-Tavor
151
0
0
22 Jul 2025
1