ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.09600
  4. Cited By
Effective Red-Teaming of Policy-Adherent Agents
v1v2v3 (latest)

Effective Red-Teaming of Policy-Adherent Agents

11 June 2025
Itay Nakash
George Kour
Koren Lazar
Matan Vetzler
Guy Uziel
Ateret Anaby-Tavor
    AAML
ArXiv (abs)PDFHTMLHuggingFace (38 upvotes)

Papers citing "Effective Red-Teaming of Policy-Adherent Agents"

3 / 3 papers shown
Title
ASTRA: Agentic Steerability and Risk Assessment Framework
ASTRA: Agentic Steerability and Risk Assessment Framework
Itay Hazan
Yael Mathov
Guy Shtar
Ron Bitton
Itsik Mantin
84
0
0
22 Nov 2025
Don't Pass@k: A Bayesian Framework for Large Language Model Evaluation
Don't Pass@k: A Bayesian Framework for Large Language Model Evaluation
Mohsen Hariri
Amirhossein Samandar
Michael Hinczewski
Vipin Chaudhary
ALM
321
0
0
05 Oct 2025
Towards Enforcing Company Policy Adherence in Agentic Workflows
Towards Enforcing Company Policy Adherence in Agentic Workflows
Naama Zwerdling
David Boaz
Ella Rabinovich
Guy Uziel
David Amid
Ateret Anaby-Tavor
151
0
0
22 Jul 2025
1