ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.02167
  4. Cited By
Red Teaming with Mind Reading: White-Box Adversarial Policies Against RL
  Agents
v1v2v3 (latest)

Red Teaming with Mind Reading: White-Box Adversarial Policies Against RL Agents

5 September 2022
Stephen Casper
Taylor Killian
Gabriel Kreiman
Dylan Hadfield-Menell
    AAML
ArXiv (abs)PDFHTMLGithub (1★)

Papers citing "Red Teaming with Mind Reading: White-Box Adversarial Policies Against RL Agents"

1 / 1 papers shown
Black-Box Access is Insufficient for Rigorous AI Audits
Black-Box Access is Insufficient for Rigorous AI AuditsConference on Fairness, Accountability and Transparency (FAccT), 2024
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
560
133
0
25 Jan 2024
1