AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents
v1v2v3 (latest)

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

International Conference on Learning Representations (ICLR), 2024

Papers citing "AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents"

0 / 0 papers shown

No papers found