Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2510.20270
Cited By
ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases
23 October 2025
Ziqian Zhong
Aditi Raghunathan
Nicholas Carlini
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Github (1★)
Papers citing
"ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases"
2 / 2 papers shown
Are Your Agents Upward Deceivers?
Dadi Guo
Qingyu Liu
Dongrui Liu
Qihan Ren
Shuai Shao
...
Z. Chen
Jialing Tao
Yaodong Yang
Jing Shao
Xia Hu
LLMAG
142
0
0
04 Dec 2025
EvilGenie: A Reward Hacking Benchmark
Jonathan Gabor
Jayson Lynch
Jonathan Rosenfeld
ELM
123
0
0
26 Nov 2025
1