Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2510.20270
Cited By

ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases

ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases

23 October 2025

Aditi Raghunathan

Nicholas Carlini

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)Github (1★)

Papers citing "ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases"

2 / 2 papers shown

Are Your Agents Upward Deceivers?

Are Your Agents Upward Deceivers?

...

146

0

0

04 Dec 2025

EvilGenie: A Reward Hacking Benchmark

EvilGenie: A Reward Hacking Benchmark

Jonathan Rosenfeld

123

0

0

26 Nov 2025