Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.04758
Cited By
Measuring Goal-Directedness
6 December 2024
Matt MacDermott
James Fox
Francesco Belardinelli
Tom Everitt
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Measuring Goal-Directedness"
3 / 3 papers shown
Title
Evaluating the Goal-Directedness of Large Language Models
Tom Everitt
Cristina Garbacea
Alexis Bellot
Jonathan G. Richens
Henry Papadatos
Simeon Campos
Rohin Shah
ELM
LM&MA
LM&Ro
LRM
72
0
0
16 Apr 2025
Robust agents learn causal world models
Jonathan G. Richens
Tom Everitt
OOD
119
36
0
16 Feb 2024
Honesty Is the Best Policy: Defining and Mitigating AI Deception
Francis Rhys Ward
Francesco Belardinelli
Francesca Toni
Tom Everitt
110
27
0
03 Dec 2023
1