Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.13763
Cited By
v1
v2 (latest)
Language Models Are Capable of Metacognitive Monitoring and Control of Their Internal Activations
19 May 2025
Li Ji-An
Hua-Dong Xiong
Robert C. Wilson
Marcelo G. Mattar
M. Benna
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"Language Models Are Capable of Metacognitive Monitoring and Control of Their Internal Activations"
9 / 9 papers shown
Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning
Yang Li
Z. He
Y. Huang
Zhuhanling Xiao
Chao Yu
Meng Fang
Kun Shao
Jun Wang
LRM
VLM
156
0
0
28 Nov 2025
Monitor-Generate-Verify (MGV): Formalising Metacognitive Theory for Language Model Reasoning
Nick Oh
Fernand Gobet
LRM
198
1
0
06 Nov 2025
Automatic Minds: Cognitive Parallels Between Hypnotic States and Large Language Model Processing
Giuseppe Riva
B. Wiederhold
F. Mantovani
137
0
0
03 Nov 2025
Before you <think>, monitor: Implementing Flavell's metacognitive framework in LLMs
Nick Oh
LRM
134
0
0
18 Oct 2025
Learning to Interpret Weight Differences in Language Models
Avichal Goel
Yoon Kim
Nir Shavit
T. T. Wang
221
1
0
06 Oct 2025
Evidence for Limited Metacognition in LLMs
Christopher Ackerman
LRM
294
1
0
25 Sep 2025
An Approach to Technical AGI Safety and Security
Rohin Shah
Alex Irpan
Alexander Matt Turner
Anna Wang
Arthur Conmy
...
Shane Legg
Noah D. Goodman
Allan Dafoe
Four Flynn
Anca Dragan
351
31
0
02 Apr 2025
Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation
Bowen Baker
Joost Huizinga
Leo Gao
Zehao Dou
M. Guan
Aleksander Mądry
Wojciech Zaremba
J. Pachocki
David Farhi
LRM
440
126
0
14 Mar 2025
Internal Activation Revision: Safeguarding Vision Language Models Without Parameter Update
AAAI Conference on Artificial Intelligence (AAAI), 2025
Qing Li
Fauzan Farooqui
Zongxiong Chen
Kun Song
Lei Ma
Fakhri Karray
KELM
LLMSV
221
12
0
24 Jan 2025
1