Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.13382
Cited By
Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task
24 October 2022
Kenneth Li
Aspen K. Hopkins
David Bau
Fernanda Viégas
Hanspeter Pfister
Martin Wattenberg
MILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task"
50 / 200 papers shown
Title
Scaling Laws for State Dynamics in Large Language Models
Jacob X Li
Shreyas S Raman
Jessica Wan
Fahad Samman
Jazlyn Lin
14
0
0
20 May 2025
ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models
Matteo Merler
Nicola Dainese
Minttu Alakuijala
Giovanni Bonetta
Pietro Ferrazzi
Yu Tian
Bernardo Magnini
Pekka Marttinen
LM&Ro
VLM
27
0
0
19 May 2025
Reward Inside the Model: A Lightweight Hidden-State Reward Model for LLM's Best-of-N sampling
Jizhou Guo
Zhaomin Wu
Philip S. Yu
12
0
0
18 May 2025
Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis
Akarsh Kumar
Jeff Clune
Joel Lehman
Kenneth O. Stanley
OOD
26
0
0
16 May 2025
Evaluating Large Language Models for Real-World Engineering Tasks
Rene Heesch
Sebastian Eilermann
Alexander Windmann
Alexander Diedrich
Philipp Rosenthal
Oliver Niggemann
ELM
13
0
0
12 May 2025
Improving World Models using Deep Supervision with Linear Probes
Andrii Zahorodnii
28
0
0
04 Apr 2025
Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure
Boshi Wang
Huan Sun
49
2
0
02 Apr 2025
Improving Preference Extraction In LLMs By Identifying Latent Knowledge Through Classifying Probes
Sharan Maiya
Yinhong Liu
Ramit Debnath
Anna Korhonen
46
0
0
22 Mar 2025
MedAgent-Pro: Towards Evidence-based Multi-modal Medical Diagnosis via Reasoning Agentic Workflow
Ziyue Wang
Junde Wu
Linghan Cai
Chang Han Low
Xihong Yang
Qiaxuan Li
Yueming Jin
LRM
70
2
0
21 Mar 2025
Revisiting the Othello World Model Hypothesis
Yifei Yuan
Anders Søgaard
LRM
60
0
0
06 Mar 2025
The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems
Richard Ren
Arunim Agarwal
Mantas Mazeika
Cristina Menghini
Robert Vacareanu
...
Matias Geralnik
Adam Khoja
Dean Lee
Summer Yue
Dan Hendrycks
HILM
ALM
90
0
0
05 Mar 2025
(How) Do Language Models Track State?
Belinda Z. Li
Zifan Carl Guo
Jacob Andreas
LRM
57
2
0
04 Mar 2025
Implicit Search via Discrete Diffusion: A Study on Chess
Jiacheng Ye
Zhenyu Wu
Jiahui Gao
Zhiyong Wu
Xin Jiang
Zhiyu Li
Lingpeng Kong
DiffM
55
3
0
27 Feb 2025
Grandes modelos de lenguaje: de la predicción de palabras a la comprensión?
Carlos Gómez-Rodríguez
SyDa
AILaw
ELM
VLM
115
0
0
25 Feb 2025
Representation Engineering for Large-Language Models: Survey and Research Challenges
Lukasz Bartoszcze
Sarthak Munshi
Bryan Sukidi
Jennifer Yen
Zejia Yang
David Williams-King
Linh Le
Kosi Asuzu
Carsten Maple
102
0
0
24 Feb 2025
Text2World: Benchmarking Large Language Models for Symbolic World Model Generation
Mengkang Hu
Tianxing Chen
Yude Zou
Yuheng Lei
Qiguang Chen
Ming Li
Yao Mu
H. Zhang
Wenqi Shao
Ping Luo
LLMAG
52
2
0
18 Feb 2025
The Representation and Recall of Interwoven Structured Knowledge in LLMs: A Geometric and Layered Analysis
Ge Lei
Samuel J. Cooper
KELM
53
0
0
15 Feb 2025
MET-Bench: Multimodal Entity Tracking for Evaluating the Limitations of Vision-Language and Reasoning Models
Vanya Cohen
Raymond J. Mooney
52
0
0
15 Feb 2025
Emergent Stack Representations in Modeling Counter Languages Using Transformers
Utkarsh Tiwari
Aviral Gupta
Michael Hahn
259
0
0
03 Feb 2025
It's Not Just a Phase: On Investigating Phase Transitions in Deep Learning-based Side-channel Analysis
Sengim Karayalçin
Marina Krček
Stjepan Picek
AAML
82
0
0
01 Feb 2025
An Attempt to Unraveling Token Prediction Refinement and Identifying Essential Layers of Large Language Models
Jaturong Kongmanee
44
1
0
28 Jan 2025
Revisiting Rogers' Paradox in the Context of Human-AI Interaction
Katherine M. Collins
Umang Bhatt
Ilia Sucholutsky
61
1
0
16 Jan 2025
Representation in large language models
Cameron C. Yetman
46
1
0
03 Jan 2025
ICLR: In-Context Learning of Representations
Core Francisco Park
Andrew Lee
Ekdeep Singh Lubana
Yongyi Yang
Maya Okawa
Kento Nishi
Martin Wattenberg
Hidenori Tanaka
AIFin
125
4
0
29 Dec 2024
A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Shilin Sun
Wenbin An
Feng Tian
Fang Nan
Qidong Liu
Jing Liu
N. Shah
Ping Chen
104
3
0
18 Dec 2024
Transformers Use Causal World Models in Maze-Solving Tasks
Alex F Spies
William Edwards
Michael Ivanitskiy
Adrians Skapars
Tilman Rauker
Katsumi Inoue
A. Russo
Murray Shanahan
224
1
0
16 Dec 2024
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Meng Cao
Haoran Tang
Haoze Zhao
Hangyu Guo
Jing Liu
Ge Zhang
Ruyang Liu
Qiang Sun
Ian Reid
Xiaodan Liang
106
2
0
02 Dec 2024
Think-to-Talk or Talk-to-Think? When LLMs Come Up with an Answer in Multi-Step Arithmetic Reasoning
Keito Kudo
Yoichi Aoki
Tatsuki Kuribayashi
Shusaku Sone
Masaya Taniguchi
Ana Brassard
Keisuke Sakaguchi
Kentaro Inui
ReLM
LRM
79
0
0
02 Dec 2024
COLD: Causal reasOning in cLosed Daily activities
Abhinav Joshi
A. Ahmad
Ashutosh Modi
LRM
ReLM
74
1
0
29 Nov 2024
Probing for Consciousness in Machines
Mathis Immertreu
A. Schilling
Andreas K. Maier
P. Krauss
AI4CE
77
1
0
25 Nov 2024
Towards Unifying Interpretability and Control: Evaluation via Intervention
Usha Bhalla
Suraj Srinivas
Asma Ghandeharioun
Himabindu Lakkaraju
47
5
0
07 Nov 2024
LLM Generated Distribution-Based Prediction of US Electoral Results, Part I
Caleb Bradshaw
Caelen Miller
Sean Warnick
46
0
0
05 Nov 2024
The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare
Souren Pashangpour
Goldie Nejat
LM&MA
58
7
0
05 Nov 2024
All or None: Identifiable Linear Properties of Next-token Predictors in Language Modeling
Emanuele Marconato
Sébastien Lachapelle
Sebastian Weichwald
Luigi Gresele
69
3
0
30 Oct 2024
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics
Yaniv Nikankin
Anja Reusch
Aaron Mueller
Yonatan Belinkov
AIFin
LRM
46
25
0
28 Oct 2024
Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Zhengkai Lin
Z. Fu
Kai Liu
Liang Xie
Binbin Lin
Wenxiao Wang
D. Cai
Yue Wu
Jieping Ye
LRM
30
3
0
24 Oct 2024
Are Large Language Models Ready for Travel Planning?
Ruiping Ren
Xing Yao
Shu Cole
Haining Wang
33
0
0
22 Oct 2024
Chatting with Bots: AI, Speech Acts, and the Edge of Assertion
Iwan Williams
Tim Bayne
41
1
0
22 Oct 2024
Do LLMs "know" internally when they follow instructions?
Juyeon Heo
Christina Heinze-Deml
Oussama Elachqar
Shirley Ren
Udhay Nallasamy
Andy Miller
Kwan Ho Ryan Chan
Jaya Narain
54
6
0
18 Oct 2024
Automatic Mapping of Anatomical Landmarks from Free-Text Using Large Language Models: Insights from Llama-2
Mohamad Abdi
Gerardo Hermosillo Valadez
H. Yerebakan
MedIm
32
0
0
16 Oct 2024
Systems with Switching Causal Relations: A Meta-Causal Perspective
Moritz Willig
Tim Nelson Tobiasch
Florian Peter Busch
Jonas Seng
Devendra Singh Dhami
Kristian Kersting
CML
48
0
0
16 Oct 2024
Analyzing (In)Abilities of SAEs via Formal Languages
Abhinav Menon
Manish Shrivastava
David M. Krueger
Ekdeep Singh Lubana
50
7
0
15 Oct 2024
The Geometry of Concepts: Sparse Autoencoder Feature Structure
Yuxiao Li
Eric J. Michaud
David D. Baek
Joshua Engels
Xiaoqing Sun
Max Tegmark
58
9
0
10 Oct 2024
Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing
Zhuoran Zhang
Yongqian Li
Zijian Kan
Keyuan Cheng
Lijie Hu
Di Wang
KELM
31
5
0
08 Oct 2024
Chain and Causal Attention for Efficient Entity Tracking
Erwan Fagnou
Paul Caillon
Blaise Delattre
Alexandre Allauzen
33
3
0
07 Oct 2024
Organizing Unstructured Image Collections using Natural Language
Mingxuan Liu
Zhun Zhong
Jun Li
Gianni Franchi
Subhankar Roy
Elisa Ricci
VLM
52
3
0
07 Oct 2024
Latent Abstractions in Generative Diffusion Models
Giulio Franzese
Mattia Martini
Giulio Corallo
Paolo Papotti
Pietro Michiardi
DiffM
43
0
0
04 Oct 2024
On Logical Extrapolation for Mazes with Recurrent and Implicit Networks
Brandon Knutson
Amandin Chyba Rabeendran
Michael Ivanitskiy
Jordan Pettyjohn
Cecilia G. Diniz Behn
Samy Wu Fung
Daniel McKenzie
LRM
47
2
0
03 Oct 2024
Meta-Models: An Architecture for Decoding LLM Behaviors Through Interpreted Embeddings and Natural Language
Anthony Costarelli
Mat Allen
Severin Field
27
1
0
03 Oct 2024
Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations
Nick Jiang
Anish Kachinthaya
Suzie Petryk
Yossi Gandelsman
VLM
39
17
0
03 Oct 2024
1
2
3
4
Next